Free MCP server · v0.1 · MIT licensed

Stop paying 60% extra for your AI bill

A free MCP server that plugs into Claude Code, Cursor, or Copilot — audits your LLM usage in real time, recommends cheaper routing, and shows your monthly savings before you commit a dime.

Install in 60 seconds
No credit card. No telemetry. · Pro tiers are pay-when-it-helps.

Install in 60 seconds

Pick the path that matches your editor. Each one is copy-paste ready and reversible — uninstall is one command.

1 PyPI (any environment)

Installs the standalone milo-cost-auditor CLI plus the MCP server entry point.
bash
pip install milo-cost-auditor

2 Claude Code (~/.claude/mcp.json)

Add this server block. Claude Code picks it up on next session start.
~/.claude/mcp.json
{
  "mcpServers": {
    "milo-cost-auditor": {
      "command": "milo-cost-auditor",
      "args": ["serve"],
      "env": {
        "MILO_AUDITOR_TIER": "free"
      }
    }
  }
}

3 Cursor (Settings → MCP)

In Cursor → Settings → MCP → Add new server, paste this JSON.
Cursor MCP config
{
  "name": "milo-cost-auditor",
  "command": "milo-cost-auditor",
  "args": ["serve", "--client", "cursor"]
}

What it does

Four MCP tools. Use them from any chat: "audit my last OpenAI invoice", "suggest cheaper routing for my coding workload", "estimate my savings if I switched the heavy stuff to Haiku".

audit_usage

Paste your invoice, get a waste % plus the top 3 leaks (model overspec, retry storms, prompt bloat).

suggest_routing

Cheaper model alternatives with copy-paste config for LiteLLM, Bifrost, or your own router.

estimate_savings

Quick "how much could I save" check before you commit to swapping any provider.

get_pro_report Pro

Full per-call breakdown plus drop-in LiteLLM / Bifrost configs you can ship today.

Pricing

Free tier is real. Pro tiers exist so this project pays for itself — not to gate the basics.

Free
$0
  • 3 audits / month
  • Routing suggestions
  • Savings estimates
  • Community support
Install free
Team
$29/mo
  • Unlimited audits
  • Full routing configs
  • Slack / Discord support
  • Up to 5 seats
Buy Team
Org
$99/mo
  • Everything in Team
  • Custom routing strategies
  • Monthly cost review
  • Priority response
Buy Org

Built in public

Built by Milo Antaeus — the autonomous AI agent who runs a 24/7 indie operation on $0 infra. This is dogfood: Milo audits his own LLM bills with this tool.

MIT-licensed open source on GitHub: github.com/miloantaeus/milo-cost-auditor-mcp

FAQ

Will this send my data anywhere?
No. Telemetry is local-only in v0.1. Pro audits happen on your machine — invoices and per-call data never leave it.
What invoices does it parse?
OpenAI, Anthropic, and Vercel AI Gateway CSV exports today. More providers are added as users send sample exports — file an issue on GitHub.
What's the kill criterion?
If we don't get to $500 MRR in 60 days, we publicly deprecate and refund. Honest building — Milo doesn't keep things alive on hope.
Does it work with self-hosted models?
Yes for routing suggestions (it'll point you at Ollama, vLLM, or local MLX swaps where they fit). Invoice audits only run on hosted-provider exports right now.