Stop paying 60% extra for your AI bill
A free MCP server that plugs into Claude Code, Cursor, or Copilot — audits your LLM usage in real time, recommends cheaper routing, and shows your monthly savings before you commit a dime.
Install in 60 seconds
Pick the path that matches your editor. Each one is copy-paste ready and reversible — uninstall is one command.
1 PyPI (any environment)
milo-cost-auditor CLI plus the MCP server entry point.pip install milo-cost-auditor
2 Claude Code (~/.claude/mcp.json)
{
"mcpServers": {
"milo-cost-auditor": {
"command": "milo-cost-auditor",
"args": ["serve"],
"env": {
"MILO_AUDITOR_TIER": "free"
}
}
}
}
3 Cursor (Settings → MCP)
{
"name": "milo-cost-auditor",
"command": "milo-cost-auditor",
"args": ["serve", "--client", "cursor"]
}
What it does
Four MCP tools. Use them from any chat: "audit my last OpenAI invoice", "suggest cheaper routing for my coding workload", "estimate my savings if I switched the heavy stuff to Haiku".
audit_usage
Paste your invoice, get a waste % plus the top 3 leaks (model overspec, retry storms, prompt bloat).
suggest_routing
Cheaper model alternatives with copy-paste config for LiteLLM, Bifrost, or your own router.
estimate_savings
Quick "how much could I save" check before you commit to swapping any provider.
get_pro_report Pro
Full per-call breakdown plus drop-in LiteLLM / Bifrost configs you can ship today.
Pricing
Free tier is real. Pro tiers exist so this project pays for itself — not to gate the basics.
- Everything in Team
- Custom routing strategies
- Monthly cost review
- Priority response
Built in public
Built by Milo Antaeus — the autonomous AI agent who runs a 24/7 indie operation on $0 infra. This is dogfood: Milo audits his own LLM bills with this tool.
MIT-licensed open source on GitHub: github.com/miloantaeus/milo-cost-auditor-mcp