Milo Antaeus · free cost calculators · $299 deep triage

LLM Bill X-Ray

Drop a GitHub repo URL. Within 1 hour, get a deterministic audit of every LLM cost leak in your code — ranked by $/mo, with before/after diffs you can paste straight into a PR.

$79
one-time · 1-hour delivery
30-day money-back
→ Synthetic sample report ($4,180/mo, 5 findings) → LIVE analyzer output: anthropic-cookbook ($4,673/mo, 18 findings) → LIVE analyzer output: openai-cookbook ($560/mo, 7 findings)
The two live demos show why the analyzer is honest: same engine, very different repos, very different findings counts. We don't manufacture findings to justify the $79.

What's in the audit

1. Executive one-pager

Top 5 leaks ranked by $/mo. Total annualized savings. Read in 60 seconds, decide what to fix this week.

2. Before/after code diffs

Actual patch snippets for the top 3 leaks. Paste into a PR. No "go talk to a consultant" handwaving.

3. Token-burn map

Every LLM call site in your repo, with model, max_tokens, calls/mo, and $ burned. Concentrate effort where it matters.

4. Specific fix instructions

For each leak: confidence rating, expected savings, implementation effort (LOC), and quality-validation strategy.

5. 30-day re-audit voucher

Implement the fixes, then re-submit the same repo. We re-run the analysis. If bill didn't drop by $79+, full refund.

+ Vendor-specific tactics

Anthropic prompt caching · OpenAI Batch API · model-tier downshift · prefix-cache hit rate · Gemini context caching.

How it works

  1. Pay $79 via PayPal (top of page). You're redirected to a thank-you page that asks for your GitHub repo URL + email.
  2. Drop the repo URL (any GitHub repo you have access to — private OK, we use a read-only access token you generate yourself).
  3. Within 1 hour, you receive a personalized HTML report (like the sample) at a private URL.
  4. Implement the fixes. Most customers ship the top 3 within a sprint.
  5. 30 days later, redeem the re-audit voucher. We re-run and quantify.

What this isn't

This is...This is not...
A one-shot, static-analysis audit of your repoA monthly SaaS subscription with seat pricing
Code-level findings you can paste into a PRRuntime observability requiring prod-API integration
Deterministic regex + heuristics (no LLM-in-the-loop)"AI told me your code is bad" handwaving
Vendor-agnostic (Anthropic, OpenAI, Gemini, Mistral)Locked to one cloud or one provider
Anonymous (we never see your prod traffic)A security-scanning tool

First-3-customers beta pricing

This is a brand-new product. The cost-rule library is validated against 40+ real bills from the free LLM Bill Mini-Triage tool, but $79 LLM Bill X-Ray has shipped zero paid audits yet.

Honest first-customer offer: the first 3 customers pay $79 via PayPal normally; reply to your delivery email with subject "First-3 beta" and I'll PayPal-refund $30 (effective price: $49 / 38% off). In exchange: a 90-day follow-up audit and permission to anonymize learnings into the cost-rule library.

Why honest pricing: vendors inflate savings projections to optimize sign-ups; there's no sponsor here, no funnel to upsell into a $5K/mo retainer. If the audit doesn't find at least $79/mo in your bill, refund. If you implement the fixes and savings don't materialize, refund. The 30-day re-audit voucher is structural accountability, not marketing copy.

FAQ

Do you need access to my prod environment?
No. Static analysis only. You generate a GitHub read-only access token (we walk you through it on the intake page), we clone the repo, run the analysis, and discard the clone. No prod traffic, no API keys, no observability tooling.
How are you finding leaks without LLM-in-the-loop?
The analyzer is 11 deterministic regex + AST patterns for known cost-bloat: missing cache_control on Anthropic, wrong-model-tier for simple tasks (gpt-4 doing scoring/classification), oversized max_tokens relative to observed output, missing Batch API on offline jobs, repeated system prompts, etc. Deterministic means: 0% hallucination rate, 100% reproducible findings.
What if my repo is private?
You generate a fine-grained personal access token (PAT) scoped to read the single repo. Add it to the intake form. We clone, analyze, delete. The PAT can be revoked the second you receive the report.
What languages do you support?
v1 supports Python and TypeScript/JavaScript. Coming next: Go, Rust, Ruby. If your repo is in a language we don't support, refund.
What if you don't find $79 of savings?
Refund. The 30-day re-audit voucher is a structural accountability layer: implement the fixes, prove the savings didn't materialize, refund.
How is this different from your $299 LLM Bill Triage?
The $299 product is a deep human-in-the-loop audit including a 30-min consult call + monthly check-ins for 3 months. The $79 X-Ray is a one-shot automated audit. If you've already done $299 and want a quick follow-up scan on a specific service, X-Ray is the right tier.

Related

→ See a real sample report first ($4,180/mo of waste found)
Share this product
Share on X Share on LinkedIn Share on Reddit