ALL MEMOS Download .docx

TITAN Credit Audit — 2026-05-09

Window: trailing 7 days (2026-05-02 → 2026-05-09)

Generated by: FORGE

Source of truth: ~/.claude/projects/*.jsonl via F:/TITAN/scripts/session_token_audit.py

---

1. Executive Summary

2. By-provider breakdown

| Provider | 7-day spend | Tracking quality | Notes |

|---|---|---|---|

| Anthropic (Claude Code) | $5,625.63 notional | ✅ Excellent (session_token_audit.py) | All Opus, one project, 100% cache-hit ratio (caching works — burn is from sheer call volume + cache replays + output tokens) |

| Perplexity | $1.32 actual | ✅ Good (pplx-cost-log.jsonl, 39 calls, 7d) | Negligible. sonar-pro $0.04/call, sonar $0.01/call. |

| OpenAI (gpt-image-1, GPT-4/5) | Unknown | ❌ Not tracked | No log file found. The 9 vision-app images today (~$0.90) are not in any TITAN log. |

| ElevenLabs (voice) | Unknown | ❌ Not tracked | voice.log exists but no cost record. |

| AWS Bedrock (direct) | ~$0 | ✅ Code search | Only 3 scripts reference bedrock-runtime (innerverse_apps_nightly, lambda_innerverse_nightly, titan_bridge). No daily cron uses Bedrock for LLM calls. |

| AWS infra (SES/Lambda/S3/CF) | ~$0–5/mo est. | ⚠️ daily-aws-cost.log exists, not aggregated | Roughly flat. SES sends ~10/day. |

| Resend | $5/mo | ✅ Self-reported (external-spend.jsonl) | Flat. |

Finding F-1 (red flag #1): external-spend.jsonl and llm-costs.jsonl publishers are stale 19–20 days (per nightly report 2026-05-09, line 36–37). TITAN has no live ingestion of Anthropic billing dashboard, OpenAI usage, or ElevenLabs character counts. Anything not in the Claude Code session JSONL is dark.

3. Today's 24-hour spike (2026-05-09)

What ran today: 5 MANIFEST forge agents, 5 GAUNTLET, 5 SADHANA (+2 replacements), 5 Innerverse upgrades, 3 newsletter scripts, project registry agent, orphan-distro cleanup, tasks dashboard agent, this credit audit. ~26 sub-agents in one day. Each sub-agent is its own Opus session at ~$0.30–$0.60/turn × 30–60 turns = $10–$30 per agent. 26 agents × $20 average ≈ $520, which lines up directionally with the $345 measured (cache reduces it).

The spike vs the trend: May 7 = $1,525 (peak), May 3 = $1,335, May 5 = $1,024, May 4 = $826. Today's $345 is actually below the recent average. The "burn rate problem" Harnoor noticed is the last 7 days have averaged $804/day notional — this is the new normal, not a one-off.

4. Top 5 Recommendations (priority-ordered)

R1. Hard model-tiering for sub-agents — Sonnet by default, Opus on demand.

R2. Fix the publisher gap on external-spend.jsonl / llm-costs.jsonl.

R3. Cap parallel sub-agent fan-out per session.

R4. Throttle / kill scheduled tasks that are noisy or duplicative.

R5. One-prompt-per-agent rule for routine tasks.

5. Sweep table — every recurring cost surface

| Surface | What it is | Cost class | Verdict | Action |

|---|---|---|---|---|

| titan-daily-pa-email (cron 05:27) | Daily brief email | Free (SES) | ✅ keep | none |

| titan-daily-token-audit (cron 06:30) | This very script | Free (pure Python) | ✅ keep | confirm cron is enabled — running it manually today gave fresh data |

| titan-daily-newsletter | OpenClaw / Agentic AI newsletter | Pplx + SES | ✅ keep | $0.04/run |

| titan-batch-poll-15m | Anthropic batch API poll | Free (poll only) | ✅ keep | none |

| titan-inbox-watch | Inbox queue watcher | Free | ✅ keep | none |

| titan-master-batch-nightly | Nightly batch dispatch | Bedrock batch (cheap) | ✅ keep | already on cheapest tier |

| nightly-report-writer | Builds nightly report | Haiku-only per memo line 3 | ✅ keep | already token-cheap |

| swarm-health-orchestrator | Health checks + respawn | Likely Sonnet/Opus | ⚠️ tune | confirm model = Sonnet, not Opus |

| claude-code-audit-every-6h | Self-audit via claude CLI | Likely Opus | ⚠️ tune | force Sonnet, or convert to pure-Python audit |

| agentic-247-watchdog | Continuous watchdog | Likely Opus | ⚠️ tune | downgrade to Haiku — checks are simple |

| titan-claude-weekly | Weekly report | Opus | ⚠️ tune | weekly Opus is fine, but cap output_tokens |

| titan-agentic-ai-weekly | Weekly newsletter | Pplx + SES | ✅ keep | cheap |

| titan-openclaw-weekly | Weekly newsletter | Pplx + SES | ✅ keep | cheap |

| titan-weekly-reminder | Reminder | Free | ✅ keep | none |

| job-search-daily-digest | Job digest | Pplx | ✅ keep | $0.04/day |

| Sub-agent fan-out (today: 26) | Parent-spawned forge/scout/etc | Opus default | ❌ THROTTLE | apply R1 + R3 immediately |

| pplx.py ad-hoc calls | Research | $0.01–0.04/call | ✅ keep | already cheap and metered |

| vision app gpt-image-1 | Daily image gen (9 today) | $0.10/image | ⚠️ tune | log to llm-costs; cap at 5/day |

| ElevenLabs voice gen | Voice synthesis | unknown | ⚠️ tune | start logging characters/$ |

| AWS SES sends (~10/day) | Email | $0.0001/send | ✅ keep | none |

| AWS Lambda Innerverse nightly | 5 apps × 1 nightly | Bedrock (cheap model) | ✅ keep | already cheap |

6. Three Biggest Red Flags

1. No model tiering. Opus = 99.2% of 7-day burn ($5,581 of $5,626). Even cron-scheduled audits and routine inventory agents are running on Opus. This is the single biggest fixable lever — switching the default for sub-agents to Sonnet would cut weekly notional burn by ~70%.

2. Spend visibility is broken outside Claude Code. external-spend.jsonl and llm-costs.jsonl are 20 days stale. OpenAI image gen, ElevenLabs voice, and any direct Bedrock calls are completely untracked. We are flying blind on every provider except Anthropic-via-Claude-Code and Perplexity.

3. The spike isn't a spike — it's the new baseline. 7-day daily average = $804 notional. Today's $345 is below average. May 7 hit $1,525 in a single day. The pattern is "Harnoor opens claude → spawns 5+ sub-agents → repeat 3x" and it has been running this way for at least a week. Without R1+R3 above, this trajectory continues.

---

Memo author: FORGE

Source script: F:/TITAN/scripts/session_token_audit.py --window 7d

Path: F:/TITAN/plans/TITAN-CREDIT-AUDIT-2026-05-09.md