Declared by: Harnoor
Declared at: 2026-04-25T13:40 UTC
Priority: S-tier
Status: QUEUED — research begins after current ship queue clears
Bookmark: YES (Harnoor explicitly asked for visible tracking)
---
> "android an iPhone have the features where people can transcribe locally try to use that intelligently so we don't have to process the text on the server side. People could send text from their side right so can the chat be integrated in a way where when they speak it uses their native CPU on their iPhone or whatever software and then the ability to transfer on their side and then send it over that can save us some time some CPU cost. I don't know if it's possible to intelligently included into the chat somehow, but you had that feature regardless I would love to see some innovation where you can actually include that to save us 50% of the cost and it will also reduce latency since it's being processed locally thank you so much so yeah this is something that you need to research the pros and cons of look at Claude code delete code as well as what's in the interface and see what they're doing while they're advocate thank you."
Use on-device speech-to-text (Web Speech API on browsers, native dictation on mobile) so the user's voice is transcribed by their phone — not by a Lambda → Deepgram/Whisper round-trip.
Two things get better:
1. Cost — every voice turn currently spends Deepgram or Whisper minutes. Harnoor estimates "50% off." Plausible if voice is a meaningful share of turns.
2. Latency — eliminates the upload + STT round-trip. Text appears in the compose box the moment the user finishes speaking.
The constraint: must not regress emotion-aware experiences (Hume EVI prosody, voice tone, frustration detection). That work currently leans on server-side voice.
---
SpeechRecognition) — interim + final results, single-shot---
1. Memo at F:/TITAN/plans/advisors/CLIENT-SIDE-STT-RESEARCH-2026-04-25.md (3500-5000 words)
2. Implementation spec with capability detection JS snippet + fallback contract
3. R-number queue — recommend next 1-3 R-numbers to ship the feature
4. SES email to Harnoor per AGENT-REPORTS-EMAIL-DAILY directive — top findings + recommendation + memo path
---
Per Harnoor's own words: "this should be the last thing you do for other things."
Current queue ahead of A013:
1. CloudWatch alarm on upstream_error EMF metric (in flight)
2. Migrate titan-daily-pa-email + titan-daily-newsletter from Gmail draft to SES send
3. BUILD livegroweveryday dashboard (after Harnoor sign-off on plan)
A013 SCOUT spawn: after item 3 completes, or when Harnoor explicitly says "go on the STT research."
This brief lives at F:/TITAN/plans/advisors/research-queue/. The dashboard /research/queue page (when built) will surface this prominently as the next research scheduled. The ask ledger entry A013 also points back to this brief.
— TITAN · 2026-04-25