Feedback: Cache AI-generated assets on S3, never regenerate

Date: 2026-05-09

Source: Harnoor (explicit, durable rule)

Confidence: explicit · durable

Rule

ANY file generated by a paid AI service (ElevenLabs audio, gpt-image-1 images, gpt-5/Bedrock long-form text, Veo 3 video, future PDF/Word generators, etc.) is a one-shot expense. Once it's saved to S3, never pay to regenerate it.

How to apply (every generator must follow this pattern)


def gen_or_fetch(s3_key: str, generator_fn):
    """Generate-once, cache-on-S3 pattern. Use everywhere."""
    s3 = boto3.client("s3")
    try:
        s3.head_object(Bucket=BUCKET, Key=s3_key)
        return f"https://{BUCKET}.s3.{REGION}.amazonaws.com/{s3_key}"  # already exists, skip
    except s3.exceptions.ClientError as e:
        if e.response["Error"]["Code"] != "404":
            raise
    # Doesn't exist — generate, upload, return URL
    bytes_blob = generator_fn()
    s3.put_object(Bucket=BUCKET, Key=s3_key, Body=bytes_blob, ContentType=mime, ACL="public-read")
    return f"https://{BUCKET}.s3.{REGION}.amazonaws.com/{s3_key}"

S3 key naming convention to enable cache hits:

Date-stamped: manifest/summon/harnoor/2026-05-09/episode.mp3 ← deterministic per user per day
Hash-stamped: manifest/echo/img/oracle.png ← stable name, generated once forever
Prompt-hash for one-off: gen-cache/{sha256(prompt)}.png ← any generator, dedupe identical prompts

Asset types this covers

|---|---|---|---|

| Cartoon images | OpenAI gpt-image-1 | $0.10–0.20 | YES |

| AI video clips | Veo 3 / Sora | $0.10–1.00/sec | YES |

| Long-form text essays | Claude Sonnet/Opus | $0.01–0.10 | YES |

| Word/.docx | local Python | $0 | YES |

When NOT to cache

Actually-personalized-per-request content where the user IS the variable (e.g. live chat reply). But for daily-scheduled content (Childhood letter, Summon episode, daily darshan, today's intent) — always cache.

Existing assets already following this rule

Childhood letter MP3 (Lambda Q14)
Summon episode MP3 (Lambda Q22)
38 cartoon PNGs across Manifest studio (Q-MANIFEST-UPGRADES)
Vision board 9 cartoon tiles
Childhood note.mp3 + letter.json sidecar

Existing generators that need patching to add this pattern

lambda_innerverse_nightly.py already does it for Childhood + Summon — confirm Dreams/Oracle/Childhood/Timelines all check before regen
Any future PDF generator (job-application-generator, advisor memo PDF) must check S3 first
Future Word doc generator (advisor memos in .docx) — same

Provider preference order (image) — updated 2026-05-10 16:00

All three working. Pick by use-case:

|---|---|---|---|---|---|

Default routing: OpenAI for daily/volume (Tier A) since it has auto-recharge and works. Switch to Imagen 4 for any "this must be amazing" hero shot. Bedrock only as fallback.

Routing rule:

If app is Harnoor-facing daily ritual content → Tier B (Bedrock when re-enabled, fallback to A)
If app is share-card hero / launch asset → Tier A (OpenAI)
If marketing material at scale / PH launch → Tier C (Gemini)

CLI wrappers:

Tier A: python F:/TITAN/scripts/gen_glyphs_inline.py (script ready) or direct openai.images.generate(model="gpt-image-1")
Tier B: F:/TITAN/scripts/bedrock_image.py (built, awaits Bedrock model-access re-enable)
Tier C: F:/TITAN/scripts/gemini_image.py (built, awaits Gemini billing)

Provider preference order (video) — updated 2026-05-10 16:00

|---|---|---|---|---|---|

| A — Default | Google Veo 3 (veo-3.0-generate-001, :predictLongRunning) | Hero clips for MIRROR / VISION / SUMMON / OMEN / ECHO daily content + marketing | ~$0.50–1.50 (4–8 sec) | ★★★★★ (with synced audio, cinematic) | ✅ live (paid tier active 2026-05-10) |

Daily-content video budget: at Veo 3 ~$1/clip × 5 apps × 30 days = ~$150/mo for full Manifest+Mirror video pipeline. Decide per-app whether the cinematic upgrade is worth it.

CLI wrapper: F:/TITAN/scripts/gemini_video.py (built; needs model name confirmed as veo-3.0-generate-001 and method :predictLongRunning).

Smoke test results 2026-05-10 16:00


Imagen 4 (1024x1024)    → 1080 KB PNG saved ✅
Veo 3 (4 sec, 16:9)     → 1172 KB MP4 in ~30 sec ✅
Nano Banana             → returns no image (likely content-filter quirk on test prompt; secondary)
gemini-2.5-flash text   → free tier, working