Fable 5 Deadline Rush, GLM 5.2 Rigs, Seedance Ads — AI Daily Jul 04
633 messages · 63 active members
Overview
Topics
With Fable 5 leaving Max plans on July 7, builders are stacking 3-6 subscriptions, using gmail dot/plus tricks, and burning weekly resets. The settled pattern: Fable for planning, skill audits, and orchestrating sub-agents (Hermes, OpenClaw, Codex), then hand execution to Sonnet/Opus/Codex. Plan mode and explicit 'answer only' prompts are the fix for Fable silently executing code when only asked questions.
Mid-session auto-downgrades from Fable 5 to Opus 4.8 are torching expensive multi-agent runs. @c_1media's fix: `/config` and set 'Switch models when a message is flagged' to false. Separately, Fable 5's copy guardrails are stricter than Opus 4.8, silently rewriting or blocking marketing text — pushing performance marketers toward Opus + orchestrator patterns and Nano Banana for text-in-image work.
@jcartu argued a ~$150k EUR 8x 6000-card node (or ~$30k/mo B200 rental) runs FP8 GLM 5.2 at 120+ tps for 8 concurrent devs, with GLM 5.5 in August rumored at Opus 4.8 quality plus vision. Draft models (dflash) roughly double throughput — DeepSeek v4 flash jumped from ~180 to 360-400 tps. @robinroy proposed splitting hardware costs across a small operator group.
@arielletolome shared a full ad-creative stack: Higgsfield CLI for Seedance generation, Hetzner RTX 6000 for 480p→4K upscaling, reusable body+CTA clips with swappable hooks, and OpenClaw agents triggered via Slack. Wan 2.2 was ruled out for weak lip sync and 8s duration. @artmalk opened a parallel thread on ingesting 3 months of ad videos, transcripts, and $1.5M+ spend data into a searchable insight system — TwelveLabs vs vector DB vs Obsidian.
Twitter chatter pegs July 7 for a possible new GPT drop (GPT 5.6 rumored) and July 17 for Gemini 3.5 Pro. GPT-5.5 Extra High is the practical default; Cyber's KYC requirement is friction. @thewildzeno pushed back on AI-designs-side-effect-free-drugs hype — side effects are mechanism-linked — while granting big prototyping gains. Hermes v0.18.0 shipped with /journey timelines, MoA swarms, Vertex AI, per-channel Telegram overrides, and an Unbroker data-broker opt-out skill.
Key Takeaways
- Fable 5 leaves Max plans July 7 — stack plans now and use Fable for planning while offloading execution to Sonnet/Opus/Codex to conserve tokens.
- If Claude Code auto-downgrades to Opus 4.8 mid-run, `/config` and disable 'Switch models when a message is flagged' to stop expensive tangents.
- A ~$150k EUR 8x 6000-card rig runs FP8 GLM 5.2 at 120+ tps for 8 devs; draft models (dflash) are roughly doubling local inference throughput.
- Higgsfield CLI + Seedance + a Hetzner RTX 6000 for upscaling is a viable production ad-video stack; Wan 2.2 is not yet competitive on lip sync.
- Watch July 7 for a rumored new GPT release and July 17 for Gemini 3.5 Pro — builders are timing experiments and switching with zero loyalty.
Hot Threads
Fable 5 productivity gains, Max plan stacking, and telephony/dialer build
Local B200/6000 rigs running GLM 5.2 unlimited — ROI vs subscriptions and Opus fallback rants
Seedance ad pipeline with GPU-server upscaling and OpenClaw agents