Opus 4.8 Token Burn, Hermes Agents, Claude Code Scaling — AI Daily May 29

386 messages · 69 active members

386

messages

active members

@Wootbro, @bofu2u, @arielletolome

top contributors

Overview

Claude Opus 4.8 dominated the day, with builders reporting eye-watering token consumption — @GuruTime hit 15M tokens across 182 agents in 27 minutes, and @c_1media burned 40% of a weekly $200 Claude account in one ultracode session (~$850 API equivalent). Reactions were mixed: @sibunting praised smoother multi-repo orchestration and context handling, while @samtome and @mb29266 found 4.6 still better for ad copy. @filiuser argued Anthropic is deliberately designing for token bloat to drive subscription revenue. On the workflow side, @rstmaur and @Wootbro shared OMO team-mode orchestrator templates covering preflight, lane planning, parallel swarm waves, fan-in verification, and proof audits. Hermes voice agents and orchestration patterns drove a major parallel thread. @rmktg shared his workflow for spinning up new bot profiles in minutes via botfather + orchestrator hand-off, @kennyaronson detailed a Klaviyo manager agent building 14-day customer sequences plus calendar-driven scheduling, and @navuud demoed 'Migi', a Telegram userbot on pyrofork + GPT-realtime that lets her prompt agentic coding sessions by voice from the beach. Meanwhile @danfeldman's question on scaling 40+ Claude Code sessions across teammates surfaced SSH+tmux session scripts (@geilt) and Dropbox-shared project folders with Telegram bridges (@weslindquist). Side threads covered AWS launching Claude on Bedrock plus Claude Cowork (with @geilt clarifying only true Bedrock keeps data in your AWS account), video generation economics (kie.ai at $1.16/15s vs Seedance pricing), Oura vs Whoop vs Fitbit for sleep tracking, and @jcartu's continued evangelism of GLM 5.1 as an Opus substitute. Codex computer use landed on Windows, and Hermes had another outage hitting traveling users.

Topics

Claude Opus 4.8: Token Burn, Ultracode & Dynamic Workflows57 msgs

Builders reported massive token consumption on Opus 4.8 — @GuruTime hit 15M tokens / 182 agents / 27 min, @c_1media burned $850-equivalent in one ultracode run. @sibunting praised smoother context handling while @samtome and @mb29266 preferred 4.6 for ad copy. @Wootbro shared a full OMO team-mode orchestrator template (preflight → lane planning → swarm waves → fan-in → proof audit) that several members said mirrored what they were independently building.

Hermes Voice Agents & Multi-Bot Orchestration33 msgs

@rmktg detailed spinning up new Hermes bot profiles in minutes via botfather + orchestrator hand-off, with bots auto-reading prior notes. @kennyaronson shared a Klaviyo manager agent building 14-day sequences plus a calendar-driven scheduling system. @navuud demoed 'Migi', a Telegram userbot on pyrofork + GPT-realtime for voice-prompted coding sessions. Hermes had another outage hitting traveling users.

Scaling Claude Code Across Teams & AWS Bedrock Launch26 msgs

@danfeldman runs ~40 Claude Code sessions on a dedicated machine and is hitting concurrency limits sharing via AnyDesk. Solutions: SSH + tmux per-user session scripts (@geilt), Dropbox-shared project folders with Telegram bridge (@weslindquist). AWS rolled out Claude on Bedrock plus Claude Cowork — @geilt clarified only true Bedrock deployment keeps prompts inside your AWS account; the Anthropic Platform route still ships data to Anthropic.

AI Video Generation Stack: Seedance, Kie, MoneyPrinterTurbo20 msgs

@tidemid and @arielletolome hunted for cheapest Seedance 2 API ($2.41/15s deemed too expensive). @jlang123 recommended kie.ai at $1.16 for 15s at 480p with upscaling. @kingofgrowth surfaced MoneyPrinterTurbo (69k stars) as a candidate auto-video pipeline. @fmill1 tried Gemini for IG/TT video with no luck.

Local AI, GLM 5.1, Codex on Windows & Wearables37 msgs

@jcartu kept evangelizing GLM 5.1 as a cheap Opus substitute, flagging Fireworks hosting GLM 5.1 and Kimi 2.6 at 250-350 tps. @jrizzolo announced Codex remote control and computer use on Windows. @mikeconner shared a case study on a company going 100% local with Codex + Ollama. In parallel, @pqbd1, @tounano, @bofu2u and others debated Oura vs Whoop vs Fitbit Air for sleep and recovery tracking.

Key Takeaways

Opus 4.8 ultracode can burn 40% of a $200 weekly Claude quota in one run — reserve max effort for tasks that need it; harness + prompting usually beats raw effort.
Dynamic workflows are converging on a standard pattern: preflight → lane planning → parallel swarm → fan-in verification → proof audit (see @Wootbro's OMO template).
For Hermes-style multi-bot setups, point new bots at prior bot notes via the orchestrator — @rmktg gets new profiles cranking in minutes instead of hours.
Claude via AWS Anthropic Platform still ships data to Anthropic — only true Bedrock deployment keeps prompts inside your AWS account.
Cheapest viable AI video pipeline right now: kie.ai at ~$1.16 per 15s at 480p, then upscale to 720p only on winners.

Hot Threads

@GuruTimestarted

Opus 4.8 token consumption insanity (182 agents, 15M tokens, 27 min runs)

18 replies8 participants

@KtargetMediastarted

Oura ring vs Apple Watch vs Whoop for sleep and recovery tracking

22 replies9 participants

@danfeldmanstarted

How to share Claude Code across a team with 40+ active sessions

14 replies5 participants

Linked Items

Overview

Topics

Key Takeaways

Hot Threads

Linked Items

Developer Uses Claude Opus 4.8 Just to Rename a Single File

Unable to analyze - insufficient content data

Claude's Hidden Price Hikes & Crashes: A Timeline of AI Gone Wrong

TikTok - Make Your Day

Claude's Opus 4.8 CLI Update: New Effort Level Feature Revealed