Kimi K2.6 Orchestration, Opus 4.8 Burn, Claude Skills — AI Daily May 30
363 messages · 64 active members
Overview
Topics
@jcartu shared a tiered orchestration pattern using Kimi K2.6 Turbo on Fireworks (200+ tps, 256k ctx) as the default orchestrator, Opus 4.8 for high-stakes reasoning, and DeepSeek V4 Flash for bulk work, with two ASCII diagrams and ~3x cost savings vs gpt-5.5. @rmktg and @samb69 are migrating; @yangthegoat confirmed Anthropic killed Claude sub auth inside Hermes, forcing API providers from day one.
@zippi101 sees 30-min waits on single Opus 4.8 prompts, @kennetele and @mb29266 say 4.6 still wins for structured ad copy and burns fewer tokens, and @samtome argued 4.8 is 4.7 with workflows. Builders also mapped Claude's $20/$200 Max tiers to 1:1 API credits for claude -p but found burn rates steep, eyeing GPT Pro and Grok, and sharing claude.md/memory.md persistence tips.
@BowTiedRabbit shared github.com/mattpocock/skills and called grill-with-docs a game changer for new features. @WhiskeyATX dropped it into a live project and extended /improve-codebase-architecture as a sub-agent that reviews PRDs from an architectural lens. @jonmacofficial built a custom skill from the workflow; @iannagy installed immediately.
Coolify emerged as a self-hostable open-source alternative to Vercel/Heroku/Railway with 280+ one-click services and GitHub webhook auto-deploy, validated by members on Oracle free tier and VPS. Paulo Hernandez detailed Sky House — a 52-agent platform organized like a company with a top-level brain delegating to department agents, plus a PRD-to-subagent flow requiring a 7-agent review team to pass two clean rounds before shipping.
@arielletolome scraped 9.9k Newsbreak ads (50k FB ads incoming, ~90 hrs) for a creative brief system that auto-generates 20 ad angles and image briefs at a time. @jonmacofficial launches ViralView Monday (BYO Kie.ai, Kling 3.0 over Seedance). Grok Imagine 1.5 hit X API (~$0.08/sec) with strong visuals but weak audio/VO; Seedance remains the community pick. @expadz shared a Claude Code video editing workflow that trims silences and rerolls inconsistent clips.
Key Takeaways
- Don't drive Hermes with Opus directly — route through Kimi K2.6 Turbo on Fireworks (~200 tps, ~4x cheaper) and escalate to Opus 4.8 only for red-lane reasoning; Anthropic killed Claude sub auth inside Hermes.
- Opus 4.8 burns more tokens and runs slower than 4.6 for many real workflows; 4.6 still wins for structured ad copy, and Claude's $20/$200 Max API credits exhaust fast under agent loads.
- Install Matt Pocock's .claude skills repo — grill-with-docs for new features and /improve-codebase-architecture (runnable as a sub-agent to review PRDs) are immediate wins; after feedback, tell Claude to 'update the learnings to github' to auto-persist claude.md and memory.md.
- Coolify gives you GitHub-webhook auto-deploy on your own VPS as a Vercel/Railway replacement, and multi-agent PRD review (7-agent team, two clean rounds) catches weaknesses before code is written.
- Grok Imagine 1.5 has strong visuals but weak audio at ~$0.08/sec; Seedance still wins for video, and ViralView (BYO Kie.ai, Kling 3.0) launches Monday for cloning viral TikToks/Meta ads.
Hot Threads
Kimi K2.6 Turbo orchestrator stack with ASCII architecture diagrams replacing Opus for Hermes
Grok Imagine 1.5 video testing, Seedance comparison, and Coolify self-hosting discovery
Scraping 50k FB ads to power an auto-angle creative brief system