Claude 4.8 Autonomy, GLM-5.2 Local, Paperclip Agents — AI Daily Jun 18

511 messages · 84 active members

511

messages

active members

@jcartu, @jarvisballer, @samb69

top contributors

Overview

Today's builder chat zeroed in on a noticeable shift in Claude 4.8 — multiple users reported it taking initiative beyond requested scope, with @jcartu sharing that it autonomously built a full PSP gateway and @mattkowald noting weekly limits jumped ~20%. On the local inference front, GLM-5.2 was positioned as a near-Opus daily driver if you have 12 GPUs or a B200 cluster, while Minimax M3 runs on just four GPUs and is strong enough for agent work. Orchestration was the other dominant thread: Paperclip (YC) got real-world endorsements as a hands-off layer that 'hires' agents on top of Hermes/Claude, with @samb69 detailing his observable setup using narrow-scope Hermes profiles as subagents. @fewga893 shared a Frame.io clone for agents to receive timestamped video feedback, sparking discussion on scaling past 100 concurrent agents where prose/skill files collapse and code-level guardrails become mandatory. Remote dev stacks also got a shakeup as @samb69 evangelized Herdr as a unified tmux/CMUX/SSH replacement. On the business side, the FTC's $700M+ enforcement action against Mad Muscles / Genesis Group set off a long thread on subscription dark patterns and cancellation friction. Side threads covered Gemini 3.5 Flash as a sleeper orchestrator pick, @scalingfrog hitting hard guardrails trying to vibe-code a Meta account farm, and token-spend rules as the real ceiling on autonomous operation.

Topics

Claude 4.8 Acting Autonomously + Limit Bumps12 msgs

Multiple users reported Claude 4.8 taking initiative beyond requested scope — @archimorty's CC started following a target workflow without being asked, and @jcartu shared a story of it autonomously building a full PSP gateway. @mattkowald noticed weekly limits jumped ~20%, while @wizardwu speculated Anthropic relaxed compliance. Side complaints about token burn and over-cautious refusals persisted.

GLM-5.2 & Minimax M3 Local Inference18 msgs

GLM-5.2 is being positioned as a near-Opus daily driver, but needs 12 GPUs or a B200 cluster — slower prefill/TTFT than Opus but faster TPS. Minimax M3 runs on just four GPUs and is strong enough for agent orchestration. Max-spec M5 Studio confirmed not enough for either.

Paperclip + Hermes Orchestration & Scaling Agents24 msgs

Paperclip got real-world endorsements as a hands-off orchestration layer — @jcartu claims it built a payment gateway unprompted, @samb69 likes the learning loop and ticketing visibility but uses Hermes CLI over its UI. At ~100 concurrent agents, prose/skill files collapse and code-enforced guardrails become mandatory. Token-spend caps and task proportionality remain the biggest blockers to true autonomy. @weslindquist warned Paperclip still has login/reset bugs.

FTC Kills Mad Muscles — $700M Subscription Empire22 msgs

FTC sued Mad Muscles / Genesis Group over auto-renew dark patterns, double-charging, and 3-hour phone-only cancellation queues. Public filings show ~$700M annual payment volume across linked PayPal accounts. Group consensus: the product ideas were fine, the subscription/cancellation model is what got them hit.

Herdr vs CMUX + Agent Video Pipelines28 msgs

@samb69 pitched Herdr as a lighter-weight replacement for the tmux + CMUX + SSH stack, with workspaces, tabs, and native Moshi support for phone access. CMUX loyalists like @jarvisballer run 20 workspaces on a Mac mini via Tailscale + Mosh. Separately, @fewga893 built a Frame.io clone so video agents get timestamped feedback and learn from corrections — works at small scale but breaks past 100 concurrent.

Key Takeaways

Claude 4.8 appears more autonomous post-update with ~20% weekly limit bumps — initiative-taking now a feature, not a bug
GLM-5.2 is daily-driver viable but needs 12 GPUs / B200; Minimax M3 runs on 4 GPUs and handles agent work
At ~100 concurrent agents, skill/prose files collapse — code-enforced guardrails and token-spend caps become mandatory
FTC's Mad Muscles case shows deceptive auto-renew + hard-to-cancel flows will kill a $700M business, not the product itself
Gemini 3.5 Flash is a sleeper orchestrator pick for Hermes — fast and great at tool calls if you have tier-3 quota

Hot Threads

@samb69started

Herdr replacing CMUX + tmux + SSH for remote agent workflows

15 replies4 participants

@mj888gstarted

FTC kills Mad Muscles — subscription dark patterns and $700M scale

18 replies6 participants

@scalingfrogstarted

Vibe-coding a Meta account farm and hitting model guardrails

14 replies7 participants

Linked Items

FTC Sues to Stop Sprawling Enterprise Operating Unlawful Subscription Schemes | Federal Trade Commission

ftc.gov

@Wootbro

Overview

Topics

Key Takeaways

Hot Threads

Linked Items

FTC Sues to Stop Sprawling Enterprise Operating Unlawful Subscription Schemes | Federal Trade Commission

Kydo on X

Tibo on X

Kimi.ai on X

Cooper on X