GLM 5.2 Self-Hosted Inference on 4x RTX 6000 Pros
Jun 21, 2026 · 435 messages · 62 active members
@jcartu shared a HuggingFace release of GLM-5.2-504B-FullKD and made a case for buying 4-8 RTX 6000 Pros to self-host: ~0 KLD loss, 8 concurrent sessions at 60-80tps, near Opus 4.8 coding quality. Card prices jumped 750k…
Read full digest →