GLM 5.2 Self-Hosted Inference on 4x RTX 6000 Pros discussions in AI Builders Community

GLM 5.2 Self-Hosted Inference on 4x RTX 6000 Pros

Jun 21, 2026 · 435 messages · 62 active members

@jcartu shared a HuggingFace release of GLM-5.2-504B-FullKD and made a case for buying 4-8 RTX 6000 Pros to self-host: ~0 KLD loss, 8 concurrent sessions at 60-80tps, near Opus 4.8 coding quality. Card prices jumped 750k…

Read full digest →