Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| models:glm-5 [2026/04/11 14:36] – created kat | models:glm-5 [2026/04/19 22:16] (current) – Updated GLM-5 page: split from GLM-5.1, added deprecation notice gwyntel | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| '' | '' | ||
| - | **Price**: $1.00/mtok in, $3.00/mtok out | + | **Price**: $1.00/mtok in, $3.00/mtok out (beta pricing) |
| - | Currently the <wrap hi>smartest/ | + | <WRAP center round alert 60%> |
| + | GLM-5 is in **beta** | ||
| + | </WRAP> | ||
| - | Also, barring | + | GLM-5 was launched in beta on March 30, 2026, after SGLang made progress stabilizing GLM-5' |
| - | Consequently, | + | * **Pros:** Excellent at backend coding and long-horizon agentic work. Fewer total parameters |
| + | * **Cons:** Not as strong at UI/frontend work. Beta pricing is set high (impacts rate limits). Will be replaced by GLM-5.1. | ||
| - | * **Pros:** Excels at almost all coding (see below) and long-horizon agentic work. Widely considered to be exceptionally good at code review. | + | === Architecture Notes === |
| - | * **Cons:** Quite a bit worse at user interface work. Overkill for basic assistant work, such as for OpenClaw. Worse at lateral thinking than other frontier | + | GLM-5 has fewer total parameters than Kimi K2.5, making it more efficient to serve on B200 hardware |
| + | |||
| + | The model runs on SGLang (which is faster for the GLM series) and uses NVFP4 quantization on B200 GPUs. Each replica requires 4 B200 GPUs (tp4). | ||
| + | |||
| + | === Deprecation Path === | ||
| + | |||
| + | Based on public statements from Synthetic staff, the plan is: | ||
| + | |||
| + | 1. Take GLM-5 out of beta, stop self-hosting GLM-4.7 | ||
| + | 2. Put [[:models: | ||
| + | 3. Once GLM-5.1 is out of beta, retire/ | ||
| + | |||
| + | Old models are typically proxied to Fireworks or TogetherAI, although proxy duration depends on load since proxies are expensive. | ||
| + | |||
| + | See also: [[: | ||