Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| models:glm-5 [2026/04/12 21:36] – xenolandscapes | models:glm-5 [2026/04/19 22:16] (current) – Updated GLM-5 page: split from GLM-5.1, added deprecation notice gwyntel | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | ==== GLM-5.1 ==== | + | ==== GLM-5 ==== |
| - | '' | + | '' |
| - | **Price**: $1.00/mtok in, $3.00/mtok out | + | **Price**: $1.00/mtok in, $3.00/mtok out (beta pricing) |
| - | GLM 5.1 is currently the <wrap hi> | + | <WRAP center round alert 60%> |
| + | GLM-5 is in **beta** and is slated to be retired/ | ||
| + | </WRAP> | ||
| - | Also the most capable open weight model period, trading | + | GLM-5 was launched in beta on March 30, 2026, after SGLang made progress stabilizing GLM-5' |
| - | Consequently, | + | * **Pros:** Excellent at backend coding and long-horizon agentic work. Fewer total parameters |
| + | * **Cons:** Not as strong at UI/frontend work. Beta pricing is set high (impacts rate limits). Will be replaced by GLM-5.1. | ||
| - | * **Pros:** Excels at almost all coding (see below) and long-horizon agentic work. Widely considered to be exceptionally good at code review. | + | === Architecture Notes === |
| - | * **Cons:** Quite a bit worse at user interface work. Overkill for basic assistant work, such as for OpenClaw. Worse at lateral thinking than other frontier models (needs more express guidance). | + | GLM-5 has fewer total parameters than Kimi K2.5, making it more efficient to serve on B200 hardware |
| - | Note: GLM-5 is also currently hosted, but due to be replaced. | + | The model runs on SGLang (which is faster for the GLM series) and uses NVFP4 quantization on B200 GPUs. Each replica requires 4 B200 GPUs (tp4). |
| + | |||
| + | === Deprecation Path === | ||
| + | |||
| + | Based on public statements from Synthetic staff, the plan is: | ||
| + | |||
| + | 1. Take GLM-5 out of beta, stop self-hosting GLM-4.7 | ||
| + | 2. Put [[: | ||
| + | 3. Once GLM-5.1 | ||
| + | |||
| + | Old models are typically proxied | ||
| + | |||
| + | See also: [[: | ||