Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| models [2026/04/09 17:15] – [GLM-5] xenolandscapes | models [2026/04/20 06:52] (current) – kat | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| ===== Selection Criteria ===== | ===== Selection Criteria ===== | ||
| - | ==== GLM-5 ==== | + | {{page> |
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| + | {{page> | ||
| - | '' | + | ===== Embedding Models ===== |
| - | **Price**: $1.00/mtok in, $3.00/mtok out | + | {{page>models:nomic-embed-text-15}} |
| - | + | ||
| - | Currently the <wrap hi>smartest/ | + | |
| - | + | ||
| - | Also, barring GLM 5.1, the most capable open weight model period, trading blows with SOTA proprietary | + | |
| - | + | ||
| - | Consequently, | + | |
| - | + | ||
| - | * **Pros:** Excels at almost all coding (see below) and long-horizon agentic work. Widely considered to be exceptionally good at code review. | + | |
| - | + | ||
| - | * **Cons:** Quite a bit worse at user interface work. Overkill for basic assistant work, such as for OpenClaw. Worse at lateral thinking than other frontier models (needs more express guidance). | + | |
| - | + | ||
| - | ==== Kimi K2.5 ==== | + | |
| - | + | ||
| - | '' | + | |
| - | + | ||
| - | **Price**: $0.45/mtok in, $3.40/mtok out | + | |
| - | + | ||
| - | <WRAP center round info 60%> | + | |
| - | '' | + | |
| - | </ | + | |
| - | + | ||
| - | A powerful agentic model with above-average lateral thinking/ | + | |
| - | + | ||
| - | * **Pros:** Solid code. Amazing at orchestrating other agents due to special "agent swarm" reinforcement learning ([[https:// | + | |
| - | + | ||
| - | * **Cons:** Prone to outright laziness (keeping code for " | + | |
| - | + | ||
| - | ==== MiniMax M2.5 ==== | + | |
| - | + | ||
| - | '' | + | |
| - | + | ||
| - | **Price**: $0.40/mtok in, $2.00/mtok out | + | |
| - | + | ||
| - | Currently the most capable middle-tier model on Synthetic for general agentic and coding tasks. <wrap hi>Best used as a fast subagent orchestrated by a more powerful model</ | + | |
| - | + | ||
| - | * **Pros:** Very fast due to a very low active parameter count (10b). Pretty good at straightforward agentic tool use, agentic terminal use, and writing working, adequate code, as well as thoroughly exploring and writing reports on codebases or document collections. | + | |
| - | + | ||
| - | * **Cons:** Will very easily get stuck in loops if it isn't able to quickly debug an issue with its code — or its tools — in 1-2 turns. Requires //detailed and thorough// instructions to correctly execute the desired task (otherwise it will misinterpret what you mean, leave crucial things out, or just not understand the assignment). | + | |
| - | + | ||
| - | ==== Kimi K2-Thinking ==== | + | |
| - | + | ||
| - | '' | + | |
| - | + | ||
| - | **Price**: $0.60/mtok, $2.50/ | + | |
| - | + | ||
| - | The previous most capable model on Synthetic before GLM 5 and Kimi K2.5 came around. <wrap hi>Still by far the best writing model</ | + | |
| - | + | ||
| - | * **Pros:** Mostly just very good at writing, especially in a way that doesn' | + | |
| - | + | ||
| - | * **Cons:** Writing isn't always great at conveying coherent physical spaces or motions; can have continuity issues sometimes. Shouldn' | + | |
| - | + | ||
| - | ==== Nemotron 3 Super ==== | + | |
| - | + | ||
| - | '' | + | |
| - | + | ||
| - | **Price**: $0.30/mtok, $1.00/ | + | |
| - | + | ||
| - | The <wrap hi>most powerful budget model</ | + | |
| - | + | ||
| - | * **Pros:** Very long context for such a cheap/small model (double the context of GPT-OSS 120b, which is the same size). Extremely, almost unnervingly fast. Does not really slow down over long contexts at all. Which is all thanks to the hybrid state space model architecture. Most powerful and capable //fully// open source model ([[https:// | + | |
| - | + | ||
| - | * **Cons:** Not really very flexible at problem solving. Can lose the plot pretty hard if set loose on a difficult problem for a very long time without feedback, although it doesn' | + | |
| - | + | ||
| - | ==== GLM 4.7 Flash ==== | + | |
| - | + | ||
| - | '' | + | |
| - | + | ||
| - | **Price**: $0.10/mtok, $0.50/ | + | |
| - | + | ||
| - | By far the <wrap hi> | + | |
| - | + | ||
| - | * **Pros**: Cheapest. Very fast. | + | |
| - | + | ||
| - | * **Cons**: Only for basic usage. | + | |