Organic

Nemotron 3 Super

hf:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

Price: $0.30/mtok, $1.00/mtok

The most powerful budget model on Synthetic. Definitely worth using for agentic web search and report gathering, basic agentic terminal automation, as well as thread summary and title generation, and other basic housekeeping tasks you don’t need a frontier model for. Should not be allowed to touch code with a ten-foot pole.

  • Pros: Very long context for such a cheap/small model (double the context of GPT-OSS 120b, which is the same size). Extremely, almost unnervingly fast. Does not really slow down over long contexts at all. Which is all thanks to the hybrid state space model architecture. Most powerful and capable fully open source model (source)
  • Cons: Not really very flexible at problem solving. Can lose the plot pretty hard if set loose on a difficult problem for a very long time without feedback, although it doesn’t really context rot and is very tenacious, so it has that going for it. Probably shouldn’t be allowed to write code. Not that smart.