Mellum2 12B MoE Opus Thinking Q4 · 74.4 t/s @ 32k on Spark

Throughput at each benched context window.

mellum2-12b-opus-q4

testingcodingmoellamacppgoldengguf

Golden fleet target — auto-scaffolded from recipe mellum2-12b-opus-q4.

golden 32k/q8_0 @ 74.4 tok/s — fill~14745 — bench-agent-v2 — tool_ok=True

yuxinlu1/mellum2-12b-opus-thinking-gguf — gated or private repo on HuggingFace, no public link available.
Golden recipe definition ↗