antirez/deepseek-v4-flash
DeepSeek V4 Flash (DwarfStar)
Throughput
17.3t/s
Engine
ds4
Parameters
13B active / 180B total
Released
2026-04-26
Benchmarked
2026-06-27
Context ladder
Throughput at each benched context window.
| Context | KV | Throughput |
|---|---|---|
| @ 128k peak | — | 17.3t/s |
| @ 1M golden | q8_0 | 1.7t/s |
Golden profile
antirez-deepseek-v4-flash-ds4
Capabilities
reasoningcodingagenticmoeds4experimentallong-context
Why we run it
Queued for DwarfStar engine bake-off on GB10 — antirez Q2-imatrix GGUF.
Bench notes
golden 1024k/q8_0 @ 1.7 tok/s — fill~50000 — bench-agent-v2 — tool_ok=True