Small model champion; 14B rivals 70B+ models on reasoning; MIT license; strongest quality-per-parameter; on-device/edge optimized; data-quality-focused training
Benchmarks
4 full, 0 partial of 4
Knowledge (MMLU/GPQA)
Performance on knowledge benchmarks — MMLU, GPQA, ARC. Breadth and depth of world knowledge vs frontier closed-source models.