← Back to articles

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

2026-06-23 | original

read original more articles

Comments

Explore topics: vibethinker 3b model opus 4.5 sft+grpo reasoning