Skip to content
GoKawiil
Tech News
← Back to articles
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
2026-06-23 |
original
read original
more articles
Comments
Explore topics:
vibethinker
3b model
opus 4.5
sft+grpo
reasoning
Related:
AI demands more engineering discipline. Not less
GLM-5.2 is the new leading open weights model on Artificial Analysis
Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again
Researchers say they trained a foundation model from scratch for about $1,500
Microsoft's first reasoning model is one of 7 AIs just released at Build - what we know so far
Get alerts for these topics
vibethinker
3b model
opus 4.5
sft+grpo
reasoning
Subscribe
We'll send a verification email. No spam.