Hacker Neus
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
(arxiv.org)
337 points
by timhigins
16 hours ago |
175 comments