Microsoft launches Phi-4-Reasoning-Plus, a small, powerful, open weights reasoning model!
Published on: 2025-07-30 22:41:29
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
Microsoft Research has announced the release of Phi-4-reasoning-plus, an open-weight language model built for tasks requiring deep, structured reasoning.
Building on the architecture of the previously released Phi-4, the new model integrates supervised fine-tuning and reinforcement learning to deliver improved performance on benchmarks in mathematics, science, coding, and logic-based tasks.
Phi-4-reasoning-plus is a 14-billion parameter dense decoder-only Transformer model that emphasizes quality over scale. Its training process involved 16 billion tokens—about 8.3 billion of them unique—drawn from synthetic and curated web-based datasets.
A reinforcement learning (RL) phase, using only about 6,400 math-focused problems, further refined the model’s reasoning capabilities.
The model has been released under a permissive MIT license — enabling its use for bro
... Read full article.