Phi-3 is a household of open supply small language fashions developed and made accessible by Microsoft.
“Small language fashions are designed to carry out properly for less complicated duties, are extra accessible and simpler to make use of for organizations with restricted sources, and they are often extra simply fine-tuned to fulfill particular wants. They’re properly suited to purposes that have to run regionally on a tool, the place a process doesn’t require in depth reasoning and a fast response is required,” Misha Bilenko, company vp for Microsoft GenAI, wrote in a weblog submit.
The concept behind creating a mannequin so small was impressed by Microsoft researcher Ronan Elden studying a bedtime story to his daughter, which led him to suppose “how did she study this phrase? How does she know the right way to join these phrases?”
Making use of this to AI, Elden puzzled what would occur if an AI mannequin was skilled simply on phrases that might be understood by a 4-year-old.
Phi-3 is available in a wide range of choices:
- Phi-3-vision is a 4.2B parameter mannequin that able to understanding each textual content and imaginative and prescient
- Phi-3-mini is a 3.8B parameter mannequin, accessible in 128K and 4K context size choices
- Phi-3-small is a 7B parameter mannequin, accessible in 128K and 4K context size choices
- Phi-3-medium is a 14B parameter mannequin, accessible in 128K and 4K context size choices
Phi-3-vision is the primary multimodal mannequin within the household, and may generate insights from charts and diagrams. “Phi-3-vision builds on the language capabilities of the Phi-3-mini, persevering with to pack robust language and picture reasoning high quality in a small mannequin,” Bilenko wrote.
Based on Microsoft, in comparison with different fashions, Phi-3 performs properly. For instance, Phi-3-small beats GPT-3.5T throughout a wide range of language, reasoning, coding, and math benchmarks, whereas Phi-3-medium beats out Gemini 1.0 Professional. Moreover, Phi-3-vision outperforms Claude-3 Haiku and Gemini 1.0 Professional V typically visible reasoning duties, OCR, desk, and chart understanding duties.
All the Phi-3 fashions are at the moment accessible on Azure AI and Hugging Face.