
Mistral AI
Open and portable generative AI for devs and businesses
5.0•34 reviews•3.6K followers
Open and portable generative AI for devs and businesses
5.0•34 reviews•3.6K followers
- We’re committed to empower the AI community with open technology. Our open models sets the bar for efficiency, and are available for free, with fully permissive license.
- Our optimized commercial models are designed for performance and are available via our flexible deployment options.
This is the 21st launch from Mistral AI. View more
Mistral Small 4
Launching today
Mistral Small 4 is a unified open-weight AI model from Mistral AI that combines fast chat, deep reasoning, coding, and multimodal capabilities in one system. Built with a Mixture-of-Experts architecture and a 256k context window, it delivers high performance with strong efficiency. Developers can adjust reasoning depth depending on the task, making it ideal for building assistants, coding agents, and AI applications at scale.



Free Options
Launch Team




Just discovered Mistral Small 4 from Mistral AI, and it’s an interesting step toward more unified open AI models.
What it is:
Mistral Small 4 is an open-weight AI model that combines fast instruction following, deep reasoning, coding abilities, and multimodal understanding (text + images) into one system.
The problem:
Most AI workflows require switching between different models for chat, reasoning, coding, or multimodal tasks.
The solution:
Small 4 brings these capabilities together in a single model with configurable reasoning effort—so you can choose between fast responses or deeper step-by-step reasoning depending on the task.
Key features:
Mixture-of-Experts architecture (128 experts, 4 active per token)
119B total parameters with efficient active compute
256k context window for long documents
Native multimodality (text + image inputs)
Configurable reasoning effort
Open-weight under Apache 2.0
Benefits:
Better efficiency, shorter outputs, lower latency, and easier deployment for teams building AI products.
Who it’s for:
Developers, enterprises, and researchers building assistants, coding workflows, document analysis tools, or AI applications requiring reasoning and multimodal capabilities.