The company is named after the mistral, a powerful, cold wind in southern France.[6]
History
Mistral AI was established in April 2023 by three French AI researchers, Arthur Mensch, Guillaume Lample and Timothée Lacroix.[7]
Mensch, an expert in advanced AI systems, is a former employee of Google DeepMind; Lample and Lacroix, meanwhile, are large-scale AI models specialists who had worked for Meta Platforms.[8]
Example of an image generated with Le Chat. The prompt is: Generate an image you feel represents yourself, Mistral AI.Screenshot of Le Chat, Mistral AI chatbot, saving memories of an user favorite things
On 10 December 2023, Mistral AI announced that it had raised €385 million ($428 million) as part of its second fundraising. This round of financing involves the Californian fund Andreessen Horowitz, BNP Paribas and the software publisher Salesforce.[9][10]
By December 2023, it was valued at over $2 billion.[11]
On 16 April 2024, reporting revealed that Mistral was in talks to raise €500 million, a deal that would more than double its current valuation to at least €5 billion.[12]
In June 2024, Mistral AI secured a €600 million ($645 million) funding round, elevating its valuation to €5.8 billion ($6.2 billion).[13] Led by venture capital firm General Catalyst,[14] this round resulted in additional contributions from existing investors. The funds aim to support the company's expansion. Based on valuation, as of June 2024, the company was ranked fourth globally in the AI industry, and first outside the San Francisco Bay Area.[15]
In early August 2025, the Financial Times reported that Mistral was in talks to raise $1 billion at a $10 billion valuation.[16] Shortly after, in September 2025, Bloomberg announced that Mistral AI has secured a €2 billion investment valuing it at €12 billion ($14 billion), more than doubling its June 2024 valuation.[17] This comes after $1.5 billion investment from Dutch company ASML along with other investors. ASML now owns 11% of Mistral and will work closely with the company. Mistral expects to make more than $100 million in yearly revenue.[18]
Partnerships
On 26 February 2024, Microsoft announced that Mistral's language models would be made available on Microsoft's Azure cloud, while the multilingual conversational assistant Le Chat would be launched in the style of ChatGPT.[19][non-primary source needed] The partnership also included a financial investment of $16 million by Microsoft in Mistral AI.[20]
In April 2025, Mistral AI announced a €100 million partnership with the shipping company CMA CGM.[21][22]
Additional partnerships include:
Partnership with Inria (France’s national research institute for digital science and technology): joint work on optimization of transformer architectures for energy efficiency and reduced carbon footprint.[23]
Collaboration with Hugging Face: co-maintenance of the Mistral-open model series and integration into the Hugging Face Hub, enabling broader community access and fine-tuning.[24]
On 19 November, 2024, the company announced updates for Le Chat (pronounced /lətʃat/ in French).
It added the ability to create images, using Black Forest Labs' Flux Pro model.[26]
On 6 February 2025, Mistral AI released Le Chat on iOS and Android mobile devices.[27]
Mistral AI also introduced a Pro subscription tier, priced at $14.99 per month, which provides access to more advanced models, unlimited messaging, and web browsing.[28]
On 1 September 2025, Mistral AI introduced memories feature that can remember user preferences and context across conversations and integrations with services such as Atlassian, Databricks, GitHub, Snowflake, and Stripe. Those features are available for all users including free plan.[29]
Models
The following table lists the main model versions of Mistral, describing the significant changes included with each version:[30]
On November 19, 2024, the company introduced Pixtral Large, which integrates a 1-billion-parameter visual encoder coupled with Mistral Large 2.[36][34]
Mistral Large 2 was announced on July 24, 2024, and released on Hugging Face. It is available for free with a Mistral Research Licence, and with a commercial licence for commercial purposes. Mistral AI claims that it is fluent in dozens of languages, including many programming languages. Unlike the previous Mistral Large, this version was released with open weights. The model has 123 billion parameters and a context length of 128,000 tokens.[34]
Codestral Mamba is based on the Mamba 2 architecture, which allows it to generate responses with longer input.[37] Unlike Codestral, it was released under the Apache 2.0 license. While previous releases often included both the base model and the instruct version, only the instruct version of Codestral Mamba was released.[38][34]
Mathstral 7B is a model with 7 billion parameters released by Mistral AI on July 16, 2024, focusing on STEM subjects.[39] The model was produced in collaboration with Project Numina,[37] and was released under the Apache 2.0 License with a context length of 32k tokens.[39][34]
Codestral 22B
May 2024
?
22
Mistral Non-Production License
Codestral is Mistral's first code-focused open weight model which was launched on May 29, 2024. Mistral claims Codestral is fluent in more than 80 programming languages[40] Codestral has its own license which forbids the usage of Codestral for commercial purposes.[41][34]
Similar to Mistral's previous open models, Mixtral 8x22B was released via a BitTorrent link on Twitter on April 10, 2024,[42] with a release on Hugging Face soon after.[43] The model uses an architecture similar to that of Mistral 8x7B, but with each expert having 22 billion parameters instead of 7. In total, the model contains 141 billion parameters, as some parameters are shared among the experts, but offering higher performance.[43][44][34]
Mistral Large was launched on February 26, 2024. It outputs in English, French, Spanish, German, and Italian, and provides coding capabilities.[45] It is available on Microsoft Azure.[46][34]
Mistral Medium is trained in various languages including English, French, Italian, German, Spanish.[47] The number of parameters, and architecture of Mistral Medium is not known as Mistral has not published public information about it.[34]
Much like Mistral's first model, Mixtral 8x7B was released via a BitTorrent link posted on Twitter on December 9, 2023,[2] and later Hugging Face and a blog post were released two days later.[48] Unlike the previous Mistral model, Mixtral 8x7B uses a sparse mixture of experts architecture. The model has 8 distinct groups of "experts", giving the model a total of 46.7B usable parameters.[49][50] Each single token can only use 12.9B parameters, therefore giving the speed and cost that a 12.9B parameter model would incur.[48] A version trained to follow instructions called “Mixtral 8x7B Instruct” is also offered.[48][34]
Mistral 7B is a 7.3B parameter language model using the transformers architecture. It was officially released on September 27, 2023, via a BitTorrent magnet link,[51] and Hugging Face[52] under the Apache 2.0 license. Both a base model and "instruct" model were released with the latter receiving additional tuning to follow chat-style prompts. The fine-tuned model is only intended for demonstration purposes, and does not have guardrails or moderation built-in.[53][34]
Mistral 7B
Mistral AI claimed in the Mistral 7B release blog post that the model outperforms LLaMA 2 13B on all benchmarks tested, and is on par with LLaMA 34B on many benchmarks tested,[53] despite having only 7 billion parameters, a small size compared to its competitors.
Mixtral 8x7B
Mistral AI's testing in 2023 shows the model beats both LLaMA 70B, and GPT-3.5 in most benchmarks.[54]
In March 2024, a research conducted by Patronus AI comparing performance of LLMs on a 100-question test with prompts to generate text from books protected under U.S. copyright law found that Open AI's GPT-4, Mixtral, Meta AI's LLaMA-2, and Anthropic's Claude 2 generated copyrighted text verbatim in 44%, 22%, 10%, and 8% of responses respectively.[55][56]
On 10 June 2025, Mistral AI released their first AI reasoning models: Magistral Small (open-source), and Magistral Medium, models which are purported to have chain-of-thought capabilities.[60][61]