Mistral AI logo
FreemiumBy Mistral AI

Mistral AI

French startup €14B. Open-weight MoE: Large 3 (675B/41B active), Ministral 3 edge (3B-14B offline), Devstral 2 coding (72.2% SWE-bench). Apache 2.0.

APIOpen Source
0
0
0

Description

Mistral AI

What is Mistral?

Mistral AI is a French startup founded in 2023 by former DeepMind and Meta researchers, specializing in open-weight language models with Mixture-of-Experts (MoE) architecture. Valued at $14 billion with €2B in funding, Mistral competes with OpenAI/Google by offering multimodal, multilingual, and edge-optimized models.
Mistral 3 (Dec 2025) includes Large 3 (675B params, 41B active) and Ministral 3 (3B/8B/14B) for edge devices. Free under Apache 2.0 or custom licenses.

Main Models

Mistral Large 3 (December 2025)

  • Params: 675B total, 41B active (granular MoE)
  • Context: 256K tokens
  • Multimodal: text + image + video + audio
  • Multilingual: Broad European language support
  • Performance: On par with GPT-4o, Gemini 2.0 Flash
  • Best for: Document analysis, coding, agentic workflows, enterprise

Ministral 3 (December 2025)

Family of 9 small, efficient models:
  • Sizes: 3B, 8B, 14B params (dense, not MoE)
  • Variants: Base, Instruct, Reasoning
  • Deploy: Drones, robots, cars, phones, laptops (offline)
  • License: Apache 2.0
  • Best for: Edge AI, robotics, on-device without internet

Devstral 2 (December 2025)

Specialized coding models:
  • Devstral 2: 123B params, 72.2% SWE-bench Verified (SOTA open)
  • Devstral Small 2: 24B params, 68% SWE-bench, runs on laptop
  • Context: 256K tokens both
  • License: Modified MIT (Devstral 2), Apache 2.0 (Small 2)
  • Mistral Vibe CLI: CLI agent for code automation

Magistral (June 2025)

First reasoning models:
  • Small: Open-source
  • Medium: Proprietary
  • Capabilities: Chain-of-thought reasoning

Other Models

Mistral Medium 3 (May 2025): Balanced performance/cost
Mistral Small 3.1 (March 2025): Efficient and fast
Pixtral: Multimodal vision pioneer
Voxtral: Speech-to-text (audio models)
Mistral Embed: Semantic search and embeddings
Mistral Moderation: Content safety (9 categories)

Key Features

Open-weight:
  • Publicly downloadable weights
  • Full fine-tuning and customization
  • Deploy on-premise or cloud
MoE Architecture:
  • Only activates necessary params per token
  • Efficiency without sacrificing performance
  • Mistral Large 3: 41B active of 675B total
Edge-optimized:
  • Ministral 3 runs offline on devices
  • Robotics, drones, vehicles
  • 4GB VRAM sufficient (Small models)
Multilingual:
  • Special focus on European languages
  • Maintains performance across languages
Multimodal:
  • Text, image, video, audio
  • Native early integration

Le Chat

Mistral's conversational assistant (ChatGPT-style):
  • Available: Web, iOS, Android
  • Free tier: Basic access
  • Pro: $14.99/month (advanced models, unlimited messages, web browsing)
  • Image generation: Integrates Flux Pro (Black Forest Labs)

Pricing

Open-source models (Apache 2.0):
  • Ministral 3 (all sizes): FREE
  • Devstral Small 2: FREE
  • Magistral Small: FREE
Proprietary/Custom license:
  • Mistral Large 3: Via API (competitive pricing)
  • Devstral 2: FREE via API (modified MIT)
  • Mistral Medium/Small: Via API
Cloud Providers:
  • Azure AI Studio
  • AWS Bedrock
  • Google Cloud Model Garden
  • IBM Watsonx
  • Snowflake
  • NVIDIA
  • Use your cloud credits
Self-hosted:
  • Deploy on your infra (VPC, on-prem, edge)
  • Full control and customization

Use Cases

Enterprise:
  • Document analysis (256K context)
  • Agentic workflows and AI assistants
  • RAG systems
  • Custom fine-tuning for domain-specific tasks
Development:
  • Agentic coding with Devstral 2
  • Code automation via Mistral Vibe CLI
  • SWE tasks (72.2% SWE-bench)
Edge AI:
  • Robotics without WiFi (on-site diagnostics)
  • Autonomous drones in dead zones
  • In-car AI assistants (Stellantis partnership)
  • On-device apps without internet
Physical AI:
  • Singapore HTX: robots, cybersecurity, fire safety
  • Helsing (Germany): vision-language-action for drones
  • Stellantis: in-car AI assistant
Enterprise Workflows:
  • Multilingual content moderation
  • Semantic search and organization
  • Transcription (Voxtral)
  • Scientific workloads

Partners and Deployments

Physical AI:
  • HTX Singapore (robots, security)
  • Helsing (defense drones)
  • Stellantis (automotive)
Cloud:
  • Microsoft Azure ($16M investment 2024)
  • NVIDIA (optimized GB200 NVL72: 10x performance gain)
  • AWS, Google Cloud, IBM, Snowflake
Commercial:
  • CMA CGM: €100M partnership (shipping)
  • HSBC: Major commercial contract

Advantages

Open-weight: Download, modify, fine-tune freely
Apache 2.0: Truly open for many models
Efficient MoE: 41B active vs 675B total
Edge-ready: Ministral runs offline on devices
256K context: Long-document processing
Strong multilingual: Especially European languages
Native multimodal: text + image + video + audio
Competitive pricing: Cheaper than OpenAI/Anthropic
No vendor lock-in: Self-host or multi-cloud
NVIDIA optimized: GB200 NVL72 10x gain
European champion: €14B valuation, €2B funding

Limitations

Less mature: 2 years vs OpenAI/Anthropic (9+ years)
Smaller ecosystem: Fewer integrations than competitors
Benchmark controversies: Questions about methodology
Copyright concerns: 22% copyrighted text verbatim (Mixtral)
Mixed licenses: Not everything Apache 2.0 (confusion)
Hardware requirements: Large models need powerful GPUs
Less documentation: Compared to OpenAI/Anthropic
Fewer features: No advanced voice, canvas, etc.

Key Features

Mistral Large 3: MoE 675B params, 41B active, 256K context

Ministral 3: 3B/8B/14B models for edge (offline on devices)

Devstral 2: 72.2% SWE-bench, SOTA open coding model

Devstral Small 2: 24B params, runs on laptop, Apache 2.0

Mistral Vibe CLI: CLI agent for code automation

Native multimodal: text + image + video + audio

Multilingual: special focus on European languages

Apache 2.0: Ministral, Devstral Small, Magistral Small free

Edge AI: robotics, drones, cars without WiFi

Granular MoE: efficiency without sacrificing performance

Le Chat: ChatGPT-style assistant (web, iOS, Android)

Cloud deployment: Azure, AWS, GCP, IBM, Snowflake

Self-hosted: VPC, on-premise, edge options

NVIDIA optimized: GB200 NVL72 10x performance gain

Physical AI: HTX Singapore, Helsing drones, Stellantis cars

Magistral: Reasoning models with chain-of-thought

Voxtral: SOTA speech-to-text

Mistral Embed: Semantic search and embeddings

Mistral Moderation: 9 multilingual safety categories

Fine-tuning services: Custom pre-training and model distillation

Use Cases

Agentic coding workflows with Devstral 2 (SWE tasks)

Edge robotics without WiFi (on-site diagnostics)

Autonomous drones in dead zones

In-car AI assistants (automotive)

Enterprise document analysis (256K context)

RAG systems with Mistral Embed

Code automation via Mistral Vibe CLI

Multilingual content moderation

On-device AI apps (phones, laptops offline)

Scientific workloads and research

Custom fine-tuning for specific domains

Semantic search and organization

Speech-to-text transcription (Voxtral)

Vision-language-action for military drones

Fire safety AI systems

Cybersecurity automation

Shipping logistics optimization

Banking workflows (HSBC)

Long-context document processing

Agentic enterprise assistants

User Reviews

Related AIs

Freemium
ChatGPT logo

ChatGPT

OpenAI

API

ChatGPT by OpenAI is a versatile AI assistant that excels at natural conversation, content creation, and complex problem-solving. With advanced multimodal capabilities, it processes text, voice, and images to streamline productivity and creativity.

Text Generation#Translation#Freemium#Code Generation#GPT-4#Copywriting#Summarization#Mobile App
Freemium
Perplexity AI logo

Perplexity AI

Perplexity AI

APIOpen Source

AI-powered search engine providing direct answers with verifiable citations, automated deep research, and access to multiple LLM models like GPT-5, Claude, and Gemini.

Text Generation#Research#Summarization#Browser Extension#API#Citation#Mobile App
Freemium
Copy.ai logo

Copy.ai

Copy.ai (Fullcast)

API

First GTM AI platform for marketing and sales with 15M+ users. Workflow automation, 90+ templates, multiple LLMs (GPT-4, Claude 3) and free plan available.

Text Generation#Translation#Freemium#Email Assistant#GPT-4#Copywriting#Claude#SEO#Trial#E-commerce#No-Code#Free#API#Slack Bot