MiniMax-M2 Goes Open Source, Surpasses Claude Opus 4.1 on New AI Intelligence Benchmark AI NEWS

Home
AInews
MiniMax-M2 Goes Open Source, Surpasses Claude Opus 4.1 on New AI Intelligence Benchmark

MiniMax-M2 Goes Open Source, Surpasses Claude Opus 4.1 on New AI Intelligence Benchmark

2025-10-28

MiniMax has open-sourced its new flagship AI model, MiniMax-M2, positioning it as one of the most efficient AI systems for coding and agent-based tasks available today.

Designed as an “agent- and code-native” model, MiniMax-M2 is built specifically for end-to-end developer workflows and agent reasoning.

Despite having a total of 230 billion parameters, the model activates only 10 billion at a time, delivering near-state-of-the-art performance in a more compact and cost-effective form.

MiniMax claims that MiniMax-M2 delivers results at roughly 8% of the cost of Claude Sonnet and runs nearly twice as fast. According to the AI Analysis Intelligence Index v3.0, MiniMax-M2 scored 61 points, ranking eighth overall—outperforming Anthropic’s Claude Opus 4.1, which scored 59.

The AI Analysis benchmark aggregates results from 10 key evaluations, including MMLU-Pro, GPQA Diamond, AIME 2025, SciCode, and Terminal-Bench Hard, to assess general reasoning and tool-use capabilities.

MiniMax-M2 now ranks among the strongest open-source models, surpassing Qwen 3 72B (58 points) and DeepSeek-V3.2 (57 points). While not the absolute top open-weight model, it leads the pack in this particular benchmark.

Benchmark comparisons highlight its highly competitive coding performance: it scored 46.3 on Terminal-Bench, beating both Claude Sonnet 4.5 and Gemini 2.5 Pro, and achieved a 44 on BrowseComp—significantly higher than Claude Sonnet 4.5’s 19.6.

MiniMax is offering limited-time free access to MiniMax-M2 through its Agent and API platform and has released the model weights on Hugging Face and GitHub for local deployment.

With benchmark results that place it ahead of Claude Opus 4.1, MiniMax-M2 underscores the growing strength of open-source AI models engineered to balance affordability, speed, and advanced reasoning for real-world coding and agent applications.

KiloCode

Open-source AI coding assistant for efficient code generation

AutoShorts

AI tool for effortless faceless video creation

ScanSoles

AI foot scanning for custom insoles

Tempus One

AI tool for detecting cancer and analyzing various medical conditions

Furbo Nanny

AI pet camera for real-time monitoring

Phoenix AI

Automated customer service and task management

Legora

AI lawyer assistant that can draft, review and research documents

RECENT AI TOOLS

Z.AI Chat

KiloCode

AutoShorts

ScanSoles

Tempus One

RECENT AI NEWS

Google's November Pixel Update Introduces New AI Features

Google Launches Private AI Computing to Enable On-Device Gemini Features

Anthropic to Spend $50 Billion on U.S. Data Center Infrastructure

Baidu's Latest ERNIE Model Brings Visual Reasoning to Open-Source AI

Chip Startup d-Matrix Raises $275M to Accelerate In-Memory Compute for Inference

Li Fei-Fei’s World Labs Accelerates World Model Race with First Commercial Product, Marble

OpenAI Pushes ChatGPT Toward a More Personalized Assistant with GPT-5.1 Update

BMW to Use Alexa+ for In-Car Voice Assistance

RECENT AI TOOLS