MiniMax Sets New AI Benchmark with Record 4M Token Context Models

MiniMax, a Singapore-based AI startup backed by Alibaba and Tencent, has unveiled a new series of AI models featuring record-breaking 4 million token context windows.

The release of MiniMax-Text-01 and MiniMax-VL-01 positions the company as a serious competitor to established players like OpenAI and Google, offering advanced capabilities for applications requiring sustained memory and extensive input handling.

The models, designed to handle tasks involving long documents, complex reasoning, and multimodal inputs, mark a leap forward in AI scalability and affordability. MiniMax’s announcement highlights its focus on AI agent development, addressing the growing demand for systems capable of extended context processing.

The MiniMax-Text-01 model features a total of 456 billion parameters, with 45.9 billion activated per token during inference. Designed for efficient long-context processing, it employs a hybrid attention mechanism that combines linear and SoftMax layers to optimize scalability. The model supports a context window of up to 1 million tokens during training, extending to an impressive 4 million tokens in inference.

Equipped with a lightweight Vision Transformer (ViT) module, the MiniMax-VL-01 model is tailored for multimodal applications. It processes an extensive 512 billion vision-language tokens using a structured four-stage training pipeline, ensuring robust performance in tasks requiring the integration of visual and textual data.

What 4 Million Tokens Mean for AI Development

The context window in AI models determines how much information they can process simultaneously, with each token representing a fragment of data such as a word or punctuation mark.

MiniMax-Text-01’s 4 million token capacity significantly surpasses industry standards, including OpenAI’s GPT-4 (32,000 tokens) and Google’s Gemini 1.5 Pro (2 million tokens).

According to MiniMax, this extended capacity allows their models to process volumes of data equivalent to several books in a single exchange.

The company stated on its X account, “MiniMax-01 efficiently processes up to 4M tokens—20 to 32 times the capacity of other leading models. We believe MiniMax-01 is poised to support the anticipated surge in agent-related applications in the coming year, as agents increasingly require extended context handling capabilities and sustained memory.”

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

We are thrilled to introduce our latest open-source models: the foundational language model MiniMax-Text-01 and the visual multi-modal model MiniMax-VL-01.

💪Innovative Lightning Attention… pic.twitter.com/LbJhhmxD4P

— MiniMax (official) (@MiniMax__AI) January 14, 2025

MiniMax Sets New AI Benchmark with Record 4M Token Context Models

What 4 Million Tokens Mean for AI Development

The Technology Behind MiniMax-01

Benchmarks and Performance

Accessibility and Competitive Pricing

Ethical Challenges and Regulatory Context

MiniMax in a Competitive AI Landscape

latest articles

How Ultra Ethernet And UALink Enable High-Performance, Scalable AI Networks

Firefly Aerospace’s Blue Ghost mission launches to the moon

Terraria’s ‘final’ update might not be so final after all

Nvidia Blackwell for creators and professionals — upgrades for editing video, images, audio, and more

Advanced Cores, DLSS 4, Next-Gen Gaming Technologies & More

Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint

explore more

Like Den of Thieves 2: Pantera? Then watch these 3 great heist movies now

Grok AI Expands xAI Ecosystem with iOS App Launch

Fire and Ash, Promises It’ll ‘Get Your Blood Up’

This Gorgeous Version of Fire and Blood Is Probably the Only Game of Thrones Book We’re Getting This Year

This portable smart projector is $200 off today

Cohere Unveils North AI Platform; Challenges Microsoft and Google in Enterprise AI

most viewed

NVIDIA Launches Cosmos World Foundation Model Platform to Accelerate Physical AI Development

How AI Query Engines Unlock Enterprise Data

New Whitepaper: NVIDIA AI Enterprise Security

trending right now

[CES2025] An impressive Titan 18 HX laptop presented by MSI

Nvidia App overlay could be cutting performance by up to 15%

ASUS Announces NUC 14 Essential

[CES 2025] New MSI Claw consoles: first hands-on

AMD Computex 2024 Q&A With Dr. Lisa Su!

MSI’s 27-inch 4K 120Hz gaming monitor gets price axed on Amazon in time for RTX 5090 launch