WED, 03 JUN 2026 · 18:34:20 UTC

Qwen 2.5-Max

by Alibaba Cloud·China·Released

Alibaba's frontier MoE — closed-weights, competitive with Claude 3.5 Sonnet on key benchmarks.

textvisioncodechatreasoningtoolslong-context
Vendor site
· 0 reviews

About this model

Qwen 2.5-Max (January 2025) is Alibaba's frontier closed-weights MoE — the response to DeepSeek V3's industry-shaking December 2024 release. On Alibaba's own evaluation Qwen 2.5-Max scores competitively with Claude 3.5 Sonnet and GPT-4o on most benchmarks, particularly excelling on Chinese-language tasks.

Unlike the open Qwen 2.5 family (which Alibaba aggressively open-sourced under Apache 2.0), Qwen 2.5-Max is closed-weights and served only via Alibaba Cloud. This is Alibaba's bet that frontier capability justifies a closed model for their flagship tier.

Strengths

  • Frontier-competitive on most benchmarks at launch
  • Best Chinese-language performance in the Qwen family
  • Aggressive Alibaba Cloud pricing
  • Backed by Alibaba's significant cloud and compute infrastructure

Limitations

  • Closed weights — diverges from the Qwen family's open ethos
  • 32K context — much smaller than top US frontier models
  • US/EU enterprise procurement friction (Chinese-origin model)
  • Less mature international developer ecosystem than Western labs

When to use it

  • Chinese-market enterprise deployments needing top-tier quality
  • Bilingual Chinese-English customer support
  • Workloads where Chinese-language quality matters more than English
  • Cost-sensitive frontier-class chat applications

Architecture & training

Alibaba has confirmed Qwen 2.5-Max is a sparse Mixture-of-Experts model but has not disclosed total or active parameter counts. The pretraining corpus is described as 'over 20 trillion tokens' with explicit balance between Chinese and English. Post-training follows the same RLHF + DPO pipeline as the rest of the Qwen 2.5 family.

Benchmarks

BenchmarkScoreBar
MATH68.5
MMLU-Pro76.1
HumanEval73.2

Reviews · 0

Sign in to leave a rating.

Compare against

All models →