Gemini 2.0 Flash

by Google DeepMind·USA·Released Dec 11, 2024

Fast, cheap, multimodal — 1M context at the lowest tier-1 pricing.

textvisionaudiovideochattoolslong-context

Vendor site

— · 0 reviews

About this model

Gemini 2.0 Flash (December 2024) is Google's fast, cheap, multimodal tier — $0.10/M input, $0.40/M output, with 1M-token context as a default rather than a premium feature. The combination of price and context length is genuinely unique: no other frontier-class model offers 1M tokens at this price point.

Flash isn't quite at Claude Sonnet 4 quality on hard reasoning, but it's competitive enough for most production use cases — and the cost-per-token gap is enormous. For high-volume RAG pipelines, Workspace copilots, or batch processing, Flash is often the right choice.

Strengths

•Cheapest 1M-context model on the market
•Multimodal: vision, audio, video input — not text-only
•Sub-second latency for short prompts
•Generous free tier via Google AI Studio

Limitations

•Quality gap vs Gemini 2.5 Pro on hard reasoning tasks
•Tool-call format is Google-specific
•Audio/video input quotas can be restrictive on the free tier

When to use it

→High-volume RAG pipelines
→Batch document processing at low cost
→Real-time chat where latency budget is tight
→Multimodal classification (image / video / audio tagging)

Architecture & training

Same Gemini 2.x architectural family as 2.5 Pro — sparse MoE with native multimodal training — but with a smaller activated parameter count. Optimised aggressively for tokens-per-second on Google's TPU serving infrastructure.

Benchmarks

Benchmark	Score	Bar
MATH	75.6
MMLU	78.3

Reviews · 0

Stories about Gemini 2.0 Flash

Google DeepMindJul 8, 2026

Google DeepMind Blog Lists May–June 2026 Updates on Models, Safety, and Research Partnerships

The Google DeepMind blog index features recent posts covering new Gemini and Gemma model releases, AI safety and responsibility initiatives, scientific research applications, and partnerships with A24 and Singapore.

Google DeepMindJul 2, 2026

Google DeepMind Blog Lists New Gemini and Gemma Models, AI Safety Initiatives, and Research Updates

The Google DeepMind blog features multiple announcements covering new Gemini and Gemma model releases, AI safety and responsibility programs, and scientific research tools. Updates span faster text generation, voice translation, computer-use capabilities, and international partnerships for housing, education, and robotics.

Google DeepMindJun 3, 2026

Google DeepMind Blog Lists New Gemini Models, Scientific AI Tools, and Global Partnerships

The Google DeepMind blog highlights recent posts announcing updates to the Gemini and Gemma model families, AI systems for scientific discovery and weather prediction, and new international partnerships focused on safety and research.

Google DeepMindMay 28, 2026

Google DeepMind Blog: New AI Models, Science Tools, and Global Partnerships

Google DeepMind's blog features multiple announcements including the Gemini Omni and Gemini 3.5 models, a new national AI partnership with Singapore, scientific research tools like Co-Scientist and AlphaEvolve, and the Google DeepMind Accelerator program for environmental risks in Asia Pacific.

Compare against

All models →

Gemini 2.5 Pro

USA

Google's deep-thinking flagship with a 1M-token context window.

Google DeepMind1M ctx$1.25 / $10.00

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder

China

Open-weights coding specialist — 480B MoE, agentic by design.

Alibaba Cloud256K ctx$0.60 / $2.40Open

Kimi K2

China

Open-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.

Moonshot AI128K ctx$0.60 / $2.50Open

About this model

✓ Strengths

× Limitations

When to use it

Architecture & training

Benchmarks

Reviews · 0

Stories about Gemini 2.0 Flash

Google DeepMind Blog Lists May–June 2026 Updates on Models, Safety, and Research Partnerships

Google DeepMind Blog Lists New Gemini and Gemma Models, AI Safety Initiatives, and Research Updates

Google DeepMind Blog Lists New Gemini Models, Scientific AI Tools, and Global Partnerships

Google DeepMind Blog: New AI Models, Science Tools, and Global Partnerships

Compare against

Gemini 2.5 Pro

GLM-4.5

Qwen3-Coder

Kimi K2

Strengths

Limitations