---
title: "Gemini 3 Flash"
type: model
id: "gemini-3-flash"
provider: "Google"
model_type: "proprietary"
api_model_id: "gemini-3-flash-preview"
release_date: "2025-12"
description: "Google's fast, cost-efficient model replacing Gemini 2.5 Flash. Designed for high-throughput applications with up to 1M token context at an extremely low price."
last_updated: "2026-04-30"
last_verified: "2026-04-30"
availability_status: "preview"
deprecated: false
tool_schema_format: "gemini"
pricing_confidence: "high"
model_listing_confidence: "high"
benchmark_confidence: "medium"
context_window: "1M tokens"
website: "https://deepmind.google"
license: "Proprietary"
modality:
- "text"
- "image"
- "video"
- "audio"
tags:
- "google"
- "proprietary"
- "text"
- "image"
- "video"
- "audio"
pricing:
  input: "$0.50 / 1M tokens"
  output: "$3.00 / 1M tokens"
  input_per_1m: 0.5
  output_per_1m: 3
  currency: "USD"
  cache_read_per_1m: 0.05
  batch_input_per_1m: 0.25
  batch_output_per_1m: 1.5
  note: "Free tier available; audio input is priced separately."
benchmarks:
  reasoning: 82
  coding: 80
  math: 79
  writing: 83
  multilingual: 85
  speed: 95
capabilities:
- "function_calling"
- "vision"
- "audio_input"
- "video"
- "web_search"
- "file_search"
- "code_execution"
- "structured_output"
- "streaming"
- "prompt_caching"
- "long_context"
- "reasoning"
sources:
- title: "Google Gemini API pricing"
  url: "https://ai.google.dev/gemini-api/docs/pricing"
- title: "Google Gemini 3 API documentation"
  url: "https://ai.google.dev/gemini-api/docs/models/gemini-3"
benchmark_sources:
- title: "AI Future Ready benchmark methodology"
  url: "https://ai-future-ready.com/guides/benchmark-methodology"
best_for:
- "High-volume tasks"
- "Cost-sensitive applications"
- "Quick queries"
- "Prototyping"
---

# Gemini 3 Flash

The cheapest way to run a competent model at scale. At $0.50/$3.00 per million tokens for standard text, image, and video prompts with a free tier on top, Gemini 3 Flash is still materially cheaper than Claude Haiku -- and it comes with a 1M token context window that Haiku's 200K can't touch.

Flash is not a frontier model and doesn't pretend to be. An 82/100 reasoning score and 80/100 coding score are adequate for classification, summarization, extraction, and simple Q&A. The 95/100 speed score means responses come back almost instantly. For high-throughput pipelines where you're processing thousands of documents, Flash is the model that makes the economics work.

The multilingual score of 85/100 is surprisingly good for a budget model, beating Haiku (80) and several more expensive options. Native multimodal support across text, image, video, and audio -- inherited from the Gemini architecture -- means Flash punches above its weight on media processing tasks.

**When to pick something else:** The moment you need reliable reasoning on complex prompts, step up to Gemini 3.1 Pro or Claude Sonnet 4.6. Flash's coding score of 80 means it'll struggle with non-trivial programming tasks. For anything mission-critical, the savings aren't worth the quality gap.

Gemini 3 Flash

The cheapest way to run a competent model at scale. At $0.50/$3.00 per million tokens for standard text, image, and video prompts with a free tier on top, Gemini 3 Flash is still materially cheaper than Claude Haiku -- and it comes with a 1M token context window that Haiku's 200K can't touch.

Flash is not a frontier model and doesn't pretend to be. An 82/100 reasoning score and 80/100 coding score are adequate for classification, summarization, extraction, and simple Q&A. The 95/100 speed score means responses come back almost instantly. For high-throughput pipelines where you're processing thousands of documents, Flash is the model that makes the economics work.

The multilingual score of 85/100 is surprisingly good for a budget model, beating Haiku (80) and several more expensive options. Native multimodal support across text, image, video, and audio -- inherited from the Gemini architecture -- means Flash punches above its weight on media processing tasks.

When to pick something else: The moment you need reliable reasoning on complex prompts, step up to Gemini 3.1 Pro or Claude Sonnet 4.6. Flash's coding score of 80 means it'll struggle with non-trivial programming tasks. For anything mission-critical, the savings aren't worth the quality gap.