Gemini 3.5 Flash vs Gemini 3 Flash vs Gemini 3.1 Flash-Lite: speed and cost benchmark

By Joe @ SimpleMetrics
Published 19 May, 2026
Updated 19 May, 2026
Gemini 3.5 Flash vs Gemini 3 Flash vs Gemini 3.1 Flash-Lite: speed and cost benchmark

This is a short benchmark note. I tested three Gemini models with three copywriting tasks and recorded only the simple numbers: output tokens, total latency, visible output tokens per second, and API price.

The low-cost model tested here is Gemini 3.1 Flash-Lite with model ID gemini-3.1-flash-lite. The middle model is Gemini 3 Flash Preview with model ID gemini-3-flash-preview. The new model is Gemini 3.5 Flash with model ID gemini-3.5-flash.

Benchmark setup

Setting Value
Models gemini-3.1-flash-lite, gemini-3-flash-preview, gemini-3.5-flash
Tasks 3 copywriting tasks
API tier Gemini Developer API standard tier
Generation config temperature: 0.2, maxOutputTokens: 700, thinkingBudget: 0
Speed metric Visible output tokens Ă· end-to-end request latency
Run date May 19, 2026

Speed results

Bar chart showing Gemini 3.1 Flash-Lite at 156.1 tokens per second, Gemini 3 Flash Preview at 110.7 tokens per second, and Gemini 3.5 Flash at 155.1 tokens per second
Visible output tokens per second across the three copywriting tasks.
Model Total output tokens Total latency Weighted speed Average latency
gemini-3.1-flash-lite 1,040 6.66s 156.1 tok/s 2.22s
gemini-3-flash-preview 1,074 9.70s 110.7 tok/s 3.24s
gemini-3.5-flash 1,271 8.20s 155.1 tok/s 2.73s

API price

Cost cards showing Gemini 3.1 Flash-Lite at $0.25 input and $1.50 output per million tokens, Gemini 3 Flash Preview at $0.50 input and $3.00 output, and Gemini 3.5 Flash at $1.50 input and $9.00 output
Standard Gemini API token price for the three tested models.
Model Input price Output price
gemini-3.1-flash-lite $0.25 / 1M text, image, or video tokens $1.50 / 1M output tokens
gemini-3-flash-preview $0.50 / 1M text, image, or video tokens $3.00 / 1M output tokens
gemini-3.5-flash $1.50 / 1M input tokens $9.00 / 1M output tokens

Speed vs output price

Scatter chart comparing visible output tokens per second against output price per million tokens for Gemini 3.1 Flash-Lite, Gemini 3 Flash Preview, and Gemini 3.5 Flash
Same benchmark data plotted against output token price.

Raw data summary

  • Fastest in this run: Gemini 3.1 Flash-Lite at 156.1 tok/s.
  • Very close second: Gemini 3.5 Flash at 155.1 tok/s.
  • Slowest in this run: Gemini 3 Flash Preview at 110.7 tok/s.
  • Lowest output price: Gemini 3.1 Flash-Lite at $1.50 / 1M output tokens.
  • Highest output price: Gemini 3.5 Flash at $9.00 / 1M output tokens.

Sources

Found this useful? Share it!

If this helped you, I'd appreciate you sharing it with colleagues.

Was this page helpful?

Your feedback helps improve this content.

Related Posts