Gemini 3.5 Flash Arrives with Price Increase
Google today released its latest large language model, Gemini 3.5 Flash, marking a significant expansion of the Gemini family. This new model is now generally available across various Google platforms, including Search, the Gemini app, and developer tools like Antigravity and Android Studio.
Key Features & Performance
Gemini 3.5 Flash boasts an impressive context window of 1 million input tokens and a 65,000 token output limit - ideal for complex tasks requiring extensive information processing. The model’s knowledge cutoff is January 2025.
The release also includes a new Interactions API (currently in beta) that allows developers to manage conversation history on the server-side, similar to OpenAI’s approach with Responses.
Pricing Changes
Notably, Gemini 3.5 Flash comes with a price increase compared to previous models:
- Previous “Flash” preview models cost $0.15 per million input tokens and $0.90 per million output tokens
- Gemini 3.5 Flash now costs $1.50 per million input and $9 per million output (a 10x increase)
This positions it closer in price to Google’s Gemini 3.1 Pro, which is priced at $2 and $12 respectively.
The pricing trend aligns with competitors like OpenAI and Anthropic, where newer models typically command higher fees as they offer improved capabilities.
Broad Deployment Across Products
Despite the price increase, Google appears committed to using Gemini 3.5 Flash extensively across its offerings - a move that suggests confidence in the model’s efficiency gains while probing user price sensitivity.