What's New in Google Gemini 3

Oliver Xiao
Jan 20
2 min read

Released in November 2025, Google Gemini 3 is a significant upgrade with "agentic" capabilities, deep reasoning, and interactive output. It replaces the previous Gemini 2.5 series with higher performance in coding, mathematics, and multimodal understanding.

Core New Features

Generative UI (Interactive Outputs): Gemini 3 can create dynamic, clickable layouts, interactive charts, and simulations tailored to a request.
Agentic Capabilities: This model can plan and execute multi-step workflows autonomously, such as triaging emails or managing complex tool-use tasks.
Deep Think Mode: Available for Ultra subscribers, this mode allows the model to "think longer" for complex problems, improving its score on benchmarks like ARC-AGI-2.
Nano Banana Pro: A new dedicated image generation and editing model integrated with Gemini 3, allowing for precise image edits.

Model Variants & Availability

Gemini 3 Pro: The primary high-intelligence model for complex reasoning and advanced coding. It holds a top position on the LMArena Leaderboard with a score of 1501.
Gemini 3 Flash: The new default model for the Gemini app and Google Search, offering "Pro-grade" reasoning at much higher speeds and lower costs.
Gemini 3 Deep Think: A specialized reasoning mode for difficult academic and scientific problems.

Developer & Professional Tools

Google Antigravity: A new agentic development platform where Gemini 3 can write code in an editor, run commands in a terminal, and browse the web to debug or validate its work.
"Vibe Coding": This allows users to build entire apps, games, or high-fidelity UI prototypes from a single natural-language prompt.
Expanded Context: Supports a 1 million token context window, allowing it to process entire codebases or long-form documents without losing coherence.

Benchmark Improvements (vs. Gemini 2.5 Pro)

Benchmark	Gemini 2.5 Pro	Gemini 3 Pro
Humanity's Last Exam (Advanced Reasoning)	21.6%	37.5%
GPQA Diamond (PhD-level Knowledge)	88.3%	91.9%
SWE-bench Verified (Agentic Coding)	59.6%	76.2%
MMMU-Pro (Multimodal Reasoning)	68.0%	81.0%

About Us

Our Services

Why Blue Gulf

Our Services

Contact Us

Tech & Blog

What's New in Google Gemini 3

Comments