Gemini
Start new thread
trending
Zac Zuo

5d ago

Gemini 3.1 Flash Live - Making audio AI more natural and reliable

Gemini 3.1 Flash Live is Google’s new state-of-the-art native audio model. Built for low-latency, real-time dialogue, it excels at complex reasoning and function calling. It is the exact engine currently powering Gemini Live and Google Search Live.
Bunty Shah

3d ago

What is the most critical missing feature for Gemini's chat interface?

Hey everyone,

As much as I use Gemini day-to-day, the chat interface is starting to feel a bit cluttered as my history grows. I've been thinking about the quality-of-life improvements that would make the experience much smoother, but I'm curious to know what the community thinks is the highest priority.

If you could only pick one of these features for Google to implement next, which would it be?

fmerian

1mo ago

Gemini 3.1 Pro - A smarter model for your most complex tasks

3.1 Pro is designed for tasks where a simple answer isn’t enough. Building on the Gemini 3 series, 3.1 Pro represents a step forward in core reasoning. 3.1 Pro is a smarter, more capable baseline for complex problem-solving.
Ankit Sharma

4mo ago

Gemini 3 - Bring any idea to life with multimodal capabilities

Our most intelligent model that helps you bring any idea to life.
Zac Zuo

28d ago

Gemini 3.1 Flash-Lite - Best-in-class intelligence for your high-volume workloads

Gemini 3.1 Flash-Lite is the fastest and most cost-efficient model in the Gemini 3 series. At only $0.25 input and $1.50 output per million tokens, it beats 2.5 Flash with 2.5X faster first token and 45% higher output speed while matching or beating quality.
Zac Zuo

4mo ago

Gemini Deep Research Agent - The best research agent is open for devs

The Gemini Deep Research Agent is now available to developers via the Interactions API. Powered by Gemini 3.0 Pro, it autonomously plans, executes, and synthesizes multi-step research tasks.
Zac Zuo

2mo ago

Agentic Vision in Gemini - Agentic visual reasoning with code execution

Agentic Vision, a new capability introduced in Gemini 3 Flash, converts image understanding from a static act into an agentic process
Chris Messina

2yr ago

Gemini - Google's answer to GPT-4

Google's largest and most capable AI model. Built from the ground up to be multimodal, Gemini can generalize and seamlessly understand, operate across and combine different types of information, including text, images, audio, video and code.
Zac Zuo

8mo ago

Guided Learning in Gemini - From answers to understanding

Guided Learning is a new mode in Gemini that helps you learn and study by building a deep understanding instead of just getting answers.
Zac Zuo

10mo ago

Jules - Your asynchronous AI coding partner

Jules by Google Labs is an asynchronous AI coding agent that works on your GitHub repos. Understands project context, fixes bugs, builds features & offers audio changelogs. Public beta now open.
12
Next
Last