Google
Google

Gemini 2.0 Flash Thinking Mode

google/gemini-2.0-flash-thinking

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stronger reasoning capabilities in its responses than the base Gemini 2.0 Flash model.

Capability

Vision Support

Tools

Function Calling

Context Window

32,000

Max Output Tokens

8,000

Using Gemini 2.0 Flash Thinking Mode with Python API

Using Gemini 2.0 Flash Thinking Mode with OpenAI compatible API

import openai

client = openai.Client(
  api_key= '{your_api_key}',
  base_url="https://api.model.box/v1",
)
response = client.chat.completions.create(
model="google/gemini-2.0-flash-thinking",
messages: [
  {
    role: 'user',
    content:
      'introduce your self',
    },
  ]
)
print(response)