Google
Gemini 2.0 Flash Thinking Mode
google/gemini-2.0-flash-thinking
Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stronger reasoning capabilities in its responses than the base Gemini 2.0 Flash model.
Capability
Vision Support
Tools
Function Calling
Context Window
32,000
Max Output Tokens
8,000
Using Gemini 2.0 Flash Thinking Mode with Python API
Using Gemini 2.0 Flash Thinking Mode with OpenAI compatible API
import openai
client = openai.Client(
api_key= '{your_api_key}',
base_url="https://api.model.box/v1",
)
response = client.chat.completions.create(
model="google/gemini-2.0-flash-thinking",
messages: [
{
role: 'user',
content:
'introduce your self',
},
]
)
print(response)