Google
Gemini 2.5 Flash Preview
google/gemini-2.5-flash-preview
Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding. Instead of immediately generating an output, the model can perform a "thinking" process to better understand the prompt, break down complex tasks, and plan a response. On complex tasks that require multiple steps of reasoning (like solving math problems or analyzing research questions), the thinking process allows the model to arrive at more accurate and comprehensive answers. In fact, Gemini 2.5 Flash performs strongly on Hard Prompts in LMArena, second only to 2.5 Pro.
Capability
Vision Support
Tools
Function Calling
Context Window
1,048,576
Max Output Tokens
65,536
Using Gemini 2.5 Flash Preview with Python API
Using Gemini 2.5 Flash Preview with OpenAI compatible API
import openai
client = openai.Client(
api_key= '{your_api_key}',
base_url="https://api.model.box/v1",
)
response = client.chat.completions.create(
model="google/gemini-2.5-flash-preview",
messages: [
{
role: 'user',
content:
'introduce your self',
},
]
)
print(response)