Google
Google

Gemini 2.5 Flash Preview

google/gemini-2.5-flash-preview

Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding. Instead of immediately generating an output, the model can perform a "thinking" process to better understand the prompt, break down complex tasks, and plan a response. On complex tasks that require multiple steps of reasoning (like solving math problems or analyzing research questions), the thinking process allows the model to arrive at more accurate and comprehensive answers. In fact, Gemini 2.5 Flash performs strongly on Hard Prompts in LMArena, second only to 2.5 Pro.

Capability

Vision Support

Tools

Function Calling

Context Window

1,048,576

Max Output Tokens

65,536

Using Gemini 2.5 Flash Preview with Python API

Using Gemini 2.5 Flash Preview with OpenAI compatible API

import openai

client = openai.Client(
  api_key= '{your_api_key}',
  base_url="https://api.model.box/v1",
)
response = client.chat.completions.create(
model="google/gemini-2.5-flash-preview",
messages: [
  {
    role: 'user',
    content:
      'introduce your self',
    },
  ]
)
print(response)