Skip to content

Model Catalog

ai& currently serves the open-weight models below. The list is dynamic — each organization sees only what it has access to, so the source of truth is always GET /v1/models.

Free to prototype

google/gemma-3-27b-it and qwen/qwen3.5-9b — no charges, ideal for evals.

Best price-to-performance

openai/gpt-oss-120b at $0.15 / $0.60 per 1M tokens — reasoning + tool calling.

Multimodal

google/gemma-4-31b-it for image + video, moonshotai/kimi-k2.6 for image + document.

Largest context

Every Qwen3.5 variant, google/gemma-4-31b-it, and moonshotai/kimi-k2.6 — 262K tokens.

Model IDCapabilitiesContextInput / 1MOutput / 1M
google/gemma-3-27b-it131KFreeFree
qwen/qwen3.5-9breasoning, tool_calling262KFreeFree
openai/gpt-oss-120breasoning, tool_calling131K$0.15$0.60
google/gemma-4-31b-itreasoning, tool_calling, vision, video, document262K$0.20$0.50
qwen/qwen3.5-27breasoning, tool_calling262K$0.25$1.80
deepseek-ai/deepseek-v3.2reasoning, tool_calling164K$0.50$1.50
qwen/qwen3.5-397b-a17breasoning, tool_calling262K$0.55$3.50
moonshotai/kimi-k2.6reasoning, tool_calling, vision, document262K$0.85$3.50
zai-org/glm-5reasoning, tool_calling203K$0.95$2.90
zai-org/glm-5.1reasoning, tool_calling203K$1.40$4.40

Sorted by output price, low to high. Prices are USD per million tokens — see Pricing for how the formula computes a per-request cost.

CapabilityMeaning
reasoningSupports reasoning_effort and emits internal reasoning tokens (charged as output).
tool_callingAccepts tools / tool_choice. See Tool Calling.
visionAccepts image inputs (image_url or file_id). See Vision.
videoAccepts video inputs by file_id. See Video Understanding.
documentAccepts PDF / document inputs.

A capability absent from the list means requests using that feature will be rejected at validation time.

Terminal window
curl https://api.aiand.com/v1/models \
-H "Authorization: Bearer sk-your-api-key"
{
"object": "list",
"data": [
{
"id": "openai/gpt-oss-120b",
"object": "model",
"name": "openai/gpt-oss-120b",
"owned_by": "ai&",
"provider": "openai",
"context_window": 131072,
"capabilities": ["reasoning", "tool_calling"],
"description": "OpenAI GPT OSS 120B",
"input_per_1m": "0.150000",
"output_per_1m": "0.600000",
"created": 1775474514
}
]
}
FieldTypeMeaning
idstringThe value to pass as "model" in chat / responses / messages.
providerstringThe lab that released the open weights (openai, google, qwen, …). All models are hosted on ai& infrastructure regardless of provider.
context_windowintMax combined input + output tokens.
capabilitiesstring[]Supported features — see the table above.
input_per_1mstringUSD per 1 million input tokens. Numeric stored as a string for precision.
output_per_1mstringUSD per 1 million output tokens.

GET /v1/models with an anthropic-version header returns the Anthropic shape — data[].display_name, created_at, has_more, first_id, last_id. Pricing and capabilities are only on the OpenAI surface.