Model Catalog

ai& currently serves the open-weight models below. The list is dynamic — each organization sees only what it has access to, so the source of truth is always GET /v1/models.

Quick picks

Free to prototype

qwen/qwen3.6-27b — no charges, ideal for evals.

Best price-to-performance

deepseek-ai/deepseek-v4-flash at $0.15 / $0.25 per 1M tokens with a 1M context window.

Multimodal

google/gemma-4-31b-it for image + video, moonshotai/kimi-k2.7-code for image + document.

Largest context

deepseek-ai/deepseek-v4-flash, deepseek-ai/deepseek-v4-pro, and zai-org/glm-5.2 — 1M tokens.

Available models

Model ID	Capabilities	Context	Input / 1M	Output / 1M
`qwen/qwen3.6-27b`	reasoning, tool_calling	262K	Free	Free
`deepseek-ai/deepseek-v4-flash`	reasoning, tool_calling	1M	$0.15	$0.25
`google/gemma-4-31b-it`	reasoning, tool_calling, vision, video, document	262K	$0.20	$0.50
`openai/gpt-oss-120b`	reasoning, tool_calling	131K	$0.15	$0.60
`deepseek-ai/deepseek-v4-pro`	reasoning, tool_calling	1M	$1.00	$2.50
`moonshotai/kimi-k2.7-code`	reasoning, tool_calling, vision, document	262K	$0.75	$3.50
`moonshotai/kimi-k2.6`	reasoning, tool_calling, vision, document	262K	$0.85	$3.50
`zai-org/glm-5.2`	reasoning, tool_calling	1M	$1.00	$4.00
`zai-org/glm-5.1`	reasoning, tool_calling	203K	$1.40	$4.40

Sorted by output price, low to high. Prices are USD per million tokens — see Pricing for how the formula computes a per-request cost.

Capability values

Capability	Meaning
`reasoning`	Emits internal reasoning tokens (charged as output) and accepts `reasoning_effort` — the values it takes are listed in that model’s `reasoning_efforts`, since models don’t share one vocabulary.
`tool_calling`	Accepts `tools` / `tool_choice`. See Tool Calling.
`vision`	Accepts image inputs (`image_url` or `file_id`). See Vision.
`video`	Accepts video inputs by `file_id`. See Video Understanding.
`document`	Accepts PDF / document inputs.

A capability absent from the list means requests using that feature will be rejected at validation time.

Listing your models programmatically

curl https://api.aiand.com/v1/models \
  -H "Authorization: Bearer sk-your-api-key"

from openai import OpenAI

client = OpenAI(base_url="https://api.aiand.com/v1", api_key="sk-...")
for m in client.models.list().data:
    print(m.id, m.context_window, m.capabilities)

import OpenAI from "openai";

const client = new OpenAI({ baseURL: "https://api.aiand.com/v1", apiKey: "sk-..." });
const { data } = await client.models.list();
for (const m of data) console.log(m.id, m.context_window, m.capabilities);

Response shape (OpenAI surface)

{
  "object": "list",
  "data": [
    {
      "id": "openai/gpt-oss-120b",
      "object": "model",
      "name": "openai/gpt-oss-120b",
      "owned_by": "ai&",
      "provider": "openai",
      "context_window": 131072,
      "capabilities": ["reasoning", "tool_calling"],
      "reasoning_efforts": ["low", "medium", "high"],
      "reasoning_effort_default": "medium",
      "description": "OpenAI GPT OSS 120B",
      "currency": "usd",
      "input_per_1m": "0.150000",
      "output_per_1m": "0.600000",
      "created": 1775474514
    }
  ]
}

Field	Type	Meaning
`id`	string	The value to pass as `"model"` in chat / responses / messages.
`provider`	string	The lab that released the open weights (`openai`, `google`, `qwen`, …). All models are hosted on ai& infrastructure regardless of provider.
`context_window`	int	Max combined input + output tokens.
`capabilities`	string[]	Supported features — see the table above.
`reasoning_efforts`	string[] \| null	The `reasoning_effort` values this model accepts; any other value is rejected with a 400. `null` on models that take no effort parameter.
`reasoning_effort_default`	string \| null	The level to default a UI to for this model. Not applied server-side — omitting `reasoning_effort` leaves the upstream’s own default in place.
`currency`	string	Your organization’s billing currency (`usd` or `jpy`). Set at org creation.
`input_per_1m`	string	Price per 1 million input tokens, in your billing currency. Numeric stored as a string for precision.
`output_per_1m`	string	Price per 1 million output tokens, in your billing currency.

Anthropic surface

GET /v1/models with an anthropic-version header returns the Anthropic shape — data[].display_name, created_at, has_more, first_id, last_id. Pricing and capabilities are only on the OpenAI surface.