Infery.ai

145 models, one API

Switch between frontier and open-source models with a single credit balance. All models are OpenAI-compatible through our gateway.

Group by:

Chat & Text

87 models

OpenAI·46

Babbage 002

OpenAI

babbage-002

16K ctx

Computer Use Preview

OpenAI

computer-use-preview

200K ctx

ChatStreamingVision

Davinci 002

OpenAI

davinci-002

16K ctx

GPT-3.5 Turbo

OpenAI

gpt-3.5-turbo

16K ctx

ChatStreamingToolsJSON

GPT-4.1

OpenAI

gpt-4.1

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-4.1 Mini

OpenAI

gpt-4.1-mini

200K ctx

ChatStreamingVisionToolsJSON

GPT-4.1 Nano

OpenAI

gpt-4.1-nano

200K ctx

ChatStreamingToolsJSON

GPT-4o

OpenAI

gpt-4o

128K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-4o (2024-05-13)

OpenAI

gpt-4o-2024-05-13

128K ctx

ChatStreamingVisionTools

GPT-4o Mini

OpenAI

gpt-4o-mini

128K ctx

ChatStreamingToolsJSON

GPT-4o Mini Search Preview

OpenAI

gpt-4o-mini-search-preview

128K ctx

ChatStreaming

GPT-4o Search Preview

OpenAI

gpt-4o-search-preview

128K ctx

ChatStreaming

GPT-4 Turbo

OpenAI

gpt-4-turbo

128K ctx

ChatStreamingVisionTools

GPT-5

OpenAI

gpt-5

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.1

OpenAI

gpt-5.1

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.1 Codex

OpenAI

gpt-5.1-codex

200K ctx

StreamingTools

GPT-5.1 Codex Max

OpenAI

gpt-5.1-codex-max

200K ctx

Streaming

GPT-5.1 Codex Mini

OpenAI

gpt-5.1-codex-mini

200K ctx

Streaming

GPT-5.2

OpenAI

gpt-5.2

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.2 Codex

OpenAI

gpt-5.2-codex

200K ctx

StreamingTools

GPT-5.2 Pro

OpenAI

gpt-5.2-pro

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.3 Codex

OpenAI

gpt-5.3-codex

200K ctx

StreamingTools

GPT-5.4

OpenAI

gpt-5.4

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.4 Mini

OpenAI

gpt-5.4-mini

200K ctx

ChatStreamingVisionToolsJSONImage output

GPT-5.4 Nano

OpenAI

gpt-5.4-nano

200K ctx

ChatStreamingToolsJSON

GPT-5.4 Pro

OpenAI

gpt-5.4-pro

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.5

OpenAI

gpt-5.5

272K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5.5 Pro

OpenAI

gpt-5.5-pro

272K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5 Codex

OpenAI

gpt-5-codex

200K ctx

StreamingTools

GPT-5 Mini

OpenAI

gpt-5-mini

200K ctx

ChatStreamingToolsJSON

GPT-5 Nano

OpenAI

gpt-5-nano

200K ctx

ChatStreamingToolsJSON

GPT-5 Pro

OpenAI

gpt-5-pro

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

GPT-5 Search API

OpenAI

gpt-5-search-api

200K ctx

ChatStreaming

GPT Realtime 1.5

OpenAI

gpt-realtime-1.5

200K ctx

ChatStreamingVisionImage output

GPT Realtime Mini

OpenAI

gpt-realtime-mini

200K ctx

ChatStreamingVisionImage output

o1

OpenAI

o1

200K ctx

ChatStreaming

o1-mini

OpenAI

o1-mini

200K ctx

ChatStreaming

o1-pro

OpenAI

o1-pro

200K ctx

ChatStreaming

o3

OpenAI

o3

200K ctx

ChatStreamingTools

o3 Deep Research

OpenAI

o3-deep-research

200K ctx

ChatStreaming

o3-mini

OpenAI

o3-mini

200K ctx

ChatStreamingTools

o3-pro

OpenAI

o3-pro

200K ctx

ChatStreamingTools

o4-mini

OpenAI

o4-mini

200K ctx

ChatStreamingTools

o4-mini Deep Research

OpenAI

o4-mini-deep-research

200K ctx

ChatStreaming

Omni Moderation

OpenAI

omni-moderation-latest

33K ctx

Vision

Text Moderation

OpenAI

text-moderation-latest

33K ctx

Anthropic·11

Claude 3.5 Haiku

Anthropic

claude-haiku-3-5

200K ctx

ChatStreamingTools

Claude Haiku 3

Anthropic

claude-haiku-3

200K ctx

ChatStreamingTools

Claude Haiku 4.5

Anthropic

claude-haiku-4.5

200K ctx

ChatStreamingVisionToolsJSONImage output

Claude Opus 4

Anthropic

claude-opus-4

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Opus 4.1

Anthropic

claude-opus-4.1

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Opus 4.5

Anthropic

claude-opus-4.5

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Opus 4.6

Anthropic

claude-opus-4.6

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Opus 4.7

Anthropic

claude-opus-4.7

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Sonnet 4

Anthropic

claude-sonnet-4

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Sonnet 4.5

Anthropic

claude-sonnet-4.5

200K ctx

ChatStreamingVisionPDFToolsJSONImage output

Claude Sonnet 4.6

Anthropic

claude-sonnet-4.6

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Google·10

Gemini 2.5 Computer Use

Google

gemini-2-5-computer-use

1M ctx

ChatStreamingVisionImage output

Gemini 2.5 Flash

Google

gemini-2-5-flash

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini 2.5 Flash-Lite

Google

gemini-2-5-flash-lite

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini 2.5 Flash Native Audio

Google

gemini-2-5-flash-native-audio

1M ctx

ChatStreamingVisionImage output

Gemini 2.5 Pro

Google

gemini-2-5-pro

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini 3.1 Flash-Lite Preview

Google

gemini-3.1-flash-lite-preview

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini 3.1 Flash Live

Google

gemini-3.1-flash-live-preview

131K ctx

Gemini 3.1 Pro Preview

Google

gemini-3.1-pro-preview

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini 3 Flash Preview

Google

gemini-3-flash-preview

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Gemini Robotics-ER 1.5

Google

gemini-robotics-er

1M ctx

ChatStreamingVisionImage output

xAI·5

Grok 4-1 Fast

xAI

grok-4-1-fast

2M ctx

ChatStreamingVisionToolsJSON

Grok 4-1 Fast Reasoning

xAI

grok-4-1-fast-reasoning

2M ctx

ChatStreamingVisionToolsJSON

Grok 4.20

xAI

grok-4.20

2M ctx

ChatStreamingVisionToolsJSON

Grok 4.20 Multi-Agent

xAI

grok-4.20-multi-agent

2M ctx

ChatStreamingVisionToolsJSON

Grok 4.20 Reasoning

xAI

grok-4.20-reasoning

2M ctx

ChatStreamingVisionToolsJSON

DeepSeek·2

DeepSeek V3.2

DeepSeek

deepseek-chat

128K ctx

ChatStreamingToolsJSON

DeepSeek V3.2 Reasoner

DeepSeek

deepseek-reasoner

128K ctx

ChatStreamingTools

Alibaba·13

Qwen3.5 Flash

Alibaba

qwen3-5-flash

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Qwen3.5 Omni Plus

Alibaba

qwen3-5-omni-plus

262K ctx

ChatStreamingVisionPDFImage output

Qwen3.5 Plus

Alibaba

qwen3-5-plus

1M ctx

ChatStreamingVisionPDFToolsJSONImage output

Qwen3 Coder Plus

Alibaba

qwen3-coder-plus

1M ctx

ChatStreamingTools

Qwen3 Max

Alibaba

qwen3-max

262K ctx

ChatStreamingVisionPDFToolsJSONImage output

Qwen3 Omni Flash

Alibaba

qwen3-omni-flash

66K ctx

ChatStreamingVisionPDFImage output

Qwen Flash

Alibaba

qwen-flash

1M ctx

ChatStreamingToolsJSON

Qwen Long

Alibaba

qwen-long

10M ctx

ChatStreaming

Qwen Plus

Alibaba

qwen-plus

1M ctx

ChatStreamingToolsJSON

Qwen Turbo

Alibaba

qwen-turbo

131K ctx

ChatStreamingToolsJSON

Qwen VL Max

Alibaba

qwen-vl-max

131K ctx

ChatStreamingVisionPDFImage output

Qwen VL Plus

Alibaba

qwen-vl-plus

131K ctx

ChatStreamingVisionPDFImage output

QwQ Plus

Alibaba

qwq-plus

131K ctx

ChatStreaming

Image generation

17 models

OpenAI·6

DALL-E 2

OpenAI

dall-e-2

Image output

DALL-E 3

OpenAI

dall-e-3

Image output

GPT Image 1

OpenAI

gpt-image-1

Image output

GPT Image 1.5

OpenAI

gpt-image-1.5

Image output

GPT Image 1 Mini

OpenAI

gpt-image-1-mini

Image output

GPT Image 2

OpenAI

gpt-image-2

Image output

Google·6

Imagen 4

Google

imagen-4

Image output

Imagen 4 Fast

Google

imagen-4-fast

Image output

Imagen 4 Ultra

Google

imagen-4-ultra

Image output

Nano Banana

Google

gemini-2-5-flash-image

Image output

Nano Banana 2

Google

gemini-3-1-flash-image

VisionImage output

Nano Banana Pro

Google

gemini-3-pro-image

VisionImage output

xAI·2

Grok Imagine Image

xAI

grok-imagine-image

Image output

Grok Imagine Image Pro

xAI

grok-imagine-image-pro

Image output

Alibaba·3

Qwen Image 2.0 Pro

Alibaba

qwen-image-2-0-pro

Image output

Wan 2.6 Image

Alibaba

wan2-7-image

Image output

Z-Image Turbo

Alibaba

z-image-turbo

Image output

Voice (TTS/STT)

16 models

OpenAI·7

GPT-4o Mini Transcribe

OpenAI

gpt-4o-mini-transcribe

GPT-4o Mini TTS

OpenAI

gpt-4o-mini-tts

GPT-4o Transcribe

OpenAI

gpt-4o-transcribe

GPT-4o Transcribe + Diarization

OpenAI

gpt-4o-transcribe-diarize

TTS-1

OpenAI

tts-1

TTS-1 HD

OpenAI

tts-1-hd

Whisper-1

OpenAI

whisper-1

Google·4

Gemini 2.5 Flash STT

Google

gemini-2-5-flash-stt

Gemini 2.5 Flash TTS

Google

gemini-2-5-flash-tts

Gemini 2.5 Pro TTS

Google

gemini-2-5-pro-tts

Google Cloud TTS

Google

google-cloud-tts

xAI·1

Grok TTS

xAI

grok-tts

Alibaba·4

CosyVoice V2

Alibaba

cosyvoice-v2

CosyVoice V3 Plus

Alibaba

cosyvoice-v3-plus

Paraformer V2

Alibaba

paraformer-v2

Qwen3 ASR Flash

Alibaba

qwen3-asr-flash

Video generation

11 models

OpenAI·2

Sora 2

OpenAI

sora-2

Sora 2 Pro

OpenAI

sora-2-pro

Google·6

Veo 2

Google

veo-2

Veo 3

Google

veo-3

Veo 3.1

Google

veo-3-1

Veo 3.1 Fast

Google

veo-3-1-fast

Veo 3.1 Lite

Google

veo-3-1-lite

Veo 3 Fast

Google

veo-3-fast

xAI·1

Grok Imagine Video

xAI

grok-imagine-video

Alibaba·2

Wan 2.7 Image-to-Video

Alibaba

wan2-7-i2v

Wan 2.7 Text-to-Video

Alibaba

wan2-7-t2v

Music

7 models

Google·2

Lyria 3 Clip

Google

lyria-3-clip

Lyria 3 Pro

Google

lyria-3-pro

Suno·5

Suno V4

Suno

suno-v4

Suno V4.5

Suno

suno-v4-5

Suno V4.5 Plus

Suno

suno-v4-5-plus

Suno V5

Suno

suno-v5

Suno V5.5

Suno

suno-v5-5

Embeddings

6 models

OpenAI·3

Text Embedding 3 Large

OpenAI

text-embedding-3-large

8K ctx

Text Embedding 3 Small

OpenAI

text-embedding-3-small

8K ctx

Text Embedding Ada 002

OpenAI

text-embedding-ada-002

8K ctx

Google·2

Gemini Embedding

Google

gemini-embedding-001

8K ctx

Gemini Embedding 2

Google

gemini-embedding-2

8K ctx

VisionPDFImage output

Alibaba·1

Qwen Text Embedding v3

Alibaba

qwen-text-embedding-v3

8K ctx

Rerank

1 models

Alibaba·1

Qwen3 Rerank

Alibaba

qwen3-rerank

33K ctx