With Model Router, you can access the most popular models with a single endpoint and bill. Experiment with new models and scale your app without worrying about the underlying infrastructure.

Setup

Getting started with Model Router is simple. Generate an API key and drop it into your favorite framework.

Generate API key

API keys for Model Router are generated within your workspace. Generate a key by logging into the console and navigating to Model routerAPI keys.

Connect via framework

Model Router integrates easily into the most popular frameworks.

Model Router is a drop-in replacement for OpenAI’s API.

import openai

# Configure with your Hypermode Workspace API key and Hypermode Model Router base url
client = openai.OpenAI(
    api_key="<YOUR_HYP_WKS_KEY>",
    base_url="https://models.hypermode.host/v1",
)

# Set up the request
response = client.chat.completions.create(
    model="meta-llama/llama-4-scout-17b-16e-instruct",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is Modus?"},
    ],
    max_tokens=150,
    temperature=0.7,
)

# Print the response
print(response.choices[0].message.content)

Connect directly via API

You can also access the API directly.

curl -X POST \
  https://models.hypermode.host/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $YOUR_HYP_WKS_KEY" \
  -d '{
    "model": "meta-llama/llama-4-scout-17b-16e-instruct",
    "messages": [
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is Dgraph?"}
    ],
    "max_tokens": 150,
    "temperature": 0.7
  }'

Available models

Hypermode provides a variety of the most popular open source and commercial models.

We’re constantly evaluating model usage in determining new models to add to our catalog. Interested in using a model not listed here? Let us know at [email protected].

Generation

Large language models provide text generation and reasoning capabilities.

ProviderModelSlug
AnthropicClaude 3.7 Sonnetclaude-3-7-sonnet-20250219
AnthropicClaude 3.5 Sonnet (latest)claude-3-5-sonnet-20241022
AnthropicClaude 3.5 Sonnetclaude-3-5-sonnet-20240620
AnthropicClaude 3.5 Sonnetclaude-3-5-sonnet-20240229
AnthropicClaude 3.5 Haiku (latest)claude-3-5-haiku-20241022
AnthropicClaude 3.5 Haikuclaude-3-5-haiku-20240307
AnthropicClaude 3.5 Opusclaude-3-5-opus-20240229
AnthropicClaude 2.1claude-2.1
AnthropicClaude 2.0claude-2.0
DeepSeekDeepSeek-R1-Distill-Llamadeepseek-ai/deepseek-r1-distill-llama-8b
GoogleGemini 2.5 Progemini-2.5-pro-exp-03-25
GoogleGemini 2.5 Pro Previewgemini-2.5-pro-preview-03-25
GoogleGemini 2.5 Flash Previewgemini-2.5-flash-preview-04-17
GoogleGemini 2.0 Progemini-2.0-pro-exp-02-05
GoogleGemini 2.0 Flash Lite (latest)gemini-2.0-flash-lite
GoogleGemini 2.0 Flash Litegemini-2.0-flash-lite-001
GoogleGemini 2.0 Flash Lite Preview (latest)gemini-2.0-flash-lite-preview-02-05
GoogleGemini 2.0 Flash Lite Previewgemini-2.0-flash-lite-preview
GoogleGemini 2.0 Flash Image Generationgemini-2.0-flash-exp-image-generation
GoogleGemini 2.0 Flash Thinking (latest)gemini-2.0-flash-thinking-exp
GoogleGemini 2.0 Flash Thinkinggemini-2.0-flash-thinking-exp-1219
GoogleGemini 2.0 Flash Thinkinggemini-2.0-flash-thinking-exp-01-21
GoogleGemini 2.0 Flash Livegemini-2.0-flash-live-001
GoogleGemini 2.0 Flash (latest)gemini-2.0-flash
GoogleGemini 2.0 Flashgemini-2.0-flash-001
GoogleGemini 2.0 Flashgemini-2.0-flash-exp
GoogleGemini 1.5 Pro (latest)gemini-1.5-pro-latest
GoogleGemini 1.5 Progemini-1.5-pro
GoogleGemini 1.5 Progemini-1.5-pro-002
GoogleGemini 1.5 Progemini-1.5-pro-001
GoogleGemini 1.5 Flash (latest)gemini-1.5-flash-latest
GoogleGemini 1.5 Flashgemini-1.5-flash
GoogleGemini 1.5 Flashgemini-1.5-flash-002
GoogleGemini 1.5 Flashgemini-1.5-flash-001
GoogleGemini 1.5 Flash 8B (latest)gemini-1.5-flash-8b-latest
GoogleGemini 1.5 Flash 8Bgemini-1.5-flash-8b
GoogleGemini 1.5 Flash 8Bgemini-1.5-flash-8b-exp-0924
GoogleGemini 1.5 Flash 8Bgemini-1.5-flash-8b-exp-0827
GoogleGemini 1.5 Flash 8Bgemini-1.5-flash-8b-001
GoogleGemini Pro Visiongemini-pro-vision
MetaLlama 4 Scoutmeta-llama/llama-4-scout-17b-16e-instruct
MetaLlama 3.2meta-llama/llama-3.2-3b-instruct
OpenAIGPT 4.5 Preview (latest)gpt-4.5-preview
OpenAIGPT 4.5 Previewgpt-4.5-preview-2025-02-27
OpenAIO3 Mini (latest)o3-mini
OpenAIO3 Minio3-mini-2025-01-31
OpenAIO1 (latest)o1
OpenAIO1o1-2024-12-17
OpenAIO1 Preview (latest)o1-preview
OpenAIO1 Previewo1-preview-2024-09-12
OpenAIO1 Mini (latest)o1-mini
OpenAIO1 Minio1-mini-2024-09-12
OpenAIGPT 4.1 (latest)gpt-4.1
OpenAIGPT 4.1gpt-4.1-2025-04-14
OpenAIGPT 4.1 Mini (latest)gpt-4.1-mini
OpenAIGPT 4.1 Minigpt-4.1-mini-2025-04-14
OpenAIGPT 4.1 Nano (latest)gpt-4.1-nano
OpenAIGPT 4.1 Nanogpt-4.1-nano-2025-04-14
OpenAIGPT 4o (latest)gpt-4o
OpenAIGPT 4ogpt-4o-2024-11-20
OpenAIGPT 4ogpt-4o-2024-08-06
OpenAIGPT 4ogpt-4o-2024-05-13
OpenAIGPT 4o Mini (latest)gpt-4o-mini
OpenAIGPT 4o Minigpt-4o-mini-2024-07-18
OpenAIGPT 4o Audio Preview (latest)gpt-4o-audio-preview
OpenAIGPT 4o Audio Previewgpt-4o-audio-preview-2024-12-17
OpenAIGPT 4o Audio Previewgpt-4o-audio-preview-2024-10-01
OpenAIGPT 4o Search Preview (latest)gpt-4o-search-preview
OpenAIGPT 4o Search Previewgpt-4o-search-preview-2025-03-11
OpenAIGPT 4o Search Previewgpt-4o-search-preview-2025-03-11
OpenAIChatGPT 4ochatgpt-4o-latest
OpenAIGPT 4 (latest)gpt-4
OpenAIGPT 4gpt-4-0613
OpenAIGPT 4 Turbogpt-4-turbo-2024-04-09
OpenAIGPT 4 Turbo Previewgpt-4-turbo-preview
OpenAIGPT 4 Preview (latest)gpt-4-1106-preview
OpenAIGPT 4 Previewgpt-4-0125-preview
OpenAIGPT 3.5 Turbo (latest)gpt-3.5-turbo
OpenAIGPT 3.5 Turbogpt-3.5-turbo-1106
OpenAIGPT 3.5 Turbogpt-3.5-turbo-0125

Embedding

Embedding models provide vector representations of text for similarity matching and other applications.

ProviderModelSlug
Hugging FaceMiniLM-L6-v2sentence-transformers/all-MiniLM-L6-v2
Nomic AIEmbed Text V1.5nomic-ai/nomic-embed-text-v1.5
OpenAIEmbedding 3 Largetext-embedding-3-large
OpenAIEmbedding 3 Smalltext-embedding-3-small
OpenAIADA Embeddingtext-embedding-ada-002

Logging

By default, all model invocations are logged for future display in the console. If you’d like to opt out of model logging, please contact us.