Nous.Providers.OpenAICompatible (nous v0.15.5)
View SourceGeneric OpenAI-compatible provider implementation.
Note: This module implements the underlying HTTP API for OpenAI-compatible endpoints. For normal usage, prefer the
custom:model prefix which routes throughNous.Providers.Customwith better configuration support.
Works with any server that implements the OpenAI Chat Completions API:
- Groq (api.groq.com)
- Together (api.together.xyz)
- OpenRouter (openrouter.ai)
- LM Studio (localhost)
- Ollama (localhost)
- vLLM (localhost)
- SGLang (localhost)
- Mistral (api.mistral.ai)
- Any other OpenAI-compatible endpoint
For OpenAI-specific features (Responses API, Assistants, etc.), use Nous.Providers.OpenAI.
Configuration
Configuration is looked up in the following precedence (highest to lowest):
Direct options passed to functions or
Nous.new/2:Nous.new("custom:my-model", base_url: "https://api.example.com/v1", api_key: "sk-..." )Environment variables:
export CUSTOM_BASE_URL="https://api.example.com/v1" export CUSTOM_API_KEY="sk-..."Application config (in
config/config.exs):config :nous, :custom, base_url: "https://api.example.com/v1", api_key: "sk-..."Defaults (used by this module directly):
base_url:"https://api.openai.com/v1"api_key: FromOPENAI_API_KEYenv var
Options
| Option | Type | Default | Description |
|---|---|---|---|
base_url | String.t() | "https://api.openai.com/v1" | API endpoint URL |
api_key | String.t() | nil | Authentication token |
organization | String.t() | nil | OpenAI organization ID |
timeout | non_neg_integer() | 60000 | Request timeout (ms) |
Usage
With the custom: prefix (Recommended)
Use this for any OpenAI-compatible endpoint:
# Groq
agent = Nous.new("custom:llama-3.1-70b",
base_url: "https://api.groq.com/openai/v1",
api_key: System.get_env("GROQ_API_KEY")
)
# Together AI
agent = Nous.new("custom:meta-llama/Llama-3-70b",
base_url: "https://api.together.xyz/v1",
api_key: System.get_env("TOGETHER_API_KEY")
)
# OpenRouter
agent = Nous.new("custom:anthropic/claude-3.5-sonnet",
base_url: "https://openrouter.ai/api/v1",
api_key: System.get_env("OPENROUTER_API_KEY")
)
# LM Studio (local)
agent = Nous.new("custom:qwen3",
base_url: "http://localhost:1234/v1"
)
# Using environment variables
# export CUSTOM_BASE_URL="http://localhost:1234/v1"
# export CUSTOM_API_KEY="not-needed"
agent = Nous.new("custom:my-model")Direct Provider Usage (Low-level)
For direct API access without the agent:
# Using with Groq
{:ok, response} = Nous.Providers.OpenAICompatible.chat(
%{model: "llama-3.1-70b", messages: messages},
base_url: "https://api.groq.com/openai/v1",
api_key: System.get_env("GROQ_API_KEY")
)
# Using with local LM Studio
{:ok, response} = Nous.Providers.OpenAICompatible.chat(
%{model: "qwen2", messages: messages},
base_url: "http://localhost:1234/v1"
)
# Streaming
{:ok, stream} = Nous.Providers.OpenAICompatible.chat_stream(params, opts)
Enum.each(stream, fn event -> IO.inspect(event) end)Backward Compatibility
The legacy openai_compatible: prefix still works and is equivalent to custom::
# Legacy (still works)
agent = Nous.new("openai_compatible:my-model", base_url: "...")
# Recommended
agent = Nous.new("custom:my-model", base_url: "...")See Nous.Providers.Custom for the dedicated custom provider implementation.
Summary
Functions
Get the API key from options, environment, or application config.
Get the base URL from options, application config, or default.
Count tokens in messages (rough estimate).
High-level request with message conversion, telemetry, and error wrapping.
High-level streaming request with message conversion and telemetry.
Functions
Get the API key from options, environment, or application config.
Lookup order:
:api_keyoption passed directly- Environment variable (OPENAI_API_KEY)
- Application config:
config :nous, openai_compatible, api_key: "..."
Get the base URL from options, application config, or default.
Lookup order:
:base_urloption passed directly- Application config:
config :nous, openai_compatible, base_url: "..." - Default: https://api.openai.com/v1
Count tokens in messages (rough estimate).
Override this in your provider for more accurate counting.
High-level request with message conversion, telemetry, and error wrapping.
Default implementation that:
- Converts messages to provider format
- Builds request params
- Calls chat/2
- Parses response
- Emits telemetry events
- Wraps errors
High-level streaming request with message conversion and telemetry.