LLM Settings
Configure language model providers, manage API keys, set up fallback models, and connect custom endpoints for your AI agents.

Supported Providers
8bit-ai supports multiple LLM providers, giving you the flexibility to choose the best model for each use case. You can configure multiple providers and assign different models to different agents.
| Provider | Models Available | Strengths | Pricing Model |
|---|---|---|---|
| OpenAI | GPT-4 Turbo, GPT-4o, GPT-4o Mini | Strong reasoning, broad ecosystem | Pay-per-token |
| Anthropic | Claude Sonnet 4, Claude Haiku | Long context, nuanced output | Pay-per-token |
| Gemini Pro, Gemini Flash | Multimodal, cost-effective | Pay-per-token | |
| Open Source | Llama 3, Mistral, Mixtral | Self-hosted, no data leakage | Hosting cost only |
Provider Comparison
Provider Configuration
Configure each provider with your API keys and preferred default models. API keys are encrypted at rest and never exposed in logs or API responses.
OpenAI Configuration
Anthropic Configuration
Google Configuration
API Key Security
Fallback Models
Configure fallback models to ensure high availability. If the primary model is unavailable or rate-limited, the system automatically routes to the fallback model.
Automatic Fallback Chain
Define an ordered list of fallback models. The system attempts the primary model first, then falls through the chain based on configurable conditions.
Fallback Conditions
High Availability
Custom Endpoints
Connect self-hosted or third-party LLM endpoints that are compatible with the OpenAI API format. This includes local deployments of Llama, Mistral, vLLM, and other open-source models.
Self-Hosted Model Requirements
- Endpoint must expose an OpenAI-compatible chat completions API
- HTTPS is required for production endpoints
- Minimum throughput of 10 tokens/second recommended
- Support for streaming responses is optional but recommended
Supported Formats
Latency Considerations
Usage Monitoring
Track token usage and costs across all configured providers to optimize spending.