AI Models & Providers
dassi supports multiple AI models. You can use managed credits (default, no setup required), bring your own API key for a named provider, or connect any OpenAI-compatible endpoint.
Managed credits (default)
Section titled “Managed credits (default)”When you sign in, dassi automatically provisions managed credits. No API key is needed — just pick a model and start working.
Available models depend on your plan:
| Model | Free Trial | Paid plans |
|---|---|---|
| Gemini 3.5 Flash | yes | yes |
| Gemini 3 Pro | yes | yes |
| Claude Haiku 4.5 | — | yes |
| Claude Sonnet 4.5 | — | yes |
| Claude Opus 4.6 | — | yes |
| Gemini 3 Flash | — | yes |
| DeepSeek V3.2 | — | yes |
| GLM 4.7 | — | yes |
The default model is Gemini 3.5 Flash. You can switch models in Settings at any time.
Choosing a model
Section titled “Choosing a model”- Gemini 3.5 Flash — fast and cost-efficient. Good all-around default choice
- Gemini 3 Pro — strong general-purpose model, good for complex reasoning
- Claude Sonnet 4.5 — excellent for writing, analysis, and complex reasoning
- Claude Haiku 4.5 — fastest response times, good for simple tasks
- Claude Opus 4.6 — most capable, best for complex multi-step tasks
- Gemini 3 Flash — very fast, good for quick questions
Bring your own key (BYOK)
Section titled “Bring your own key (BYOK)”If you prefer to use your own OpenAI API key:
- Open Settings (gear icon in the sidebar toolbar)
- Select BYOK as the billing mode
- Enter your OpenAI API key
- Choose a model:
| Model | Description |
|---|---|
| GPT-5 | Most capable OpenAI model |
| GPT-5 Mini | Balanced performance and cost |
| GPT-5 Nano | Fastest, most affordable |
Getting an OpenAI API key
Section titled “Getting an OpenAI API key”- Sign up at platform.openai.com
- Navigate to API Keys in your account settings
- Click Create new secret key
- Copy the key and paste it in dassi’s settings
Custom (OpenAI-compatible) endpoint
Section titled “Custom (OpenAI-compatible) endpoint”You can connect any server that speaks the OpenAI Chat Completions API, including:
- Azure OpenAI — a model deployed in your Azure subscription
- Ollama — local models running on your machine
- LiteLLM — a proxy that routes to many providers under one endpoint
- vLLM and other self-hosted inference servers
Connecting
Section titled “Connecting”- Open Settings (gear icon in the sidebar toolbar) and select the Connection tab
- Find the Custom (OpenAI-compatible) card and click Add endpoint
- Enter the Base URL of your endpoint — for example,
https://my-resource.openai.azure.com/openai/v1for Azure orhttp://localhost:11434for a local Ollama server - Enter the Model ID — the exact identifier your server expects, e.g.
gpt-4o,llama3.2, ormistral-7b - Enter your API key. For Ollama running locally, enter any non-empty string (e.g.
ollama) - Click Connect — Chrome will ask to allow access to that endpoint, then dassi verifies the connection
Supported model families
Section titled “Supported model families”Dassi accepts model IDs from these families: GPT, Claude, Gemini/Gemma, Grok, Kimi/Moonshot, DeepSeek, Llama, Qwen, Mistral/Mixtral, Phi, Command-R, GLM, Nemotron, and Yi. If your model ID doesn’t match a supported family, the connection will be blocked with a message telling you which families are accepted.
- Local servers using plain HTTP (e.g.
http://localhost:11434) are allowed. Remote endpoints must use HTTPS. - Dassi normalizes your base URL to end with
/v1, so enteringhttp://localhost:11434orhttp://localhost:11434/v1connects to the same endpoint.
How dassi uses models
Section titled “How dassi uses models”dassi uses your selected model as the main reasoning engine. For certain visual tasks (like clicking at specific coordinates on a page), dassi automatically uses a faster secondary model to keep things quick and cost-efficient. You don’t need to configure this — it happens automatically.
Switching models
Section titled “Switching models”You can switch models at any time from the Settings page. The change takes effect for your next message. Your conversation history is preserved when switching models.