Model Routing

arc routes prompts to two backends: Claude (via acpx) and Ollama (via httpx). The backend is selected automatically based on the model string.

Two dispatch paths

Claude via acpx

Any model string that does not start with ollama/ is dispatched via acpx, the Claude Code session manager.

acpx uses its own short model aliases, not Anthropic model IDs. When you run acpx --help, it lists the models the agent advertises:

Available models: default, sonnet, sonnet[1m], haiku

These are the values you put in your agent YAML and use in --model flags. Do not use full Anthropic model IDs like claude-sonnet-4-6 — acpx will reject them with:

acpx exited 1: Cannot apply --model "claude-sonnet-4-6": the ACP agent did not
advertise that model. Available models: default, sonnet, sonnet[1m], haiku.

The dispatcher calls:

acpx --format quiet \
  --cwd <workspace> \
  --model sonnet \
  --approve-all \
  --system-prompt "<concatenated system prompt>" \
  claude exec \
  --file <tmpfile>

--format quiet suppresses acpx's progress output. The prompt is written to a temp file to avoid shell quoting issues with multi-line prompts.

Ollama via httpx

Any model string starting with ollama/ is dispatched via httpx to an Ollama-compatible REST API. arc calls the /v1/chat/completions endpoint directly (no Ollama client library).

Model string format

Format	Backend	Example
acpx alias	acpx / Claude Code	`sonnet`, `haiku`, `default`
`ollama/<model>`	httpx / Ollama (local endpoint)	`ollama/qwen3:8b`
`ollama/<endpoint>/<model>`	httpx / named Ollama endpoint	`ollama/remote/qwen3:32b`

acpx model aliases

acpx advertises model aliases, not full Anthropic model IDs. Use these in your agent YAML and everywhere you specify a Claude model:

Alias	Maps to
`sonnet`	Claude Sonnet (current)
`haiku`	Claude Haiku (current)
`sonnet[1m]`	Claude Sonnet with 1M context
`default`	acpx's default model

To see which aliases your installed version of acpx supports, run:

acpx claude exec --help
# or check the error message after a dispatch attempt

Model resolution priority

When a prompt arrives, the effective model is resolved in this order:

--model CLI flag (or model field in the IPC request)
/model Discord command (sticky per channel, stored in daemon memory)
Agent config model field (the default)

The first non-null value wins. If the result is in allowed_models (or allowed_models is empty), dispatch proceeds. Otherwise, a DispatchError is returned.

allowed_models gatekeeping

allowed_models is a list of permitted models for an agent. If the list is empty, any model is accepted.

allowed_models:
  - sonnet
  - haiku
  - ollama/qwen3:8b

With this config, requests for opus or ollama/unknown:model will be rejected:

Error: Model 'opus' is not allowed for agent 'coach'.
Allowed: sonnet, haiku, ollama/qwen3:8b

This is enforced in the dispatcher before any backend call is made.

Ollama endpoint configuration

Ollama endpoints are named in config.yaml:

ollama:
  endpoints:
    local:
      url: http://localhost:11434/v1
    remote:
      url: http://192.168.1.100:11434/v1
    kyle:
      url: http://kyle-nuc.tailnet:11434/v1

The local endpoint is the default for ollama/<model> strings. Named endpoints are accessed with ollama/<endpoint>/<model>.

Using named endpoints

# Local Ollama (uses the 'local' endpoint)
arc ask --agent coach --model ollama/qwen3:8b "Hello"

# Named endpoint (kyle-nuc on Tailscale)
arc ask --agent coach --model ollama/kyle/qwen3:32b "Hello"

In an agent YAML:

model: ollama/remote/llama3.2:latest
allowed_models:
  - ollama/local/qwen3:8b
  - ollama/remote/llama3.2:latest

The endpoint name after ollama/ is looked up in config.ollama.endpoints. An unknown endpoint name raises: Unknown Ollama endpoint 'nonexistent'. Configured: local, remote, kyle

Per-cron-job model override

Individual cron jobs can override the agent's model:

jobs:
  heartbeat:
    schedule: "*/30 * * * *"
    agent: coach
    model: haiku   # cheaper for frequent runs
    prompt: "Read HEARTBEAT.md and follow it."

The model override follows the same allowed_models validation as any other override.

Routing the same agent through different models

# Sonnet for complex analysis
arc ask --agent coach --model sonnet \
  "Review my last month of training and identify patterns."

# Haiku for quick questions
arc ask --agent coach --model haiku \
  "What's today's workout?"

# Local Ollama model (free, private)
arc ask --agent coach --model ollama/qwen3:8b \
  "Summarize today's metrics."

In cron jobs, use cheaper models (haiku) for frequent tasks and more capable models (sonnet) for complex weekly or monthly tasks.

Local context files for Ollama

Ollama agents cannot read the filesystem via tools. Use local_context_files to inject file contents into the request:

name: trainer
workspace: /workspace/fitness-coach
model: ollama/qwen3:8b
local_context_files:
  - programs/current.md
  - weeks/current.md

The files are injected as a system message before the user prompt. This gives the Ollama model the necessary context without needing filesystem access.

Error handling

Error	Cause	Resolution
`Cannot apply --model "..."`	Full Anthropic model ID used instead of acpx alias	Use `sonnet`, `haiku`, etc. instead of `claude-sonnet-4-6`
`Model 'X' is not allowed`	Model not in `allowed_models`	Add to `allowed_models` or remove the list
`Unknown Ollama endpoint`	Endpoint name not in `config.ollama.endpoints`	Add the endpoint to config
`Cannot connect to Ollama`	Ollama not running or wrong URL	Start Ollama or fix `url` in config
`acpx timed out`	acpx took longer than `timeouts.acpx_request`	Increase the timeout or check acpx
`Ollama timed out`	Ollama took longer than `timeouts.ollama_request`	Increase the timeout or check Ollama

Two dispatch paths​

Claude via acpx​

Ollama via httpx​

Model string format​

acpx model aliases​

Model resolution priority​

allowed_models gatekeeping​

Ollama endpoint configuration​

Using named endpoints​

Per-cron-job model override​

Routing the same agent through different models​

Local context files for Ollama​

Error handling​