External golden path (minimal first success)

Goal: From a cold clone, get inference + one surface + a health check without Discord, fleet, or chump-brain/. Time target: under 30 minutes on a fast connection (Rust + model pull dominate).

Discord: Optional. This path uses the web PWA as the default first surface; add Discord later if you want. Fleet (Pixel/Mabel) is a natural next step after first success.

Not in this path: Mabel/Pixel, provider cascade, ship heartbeat, launchd roles. See FLEET_ROLES.md and OPERATIONS.md for the full stack.

Prerequisites

Requirement	Notes
Rust	Stable toolchain (`rustup`, `cargo`). Edition 2021 per `Cargo.toml`.
Git	Clone this repository.
Ollama	ollama.com — local OpenAI-compatible API on `http://localhost:11434`.
OS	macOS or Linux primary; Windows may work via WSL (not regularly tested here).

Daily driver profile (recommended first stack)

Keep one inference profile until you intentionally switch (see .env.example header):

Variable	Typical value
`OPENAI_API_BASE`	`http://localhost:11434/v1` (Ollama)
`OPENAI_API_KEY`	`ollama`
`OPENAI_MODEL`	e.g. `qwen2.5:14b` (must be pulled: `ollama pull …`)

After ./run-web.sh or chump --web is listening, run ./scripts/chump-preflight.sh (or chump --preflight) to verify /api/health, /api/stack-status, tool_policy, and local /v1/models reachability. See OPERATIONS.md Preflight.

Steps

1. Clone and enter the repo

git clone <your-fork-or-upstream-url> chump
cd chump

2. Create a minimal `.env`

./scripts/setup-local.sh

Then edit .env:

For web or CLI only, comment out DISCORD_TOKEN or set it empty so the config summary does not treat Discord as configured.
You do not need TAVILY_API_KEY, GITHUB_TOKEN, or cascade keys for this path.

Minimal variables for Ollama (can also rely on run-local.sh defaults):

OPENAI_API_BASE=http://localhost:11434/v1
OPENAI_API_KEY=ollama
OPENAI_MODEL=qwen2.5:14b

Keep your real .env aligned with one stack: If you also set Hugging Face model ids, vLLM bases, or CHUMP_INFERENCE_BACKEND=mistralrs, Chump may still talk to Ollama with the wrong model name. For Week 1–2, use only the three lines above for OPENAI_* and leave mistral / MLX / cascade lines commented until you need them (see INFERENCE_PROFILES.md). .env.example starts with the same Ollama block.

One-shot overrides (optional): If .env still points at another profile but you want to force this path for a single command:

Variable	When to use
`CHUMP_GOLDEN_PATH_OLLAMA=1`	After sourcing `.env`, forces `OPENAI_API_BASE`, `OPENAI_API_KEY`, and `OPENAI_MODEL` to the Ollama values above for that process only.
`CHUMP_USE_RELEASE=1`	Makes `./run-local.sh` run `cargo run --release --bin chump` (after `cargo build --release --bin chump`).

Example: CHUMP_USE_RELEASE=1 CHUMP_GOLDEN_PATH_OLLAMA=1 ./run-local.sh -- --check-config

3. Start Ollama and pull a model

Recommended on macOS (Homebrew Ollama): run the daemon under launchd so it survives crashes and restarts quickly:

brew services start ollama
ollama pull qwen2.5:14b

After killall ollama, GET http://127.0.0.1:11434/api/tags should return 200 again within about 10 seconds (typical respawn a few seconds). Repeat anytime: scripts/verify-ollama-respawn.sh. Alternative: ChumpMenu can start/stop Ollama from the menu bar if you use the menu app daily. Avoid relying on a one-off nohup ollama serve in a shell profile unless you accept restarts when that shell exits.

Manual / dev: ollama serve in a terminal is fine for a session; use another terminal for ollama pull ….

4. Build (first time)

cargo build

Release is optional for trying the app: cargo build --release for production-like latency.

5. Verify health (web path — recommended for external users)

Start the web server (PWA + API):

./run-web.sh
# or: ./run-local.sh -- --web --port 3000

Check JSON health:

curl -s http://127.0.0.1:3000/api/health | head -c 500

You should see JSON with status fields (model, version, etc.). Note: This is GET /api/health on the web port (default 3000). A separate sidecar GET /health exists only when CHUMP_HEALTH_PORT is set (typically with Discord); do not confuse the two.

Open the UI: http://127.0.0.1:3000 — use the PWA chat if the model is up.

6. Optional: CLI one-shot (no browser)

./run-local.sh -- --chump "Reply in one sentence: what is 2+2?"

Expect a short model reply on stdout. Uses the same Ollama env defaults as run-local.sh (and strips a stray -- before cargo run so --check-config / --chump are parsed correctly).

Latency: The first --chump run after Ollama starts may take minutes on a 14B model (load into GPU/RAM). A second run with the same model is usually much faster but may still be tens of seconds on 14B Apple Silicon depending on load and keep-alive. If warm runs stay very slow, treat it as a performance follow-up (model size, OLLAMA_KEEP_ALIVE, MLX/vLLM profile, etc.).

7. Optional: Discord

Requires a real bot token and intents — DISCORD_CONFIG.md, ./scripts/check-discord-preflight.sh, then ./run-discord-ollama.sh or ./run-discord.sh.

Advanced (defer until golden path works)

Topic	Doc
vLLM-MLX on port 8000	INFERENCE_PROFILES.md, STEADY_RUN.md
Brain wiki + `memory_brain`	CHUMP_BRAIN.md
Fleet / Mabel / Pixel	FLEET_ROLES.md, OPERATIONS.md
Provider cascade + privacy	PROVIDER_CASCADE.md
Tool approval / risk	TOOL_APPROVAL.md
Disk / archives	STORAGE_AND_ARCHIVE.md

Troubleshooting (common)

Symptom	Check
`connection refused` on chat	Ollama running? `curl -s http://127.0.0.1:11434/api/tags`
Web serves blank or 404 static	`CHUMP_HOME` / repo root so `web/` exists; see run-web.sh
`cargo` errors	`rustc --version`; run `rustup update`
Config warnings on stderr	Expected if Discord/brain/tavily unset; see config_validation.rs

Next: autonomy and fleet

After §5–6 succeed, the natural progressions are:

Task API: Try POST /api/tasks to create a task and watch it process in the next heartbeat round. See WEB_API_REFERENCE.md for the full API surface.
Discord: Add the Discord bot for ambient interaction — set DISCORD_TOKEN and run ./run-discord.sh. See DISCORD_CONFIG.md.
Fleet / Mabel: For multi-node operation (Mac + Pixel), see FLEET_ROLES.md and the "Keeping the stack running" section in OPERATIONS.md.

Automated smoke (CI / maintainers)

From repo root (does not start Ollama or the web server):

./scripts/verify-external-golden-path.sh

Runs cargo build and checks that golden-path files exist. Used in GitHub Actions after cargo test.

Timing regression

To record how long cargo build (and optionally GET /api/health) take for cold-start tracking:

./scripts/golden-path-timing.sh
GOLDEN_TIMING_HIT_HEALTH=1 ./scripts/golden-path-timing.sh   # web must already be up

Logs append to logs/golden-path-timing-YYYY-MM-DD.jsonl. If cargo build exceeds GOLDEN_MAX_CARGO_BUILD_SEC (default 900), the script exits 1.

CI: GitHub Actions runs this after verify-external-golden-path.sh with GOLDEN_MAX_CARGO_BUILD_SEC=1800 and uploads logs/golden-path-timing-*.jsonl as a workflow artifact (see .github/workflows/ci.yml).

OPERATIONS.md — run modes, env vars, heartbeats, roles
INFERENCE_PROFILES.md — Ollama, vLLM-MLX, mistral.rs configuration
DISCORD_CONFIG.md — Discord bot setup
CHUMP_PROJECT_BRIEF.md — project focus, conventions, and agent guidance

Chump: A Self-Hosted Cognitive Agent