Chat Direct to your local model — each send is a curl on your Mac
Server checking…
Server lifecycle runs on your Mac. Loading a model holds it in RAM so replies are fast.
Pull a model
Downloads from the Ollama registry onto your Mac. Big models take a while.
Installed models —
MODEL SETTINGS
Defaults are Ollama's. Recommended is NVIDIA's guidance for Nemotron — what this panel uses. Leave a custom field blank to fall back to the Ollama default.
| SETTING | DEFAULT | RECOMMENDED | YOUR VALUE |
|---|
Saved in this browser. The system prompt also ships as the built-in default, so every device starts with it.
AVAILABILITY SCHEDULE
Keeps this model loaded during the window via a cron job on your Mac. Runs even when this app is closed. Times use your Mac's local clock.
No schedule set for this model.
Connect from anywhere
All traffic flows: your client → this Netlify edge function → Cloudflare tunnel → Ollama on your Mac. The access key is checked at the edge; OLLAMA_ORIGIN and the key never reach the browser bundle.
x-app-key: ••••••••List models Chat (streaming)
Server Fleet Macs serving models, reached via reverse tunnels
Add a server
Sessions Every user session — logged to S3 (location fills in once CloudFront fronts it)
Model Console Fixed commands only — list · ps · load · unload · pull · run
Type "help" and press Enter.