Hutfin AI

Server checking…

—

Server lifecycle runs on your Mac. Loading a model holds it in RAM so replies are fast.

Pull a model

Downloads from the Ollama registry onto your Mac. Big models take a while.

Installed models —

Loading…

MODEL SETTINGS

Defaults are Ollama's. Recommended is NVIDIA's guidance for Nemotron — what this panel uses. Leave a custom field blank to fall back to the Ollama default.

SETTING	DEFAULT	RECOMMENDED	YOUR VALUE

System prompt — shared by all models (sets the assistant's role/persona)

Saved in this browser. The system prompt also ships as the built-in default, so every device starts with it.

AVAILABILITY SCHEDULE

enabled

Keeps this model loaded during the window via a cron job on your Mac. Runs even when this app is closed. Times use your Mac's local clock.

Available from to

No schedule set for this model.

Connect from anywhere

All traffic flows: your client → this Netlify edge function → Cloudflare tunnel → Ollama on your Mac. The access key is checked at the edge; OLLAMA_ORIGIN and the key never reach the browser bundle.

API base

Auth headerx-app-key: ••••••••

List models

Chat (streaming)

Server Fleet Macs serving models, reached via reverse tunnels

Loading…

Add a server

Sessions Every user session — logged to S3 (location fills in once CloudFront fronts it)

Loading…

Model Console Fixed commands only — list · ps · load · unload · pull · run

Type "help" and press Enter.

Hutfin AI

Chat Direct to your local model — each send is a curl on your Mac

Server checking…

Pull a model

Installed models —

MODEL SETTINGS

AVAILABILITY SCHEDULE

Connect from anywhere

Server Fleet Macs serving models, reached via reverse tunnels

Add a server

Sessions Every user session — logged to S3 (location fills in once CloudFront fronts it)

Model Console Fixed commands only — list · ps · load · unload · pull · run