> Ollama lets you just install it, just install models, and go. So does the orig...

lxgr · 2025-08-06T17:09:41 1754500181

Can it easily run as a server process in the background? To me, not having to load the LLM into memory for every single interaction is a big win of Ollama.

otabdeveloper4 · 2025-08-06T18:09:42 1754503782

Yes, of course it can.

lxgr · 2025-08-07T07:25:10 1754551510

I wouldn't consider that a given at all, but apparently there's indeed `llama-server` which looks promising!

Then the only thing that's missing seems to be a canonical way for clients to instantiate that, ideally in some OS-native way (systemd, launchcd etc.), and a canonical port that they can connect to.