On the desk, not in the cloud

5/28/2026
0 Comments

A few months ago I had an idea for a small tool — nothing fancy, just a script to summarize my notes at the end of each week. My first thought was to wire it up to a cloud model's API. Then I caught myself and asked: why?

The idea was mine, the notes were personal, the volume was tiny. I didn't need a model that knows quantum physics, Chinese poetry, and the accounting of the third-largest German company. I needed something that could read short text and return a sensible summary. Just that.

That's how I came back to Ollama. One command pulls a model, another runs it locally. A REST API on localhost — the same kind you've written against a thousand times. For a backend person it's quietly satisfying: you treat the model like any other service in your stack, except it never leaves the machine.

That solves several small problems at once. Personal data stays personal — it never travels to anyone's server. There's no bill growing with every iteration, so you experiment freely. No rate limits to remind you that your idea isn't somebody else's priority. And no internet as a precondition — the model works on a train, in a café with bad Wi-Fi, on a long flight.

Not every idea deserves the biggest model. Some just want a little attention.

There are limits, of course. A small local model isn't GPT-5 — it won't write you a market report on semiconductors. But for text classification, short summaries, simple transformations, generating test data, or a personal helper that reads your own notes — it's often plenty. Sometimes more than plenty.

For real products, the cloud is still the obvious answer for plenty of reasons. But for the small ideas, I'd never even start if I had to open an account somewhere and watch a meter tick — Ollama changed something. The threshold for experimenting dropped to almost zero. And when the threshold drops, things start to happen.

On the desk, not in the cloud

Comments (0)

Leave a Comment