Local LLM - rf.blog

// Local LLM

Ollama Local LLM Mac Studio Production AI Self-Hosted Privacy llama qwen Apple Silicon AI Infrastructure

Ollama in Production: Running 70B Locally

Mac Studio M4 Pro with 48GB unified memory runs llama3.3:70b for reasoning tasks. Real latency numbers, model selection logic, and where local inference actually beats cloud.

Rene Fichtmueller / 2026-05-22 / ~2 min read min read

AI Software Engineering Local LLM Build in Public LLM CostEfficiency RapidDevelopment

What I Built in 30 Days With My Local LLM Stack

In 30 days, I built over 20 production projects using my local AI model stack, achieving what would have taken months with traditional methods.

Rene Fichtmueller / 2026-04-10 / ~2 min read min read