Writing

Notes from the workbench

AI, local LLMs, cloud architecture, and developer tools — benchmarked, built, and broken so you don't have to.

LATEST Apr 5, 2026

Gemma 4 26B Won't Fit on My 24GB MacBook — Until I Did This

Ollama gives 2 tok/s with broken tool calling. I got 49 tok/s with perfect tool calling using Unsloth Q3_K_XL + llama.cpp. Then I built a Claude Code clone on top of it.

gemma 4llama.cppunslothlocal aiapple silicon

Apr 3, 2026

I Tested Every Gemma 4 Model Locally on My MacBook — What Actually Works

Audio ASR in 3 languages, image understanding, full-stack app generation, coding, and agentic behavior -- all on a MacBook M4 Pro 24GB.

gemma 4local llmmacbook

Jan 11, 2026

The Ralph Wiggum Technique: Autonomous AI Development with Claude Code

Learn how to use the Ralph Wiggum technique for autonomous AI-powered coding. Install the Ralph plugin for Claude Code and let your AI write code while you sleep.

claude codeai codingautonomous development

Mar 2, 2025

7 Best AI Coding Tools Compared: WindSURF, Cursor, Bolt New, Cline, Roocline, GitHub Copilot, and Replit

Comprehensive comparison of WindSURF, Cursor, Bolt New, Cline, Roocline, GitHub Copilot, and Replit. Learn which AI coding tool best fits your needs in 2025.

ai coding toolscursorgithub copilot