Progressive Subagents: Score the PR Before You Spawn Eight Agents
Subagents are token guzzlers. Eight in parallel on a PR feels clever and bills like a freelance crew. The fix is a signalling layer that decides how many to spawn and in what order. Part of the token-saving series.
The Underappreciation and Rebirth of Warp
Why I keep coming back to Warp, why the tech nerds gave it bad rep, and why open-sourcing it has just made it the best agentic-era terminal on the market. A walk through Warp Drive, blocks, the new vertical tabs, agent primitives and the notification inbox.
Opus vs GPT on Real Ops: Same Brain Food, Different Brains
Opus 4.7, GPT-5.5 and Hermes go head-to-head on a real shotclubhouse incident. Same prompt, same knowledge graph, same MCPs. Causal vs predictive incident response, why the model is the variable not the harness, and what zero-touch ops actually needs.
"Use Claude Code for FREE" is a Trap
Why free AI coding via Nvidia NIM and OpenRouter is a trap. The Cheap-Intelligent-Fast trilemma, 40 RPM rate limits, Opus 4.7 vs GPT-5.5 vs MiniMax M2.7 benchmarks, and why your first AI coding experience should not be the free one.
Your Coding Agent's Best Feature Isn't the Code
Why Claude Code beats Codex, Copilot, and every other coding agent in 2026. The developer experience of terminal AI coding agents matters more than the model. Statusline, /insights, hooks, and the features that make the 8th hour feel like the 1st.
Claude Is Setting Up Hermes Which Is Setting Up NanoClaw and Nobody Is Writing Features
Anthropic Pulled the Plug on Third-Party Harnesses. Here's What I'm Running Now.
Token Optimisation 101: Stop Burning Money on AI Coding Agents
How to stop getting rate-limited after an hour on Claude Code, Codex, or Copilot. Context window mechanics, the /effort command, model routing, kicking new conversations, and the silent token drains most people miss.
The AI Design Stack: Three Skills and a Workflow That Stops the Slop
Three Claude Code skills for AI-generated UI that doesn't look like slop. UI/UX Pro Max for styles, Impeccable for anti-patterns, Google Stitch DESIGN.md for design tokens. Full workflow for brand-consistent agent output.
Anthropic Just Killed Third-Party Harnesses and I'm Properly Gutted
Can Your Agents Speak Caveman? (And Should They?)
AI Alignment Just Got a Psychological Dimension and It's Properly Unsettling
Gemma 4 Is Running On Phones Now and I Don't Think People Realise How Mental That Is
expect-cli: The Validate Step My Agent Loop Was Missing
expect-cli reads your git diff, generates a test plan via AI, and executes it in a real browser with Playwright. Extracts cookies from your local Chrome/Firefox for authenticated testing. The validate step for agent loops.
Browser Tools for AI Agents Part 1: Playwright, Puppeteer, and Why Your Agent Picked Playwright
Playwright for AI agents explained 2026. Why Playwright beat Puppeteer for browser automation, how accessibility trees slash token costs, dev-browser for coding agents, Patchright and Scrapling for anti-bot bypass.
Browser Tools for AI Agents Part 2: The Framework Wars (browser-use, Stagehand, Skyvern)
AI browser frameworks compared 2026. browser-use vs Stagehand vs Skyvern: DOM-first vs vision-first architecture, LLM token costs per step, caching strategies, and the expect testing tool for coding agent validation loops.
Browser Tools for AI Agents Part 3: Managed Infrastructure and When DIY Stops Making Sense
Managed browser infrastructure for AI agents 2026. Firecrawl vs Browserbase vs Steel vs Bright Data vs Browserless pricing and features compared. Self-hosted vs managed cost analysis and when DIY stops making sense.
Browser Tools for AI Agents Part 4: Skip the Browser, Save 80% on Tokens
Save 80% on LLM tokens with content extraction 2026. markdown.new, Jina Reader, Trafilatura compared. Why feeding raw HTML to AI agents wastes tokens and how HTML-to-markdown conversion fixes your context window budget.
qmd + nano-graphrag: You Do Not Need Pinecone for This
qmd is a CLI semantic search engine for markdown by Tobias Lutke (Shopify founder). Hybrid BM25 + vector + LLM reranking. Local SQLite index, no cloud, no Docker. Pair with nano-graphrag for local graph RAG.
Google Stitch: AI-Native UI Design That Actually Understands Your Design System
Google Stitch is an AI-native design tool from Google Labs that generates high-fidelity UI from prompts and exports real code. DESIGN.md encodes your design system as portable markdown any AI coding tool can read.
Your Agents Are Talking Behind Your Back
Claude Code Source Code Leaked: 512K Lines of TypeScript and What Actually Matters
2-2 Factor for AI Agents: Multi-Agent Reliability
Human reliability maths applied to AI agents. Consensus protocols, identity checks, and why your agent swarm needs the same safeguards humans built centuries ago.
PinchTab: 12MB Binary That Replaces Playwright for AI Agents
PinchTab is a 12MB Go binary that gives AI agents browser control via REST API. Serves the Accessibility Tree instead of raw HTML, cutting tokens by up to 90%. Stable element refs, stealth mode, zero Node/Python dependencies.