Blog

28 articles on AI, productivity, and software engineering

Series

AI-Augmented Development(5)Browser Tools for AI Agents(4)Model Context Protocol (MCP) Series(4)Token Saving(1)The Complete Software Engineer's Productivity Stack(1)

The Token Optimisation Playbook

The full playbook for cheap, sharp agent work. Get observability in first (statusline, burndown, OTEL), compress what reaches the model (rtk, headroom), route who does the work (premium delegates, cheap executes, advisor escalation, model-per-task), then work the long tail: caveman, the right browser validator, scoped scans, fewer MCPs, handover, and a memory system. The capstone of the Token Saving series.

July 6, 2026|20 min read|Token Saving

token optimisation model routing cost aware agents claude code opus sonnet haiku context compression rtk headroom agent delegation ai coding agent cost

Opus vs GPT on Real Ops, Part 2: One Drove, One Was Driven

Opus 4.8 and GPT-5.5 investigate the same anonymous signup failure. Zero human nudges versus three, and a root cause one character wide. Summary post with the full interactive side-by-side linked.

June 4, 2026|3 min read

opus 4.8 gpt 5.5 claude code codex cli ai incident response autonomous ops zero touch ops posthog session replay rrweb decoding human in the loop agent autonomy production incident supabase auth password reset debugging gmail dot trick

The Underappreciation and Rebirth of Warp

Why I keep coming back to Warp, why the tech nerds gave it bad rep, and why open-sourcing it has just made it the best agentic-era terminal on the market. A walk through Warp Drive, blocks, the new vertical tabs, agent primitives and the notification inbox.

May 11, 2026|15 min read

Opus vs GPT on Real Ops: Same Brain Food, Different Brains

Opus 4.7, GPT-5.5 and Hermes go head-to-head on a real shotclubhouse incident. Same prompt, same knowledge graph, same MCPs. Causal vs predictive incident response, why the model is the variable not the harness, and what zero-touch ops actually needs.

May 7, 2026|10 min read

opus 4.7 gpt 5.5 claude code codex cli hermes agent ai incident response autonomous ops zero touch ops sre causal analysis predictive analysis multi agent systems agentic engineering ai ops comparison model selection production incident

"Use Claude Code for FREE" is a Trap

Why free AI coding via Nvidia NIM and OpenRouter is a trap. The Cheap-Intelligent-Fast trilemma, 40 RPM rate limits, Opus 4.7 vs GPT-5.5 vs MiniMax M2.7 benchmarks, and why your first AI coding experience should not be the free one.

April 26, 2026|18 min read|AI-Augmented Development

claude code free nvidia nim ai coding agent comparison 2026 opus 4.7 vs gpt 5.5 free ai coding tools nvidia nim rate limit 429 ai model trilemma cheap intelligent fast minimax m2.7 review coding agent benchmarks agentic

Your Coding Agent's Best Feature Isn't the Code

Why Claude Code beats Codex, Copilot, and every other coding agent in 2026. The developer experience of terminal AI coding agents matters more than the model. Statusline, /insights, hooks, and the features that make the 8th hour feel like the 1st.

April 19, 2026|15 min read|AI-Augmented Development

claude code ai coding agent developer experience codex copilot terminal coding agent devx ai coding tools 2026

Browser Tools for AI Agents Part 1: Playwright, Puppeteer, and Why Your Agent Picked Playwright

Playwright for AI agents explained 2026. Why Playwright beat Puppeteer for browser automation, how accessibility trees slash token costs, dev-browser for coding agents, Patchright and Scrapling for anti-bot bypass.

April 2, 2026|29 min read|Browser Tools for AI Agents

playwright ai agents browser automation ai playwright vs puppeteer ai browser tools dev-browser patchright stealth scrapling anti-bot accessibility tree ai 2026

Browser Tools for AI Agents Part 2: The Framework Wars (browser-use, Stagehand, Skyvern)

AI browser frameworks compared 2026. browser-use vs Stagehand vs Skyvern: DOM-first vs vision-first architecture, LLM token costs per step, caching strategies, and the expect testing tool for coding agent validation loops.

April 2, 2026|15 min read|Browser Tools for AI Agents

browser-use ai stagehand browser skyvern automation ai browser framework comparison dom vs vision ai ai agent browser tools expect testing llm browser automation 2026

Browser Tools for AI Agents Part 3: Managed Infrastructure and When DIY Stops Making Sense

Managed browser infrastructure for AI agents 2026. Firecrawl vs Browserbase vs Steel vs Bright Data vs Browserless pricing and features compared. Self-hosted vs managed cost analysis and when DIY stops making sense.

April 2, 2026|16 min read|Browser Tools for AI Agents

firecrawl ai browserbase steel browser bright data scraping browserless pricing managed browser infrastructure ai web scraping saas headless chrome cloud 2026

Browser Tools for AI Agents Part 4: Skip the Browser, Save 80% on Tokens

Save 80% on LLM tokens with content extraction 2026. markdown.new, Jina Reader, Trafilatura compared. Why feeding raw HTML to AI agents wastes tokens and how HTML-to-markdown conversion fixes your context window budget.

April 2, 2026|14 min read|Browser Tools for AI Agents

llm token optimization markdown.new cloudflare jina reader ai trafilatura extraction html to markdown ai content extraction reduce token costs web scraping for llm 2026

2-2 Factor for AI Agents: Multi-Agent Reliability

Human reliability maths applied to AI agents. Consensus protocols, identity checks, and why your agent swarm needs the same safeguards humans built centuries ago.

March 25, 2026|9 min read

ai agents security architecture reliability

The Death of MCP: Context Rot, Token Waste, and Why Class Files Win

Model Context Protocol was supposed to be the USB-C of AI integrations. Instead it is eating 50% of your context window before your agent even starts working.

December 15, 2025|13 min read

ai mcp agents architecture developer-tools

MCP Security Risks: Tool Poisoning, Shadowing Attacks and How AI Gets Exploited

MCP security vulnerabilities explained 2025. Hands-on demos of tool poisoning, cross-server shadowing attacks, token theft, and data exfiltration via Model Context Protocol. Practical defences and security best practices for AI agents.

May 19, 2025|20 min read|Model Context Protocol (MCP) Series

mcp security model context protocol vulnerabilities ai agent security risks tool poisoning attack mcp shadowing ai data exfiltration mcp best practices ai cybersecurity 2025

2025s Best AI Coding Tools: Real Cost, Geeky Value & Honest Comparison

2025 AI coding tools pricing compared. Gemini 2.5 Pro free tier, GitHub Copilot, Cursor, Claude Code, Aider, Roocode, Cline costs and value. Honest review with comparison table for developers on a budget.

May 16, 2025|35 min read|AI-Augmented Development

ai coding tools pricing 2025 github copilot cost cursor pricing claude code review gemini ai free aider cli roocode cline vs copilot ai developer tools comparison

Build Your Own MCP Prompt Server: A Dev-Centric Registry with STDIO

Build your own MCP prompt server tutorial 2025. Step-by-step TypeScript guide to a layered prompt registry with STDIO transport, Zod validation, file-based storage. Works with Claude Desktop, Cursor, and any MCP client.

May 13, 2025|14 min read|Model Context Protocol (MCP) Series

mcp prompt server build mcp server tutorial model context protocol typescript mcp stdio transport zod validation prompt management claude desktop mcp cursor mcp server 2025

MCP Architecture Explained: STDIO, SSE Transport and What Makes It Tick

MCP architecture deep dive 2025. How Model Context Protocol works under the hood: function calling, JSON-RPC, STDIO vs SSE transport, streamable HTTP, OAuth 2.1 auth, and the enterprise security gaps nobody talks about.

May 5, 2025|20 min read|Model Context Protocol (MCP) Series

mcp architecture model context protocol transport mcp stdio vs sse mcp oauth json-rpc ai mcp function calling streamable http mcp mcp enterprise security 2025

Introduction to Model Context Protocol (MCP): The USB-C of AI Integrations

Model Context Protocol (MCP) explained for developers 2025. How Anthropic MCP standardises AI tool integrations, replaces LangChain connector chaos with M+N simplicity. MCP servers, clients, and the USB-C analogy.

May 5, 2025|8 min read|Model Context Protocol (MCP) Series

model context protocol mcp server mcp explained anthropic mcp ai tool integration mcp vs langchain ai standards 2025 mcp architecture

Finding the Best AI Coding Assistant: From Pure Vibe to Practical Power

Best AI coding assistant comparison 2025. GitHub Copilot vs Cursor vs Claude vs ChatGPT for developers. Performance benchmarks, pricing, and how to pick the right AI pair programmer for your workflow.

April 29, 2025|11 min read|AI-Augmented Development

ai coding assistant github copilot cursor ai claude code chatgpt coding ai pair programming best ai tools for developers 2025

Software Engineer Productivity Stack: Desktop, Obsidian, AI Tools

Build your productivity stack as a software engineer. Desktop engineering, Obsidian for devs, AI coding assistants, and MCP tooling.

April 29, 2025|6 min read|The Complete Software Engineer's Productivity Stack

productivity developer ai workflow

Beyond the Hype: What Truly Makes an AI a Great Coding Partner?

What makes an AI coding assistant actually good in 2025. Beyond HumanEval benchmarks to real-world capabilities: architecture understanding, debugging, security awareness, code generation vs code comprehension.

April 29, 2025|11 min read|AI-Augmented Development

ai coding benchmarks ai code generation ai debugging tools humaneval swebench ai architecture understanding llm coding capabilities 2025

Tyranny of Small Decisions: AI Agents and Codebase Drift

Alfred Kahn warned that rational small decisions aggregate into irrational outcomes. Now AI agents make thousands of those decisions daily in your codebase.

March 5, 2025|9 min read

ai agents architecture engineering-culture economics

Make Your Own Luck: Probability, Chaos Theory and Fortune

Why lucky people are better at setting initial conditions. Probability, chaos theory, and quantum mechanics explain how to manufacture your own luck.

February 20, 2025|4 min read

philosophy science productivity mindset

Entropy in Software Engineering: Why Everything You Build Rots

Why your code decays, your plans fall apart, and your kids room is always a tip. Entropy through physics, life, and software engineering.

January 15, 2025|4 min read

entropy software engineering philosophy productivity

War Heroes vs The Meticulous Engineer

Why orgs promote firefighters and ignore the people who prevent fires. Time preference, effort paradox, and the cultural rot that follows.

October 1, 2024|7 min read

leadership engineering-culture organisations management

The 2-2 Factor: Why Two Pairs of Eyes Is a Statistical Necessity

The maths behind why no single human should act alone on anything that matters. Error rates, independent review, and why four eyes beat two every time.

January 15, 2023|7 min read

engineering reliability process security

Manage Dev Secrets and Dotenv Files with Bitwarden CLI

Use Bitwarden CLI as a password manager for developer secrets, dotenv files, and local environment variables. Full shell setup included.

July 26, 2022|7 min read

bitwarden enviroment passwordmanager dotenv

How to Calculate Composite Availability SLA for Your Cloud Stack

Step-by-step guide to calculating composite SLA availability for Azure, AWS, or any multi-region cloud stack using probability maths.

July 6, 2022|7 min read

sla azure cloud reliability

GitHub Actions OIDC with Terraform and Azure

Set up passwordless GitHub Actions using OIDC federated credentials with Terraform on Azure. Full working example with GitOps vending machine.

July 4, 2022|10 min read

azure github terraform tutorial