AI Security Blog | Brooks McMillin

This blog is where I work out, in public, what it actually takes to ship AI systems that don't betray the people relying on them. The focus is narrow on purpose: agentic systems, prompt injection, MCP and tool-use security, OAuth in the LLM era, and the engineering patterns that keep autonomous code safe enough to delegate real work to.

Posts lean toward concrete trade-offs and reproducible findings rather than threat-of-the-week commentary. You'll find research write-ups with code where the experiment is reproducible, infrastructure deep-dives where I walk through a design and what I'd change in hindsight, and field notes from running real defenses against real attacks. Use the category filter below to narrow in on a specific thread — or browse chronologically if you want the full arc. New writing usually lands every few weeks, and you can subscribe at the bottom of the page if you'd like it in your inbox rather than chasing the RSS feed.

A Coding Agent Read a File That Didn't Exist Five Times, Then Blamed the Tools

June 3, 2026 10 min read

A Claude Code session confabulated a nonexistent Python file, persisted against five truthful "does not exist" errors, then self-diagnosed as corrupted tool output. A reconstruction from the raw transcript, a corpus scan across 3,001 sessions on whether the failure is worse in Opus 4.8, and a model-independent mitigation.

#ai-security #agents #claude-code #llm-failure-modes

Read article →

Wiring capability warrants into autonomous agents

May 21, 2026 14 min read

Why OAuth scopes aren't enough for autonomous LLM agents calling MCP tools, and how we wired Tenuo capability warrants end-to-end. Scope-gated rollout, two real bugs, multi-hop delegation, and an attack the warrant catches.

#tenuo #mcp #security #agents #oauth #capabilities

Read article →

Poisoning the Safety Net: Attacking AI Code Review Pipelines

May 19, 2026 24 min read

Four months after writing about defense in depth for LLM-assisted development, I went back and tried to attack every layer of my own stack. The obvious attacks are caught by 2026 models. The class isn't closed; the cover stories got better.

#security #AI #LLM #code-review #prompt-injection #ci-cd #agents-md #MCP

Read article →

mcp-authflow: OAuth 2.0 for Production MCP Servers

April 30, 2026 12 min read

Open-sourcing mcp-authflow and mcp-authflow-resource: an RFC-compliant OAuth 2.0 framework for MCP servers, plus a one-command example server. Why MCP deployments need real auth, what the two packages do, and three non-obvious gotchas from production.

#mcp #oauth #security #open-source #starlette #python

Read article →

The MCP stdio Problem: Why I Rebuilt My Auth Proxy as a Persistent HTTP Service

April 9, 2026 6 min read

Claude Code silently kills stdio MCP servers during idle periods, forcing manual reconnection. How I converted a fragile stdio bridge into a persistent Starlette HTTP reverse proxy — and the obscure SDK crash that followed.

#mcp #claude-code #oauth #starlette #systemd #devtools

Read article →

Building Secure Agentic Systems: The Six Layers

March 24, 2026 19 min read

Six layers of security architecture for running LLM agents as daily drivers — every design decision with production stats and companion code.

#security #AI #agents #MCP #prompt-injection #SSRF #observability

Read article →

A Beginner's Guide to Safe LLM-Assisted Development

March 11, 2026 20 min read

A complete beginner's guide to setting up every safety layer from the Coding Safer with LLMs post: pre-commit hooks, local review agents, CI workflows, and CLAUDE.md — starting from scratch.

#security #AI #LLM #ci-cd #pre-commit #code-review #claude-code #tutorial

Read article →

Does Your System Prompt Actually Stop Prompt Injection? We Tested 10,000 Times to Find Out

February 26, 2026 13 min read

An empirical study of 10,080 prompt injection attempts across 8 models, 6 defense strategies, and 7 attack types. The results challenge common assumptions about prompt-level defenses.

#security #AI #LLM #prompt-injection #ai-security #benchmark

Read article →

Defense in Depth for AI-Assisted Development: Pre-commit Hooks, Review Agents, and CI That Catch LLM Mistakes

January 28, 2026 14 min read

Practical strategies for safer AI-assisted development: automated review agents, layered security checks, and context management that prevents catastrophic mistakes.

#security #AI #LLM #ci-cd #pre-commit #code-review #MCP

Read article →