#agents | Brooks McMillin

A Coding Agent Read a File That Didn't Exist Five Times, Then Blamed the Tools

June 3, 2026 10 min read

A Claude Code session confabulated a nonexistent Python file, persisted against five truthful "does not exist" errors, then self-diagnosed as corrupted tool output. A reconstruction from the raw transcript, a corpus scan across 3,001 sessions on whether the failure is worse in Opus 4.8, and a model-independent mitigation.

#ai-security #agents #claude-code #llm-failure-modes

Read article →

Wiring capability warrants into autonomous agents

May 21, 2026 14 min read

Why OAuth scopes aren't enough for autonomous LLM agents calling MCP tools, and how we wired Tenuo capability warrants end-to-end. Scope-gated rollout, two real bugs, multi-hop delegation, and an attack the warrant catches.

#tenuo #mcp #security #agents #oauth #capabilities

Read article →

Building Secure Agentic Systems: The Six Layers

March 24, 2026 19 min read

Six layers of security architecture for running LLM agents as daily drivers — every design decision with production stats and companion code.

#security #AI #agents #MCP #prompt-injection #SSRF #observability

Read article →