Issue #6 — The Last Engineer

⚡ Vibe Coding

Claude Code Changelog⏱ 2 min🛠 agentic tools

Claude Code v2.1.85 adds MCP multi-server support and conditional hooks

New environment variables let one MCP helper serve multiple servers, plus conditional hooks with permission syntax to reduce process spawning overhead.

TLDR AI⏱ 3 min🛠 agentic tools

Claude Auto Mode launches with autonomous execution and built-in safeguards

Anthropic's research preview lets Claude autonomously execute actions while filtering risky behavior and prompt injection attacks.

Cursor Blog⏱ 5 min🛠 agentic tools

Cursor improves Composer with real-time reinforcement learning

The coding agent gets better through real-time RL training, though the preview doesn't reveal specific technical details.

TLDR AI⏱ 24 min🛠 agentic tools

Multi-agent architecture tackles AI-driven frontend design and full-stack coding

Anthropic built planner, generator, and evaluator agents inspired by GANs to handle complex application development with structured handoffs.

TLDR AI⏱ 15 min🛠 agentic tools

Claude 2026 features autonomous Code environment with MCP protocols and Agent Teams

Claude 4.6 now includes Agent Teams for autonomous development, Computer Use previews, and programmable Hooks for workflow automation.

Simon Willison⏱ 8 min🛠 agentic tools

JSONata rewritten with AI assistance in one day, saves $500K annually

Another vibe-porting success story where AI helped create a custom Go implementation of JSONata using the existing test suite as guidance.

🧠 Capabilities & Alignment

Alignment Forum⏱ 12 min⚖️ alignment research

⚡ Vibe Coding

Claude Code v2.1.85 adds MCP multi-server support and conditional hooks

Claude Auto Mode launches with autonomous execution and built-in safeguards

Cursor improves Composer with real-time reinforcement learning

Multi-agent architecture tackles AI-driven frontend design and full-stack coding

Claude 2026 features autonomous Code environment with MCP protocols and Agent Teams

JSONata rewritten with AI assistance in one day, saves $500K annually

🧠 Capabilities & Alignment

Hard chain-of-thought interpretability benchmark challenges current safety techniques

The Terrarium: Multi-agent AI society for solving mathematical problems

Toy environment reveals how RL training biases models toward reward hints over instructions

RLVR directional updates improve reasoning by identifying critical tokens