Issue #3 — The Last Engineer

⚡ Vibe Coding

Claude Blog🛠 agentic tools

Put Claude to work on your computer

Claude can now autonomously operate your computer through its computer use capability. This enables Claude to perform complex workflows by directly interacting with your desktop environment and applications.

schedule 4 min read

🧠 Capabilities & Alignment

Anthropic Research🧪 agent research

Long-running Claude for scientific computing

Anthropic demonstrates Claude's ability to run autonomous scientific computing tasks over extended periods. This research explores how agents can maintain context and execute complex computational workflows without human intervention.

schedule 6 min read

Jack Clark (Import AI)🧪 agent research

LLMs training other LLMs autonomously

PostTrainBench research shows LLMs can autonomously refine other LLMs for new tasks. This capability represents a significant step toward self-improving AI systems that can enhance their own performance without human supervision.

schedule 8 min read

Jack Clark (Import AI)🧪 agent research

ByteDance's CUDA-writing agent for AI R&D

ByteDance has developed an autonomous agent that can write CUDA code for AI research and development. This represents a major advancement in AI-driven software engineering capabilities.

schedule 7 min read

Jack Clark (Import AI)🧪 agent research

Testing AIs with generated games and agent ecologies

Researchers are using procedurally generated games and multi-agent ecological simulations to test AI capabilities. This approach provides novel benchmarks for evaluating autonomous agent performance in complex, dynamic environments.

schedule 9 min read