THE LAST ENGINEER
Issue #3March 24, 20265 stories

โšก Vibe Coding

Claude Blog๐Ÿ›  agentic tools

Put Claude to work on your computer

Claude can now autonomously operate your computer through its computer use capability. This enables Claude to perform complex workflows by directly interacting with your desktop environment and applications.

schedule 4 min read

๐Ÿง  Capabilities & Alignment

Anthropic Research๐Ÿงช agent research

Long-running Claude for scientific computing

Anthropic demonstrates Claude's ability to run autonomous scientific computing tasks over extended periods. This research explores how agents can maintain context and execute complex computational workflows without human intervention.

schedule 6 min read
Jack Clark (Import AI)๐Ÿงช agent research

LLMs training other LLMs autonomously

PostTrainBench research shows LLMs can autonomously refine other LLMs for new tasks. This capability represents a significant step toward self-improving AI systems that can enhance their own performance without human supervision.

schedule 8 min read
Jack Clark (Import AI)๐Ÿงช agent research

ByteDance's CUDA-writing agent for AI R&D

ByteDance has developed an autonomous agent that can write CUDA code for AI research and development. This represents a major advancement in AI-driven software engineering capabilities.

schedule 7 min read
Jack Clark (Import AI)๐Ÿงช agent research

Testing AIs with generated games and agent ecologies

Researchers are using procedurally generated games and multi-agent ecological simulations to test AI capabilities. This approach provides novel benchmarks for evaluating autonomous agent performance in complex, dynamic environments.

schedule 9 min read