What 48,000 AI Agent Trajectories Reveal

February 2025

I downloaded every publicly available trajectory from the SWE-bench Verified leaderboard—48,580 runs across 134 different AI agent systems—and pulled out 40+ behavioral features from each one. Here is what came out of the data.