AI TRIED TO RUN A COMPANY… AND FAILED SPECTACULARLY
A team of researchers built a fake tech company, staffed entirely by AI workers from Google, OpenAI, Meta, and Anthropic.
The result? Total chaos.
The best AI agent barely finished 24% of its tasks, needed nearly 30 steps, and cost $6 per assignment just to get simple jobs done.
Google’s AI needed 40 steps to complete anything and still got it right only 11% of the time.
Amazon’s AI barely managed 1.7% success – making it the worst “employee” ever hired.
The bots made up fake coworkers, got lost trying to find files, and couldn’t handle basic tasks like writing performance reviews without crashing.
For now, AI isn’t stealing your job – it can’t even survive a normal day at work without causing a disaster.
Source: Hiring Squad
🚨 AI TRIED TO RUN A COMPANY… AND FAILED SPECTACULARLY
A team of researchers built a fake tech company, staffed entirely by AI workers from Google, OpenAI, Meta, and Anthropic.
The result? Total chaos.
The best AI agent barely finished 24% of its tasks, needed nearly 30… pic.twitter.com/zSY5ZA6oFo
— Mario Nawfal (@MarioNawfal) April 27, 2025
AI agents from Google, OpenAI, Meta, Anthropic, and Amazon struggled with adapting to new situations, linking short tasks into a coherent strategy, and handling errors—key signs that true workplace autonomy is still out of reach.
Coordination broke down fast: agents invented…
— Alva (@AlvaApp) April 27, 2025