George Hotz—the hacker who first cracked the iPhone at age 17 and reverse-engineered the PlayStation 3 before Sony sued him for it—published a blog post Sunday arguing that mass adoption of AI coding agents will end in disaster, or at least close to it.
“I’m calling it now, the adoption of AI agents into software development will be one of the most costly mistakes in the field’s history,” Hotz wrote. “Agents cannot program, and it’s taking longer and longer to realize that they can’t.”
“The output is broken, but in a way that’s getting harder and harder to detect. Which is exactly what you’d expect from an increasingly accurate statistical model.”
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
It never quite does.
Not about egoHotz anticipates the obvious pushback: a programmer who defines part of his identity through his craft would naturally resist tools that threaten to replace him. He takes the objection seriously and dismisses it on the merits.
So, his concern isn't about being replaced. It's about what happens to code quality when everyone is using these tools at once, especially when Big Tech and Wall Street are constantly pushing for the mass use of these tools.
“I almost think this is some kind of psyop to sell agents,” Hotz argues. “Fear of loss is one of the only ways to make big companies move. Though I think in that fear they are making a big mistake.”
His central argument is organizational. High performers have tight enough feedback loops to catch agent-generated problems before they ship. They read the code, spot the errors, and calibrate when to trust the tool. "The bottom performers won't have that self check," he writes—and they're the ones using agents to produce 10 times their previous output. At a large company, that math produces something specific: faster degradation of average code quality, masked by sheer volume.
The outcome, he argues, will be "a golden era for buckets and buckets of slop, and a dark age for gems of quality." As a concrete example, he points to reports that Apple is pushing AI coding tools across its entire engineering organization, then asks simply: "Do you think macOS will get better or worse in the next 2 years?"
Where the camps standHotz now places himself in what he calls the "LeCun/Marcus camp"—referring to Yann LeCun, Meta's chief AI scientist, and Gary Marcus, a longtime LLM skeptic. Both have argued that language models are fundamentally sophisticated pattern-matchers: They can imitate the distribution of existing code, but can't reason through genuinely new problems from first principles.
The pushback to Hotz's position isn't abstract. Karpathy, who had been skeptical of agents earlier in 2025, reversed his position after new model releases and joined Anthropic's pre-training team on May 19—five days before Hotz published. He described the next few years at the frontier as "especially formative."
Anthropic CEO Dario Amodei said in Davos that some Anthropic engineers have already stopped writing code themselves, letting models handle it while they review the output. Hotz, for his part, says he tried to do the same thing and found himself reaching for the manual fix every time.



















