”Those claiming we’re mere months away from AI agents replacing most programmers” should think again
AI agents will change the world. But not this year.
“This isn't the first time we've seen outcomes that suggest some of the ambitious ideas about AI agents directly replacing developers are pretty far from reality. There have been numerous studies already showing that even though an AI tool can sometimes create an application that seems acceptable to the user for a narrow task, the models tend to produce code laden with bugs and security vulnerabilities, and they aren't generally capable of fixing those problems…. most researchers agree it remains likely that the best outcome is an agent that saves a human developer a substantial amount of time, not one that can do everything they can do.”
“those claiming we're mere months away from AI agents replacing most programmers should adjust their expectations because models aren't good enough at the debugging part, and debugging occupies most of a developer's time”
These quotes are from a new essay at arstechnica, looking at a new Microsoft study on troubles in getting AI to debug with reliability confirming one of the core claims in my recent critique of Hard Fork’s Kevin Roose on vibe coding: debugging is hard, and a big part of what coders do, and not about to be replaced.
Another recent quote that is quite relevant is from Sir Demis Hassabis, on agents in general, but applicable to fantasies about vibe coding agents, too, “If your Al model has a 1% error rate and you plan over 5,000 steps, that 1% compounds like compound interest.”
The only way we are going to get past this kind of 80:20, sometimes it works, sometimes it doesn’t kind of AI is to change the paradigm.
Gary Marcus is sorry to have to repeat himself. But the big change we need still hasn’t come.


We used to call indeterminate behavior bugs and paid people large sums of money to both fix them and attempt to build systems without them.
Now, they are trying to convince us indeterminate behavior is simply the new way of doing things. Just vibe code it. This is ludicrous.
I would respectfully submit that the problem is more basic than you describe.
I think the problem isn't that AI agents can't debug, the problem is that they can't read code and understand what it means.
The next step, comparing the meaning of the code to some statement or notion of intent (the spec) is beyond them because the first step is beyond them.
But boy, they can sure cosplay reading for meaning by doing pattern recognition on variable names, quoted strings, and comments.