Codemand for AI tools sparks industry pushback
In February 2026, researchers at the AI lab METR uncovered a startling dependency within the software development community: developers are no longer willing to work without AI assistance. When the team attempted to replicate a 2025 study comparing manual coding versus AI-assisted productivity, they could not recruit participants. Developers refused to perform tasks without AI, even for the duration of a limited research experiment. While developers self-reported in a May survey that AI made them twice as valuable to their organizations, emerging evidence suggests this perception may be misleading and could lead to significant long-term issues. The trend of 2026, known as tokenmaxxing, involved using the volume of AI tokens as a proxy for productivity. However, this metric has proven flawed. Amazon recently dismantled its internal Kirorank leaderboard after employees manipulated it by excessively using AI agents, inflating costs without improving output. Similarly, Uber reported depleting its entire 2026 AI budget within the first four months, with COO Andrew Macdonald acknowledging that this spending yielded no measurable increase in project delivery or productivity. Beyond cost inefficiencies, the quality of AI-generated code remains a major concern. Although AI accelerates code generation, it often shifts the burden to later stages of the development lifecycle. Programmer and author James Shore argues that writing code twice as fast is futile if maintenance costs are not halved, warning that teams are trading temporary speed for permanent technical debt. Data supports this view; Aiswarya Sankar, CEO of Entelligence AI, noted that 44% of company tokens are now spent on fixing bugs introduced by AI. Additionally, Code Rabbit analyzed open source pull requests and found that AI produces 1.7 times more problems than human-written code. Independent research from Singapore Management University in April reinforced these findings, warning that AI-generated code can introduce long-term maintenance challenges into real-world projects. Proposed solutions vary, but consensus suggests that total automation is not yet viable. Scott Wu, CEO of Cognition, which developed the AI agent Devin, admits that while his tool can work independently, its skill level currently ranges between a junior and mid-level programmer depending on the task. He does not recommend a hands-off approach. Instead, Singapore Management University researchers advocate for a hybrid strategy where developers deeply understand the specific strengths and limitations of AI. They emphasize the need for robust quality assurance systems tailored to AI outputs, urging humans to review AI code with the same scrutiny applied to junior developers. Furthermore, experts agree that critical responsibilities such as software architecture and security design must remain firmly in human hands. As the industry grapples with this new reality, the focus is shifting from blindly adopting AI to managing its integration responsibly to avoid costly pitfalls.
