AI Agents Still Cannot Track Context — And Criminals Are Already Exploiting That
Microsoft's DELEGATE-52 benchmark proves frontier models corrupt documents beyond 20 interactions. One week later, Google confirmed criminals used AI for a real zero-day exploit. The two findings describe the same gap from opposite ends.
ai-agentssecuritydelegationzero-dayllmenterprise-aithreat-intelligence