Washington Gains a Window Into Big Tech's Most Powerful AI
Google DeepMind, Microsoft, and Elon Musk's xAI have signed agreements with the US government granting federal evaluators access to their AI models before public release. The Commerce Department's Center for AI Standards and Innovation announced the partnerships, which represent the most substantive AI oversight mechanism to emerge from Washington to date.
Under the agreements, CAISI evaluators can assess frontier AI systems before deployment, testing for hacking capabilities, potential for military misuse, and unexpected or dangerous behaviors. The framework builds on earlier agreements with OpenAI and Anthropic from 2024, extending pre-deployment review to cover virtually all major frontier AI developers operating in the United States.
Why the Trump Administration Is Pursuing AI Oversight
The move is notable given the Trump administration's generally deregulatory stance toward technology. The CAISI agreements signal a more nuanced position: targeted oversight focused on national security implications, even while resisting broader regulatory frameworks. The national security framing is key — a powerful AI model capable of assisting with cyberattacks on critical infrastructure represents a national security risk regardless of whether its developer is domestic or foreign.
What CAISI Tests For
Assessments cover cybersecurity (can models generate sophisticated malware or identify vulnerabilities in critical infrastructure?), military misuse (could models accelerate development of biological, chemical, or radiological weapons?), and behavioral testing for unexpected capabilities. The evaluations are not a regulatory gate — CAISI cannot block a release. But the agreements create a reputational mechanism: releasing a model found to have serious risks by CAISI would be politically and commercially damaging.
Anthropic's Project Glasswing
In parallel, Anthropic launched Project Glasswing, giving select organizations — including AWS, Apple, Cisco, Google, JPMorgan Chase, and Microsoft — early access to Claude Mythos Preview to find and fix critical software vulnerabilities before they can be exploited. Together, the CAISI agreements and Project Glasswing sketch the outlines of an emerging AI governance ecosystem blending government oversight with industry self-policing.
What Comes Next
The CAISI framework is likely to expand. The UK's AI Safety Institute and Japan's AI governance bodies have expressed interest in coordinating pre-deployment evaluations with the US framework. Congressional momentum for formal AI legislation has picked up in the wake of high-profile AI security incidents in 2026. The question is no longer whether AI oversight will exist, but what form it will ultimately take.