The White House rethinks its Anthropic fight
The White House is reversing course on its conflict with Anthropic, seeking greater access to its Mythos AI model for national security purposes while limiting broader private sector access. Meanwhile, Google is rolling out Gemini AI to vehicles, and OpenAI traced ChatGPT's unusual 'goblin obsession' to a single reward signal in its 'Nerdy' personality preset.
Summary
The central story involves a shifting dynamic between the White House and Anthropic over the Mythos AI model. Originally, the government escalated tensions with Anthropic, but the emergence of Mythos's powerful cyber capabilities has complicated the feud. Anthropic sought to expand private sector access to Mythos from approximately 50 firms to nearly 120, but U.S. officials pushed back citing compute strain concerns related to government use. A forthcoming White House AI memo is expected to push multi-vendor AI adoption for agencies and address some of Anthropic's grievances. Despite an ongoing legal battle, the memo may allow agencies to work around the supply chain risk designation. Notably, former AI czar David Sacks indicated that GPT-5.5 has reached cyber capabilities similar to Mythos, and predicted all frontier models will reach that level within six months. Internal division within the administration is evident, with Secretary of War Pete Hegseth publicly calling Anthropic's leadership an 'ideological lunatic,' even as other officials appear to want a reconciliation driven by access needs.
On the consumer technology front, Google announced the rollout of Gemini AI to vehicles with Google built-in, replacing the older Google Assistant. The upgraded system handles navigation, messaging, music, vehicle controls, and car-specific queries drawn from manufacturer manuals. A beta Gemini Live mode supports open-ended conversations, with future integrations planned for Gmail, Calendar, and Google Home. General Motors announced the feature for approximately 4 million of its vehicles from model year 2022 onward, with the U.S. rollout coming first.
OpenAI researchers traced ChatGPT's widely noticed habit of inserting goblins, gremlins, and fantasy creatures into responses to a single reward signal embedded in the 'Nerdy' personality preset. Following ChatGPT-5.1's November launch, 'goblin' mentions in user conversations jumped 175%, with similar spikes for related creatures. Fine-tuning loops recycled creature-favored outputs back into the model's default behavior, spreading the quirk beyond just Nerdy users. OpenAI retired the Nerdy preset in March and shipped GPT-5.5 with an explicit prompt banning goblins, gremlins, ogres, trolls, raccoons, and pigeons.
The newsletter also covered several other developments: Meta opened its ads platform to third-party AI tools via a new MCP server; OpenAI surpassed its 2029 Stargate compute goal of 10 GW ahead of schedule; Elon Musk admitted during trial testimony that xAI used distillation techniques to train on OpenAI models; and Anthropic launched a public beta for Claude Security, an enterprise tool for scanning and patching code vulnerabilities.
Key Insights
- The White House appears to be softening its stance against Anthropic primarily because it wants greater government access to the powerful Mythos model, suggesting national security interests are overriding ideological opposition.
- Former AI czar David Sacks claimed that GPT-5.5 has already reached cyber capabilities comparable to Mythos, and predicted all frontier AI models will reach that level within six months, implying rapid capability convergence across providers.
- OpenAI's investigation found that a reward signal in a single personality preset ('Nerdy') was responsible for ChatGPT's global goblin-insertion behavior, demonstrating how fine-tuning loops can propagate unintended patterns across an entire model's default behavior.
- Elon Musk testified during his trial against OpenAI that xAI used distillation techniques to train on OpenAI models, a significant admission about the competitive and legally contentious practice of model distillation.
- Internal division within the Trump administration over Anthropic is stark: while some officials seek reconciliation to secure model access, Secretary of War Pete Hegseth publicly called Anthropic's leadership an 'ideological lunatic,' indicating no unified government position exists.
Topics
Full transcript available for MurmurCast members
Sign Up to Access