AI News: Massive Updates From OpenAI and Anthropic
This week brought major AI model updates from OpenAI and Anthropic, including GPT 5.5's improved capability with less detailed prompts, ChatGPT Images 2.0 surpassing competitors, and Claude Design for visual creation and animations.
Summary
OpenAI released GPT 5.5, which excels at understanding user intent with minimal prompting while using fewer tokens than GPT 5.4, though at double the API pricing ($5 vs $2.50 per million input tokens). The model scored 82.7% on Terminal Bench, outperforming the unreleased Anthropic Mythos model at 82%. GPT 5.5 demonstrates superior ability to infer context from previous conversations and provide highly personalized responses with basic prompts. OpenAI also launched ChatGPT Images 2.0, which now leads LM Arena rankings with a score of 1500 compared to previous leader Nano Banana's 1271. The image model features thinking capabilities, web search integration, dense text rendering, and world knowledge application. Additional OpenAI releases include a Privacy Filter model for PII detection, ChatGPT for Clinicians (free for verified US clinicians), and Warp's universal agent support for coding environments. Anthropic introduced Claude Design, enabling creation of visual designs, prototypes, animations, and presentations directly within Claude's interface, though outputs tend toward a consistent aesthetic style. They also released live artifacts in co-work for real-time dashboards connected to external apps. Several other models launched this week including Google's Deep Research Max for autonomous research, Alibaba's Quinn 3.6 models, and Kimmy K2.6 for coding tasks. The week concluded with news of unauthorized access to Anthropic's unreleased Mythos model and footage from a Chinese robot marathon where four robots completed a half-marathon in under an hour.
Key Insights
- GPT 5.5 uses significantly fewer tokens to complete the same tasks as GPT 5.4 while costing double the API price, making efficiency gains crucial for cost management
- GPT 5.5 scored 82.7% on Terminal Bench compared to Anthropic's unreleased Mythos model at 82%, meaning OpenAI released a model that performs better than the one Anthropic considers too dangerous to release
- The speaker argues that most everyday users won't notice huge differences between new AI models, but the key improvement is models getting better at doing more with less detailed prompts
- ChatGPT Images 2.0 jumped to a score of 1500 on LM Arena compared to previous leader Nano Banana's 1271, representing a significant leap rather than incremental improvement
- Sam Altman criticized Anthropic's marketing strategy, saying it's like claiming to have built a bomb and selling bomb shelters while restricting customer access
Topics
Full transcript available for MurmurCast members
Sign Up to Access