OpenAI's superapp hiding inside Codex
OpenAI released a major Codex update transforming it from a coding agent into a comprehensive superapp with features like background computer use, parallel agents, and an in-app browser. The newsletter also covers Anthropic's new Claude Opus 4.7 model and OpenAI's first domain-specific model for life sciences.
Summary
OpenAI announced a significant update to its Codex platform, evolving it from a simple coding agent into a comprehensive superapp that integrates ChatGPT, Atlas, and Codex functionality. The new features include background computer use that allows Codex to operate Mac apps independently, parallel agent capabilities, an Atlas-powered in-app browser, memory retention across sessions, and inline image generation. Codex has reached 3 million weekly users with 70% month-over-month growth, with OpenAI explicitly stating they are 'building the super app out in the open.' This move positions OpenAI to compete directly with Anthropic's Claude Code and Cowork offerings. Separately, Anthropic released Claude Opus 4.7, which outperforms GPT-5.4 and Gemini 3.1 Pro on coding benchmarks, achieving 64.3% on SWE-bench Pro compared to its predecessor's 53.4%. However, it still trails behind Anthropic's unreleased Mythos Preview model at 77.8%. The newsletter also covers OpenAI's launch of GPT-Rosalind, their first domain-specific model for life sciences and drug discovery, which scored better than 95% of human scientists on RNA prediction tasks in blind tests. Additional updates include new AI tools like Windsurf 2.0 and HY-World 2.0, plus a practical guide for running AI models locally using Ollama.
Key Insights
- OpenAI is explicitly building a superapp 'out in the open' with Codex as the foundation, transforming it from a coding agent into a comprehensive platform with computer use, parallel agents, and integrated browsing capabilities
- Anthropic is running two parallel development tracks with a fast 2-month public release cadence for models like Opus 4.7 and a gated frontier line in Mythos accessible only to exclusive partners
- OpenAI released two domain-specific models (GPT-5.4-Cyber and GPT-Rosalind) within three days, indicating a strategic shift toward purpose-built models for specific industries rather than relying solely on general-purpose flagship models
- GPT-Rosalind scored better than 95% of human scientists on RNA prediction tasks in blind testing by gene therapy lab Dyno Therapeutics, demonstrating significant capability in specialized scientific domains
- Codex achieved 3 million weekly users with 70% month-over-month growth, suggesting strong market adoption of AI-powered development tools
Topics
Full transcript available for MurmurCast members
Sign Up to Access