AI News: Microsoft Finally Reveals They're Plan!
The video covers major announcements from Microsoft Build and Nvidia's Computex event, including Microsoft's seven new in-house AI models, the Microsoft Scout autonomous agent, and Nvidia's RTX Spark chip. The host also recaps new image generation models, voice models, and updates from OpenAI, and includes an interview with Microsoft AI CEO Mustafa Suleyman about AI self-sufficiency, healthcare AI, and the philosophy behind building superintelligence.
Summary
The video opens with the host returning from Microsoft Build in San Francisco, where Microsoft announced seven new in-house AI models. These include a new reasoning model (MAI Thinking), a coding model (MAI Code One Flash) compared favorably to Claude Haiku 4.5, an image model (MAI Image 2.5) ranked second in image editing globally, a transcription model (MAI Transcribe 1.5) claimed to be the world's most accurate and five times faster than competitors, and a speech generation model (MAI Voice 2) supporting 15 languages.
The host conducted a sit-down interview with Microsoft AI CEO Mustafa Suleyman, who emphasized Microsoft's goal of AI self-sufficiency — reducing reliance on external providers like OpenAI — and highlighted their commitment to ethically licensed, clean training data. Suleyman also discussed Microsoft Scout, a new autonomous personal agent (powered by OpenClaw's open-source technology) that operates at the OS level across Windows, Teams, Outlook, OneDrive, and SharePoint.
Microsoft also unveiled a new GitHub Copilot app that mirrors the Codex IDE experience but allows users to choose from any available AI model provider — though the host found signups were temporarily paused at the time of recording. Project Solara was introduced as a platform for embedding AI agents in physical devices, including a desktop hub and a smart badge with a camera and microphone for workplace use. Microsoft also announced a collaboration with the Mayo Clinic to develop a frontier healthcare AI model, with Suleyman predicting 'medical superintelligence' within 2-3 years.
From Nvidia's Computex event, the host highlighted the RTX Spark chip — a combined GPU/CPU with up to 128GB of unified compute — enabling powerful local LLM inference and gaming performance. This chip is being integrated into the new Microsoft Surface Laptop Ultra, though pricing is expected to be high, potentially starting around $4,000.
The rapid-fire section covered: OpenAI's Codex gaining Windows computer-use support and new role-based plugins; Noose Research launching a desktop app for their Hermes agent; new image models including Ideogram 4.0 (open weights, strong text rendering), Reeve 2.0 (now ranked #2 in image generation arenas), Crea 2 Turbo (2-second generation speed), and Grok Imagine 1.5 (video with dialogue). Runway ALF 2.0 was briefly demoed for video editing. The host also highlighted Miso One, an open-source voice model claimed to be the most emotive in the world.
The video closes with extended interview footage of Mustafa Suleyman discussing humanist superintelligence, AI safety, the importance of subjecting AI to the test of whether it makes human lives healthier and happier, and advice for students entering the field — recommending software engineering, science, mathematics, and increasingly philosophy and ethics.
Key Insights
- Mustafa Suleyman states that Microsoft's goal with its new in-house models is 'true self-sufficiency in AI,' explicitly framing it as a move to reduce dependence on external providers like OpenAI and demonstrate the capacity to reach the absolute frontier independently.
- Suleyman claims Microsoft deliberately avoided open-source datasets when training its models, citing concerns about unknown security bugs and lack of rigor, and instead paid to license data carefully — positioning this as a trust-building measure for enterprise customers.
- MAI Transcribe 1.5 is described as the current world state-of-the-art in transcription accuracy and five times faster than competing models, standing largely alone on speed benchmarks according to Microsoft's own data presented at Build.
- Mustafa Suleyman predicts that within 2 to 3 years, AI will enable access to the absolute best healthcare in the world — quality currently only available at institutions like the Mayo Clinic — making it 'abundant and available to everybody' through a personal health assistant.
- Suleyman argues that the only valid test for superintelligent technology is a simple one: does it make collective human lives healthier and happier — the same standard applied to drugs and scientific innovation — and frames this as the core humanist motivation behind Microsoft AI's work.
Topics
Full transcript available for MurmurCast members
Sign Up to Access