Anthropic launches Claude Science and a new model
Major AI companies released significant updates including Anthropic's Claude Science workbench and Claude Sonnet 5, Base44's proprietary Base1 model, X's new MCP server integration, and Google's Gemini Nano and Omni Flash models with improved pricing and capabilities.
Summary
Anthropic launched Claude Science, a specialized workbench for researchers connecting 60 scientific databases with pre-built tools for genomics, protein structures, and chemistry. Running on Claude Opus 4.8, it offers $30,000 in free credits to 50 academic projects. The platform enables multi-agent architectures with parallel subassistants for sequence analysis, structural prediction, and fact-checking while maintaining full reproducibility. Anthropic is betting that workflow optimization will drive researcher adoption over raw model capacity.
Base44, owned by Wix and generating $100 million in annual recurring revenue, launched Base1, their proprietary LLM trained on tens of millions of real user interactions. By leveraging user conversation data from their platform, they can fine-tune the model to predict what developers need when building websites and apps, allowing them to cut costs, reduce reliance on third-party APIs, and maintain exclusive training data. This positions Base44 similarly to Cursor's strategy of intercepting code interactions to build proprietary models.
X launched a hosted MCP server opening its API to Claude, Cursor, and Grok, reversing their earlier API restrictions. The MCP provides search, post retrieval, user lookup, and trend analysis capabilities with pricing at $0.015 per published post and $0.20 per post with links. This allows regular users and developers to integrate X data into their AI workflows for applications like sentiment analysis and content curation.
Anthropic released Claude Sonnet 5 at $2 per million input tokens (promotional pricing through August 31st, then $3), undercutting Claude Opus 4.8. Despite being positioned as a lower-tier model, Sonnet 5 performs slightly better than Opus 4.8 on knowledge work benchmarks and shows fewer mid-task stalls in multi-step workflows, suggesting cost-per-task efficiency is now more important than raw reasoning capacity.
Google shipped Nano Banana 2 Light, replacing the original with improved speed and cost optimization while maintaining prompt compatibility, and Gemini Omni Flash, a video generation model at $0.10 per second of output with conversational editing and multimodal input capabilities. Both models will be integrated across nine Google products including AI Studio, Search, Photos, and Google Ads.
About this episode
<span>Claude Sonnet 5 hits 63% agentic coding at $2/M tokens; Google ships Nano Banana 2.<br /><br /><br /></span> <span><div><b>Show Links</b></div><ul><li><p><span>Get the top 80+ AI Models for $8.99 at AI Box: </span><span><a href="https://aibox.ai"></a><a href="https://aibox.ai/builder">https://aibox.ai</a></span><a href="https://aibox.ai/builder">/builder</a></p></li><li><p><span>How I Grow and Scale My Business with AI: </span><a href="https://www.skool.com/aihustle"><span>https://www.skool.com/aihustle</span></a></p></li><li><p><span>Get the AI Chat Daily Newsletter: </span><a href="https://www.aichatdaily.com/newsletter">https://www.aichatdaily.com/newsletter</a><br /></p></li></ul></span> See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
Key Insights
- Base44 is using tens of millions of user interactions from their platform to train Base1, allowing them to predict developer needs before they're explicitly requested and optimize costs by routing tasks to their proprietary model instead of third-party APIs.
- Claude Sonnet 5 slightly outperforms Claude Opus 4.8 on knowledge work benchmarks despite being positioned as a lower-tier model, indicating that cost-per-task efficiency has become more strategically important than raw reasoning capacity.
- Anthropic's inability to upgrade Claude Opus 4.8 has resulted in their newer, lower-tier Sonnet 5 model becoming objectively better on certain benchmarks, creating an unusual situation where the company's worse-positioned model is now superior to their previous flagship.
- X's shift from restricting API access to launching an open MCP server represents a strategic reversal, now allowing regular users and developers to integrate X data into Claude and other AI tools for applications like sentiment analysis and content discovery.
- Google's Gemini Omni Flash video generation at $0.10 per second represents a significant cost reduction in AI video generation, made possible by aggressive optimization, after OpenAI discontinued Sora due to prohibitive expenses.
Topics
Transcript
Anthropic is launching Claude Science. This is going to be a workbench for researchers. Base44 is launching their very own AI model. They're trying to protect their $100 million in annual recurring revenue for their vibe coding business. And X has just launched a hosted MCP server. So it's going to open its API to Claude, Cursor, Grok, Build, which if you know anything about the backstory of X and their API, this is quite quite a big deal. Anthropic is shipping Claude sonnet five, it's going to be $2 per million input tokens. And Google is shipping nano banana to light and Gemini Omni Flash to developers. So tons coming out from basically every top AI model today, we're…
Full transcript available for MurmurCast members
Sign Up to AccessMore from Hard Fork AI
Arena AI hits $100M run-rate in 8 Months
Arena AI has reached a $100M annual run rate in just 8 months after launching paid evaluations, competing with Scale AI and Merkle by selling analytics on model comparisons. The episode covers major AI industry developments including Palantir's adoption of NVIDIA's open-source Nemotron models for US government, proposed health data protection legislation, Flexion Robotics' autonomous humanoid robot software, and China's CXMT securing a $3B memory supply deal with Tencent.
Lovable HIts $500M in ARR, Apple Announces AI Siri
The podcast covers major AI and tech news including Lovable hitting $500M ARR, Apple's WWDC 2026 announcements featuring a redesigned Siri and natural language automation tools, and Perplexity's 2028 IPO target. The host also shares a personal story about rebuilding his meditation app 'Self Pause' using vibe coding tools, highlighting how AI has dramatically lowered the cost and complexity of app development.
The Impact of S&P Regulations on IPOs
This podcast episode covers the S&P 500's refusal to bend its rules for SpaceX's IPO inclusion, which will also affect future OpenAI and Anthropic IPOs. Additional topics include Google's $920M/month compute deal with SpaceX's XAI, enterprises massively overshooting AI token budgets, Airbnb CEO Brian Chesky launching an AI lab, and the unusual pattern of ~90 investors holding stakes in both OpenAI and Anthropic.
Funding the Future: Alphabet's $80 Billion Goal
This podcast episode covers major AI industry news including Alphabet's $80 billion stock raise to fund AI infrastructure, Trump's revised AI executive order, GitHub Copilot's controversial usage-based pricing backlash, Opal's pivot to AI hardware with OpenAI backing, and Uber capping employee AI spending after burning through annual budgets in four months.
The Fork in the Road: Anthropic’s IPO
This transcript covers a tech news roundup including Anthropic's confidential IPO filing at a $965 billion valuation, Microsoft's first proprietary reasoning model MAI Thinking One, NVIDIA's Cosmos 3 robotics AI model, Intel's comeback with their Crescent Island AI chip, Strava's new API paywall, and Windborne's AI weather forecasting model beating the ECMWF.