Baiting AI [LIVE]
Matthew Berman hosts a casual live stream covering several AI industry topics including app store scams targeting his mother, the jagged nature of AI intelligence, Anthropic's controversial user policies, Meta's employee keystroke monitoring program, and a new mystery model called Owl Alpha. The stream also features live AI model testing and discussion of content creator economics on YouTube vs. X.
Summary
The stream opens with extended technical difficulties involving Matthew's newly recabled XLR microphone setup, which was accidentally set to phantom power despite the mic not requiring it. After resolving the audio issues, Matthew transitions into the main content.
Matthew shares a story about his mother being scammed by a counterfeit ChatGPT app on the Apple App Store. When searching for ChatGPT on the App Store, the first several results are deliberately designed lookalike apps with nearly identical logos, some charging around $40/year while likely serving free-tier API access. Matthew argues Apple bears responsibility for allowing these deceptive apps and demonstrates live how none of the top search results are the actual ChatGPT app. He notes his mother had purchased two fake AI apps before he intervened.
Matthew previews an upcoming video breaking down André Karpathy's talk at Sequoia's AI Ascent event, focusing on the 'jagged intelligence' phenomenon — where AI excels at some tasks and fails surprisingly at simple ones. He explains this stems from two factors: the verifiability of certain domains (like coding, where you can test outputs and get clear feedback) and the revenue incentives that drive AI labs to optimize for coding and math. He demonstrates live testing across GPT-5.3, GPT-5.5 thinking mode, and Gemini on a classic logic puzzle about whether to walk or drive to a car wash 50 meters away, showing that non-thinking models fail while thinking models and Gemini succeed.
The stream then covers controversy around Anthropic, citing content creator Theo's viral open letter criticizing the company. Key criticisms include opaque and arbitrary token quota manipulation, preventing subscribers from using tokens in third-party tools like OpenClaw, and a cult-like corporate culture where employees reportedly fear being fired for a single bad tweet. Matthew also notes Anthropic's strong anti-open-source lobbying stance. A data analyst named Powell challenged Theo's claim that his anti-Anthropic content costs him money, showing anti-posts get 2.8x more views on X — but Matthew and Theo both argue X view counts are heavily inflated and don't reflect meaningful conversion compared to YouTube.
Matthew briefly tests a new mystery model called 'Owl Alpha' on OpenRouter, described as a high-performance model for agentic workloads compatible with Claude Code and OpenClaw. The model initially fails the car wash logic puzzle but succeeds after being prompted to reason as a logical expert.
The final segment covers Meta CEO Mark Zuckerberg's announcement at a company-wide meeting that Meta would install a monitoring tool called the 'Model Capability Initiative' to track employee keystrokes and mouse movements to train AI models. Zuckerberg argued Meta employees have higher average intelligence than typical data labeling contractors, making their activity more valuable training data. Matthew notes this practice of employee monitoring already exists at many companies for productivity and security reasons, but the AI training application is new. He speculates this could spread to other AI labs and eventually to non-tech industries if successful.
About this episode
Download The 25 OpenClaw Use Cases eBook 👇🏼 https://bit.ly/4aBQwo1 Download The Subtle Art of Not Being Replaced 👇🏼 http://bit.ly/3WLNzdV Download Humanities Last Prompt Engineering Guide 👇🏼 https://bit.ly/4kFhajz Join My Newsletter for Regular AI Updates 👇🏼 https://forwardfuture.ai Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forwardfuture 👉🏻 Instagram: https://www.instagram.com/matthewberman_ai 👉🏻 TikTok: https://www.tiktok.com/@matthewberman_ai 👉🏻 Spotify: https://open.spotify.com/show/6dBxDwxtHl1hpqHhfoXmy8 Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45V
Key Insights
- Matthew argues that Apple bears direct responsibility for allowing deliberately deceptive AI app clones on the App Store, noting all top ChatGPT search results are counterfeit apps designed to confuse non-technical users.
- Matthew explains the jagged nature of AI intelligence stems primarily from two factors: the verifiability of domains like coding (short feedback loops) and revenue incentives that push labs to optimize for coding and math over creative tasks.
- Matthew demonstrated live that GPT-5.3 in instant (non-thinking) mode fails the car wash logic puzzle but can be corrected by prefacing with 'you are an expert in logical thinking,' which he argues is a pointless workaround that should be unnecessary.
- Matthew argues that Anthropic's practice of segmenting tokens by product (e.g., Claude Design having separate quotas from other models) is arbitrary and frustrating since they are technically the same tokens the user paid for.
- Matthew contends that Theo's criticism of Anthropic appears genuine because Theo has stated it costs him sponsors and money, and Matthew finds Theo to be a consistently genuine person in his interactions.
- Matthew characterizes Anthropic's CEO Dario as 'completely AGI-pilled' rather than explicitly disrespecting engineers, arguing Anthropic's singular focus on AGI drives their seemingly user-hostile decisions.
- Matthew observes that unlike other AI labs which have experienced significant founder and employee departures, all of Anthropic's original seven founders are reportedly still at the company, which contributes to its cult-like perception.
- Matthew argues that X view counts are heavily gamed and inflated — a view counts even if the post merely appears on screen without being clicked — making X metrics far less meaningful for sponsor conversion compared to YouTube.
- Matthew describes the information flow hierarchy for AI ideas as: X first, then YouTube, then Instagram, then TikTok, then Facebook, then LinkedIn weeks later, positioning X as where influential people soundboard ideas before they reach broader audiences.
- Matthew argues that Meta employees at big tech companies are already subject to significant computer monitoring for security and productivity reasons, so keystroke logging for AI training is not as novel as it might seem.
- Matthew speculates that if Meta's employee monitoring for AI training succeeds without significant pushback, other AI labs and eventually non-tech companies may be approached by third-party data aggregators to monetize employee activity data.
- Matthew notes that Anthropic is the least open-source AI company he has seen and is actively lobbying against open-source AI, contrasting it with OpenAI which at least open-sourced its Codex evaluation harness.
Topics
Transcript
[3:39] All right. Hello. Can you all hear me? All right, welcome. Welcome. [4:12] We're going to do a casual today. I have just a few things I wanted to talk about. Hey everybody. Uh, tell me where you're from. Drop it in the comments or the message, the replies. And, uh, yeah. All right. [4:48] We have Thailand, the Netherlands, Bosnia, Tunisia California. Brian, you're from America. All right. [5:21] SoCal. Ryan. SoCal. Do you have a Discord community? Yes. Yes, we do. Brian Fu uh Forward Future Brian will drop a link there and you all should join. We just uh revamped it. Thanks to Brian. Um it was unwieldy and he brought it into focus and there's…
Full transcript available for MurmurCast members
Sign Up to AccessMore from Matthew Berman
What is an AI Agent?
This video explains what an AI agent is, describing it as an AI system equipped with tools, memory, and a harness that enables it to perform real-world tasks autonomously. It covers how agents can collaborate, learn over time, and how popular AI systems like ChatGPT, Claude, and Gemini have effectively become agents.
Reacting to "Why AI is so smart but also so dumb?"
Andrej Karpathy discusses the evolution of AI from Software 1.0 to Software 3.0, explaining why LLMs excel in verifiable domains like code and math while struggling in others. He introduces the concepts of vibe coding versus agentic engineering, and argues that the entire internet needs to be rebuilt with agents in mind.
Deepseek is a Problem
The video argues that US open-source AI is effectively doomed due to a broken business model, while China exploits government subsidies to dominate the space. The speaker warns that if US enterprises adopt Chinese open-source models, China could gain dangerous influence over AI standards, chip manufacturing, and even cultural narratives. Several potential solutions are proposed, including federal grants, Nvidia's role as a potential savior, and vertical industry-specific models.
Worst AI Reddit Take
A YouTuber reacts to a viral anti-AI Reddit post about a parent banning their 9-year-old from using Google AI, agreeing with concerns about sycophancy and children's mental health while pushing back on the environmental impact argument. The creator shares his own approach of supervised AI use for his children, emphasizing education about hallucinations and AI's non-human nature. A sponsored segment on Med-OS and a guest expert discussion on AI's environmental trade-offs are also included.