NewsTechnical

OpenClaw 4.24: New AI Voice + Browser Updates

Julian Goldie SEO

OpenClaw 4.24 introduces Google Meet integration, allowing AI agents to join meetings with full tool access, alongside upgraded voice call capabilities via an 'agent console' feature. DeepSeek V4 Flash and Pro are now built-in, with Flash set as the default model for cost-effective use. Browser automation, channel integrations (Telegram, Slack, WhatsApp), and memory systems also received significant improvements.

Summary

OpenClaw 4.24 is a major release featuring over 139 contributors — the largest contributor count on a single release according to the presenter. The update spans meeting integration, voice upgrades, new AI models, browser automation improvements, channel fixes, and memory system enhancements.

The headline feature is Google Meet integration. AI agents can now join Google Meet calls via Chrome or Twilio, bringing their full toolset and context into live meetings. Agents can export recordings, transcripts, smart notes, and attendance lists as clean files, and can scan meeting history. A recovery system handles disconnects and permission blockers gracefully, reporting issues rather than failing silently.

Voice call functionality received a major upgrade through a new 'agent console' feature. Previously, agents on phone calls ran a stripped-down version of themselves without access to full tools or knowledge. Now, mid-call, the agent can consult the full OpenClaw system to retrieve data, check files, or run tools before responding. A new Gemini Live voice provider joins OpenAI Realtime and Twilio as options, and a 'smoke test' command lets users verify their Twilio setup before going live with real customers.

DeepSeek V4 Flash and V4 Pro are now natively integrated. V4 Flash is set as the default DeepSeek model, prioritizing speed and low cost for new users. A bug where switching to a DeepSeek V4 model mid-conversation broke history replay due to missing expected data has been fixed. The presenter suggested a model selection framework: DeepSeek V4 Flash for speed and cost, GPT-5.5 for power, and Claude for personality.

Browser automation saw three notable upgrades: coordinate-based clicking (pixel-level precision for unreliable websites), an increased default timeout of 60 seconds for slow-loading pages with customizable per-profile timeouts, and improved tab recovery that automatically clears stale browser locks after crashes. A new 'browser doctor' diagnostic command identifies specific issues like missing Chrome installs or bad profile configurations.

Channel-specific fixes addressed Telegram's phantom error messages and markdown formatting bugs, Slack's out-of-order message delivery and thread leakage issues, and WhatsApp's new voice note transcription capability (converting audio messages to text before passing them to the agent) and improved media delivery from tools.

On the memory side, search results now show the breakdown between text-based and meaning-based (semantic) matches, helping users tune agent memory accuracy. The local embedding system no longer requires a heavy AI package by default, speeding up startup. The memory dreaming system was decoupled from the heartbeat system, and a bug where heartbeat prompts leaked into normal conversations — causing agents to respond with silent acknowledgements instead of actual replies — was resolved. MCP connections now properly shut down after use, idle sessions have timeouts, and the conversation compaction system was fixed to rebuild summaries from original conversation content rather than stacking summaries on top of each other.

Key Insights

  • The presenter argues that the new 'agent console' fundamentally changes voice call capability — previously agents on phone calls ran a lighter, tool-limited version of themselves, but now they can pause mid-call to query the full OpenClaw agent with all tools and context before responding.
  • The presenter notes that DeepSeek V4 Flash is now the default model for new DeepSeek setups, framing this as a deliberate accessibility decision so users learning the platform aren't defaulted into expensive API calls.
  • WhatsApp voice note transcription is described by the presenter as 'huge for customer-facing WhatsApp agents' because people commonly send voice messages, and the agent previously had no way to understand raw audio — now it receives a text version automatically.
  • The presenter identifies a compaction flaw that was silently degrading agent memory: the summary system was building summaries on top of old summaries like a 'game of telephone,' causing increasing distortion — OpenClaw 4.24 fixes this by regenerating summaries from the actual original conversation.
  • The presenter highlights that OpenClaw 4.24 had over 139 contributors, calling it the largest contributor count he has seen on a single release, and uses this to frame the project as a rapidly accelerating open community effort.

Topics

Google Meet AI agent integrationVoice call agent console upgradeDeepSeek V4 Flash and Pro integrationBrowser automation improvementsChannel fixes for Telegram, Slack, and WhatsAppMemory and session system improvements

Full transcript available for MurmurCast members

Sign Up to Access

Get AI summaries like this delivered to your inbox daily

Get AI summaries delivered to your inbox

MurmurCast summarizes your YouTube channels, podcasts, and newsletters into one daily email digest.