This NEW AI AGENT is INSANE! 🤯
Ornith 1.0, a new open-source AI agent from Deep Reinforce, has achieved a score of 82.4 on SWE-Bench, surpassing Claude Opus 4.7. The model introduces self-scaffolding reinforcement learning, allowing it to build its own problem-solving framework without human-built instructions, and is available in four versions ranging from 9B to 397B parameters.
Summary
A new open-source AI coding agent called Ornith 1.0 has been released by Deep Reinforce under an MIT license. According to the transcript, it has achieved a verified SWE-Bench score of 82.4, beating Claude Opus 4.7's performance. The most significant innovation highlighted is its self-scaffolding capability—unlike most existing AI coding agents that require human-built scaffolds (instructions for tool usage, failure retry logic, and memory organization), Ornith 1.0 generates its own scaffold dynamically as it solves problems, with its process evolving in real-time.
The model is available in four different sizes to accommodate various computational resources. The 9B parameter version is lightweight enough to run on a laptop, the 35B version operates on hardware that most development teams already possess, and the 397B flagship model is the version achieving the benchmark-beating performance. All versions are released as free, open-source software under the MIT license, allowing users to deploy, fine-tune, and build commercial products on top of the technology without licensing restrictions.
Key Insights
- Ornith 1.0 achieves a SWE-Bench score of 82.4, surpassing Claude Opus 4.7's performance
- Ornith 1.0 introduces self-scaffolding reinforcement learning, enabling the model to build its own scaffold while solving problems rather than relying on human-built instructions
- Most AI coding agents require human-built scaffolds that specify how to use tools, handle retry failures, and organize memory
- Ornith 1.0 is available in four versions: a 9B model for laptops, a 35B model for existing team hardware, and a 397B flagship model
- Ornith 1.0 is released as free, MIT-licensed open-source software that users can deploy, fine-tune, and commercialize
Topics
Transcript
[0:00] This new AI agent is insane. A free open-source AI just beat Claude Opus 4.7. Ornith 1.0 just dropped from Deep Reinforce. Free open-source, MIT licensed, on SWE-Bench verified. It scores 82.4, beating Claude Opus 4.7. And it does something no other open-source coding AI has done before. Most AI coding agents need a human-built scaffold, instructions that tell the model how to use tools, retry failures, and organize memory. Ornith 1.0 builds its own scaffold while solving your problem. Process evolves as it works. [0:31] That's called self-scaffolding reinforcement learning. Four versions available. The 9B runs on a laptop. The 35B runs on hardware most teams already own. The 397B flagship is the one beating Claude. All free,…
Full transcript available for MurmurCast members
Sign Up to AccessMore from Julian Goldie SEO
Gamma Just Got Better With ChatGPT
Gamma, an AI design tool used by nearly 100 million people, is now integrated into ChatGPT as a native app, allowing users to create professional presentations, documents, and web pages without leaving the chat. The integration enables users to transform rough notes, training documents, and ideas into polished decks by simply conversing with ChatGPT, which handles the writing while Gamma handles the design.
GLM 5.2 + Claude Code is INSANE!
The speaker demonstrates how to integrate GLM 5.2 into Claude Code using Ollama to create a cost-effective alternative AI development setup. This system combines Claude Code's agentic capabilities with GLM 5.2's brain, syncs with Obsidian for memory management, and enables building apps, games, and websites while maintaining a fraction of the cost of standard Claude subscriptions.
Agent OS: Obsidian + Multi-Agent Orchestration + GLM 5.2!
This video answers community questions about building an Agent Operating System (Agent OS) that combines multiple AI agents like Claude and Hermes with tools like Obsidian for memory management. The speaker covers practical implementations including multi-agent orchestration, video/animation agents, lead generation automation, and GLM 5.2 model optimization.
Hermes Agent Kanban Swarms are INSANE!
Hermes Agent released a major update enabling concurrent multi-agent Kanban operations without freezing. The update allows teams of agents to work in parallel on tasks across multiple boards, exemplified through a content factory that auto-deploys SEO articles to websites.
Hermes Agent Is INSANE (FREE!)
This video addresses common FAQs about Hermes, a free open-source AI agent, covering how to manage multiple tasks, how it compares to Claude Co-work, and strategies for reducing API token costs. The presenter demonstrates features like the Kanban board, multiple agent profiles, memory systems via Obsidian, and OAuth-based model integrations. The content is aimed at users of the 'AR Profit Boardroom' community.