NewsTechnical

This NEW AI AGENT is INSANE! 🤯

Julian Goldie SEO

Ornith 1.0, a new open-source AI agent from Deep Reinforce, has achieved a score of 82.4 on SWE-Bench, surpassing Claude Opus 4.7. The model introduces self-scaffolding reinforcement learning, allowing it to build its own problem-solving framework without human-built instructions, and is available in four versions ranging from 9B to 397B parameters.

Summary

A new open-source AI coding agent called Ornith 1.0 has been released by Deep Reinforce under an MIT license. According to the transcript, it has achieved a verified SWE-Bench score of 82.4, beating Claude Opus 4.7's performance. The most significant innovation highlighted is its self-scaffolding capability—unlike most existing AI coding agents that require human-built scaffolds (instructions for tool usage, failure retry logic, and memory organization), Ornith 1.0 generates its own scaffold dynamically as it solves problems, with its process evolving in real-time.

The model is available in four different sizes to accommodate various computational resources. The 9B parameter version is lightweight enough to run on a laptop, the 35B version operates on hardware that most development teams already possess, and the 397B flagship model is the version achieving the benchmark-beating performance. All versions are released as free, open-source software under the MIT license, allowing users to deploy, fine-tune, and build commercial products on top of the technology without licensing restrictions.

Key Insights

  • Ornith 1.0 achieves a SWE-Bench score of 82.4, surpassing Claude Opus 4.7's performance
  • Ornith 1.0 introduces self-scaffolding reinforcement learning, enabling the model to build its own scaffold while solving problems rather than relying on human-built instructions
  • Most AI coding agents require human-built scaffolds that specify how to use tools, handle retry failures, and organize memory
  • Ornith 1.0 is available in four versions: a 9B model for laptops, a 35B model for existing team hardware, and a 397B flagship model
  • Ornith 1.0 is released as free, MIT-licensed open-source software that users can deploy, fine-tune, and commercialize

Topics

Ornith 1.0 AI agent releaseSelf-scaffolding reinforcement learningSWE-Bench benchmark performanceOpen-source AI coding modelsModel size variants and accessibility

Transcript

[0:00] This new AI agent is insane. A free open-source AI just beat Claude Opus 4.7. Ornith 1.0 just dropped from Deep Reinforce. Free open-source, MIT licensed, on SWE-Bench verified. It scores 82.4, beating Claude Opus 4.7. And it does something no other open-source coding AI has done before. Most AI coding agents need a human-built scaffold, instructions that tell the model how to use tools, retry failures, and organize memory. Ornith 1.0 builds its own scaffold while solving your problem. Process evolves as it works. [0:31] That's called self-scaffolding reinforcement learning. Four versions available. The 9B runs on a laptop. The 35B runs on hardware most teams already own. The 397B flagship is the one beating Claude. All free,…

Full transcript available for MurmurCast members

Sign Up to Access

More from Julian Goldie SEO

Get AI summaries like this delivered to your inbox daily

Get AI summaries delivered to your inbox

MurmurCast summarizes your YouTube channels, podcasts, and newsletters into one daily email digest.