This NEW Chinese AI is INSANE! (FREE + Open Source!)
Long Cap 2.0 is a new open-source Chinese AI model from a food delivery app company that offers 1 million tokens of free context memory, beats GPT-4.5 on SWE bench pro benchmarks, and uses efficient parameter activation to reduce computational overhead while maintaining high performance.
Summary
Long Cap 2.0 is a newly released Chinese AI model that has generated significant attention despite its unconventional origins. Rather than coming from a major technology corporation, the model was developed by a food delivery app company. The model initially launched anonymously on OpenRouter under the code name Alpha, where it operated for approximately two months before gaining recognition and climbing performance charts without its creator being publicly identified. Long Cap 2.0 features 1.6 trillion parameters in total, but employs a sophisticated selective activation mechanism that only activates the parameters necessary for each specific task. This approach provides substantial computational benefits by reducing power consumption and increasing processing speed compared to models that activate all parameters uniformly. The most distinctive feature is its context window capacity of 1 million tokens, allowing users to input and process entire knowledge bases simultaneously while maintaining all information in active memory. In terms of performance benchmarks, Long Cap 2.0 achieved a score of 59.5 on SWE bench pro, exceeding GPT-4.5's score of 58.6. The model is released under the MIT open-source license, making it freely available for anyone to use, modify, and build upon commercially or otherwise.
Key Insights
- Long Cap 2.0 was developed by a food delivery app company rather than a traditional tech giant, and operated anonymously on OpenRouter under the code name Alpha for two months before being identified
- The model contains 1.6 trillion parameters but only activates the ones needed for each specific task, which reduces power consumption and increases processing speed
- Long Cap 2.0 offers 1 million tokens of context memory, enabling users to feed entire knowledge bases into the model and have all information held in memory simultaneously
- Long Cap 2.0 scored 59.5 on SWE bench pro, outperforming GPT-4.5 which scored 58.6
- The model is released under the MIT open-source license, making it completely free and available for anyone to build upon
Topics
Transcript
[0:00] New free, plus open-source Chinese AI is insane. This new Chinese AI has 1 million tokens of memory for free. This new Chinese AI is called Long Cap 2.0, and it just came from a food delivery app, not a tech giant. It ran anonymously on open router for 2 months under the code name Alpha, climbing the charts with nobody knowing who built it. It has 1.6 trillion parameters, but only activates the ones it needs for each task, saving power and running faster. The real game-changer is 1 million tokens of context. Feed it an entire [0:30] knowledge base, and it holds all of it in memory at once. On SWE bench pro, it scored 59.5, beating…
Full transcript available for MurmurCast members
Sign Up to AccessMore from Julian Goldie SEO
How to Run Hermes FREE Forever!
The video demonstrates how to run the Hermes AI agent for free using Gemma 4, a local open-source model from Google, with significant speed improvements through MLX optimization. The setup works on Apple Silicon Macs or via free APIs on Open Router, enabling autonomous agents to work offline and privately without subscription costs.
Claude Code is now FREE: Here’s how…
Google's new Gemma 4 model running on Ollama is 90% faster on Apple Silicon, enabling free Claude Code usage locally without token costs. The setup requires three simple steps: downloading Ollama, Gemma 4, and installing into Claude Code, with alternatives available via OpenRouter API for non-Mac users.
X AI MCP Server Just Changed AI Agents
X has launched a hosted MCP (Model Context Protocol) server that gives AI agents direct access to real-time data from X's platform through a standardized connection, eliminating the need for custom API integration work. The setup involves OAuth authentication, the XRL token manager, and access to 200+ X API tools for research, content creation, and trend tracking.
New NotebookLM Update is INSANE!
Google's NotebookLM now features short video overviews that convert documents into engaging 60-second vertical videos using the new Nano Banana 2 Light image model. The feature represents rapid iteration in AI tools and offers practical applications for students, creators, and businesses seeking to transform static documents into shareable video content.
How to Rank #1 with Claude Fable 5 AI SEO!
The speaker demonstrates how to use Claude Fable 5 AI for SEO automation to rank websites, showing real examples of sites growing from zero to hundreds of daily clicks. The strategy emphasizes using Fable 5 for planning and building automation systems, while deploying content creation with cheaper alternative models due to Fable 5's token limitations.