Claude Code is now FREE: Here’s how… Summary — Julian Goldie SEO

Summary

The video demonstrates how to use Claude Code for free indefinitely using Google's newly updated Gemma 4 model with Ollama, which achieves a 90% speed improvement on Apple Silicon devices. The presenter outlines three straightforward setup steps: download Ollama (free), download Gemma 4 (free), and install into Claude Code (free), creating a complete free agentic coding system that can run autonomously 24/7. For users without Apple Silicon, the same capability is available via OpenRouter's free API endpoint, routing through the open-source free Claude Code project. The presenter demonstrates practical applications including building a to-do list app and Space Invaders game, showing that while Gemma 4 isn't suitable for highly complex tasks, it performs well for basic projects like blog posts, landing pages, and background scheduling tasks. The key advantage over previous local model approaches is the significant speed improvement, making local models viable as a cost-free alternative to cloud-based APIs that consume tokens. The presenter advocates for using open-source, locally-runnable models to avoid dependency on proprietary closed systems that can be removed or changed at any time, citing the example of previous model disruptions. The transcript mentions integration into an Agent Operating System with additional features like memory systems, and references Goldiebench, a local leaderboard testing models across 42 different tasks for comparative benchmarking.

Key Insights

Gemma 4 from Google is now 90% faster on Apple Silicon with Ollama using MLX, making free local models viable as a speed-competitive alternative to traditional cloud APIs

Claude Code remains the same CLI tool regardless of which model backs it; users simply point it at different models like Gemma 4, making it not a watered-down version but the full product with a different inference backend

Gemma 4 is suitable for basic tasks like writing blog posts or creating landing pages, but frontier models should be used for complex projects, creating a tiered approach to model selection based on task complexity

Local models running on schedules or in the background don't require speed optimization since results can be reviewed hours later, making Hermes and other slower agents viable for asynchronous workflows

Using open-source local models provides system ownership and protection against disruption, unlike closed proprietary models that can be taken down or removed, breaking dependent workflows

Topics

Gemma 4 model and 90% speed improvement on Apple SiliconFree Claude Code setup with local modelsOllama installation and configurationOpenRouter API as alternative for non-Mac usersComparison of local vs frontier models for different use casesAgent Operating System integrationOpen-source model advocacy vs proprietary closed systemsPractical demonstrations and use cases

Transcript

[0:00] Today I'm going to show you how to use clawed code free forever with a brand new update that just came out from Google that allows you to use free clawed code with a free local model and then also it's 90% faster which is usually the main downfall of local models. So I'm going to run you through exactly how we're running this system right here, what you can build with it, how it works, etc. You can see a bunch of things that we built with it over here. So we've already plugged this into our Asian operating system and this works in three simple steps. So first of [0:32] all the update and why this is…

Full transcript available for MurmurCast members

Claude Code is now FREE: Here’s how…

Summary

Key Insights

Topics

Transcript

More from Julian Goldie SEO

This NEW Chinese AI is INSANE! (FREE + Open Source!)

X AI MCP Server Just Changed AI Agents

New NotebookLM Update is INSANE!

How to Rank #1 with Claude Fable 5 AI SEO!

NEW Hermes + Paperclip AI Agent Update Is INSANE

Get AI summaries delivered to your inbox