Authority Hacker Podcast – AI & Automation for Small biz & Marketers

Your Claude Code Tokens Are Disappearing

40 snips

Apr 3, 2026

They unpack a massive Claude Code leak that exposed system prompts, hidden features, and a new Mythos model tier. They flag a sudden token limit cut and share ways to shrink token use. They demo EmDash, Cloudflare’s serverless WordPress alternative. They cover OpenAI’s self-serve ads roll out and Gemini’s one-click ChatGPT import move.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ADVICE

Use Lower Tier Models For Execution Tasks

Assign cheaper models per skill: use Sonnet or Haiku for execution and simple reads, Opus only where high reasoning is required.
Gael highlights a hidden slash command to use Opus for planning and Sonnet for execution to save tokens.

ADVICE

Trim Chat Histories And Cloud.md To Cut Tokens

Keep chats and cloud.md lean; long chat histories and heavy cloud.md increase token costs because they're sent with each message.
Gael recounts refactoring a 700-line skill.md down to 60 lines to drastically cut tokens.

ADVICE

Replace Token Heavy Workflows With Scripts

Build local scripts (Python) for repeated tasks so the model calls fewer tokens and runs logic off-chain.
Gael says many skills should use scripts to automate work without burning large token volumes.

Get the Snipd Podcast app to discover more snips from this episode

Get the app