Authority Hacker Podcast – AI & Automation for Small biz & Marketers

Your Claude Code Tokens Are Disappearing

40 snips
Apr 3, 2026
They unpack a massive Claude Code leak that exposed system prompts, hidden features, and a new Mythos model tier. They flag a sudden token limit cut and share ways to shrink token use. They demo EmDash, Cloudflare’s serverless WordPress alternative. They cover OpenAI’s self-serve ads roll out and Gemini’s one-click ChatGPT import move.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Use Lower Tier Models For Execution Tasks

  • Assign cheaper models per skill: use Sonnet or Haiku for execution and simple reads, Opus only where high reasoning is required.
  • Gael highlights a hidden slash command to use Opus for planning and Sonnet for execution to save tokens.
ADVICE

Trim Chat Histories And Cloud.md To Cut Tokens

  • Keep chats and cloud.md lean; long chat histories and heavy cloud.md increase token costs because they're sent with each message.
  • Gael recounts refactoring a 700-line skill.md down to 60 lines to drastically cut tokens.
ADVICE

Replace Token Heavy Workflows With Scripts

  • Build local scripts (Python) for repeated tasks so the model calls fewer tokens and runs logic off-chain.
  • Gael says many skills should use scripts to automate work without burning large token volumes.
Get the Snipd Podcast app to discover more snips from this episode
Get the app