The Orthogonal Bet

Rohit Krishnan on Training AI to Write Better

8 snips
Oct 15, 2025
Join Rohit Krishnan, a creative force behind the Strange Loop Canon newsletter and an AI practitioner, as he dives into the world of artificial intelligence and writing. He reveals the intriguing flaws of large language models in generating prose and discusses the role of reinforcement learning in training AI like his project, Walter. Rohit also explores how AI can reshape our future work dynamics, likening it to managing playful video games. Above all, he emphasizes the importance of experimentation and play in unlocking AI's full potential.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Great Writing Hides Trainable Patterns

  • High-quality writing has deeper, subtler patterns than surface token statistics.
  • Rohit proposes extracting surprisal and sampling patterns from great literature to create training gradients.
ADVICE

Craft Reward Proxies For Hard Tasks

  • Try to design reward functions for non-verifiable human tasks like writing by finding proxy signals.
  • Rohit argues improving writing is a key step toward models that reason deeper across domains.
INSIGHT

LLMs As Fuzzy Processors

  • 'Fuzzy processor' remains a useful metaphor for LLMs as probabilistic compute substrates.
  • Rohit sees current systems as chips combined with tool use and multi-loop internal reasoning.
Get the Snipd Podcast app to discover more snips from this episode
Get the app