Asianometry

VLIW: The “Impossible” Computer

10 snips
Apr 5, 2026
A tale of a seemingly impossible computer promising massive speedups using ordinary silicon. CPU basics are explained with a playful kitchen analogy. The story dives into instruction-level parallelism, trace scheduling, and the risky tricks compilers used to pack and backtrack code. It follows a daring academic startup building VLIW hardware, their initial wins, market struggles, and the lasting influence of their ideas.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Trace Scheduling Breaks Block Boundaries

  • Trace scheduling treats multiple basic blocks as one predicted long block and aggressively moves independent instructions up for parallel execution.
  • If the predicted path is correct, this can yield 10–30x speedups compared with conservative block-level scheduling.
INSIGHT

VLIW Moves Complexity To The Compiler

  • VLIW shifts complexity from hardware to compiler, avoiding dynamic hardware mechanisms that resolve runtime mispredictions.
  • The compiler inserts compensating code to backtrack when a predicted trace goes wrong, trading code size for simpler silicon.
ANECDOTE

Josh Fisher's ELI 512 Paper Sparked VLIW

  • Josh Fisher developed the ELI 512 VLIW concept and published Very Long Instruction Word Architectures with the ELI 512 design.
  • His paper paired the hardware idea with the Bulldog trace-scheduling compiler to realize massive parallelism.
Get the Snipd Podcast app to discover more snips from this episode
Get the app