
Feb 19, 2026 - InferenceX (Cam Quilici, Bryan Shan, Doug O'Laughlin, Jordan Nanos)
SemiAnalysis Weekly
00:00
Multi-Token Prediction (MTP) Benefits
Brian explains MTP/speculative decoding, how DeepSeq uses MTP heads, and accuracy-preserving speedups.
Play episode from 22:25
Transcript


