ThursdAI - The top AI news from the past week

April 16 - Codex uses your mac in the background, Opus 4.7 release not quite Mythos + 3 interviews

122 snips
Apr 16, 2026
Theodore, product manager at Cognition working on Windsurf 2.0 and agent command centers. Trevor Mons, founding engineer at Marimo building reactive Python notebooks for agent-driven data workflows. Quinn Lahodman-Kramer, co-founder and creator of Gradient-Bang, a voice-driven multiplayer LLM game. They discuss Windsurf 2.0 and Devin integration. Pairing agents with stateful Marimo notebooks. Voice multi-agent game design and stack choices.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Opus 4.7 Is A Different Kind Of Upgrade

  • Anthropic's Claude Opus 4.7 shows targeted improvements (multimodality, coding) but mixed benchmark shifts versus 4.6.
  • The team may have trained a new base with a different tokenizer, causing wins (ScreenSpot up ~20%) and regressions (MRCR long-context drop to 32%).
INSIGHT

QN 3.6 Returns Alibaba To Open Source Competitiveness

  • Alibaba released Qwen 3.6 as open source (Apache 2) and achieved strong mid-size performance usable locally.
  • Qwen 3.6 is a 35B model with ~3B active, scoring ~51% on TerminalBench and beating larger dense models on some agentic tests.
ADVICE

Match Code Review Intensity To Task Criticality

  • Treat code-review habits as task-dependent: read critical code but spot-check diffs for one-off or low-risk tasks.
  • Use tests, harnesses and automated reviews to shift review effort from manual line-by-line checks to system verification.
Get the Snipd Podcast app to discover more snips from this episode
Get the app