ThursdAI - The top AI news from the past week

đź“… ThursdAI - Feb 19 - Gemini 3.1 Pro Drops LIVE, Sonnet 4.6 Closes Gap, OpenClaw Goes to OpenAI

119 snips
Feb 20, 2026
Ryan Carson, startup founder and writer, shares his CodeFactory workflow and production tips. Nisten Tahiraj, AI practitioner, runs vibe checks on models like Qwen 3.5 and demos front-end experiments. Wolfram Ravenwolf, engineer and evaluator, breaks down TerminalBench and benchmarking quirks. They debate Gemini 3.1 Pro, Sonnet 4.6, model harness effects, and agent-driven code pipelines in short, lively segments.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Gemini 3.1 Pro: Power Meets Practical Limits

  • Gemini 3.1 Pro is a major release with top-tier scores on several benchmarks and a 1M token context option.
  • Its speed impressed the panel, but real-world vibe checks showed mixed practical results.
INSIGHT

1M-Token Benchmark Discrepancy

  • Long-context retrieval scores showed a major discrepancy between Gemini and Opus on MRCR v2 1M tests.
  • LDJ and others flagged this as likely a benchmarking or reporting inconsistency, not pure model ability.
ADVICE

Pick One Model And Harness To Maximize Productivity

  • Choose the model and harness you’ll stick with to avoid constant context switching and productivity loss.
  • Prefer the vendor harness that best matches your model for highest real-world performance.
Get the Snipd Podcast app to discover more snips from this episode
Get the app