ThursdAI - The top AI news from the past week

šŸ“… ThursdAI - Feb 26 - The Pentagon wants War Claude, every benchmark collapsed, and a solo founder hit $700K ARR with AI agents

94 snips
Feb 27, 2026
Nader Dabit, a developer-relations/product practitioner at Cognition who worked on Devin, and Ben Broca, engineer and founder of Polsia, a platform for autonomous AI companies. They discuss agentic tooling, Devin 2.2 features for engineering automation, and how Polsia runs and scales autonomous businesses. Rapid autonomy, tool ecosystems, and the race to run AI companies 24/7 are the main themes.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ANECDOTE

PCIe Card Demo Shows 15,000 Tokens Per Second Inference

  • A Canadian startup demoed a PCIe card that generates ~15,000 tokens/sec with baked-in LLM weights.
  • Panel noted the card resembles a sound card and is targeted at instantaneous, low-latency inference tasks.
INSIGHT

METR Shows Rapid Doubling Of Agent Time Horizons

  • METR shows model autonomous time-horizon doubling extremely fast, indicating agentic stamina improvements.
  • Opus and GPT 5.3/Codex drove exponential increases in hours of autonomous task completion versus earlier models.
ADVICE

Use Time Horizon Benchmarks As Relative Signals

  • Treat METR-style benchmarks as relative signals, not absolute proof; account for prompt and harness differences.
  • Panelists warned Opus can be prompt-tuned to loop work and inflate runtime scores, so use benchmarks comparatively.
Get the Snipd Podcast app to discover more snips from this episode
Get the app