

Agents Hour
Mastra
The AI Agents show that discusses hot topics in the world of AI, talks with guests building AI agents and applications, and shows the actual code of how AI applications are being built today. Hosted by Shane Thomas and Abhi Aiyer from Mastra.Watch the livestream on Youtube and X on Monday at 12PM pacific time. Watch the video versions on Spotify or YouTube.
Episodes
Mentioned books

Mar 28, 2026 • 28min
Claude Uses Your Computer, Openai Buys Python Tools & The Cursor/Kimi Plot Twist (This Week In AI)
Shane and Abhi kick off with a viral quote: if your $500K engineer isn't burning $250K in tokens, something is wrong.
OpenAI is acquiring Astral — the team behind uv and Ruff — joining the Codex team. OpenAI bets on Python; Anthropic bet on TypeScript with Bun. Then Cursor drama: someone found Composer 2 is powered by Kimi K2.5, Kimi confirmed it, and raised another $1B at an $18B valuation — three rounds in 90 days.
Anthropic shipped Claude Code Channels (Telegram/Discord control), Cowork Dispatch (persistent agent, message from phone), and a deep dive on how they use Skills. Matt Pocock found quality drops past 100K on the 1M context window. And 52 million views on enabling Claude to use your computer — Mac only.
Stripe launched MPP for agent-to-agent payments. Better Auth launched the Agent Auth Protocol. Cloudflare shipped Dynamic Workers for AI-generated code in isolates. LangChain open-sourced Deep Agents, Composio shipped 30-parallel-agent orchestration, OpenCode lost its Claude Max plugin after Anthropic sent lawyers, and Netlify and Google Stitch entered vibe coding and design.
EsoLang-Bench: LLMs score 85–95% on standard benchmarks but collapse to 0–11% on esoteric languages — memorization, not reasoning. Quick hits: GPT-5.4 mini/nano, Minimax M2.7, Morph FlashCompact, AI CMO, Letta pivots to coding agents, GLM-OCR, LiteLLM supply chain attack.
AI Agents Hour is a weekly livestream by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Mondays 12PM Pacific.
📚 READ MORE
$500K engineers: https://x.com/sundeep/status/2034829022082080846
OpenAI acquires Astral: https://openai.com/index/openai-to-acquire-astral/
Cursor Composer 2: https://x.com/cursor_ai/status/2034668943676244133
Composer 2 is Kimi K2.5: https://x.com/fynnso/status/2034706304875602030
Kimi confirms: https://x.com/kimi_moonshot/status/2035074972943831491
Kimi raises $1B: https://x.com/CodeByPoonam/status/2034940587942846665
Claude Code Channels: https://x.com/trq212/status/2034761016320696565
Cowork Dispatch: https://x.com/felixrieseberg/status/2034005731457044577
Anthropic Skills post: https://x.com/trq212/status/2033949937936085378
1M context quality: https://x.com/mattpocockuk/status/2034572011175907474
Claude computer use: https://x.com/claudeai/status/2034991044109184388
Stripe MPP: https://stripe.com/blog/machine-payments-protocol
Agent Auth Protocol: https://github.com/better-auth/agent-auth-protocol
Cloudflare Dynamic Workers: https://x.com/CloudflareDev/status/2034510221044736342
LangChain Deep Agents: https://x.com/hasantoxr/status/2033213054859792859
Composio Orchestrator: https://x.com/hasantoxr/status/2033999352008741376
OpenCode/Anthropic: https://x.com/thdxr/status/2034730036759339100
Netlify: https://x.com/Netlify/status/2034303709832773711
Google Stitch: https://stitch.withgoogle.com
EsoLang-Bench: https://arxiv.org/abs/2603.09678
GPT-5.4 mini: https://x.com/openai/status/2033953592424731072
Morph FlashCompact: https://x.com/morphllm/status/2033968877345116200
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Mastra on X: https://x.com/mastra_ai
Mastra Discord: https://mastra.ai/community/discord
Mastra GitHub: https://github.com/mastra-ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/books/principles-of-building-ai-agents
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
WHAT IS MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you're a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
00:00 — If your $500K engineer isn't burning $250K in tokens, something is wrong
01:36 — OpenAI acquires Astral
02:31 — Cursor's Composer 2 is secretly Kimi K2.5
05:35 — Kimi raises another $1B
05:57 — Anthropic ships
08:00 — Opus 4.6 1M context: quality drops noticeably past 100K tokens
08:46 — Claude can now use your computer (Mac only)
11:02 — Stripe's Machine Payments Protocol
12:28 — Better Auth launches the Agent Auth Protocol
13:12 — Cloudflare Dynamic Workers & the vibe coding platform wave
14:08 — LangChain Deep Agents, Composio's 30-agent orchestrator & cloud coding agents
17:00 — OpenCode removes the Claude Max plugin
19:26 — Google Stitch & Netlify's prompt-to-project
19:59 — LLMs aren't reasoning, they're memorizing
21:09 — Quick hits: GPT-5.4 mini, Minimax M2.7, Morph FlashCompact, AI CMO
23:55 — Letta goes all-in on coding agents, GLM-OCR
24:46 — LiteLLM supply chain attack

Mar 25, 2026 • 21min
Email Broke Productivity - It's Time To Fix It (with Brett and Naveen from Micro)
Naveen Sreekandan, technical co‑founder and engineer who built Micro's graph-based architecture. Brett Goldstein, co‑founder driving product and demos of the unified productivity platform. They discuss transforming email into a connected workspace. They demo a daily orchestrator, CRM autofill, integrated meeting notes, and the underlying Mastra graph, Prism query layer, and agent/sub‑agent approach.

Mar 21, 2026 • 9min
Two Lines of Code to Lock Down Your Agents - Mastra Studio Auth
Mastra Studio started as a local playground for developers to test agents and workflows without having to spin up a custom UI. But as the feature set grew, teams started asking: how do we share this with non-technical teammates? How do we control what different users can do?
Ryan, an engineer at Mastra, walks through the new Mastra Studio Auth — now baked directly into Studio. Starting with simple token-based auth (two lines of config), you can lock down your Studio from the open internet. From there, RBAC lets you map roles to granular permissions — 80 auto-generated permissions derived directly from Studio's routes and handlers, controllable via wildcard patterns. Out-of-the-box providers include WorkOS, Auth0, Supabase, Firebase, and Clerk, with GitHub and others in open PRs.
The team also discusses what's coming next: audit logs so you can see exactly what an agent did, why it accessed a given tool, and whether it should have. Auth for agents in production isn't magic — your tool files still need to check permissions — but Mastra handles the plumbing so you can focus on building securely.
Read more: https://mastra.ai/blog/announcing-studio-auth
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Mastra on X: https://x.com/mastra_ai
Mastra Discord: https://mastra.ai/community/discord
Mastra GitHub: https://github.com/mastra-ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/books/principles-of-building-ai-agents
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
📌 CHAPTERS
00:00 — Why Mastra Studio needed auth
01:22 — Token-based auth: the simplest setup
02:32 — RBAC: roles, permissions & wildcards
05:00 — Auth for agents vs auth for humans
06:41 — Think securely!
07:22 — Supported providers & what's coming next

Mar 18, 2026 • 32min
NVIDIA GTC, The Death of MCP, and AI Agents Are Hiring Humans - This Week in AI
Shane hosts this week's news from his usual studio while Abhi joins remotely from NVIDIA GTC 2026 in San Jose. Jensen Huang's keynote set the tone: NVIDIA is doubling down on AI factories, pushing 100x more token throughput, and helping bring OpenAI onto AWS infrastructure.
RentAHuman is a startup that lets AI agents hire humans for physical tasks they can't do themselves.
Perplexity's CTO said internally they're moving away from MCPs toward APIs and CLIs, but Chrome 146 shipping native MCP support may have undercut that argument immediately.
Anthropic had a strong week: 1M context window is now GA for Opus 4.6 and Sonnet 4.6 with no beta header required, Opus 4.6 1M is now the default model for Claude Code on Max/Team/Enterprise with no long context price premium, and the new /btw command lets you have side conversations while Claude is working.
Vercel and Cloudflare reignited their ongoing drama over the just-bash fork.
Ramp launched credit cards for agents and Perplexity announced Personal Computer, an always-on local agent running on a Mac mini.
Developer stack coverage includes Resend's open-source CLI with 53 commands, pnpm 11's git worktrees support for multi-agent monorepos, and OpenAI pushing a full computer environment behind the Responses API.
Deeper reads from Sunil Pai on generative UI post-WIMP interfaces, Elliot Arledge on the RL environment business, and Jay Scambler's Autocontext harness.
Quick hits: Replit Agent 4, Manus Desktop, NemoClaw from NVIDIA, llmock by CopilotKit, ContextKing raising to kill vector DBs, Google Maps getting Gemini, and Z.ai's GLM-5-Turbo optimized for Claude Code.
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X.
📚 READ MORE
RentAHuman
https://x.com/polymarket/status/2032470045217939723
Perplexity CTO moving away from MCPs
https://x.com/morganlinton/status/2031795683897077965
Chrome 146 native MCP support
https://x.com/xpasky/status/2032252486145253865
Claude 1M context window GA
https://x.com/claudeai/status/2032509548297343196
Opus 4.6 1M default for Claude Code
https://x.com/alexalbert__/status/2032522722551689363
Claude Code /btw command
https://x.com/trq212/status/2031506296697131352
Vercel CTO on just-bash fork
https://x.com/cramforce/status/2033285112478171373
Guillermo Rauch on Cloudflare
https://x.com/rauchg/status/2033291143715455458
Ramp Agent Cards
https://x.com/i/trending/2031832827063648342
Perplexity Personal Computer
https://x.com/perplexity_ai/status/2031790180521427166
Resend CLI
https://x.com/zenorocha/status/2032459310341800314
pnpm 11 git worktrees
https://pnpm.io/11.x/git-worktrees
OpenAI Responses API computer environment
https://openai.com/index/equip-responses-api-computer-environment/
Sunil Pai — After WIMP
https://sunilpai.dev/posts/after-wimp/
Elliot Arledge — The RL Environment Business
https://x.com/elliotarledge/status/2032753593535574433
Autocontext — Jay Scambler
https://x.com/JayScambler/status/2032508829959868690
Kimi Attention Residuals
https://x.com/Kimi_Moonshot/status/2033378587878072424
Replit Agent 4
https://x.com/amasad/status/2031755113694679094
Manus Desktop
https://x.com/ManusAI/status/2033558672152854712
NemoClaw — NVIDIA
https://nemoclaw.so
llmock by CopilotKit
https://llmock.copilotkit.dev
ContextKing — killing vector DBs
https://x.com/contextkingceo/status/2032098309029220456
Google Maps biggest upgrade
https://x.com/google/status/2032079594191261938
Z.ai GLM-5-Turbo
https://x.com/Zai_org/status/2033221428640674015
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Mastra on X: https://x.com/mastra_ai
Mastra Discord: https://mastra.ai/community/discord
Mastra GitHub: https://github.com/mastra-ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/books/principles-of-building-ai-agents
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
00:00 — NVIDIA GTC 2026
04:16 — RentAHuman
07:56 — Is MCP dead?
13:08 — Anthropic ships
14:59 — Vercel vs Cloudflare: the just-bash fork drama
18:59 — Ramp Agent Cards & Perplexity Personal Computer
20:52 — Resend CLI, pnpm worktrees, OpenAI Responses API
23:04 — Developer insights

Mar 13, 2026 • 25min
Meta Acquires Moltbook, Openai Releases GPT 5.4, TypeScript Is #1 on GitHub (This Week In AI)
A lot happened in eight days.
Meta acquired Moltbook, a social network built entirely for AI agents, not humans.
OpenAI dropped GPT-5.4 Thinking and GPT-5.4 Pro, Codex got forks for multi-agent workflows and Windows support, and there are rumblings of OpenAI building a GitHub alternative. Anthropic fired back hard — multi-agent PR code review for Claude Code, while loops via /loop, the Claude Marketplace, and a way to pull your context from other AI tools. Plus: voice mode for CLI coding is apparently real, and people are using it.
This episode also covers the explosion of coding agents: Theo's T3 Code, OpenAI's Symphony orchestration layer, OpenCode workspaces, and swyx's thesis that this is the Year of the Subagent.
Donald Knuth is making headlines for being impressed by Opus 4.6, solving a long-standing math conjecture. TypeScript is overtaking Python and JavaScript on GitHub. Gemini 3.1 Flash-Lite drops. AMI raises $1B.
OpenClaw is getting government-backed adoption in China. Stanford's paper on RAG breaking at 10K documents, Karpathy's autoresearch project, Justin Poehnelt on why your CLI needs to be rewritten for agents. Plus: Raycast Glaze, Google Workspace CLI, Copilot Cowork, Exa Deep, Expo Agent, and a discussion on whether humans should be reviewing code at all.
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Mastra on X: https://x.com/mastra_ai
Mastra Discord: https://mastra.ai/community/discord
Mastra GitHub: https://github.com/mastra-ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/books/principles-of-building-ai-agents
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
CHAPTERS
00:00 — Meta acquires Moltbook
01:41 — OpenAI updates
05:47 — Claude fights back:
09:05 — The coding agent explosion
11:28 — Donald Knuth
12:15 — TypeScript #1 on GitHub
13:05 — AI products and platforms
15:02 — Model releases and fuding
15:50 —OpenClaw goes mainstream in China
16:43 — Developer insights: RAG collapse, Karpathy's autoresearch & rewriting CLIs for agents
20:29 — Quick hits & is it time to kill the code review?

Mar 11, 2026 • 15min
The Biggest Threat to AI Agents (with Ismail Pelaseyed)
Ismail Pelaseyed from Superagent is back on Agents Hour, and this time he's talking about something most builders aren't thinking about yet — supply chain attacks on AI agents. Guardrails protect against what you tell your agent to do. But what about everything your agent reads, fetches, and installs on its own? That's the gap Brin is built to fill.
Brin is a free, open-source credit score for agent context. Before your agent acts on an external package, MCP server, skill, or web page, Brin scores it — identity, behavior, and content — and returns a verdict in under 10ms. No signup, no auth, one GET request.
Ismail walks through how supply chain attacks actually work in production, the three-tier scoring model behind Brin, how the Cline NPM incident illustrates exactly this problem, and why securing the context — not the agent — is the right mental model.
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
🔗 CHECK OUT BRIN
Brin: https://brin.sh/
Brin docs: https://brin.sh/docs
Brin GitHub: https://github.com/superagent-ai/brin
Superagent: https://superagent.sh
Superagent on X: https://x.com/superagent_ai
https://x.com/pelaseyed
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Mastra on X: https://x.com/mastra_ai
Mastra Discord: https://mastra.ai/community/discord
Mastra GitHub: https://github.com/mastra-ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/books/principles-of-building-ai-agents
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
00:00 — From guardrails to supply chain attacks
03:32 — Introducing Brin: a credit score for agent context
05:14 — How to integrate Brin into your agent
07:17 — The three-tier scoring model
10:50 — What's next for agent security

Mar 4, 2026 • 34min
Missile Strikes Disrupt AWS and Claude, Anthropic Banned from US Government, Cloudflare vs Vercel
This week in AI saw geopolitical turmoil, major funding news, and a shift in software development. Missile strikes in the UAE and Bahrain disrupted AWS and Claude services. Meanwhile, after Anthropic banned its models from autonomous weapons and mass surveillance, the Trump administration banned Anthropic from government contracts—posing a major supply chain risk. On the same day, Sam Altman secured a deal with the Department of War as OpenAI announced a $110 billion funding round, highlighting a sharp contrast in approaches.
AI coding is evolving rapidly. Andrej Karpathy noted that coding agents, ineffective before December, now work well with improved quality and coherence. Yet, perfect accuracy remains elusive. New tools emerged: Cursor demos code, Linear markets itself as an AI coding assistant, and Perplexity Computer offers an all-in-one system for managing AI projects. Smaller models like Qwen 3.5 grow faster and more efficient for edge use. Other highlights include Anthropic acquiring Vercept AI, Claude's remote coding controls, and Stanford confirming major AI firms use user conversations to train models.
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/book
Patterns for Building AI Agents (New Book): https://mastra.ai/blog/patterns-book https://docs.google.com/forms/d/e/1FAIpQLSduJjc515f6RZJqtkR2ByqJZrB0iP8B7SUKnjjZE9IajH_I8w/viewform
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
CHAPTERS
00:00 – Intro
00:25 – AWS Data Center Strikes & Claude Outages
01:20 – Anthropic Ban
05:30 – Sam Altman's Government Deal
10:05 – Cloudflare/Vercel Drama & NPM Namespaces
11:09 – Block Cuts 40% of Staff (4,000 People)
12:30 – AI & Job Market: Fear vs Reality
18:27 – OpenAI Raises $110B, Codex Growth
18:52 – Claude Releases: Vercept Acquisition, Remote Control, Auto Memory
20:13 – State of AI Coding
26:10 – AI Products and Platforms
28:42 – Open Source Models
31:30 – Quick Hits & GitHub Star Party

Mar 1, 2026 • 17min
How to Build Reliable AI Agents with Datasets, Experiments, and Error Analysis
Yujohn from Mastra explains why datasets and experiments are essential for building production-grade AI agents.
If you're building an agent, you need a way to verify it's working correctly before and after you make changes. Datasets provide that baseline. You create a collection of test cases (ground truth) that represent the scenarios your agent should handle. Then you run experiments: pass each test case through your agent and measure the results. This is error analysis in practice. You start by identifying where your agent fails, then build scorers to quantify those failure modes over time. Smaller teams often ship first and add datasets later, once they have user feedback. Larger teams need them earlier. But eventually, every production agent needs this.
The demo shows how Mastra makes this accessible. You can create datasets through the UI, add items manually or import from CSV, and run experiments with a single click. The results show you exactly what went wrong: which tool calls failed, what the agent output was, and how it compared to ground truth. You can also compare experiments side by side to see if your prompt tweaks actually improved things. And because all the data lives in your own database, you can write your own agents to analyze the results, dig into traces, and iterate. The SDK makes it easy to integrate into CI/CD: run experiments on pull requests, gate deployments on eval scores, or just collect data from production and curate datasets later.
🔗 RESOURCES
Mastra Datasets docs: https://mastra.ai/docs/observability/datasets
Running Experiments: https://mastra.ai/docs/observability/datasets/running-experiments
Mastra GitHub: https://github.com/mastra-ai/mastra
Yujohn on X: https://x.com/YujohnNatt
Mastra Discord: https://discord.gg/mastra
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/book
Patterns for Building AI Agents (New Book): https://mastra.ai/books/patterns-of-building-ai-agents
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
00:00 – Intro
00:48 – What are Datasets and Experiments
01:55 – Error Analysis
03:35 – When to Use Datasets (Team Size Matters )
05:43 – Demo: Creating a Dataset
07:04 – Demo: Ground Truth
07:53 – Demo: Running Experiments
09:34 – Demo: Comparing Results
11:00 – Your Data, Your Database
12:24 – SDK & CI Integration
14:30 – Collecting Data from Production

Feb 27, 2026 • 15min
A Coding Agent That Never Compacts
Abhi walks through Mastra Code, a new open-source coding agent with observational memory that compresses context without losing it.
When we built Observational Memory, we needed a way to test it in production. Instead of a standard bot, we dogfooded it: we built a coding agent and used it ourselves. Writing code every day quickly revealed whether the memory actually worked.
Eventually, something clicked. Long, multi-day coding sessions ran smoothly, without hitting the compaction limits that trip up other agents. The agent actually seemed to understand the conversation. That gave us confidence to release Observational Memory... and the agent itself also
became something worth sharing.
The core innovation is the Harness primitive. Rather than just a coding agent, Harness can power any type of agentic workflow: customer support, design tools, electrical engineering, and more.
In this demo, you’ll see the Mastra Code CLI, a production-ready coding agent, and how Corbin from Artifact used the Harness to create an in-app agent for electrical engineering. Same primitive, completely different use case.
🔗 RESOURCES
Mastra Code announcement: https://mastra.ai/blog/announcing-mastra-code
Mastra Code GitHub: https://github.com/mastra-ai/mastra
Mastra Code NPM: https://www.npmjs.com/package/mastra-code
Observational Memory: https://mastra.ai/blog/observational-memory
Mastra documentation: https://docs.mastra.ai
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/book
Patterns for Building AI Agents (New Book): https://mastra.ai/blog/patterns-book https://docs.google.com/forms/d/e/1FAIpQLSduJjc515f6RZJqtkR2ByqJZrB0iP8B7SUKnjjZE9IajH_I8w/viewform
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
CHAPTERS
00:00 – Intro & The Origin Story
02:09 – The Trend of Building Your Own Claude Code
03:11 – Demo
08:28 – No More Compaction
09:35 – Demo: Harness Primitive + Electrical Engineering Tool
14:08 – Getting Started

Feb 26, 2026 • 14min
AI NEWS: Stripe's Minions, Distillation Attacks on Claude, Cloudflare's Code Mode
Shane and Abhi break down the biggest AI news from the past few days.
Anthropic identified industrial-scale distillation attacks on Claude by DeepSeek, Moonshot AI, and MiniMax. Anthropic also released a groundbreaking report analyzing millions of AI agent interactions using Claude. Software engineering makes up ~50% of all agentic tool calls, but there's emerging use across back office automation, marketing, finance, and more. Claude Code Security launched in a limited research preview.
Stripe is shipping 1,300+ AI-generated PRs per week. And they're made by... Minions?
Code Mode for MCP is becoming a standard part of the MCP ecosystem. Amp claims the coding agent is dead. We also cover skills benchmarks (smaller models with high-quality skills beat raw Opus), trajectory explorer for agent traces, Vercel AI Gateway video support, Taalas custom silicon for AI inference, Google's Pomelli Photoshoot feature, and the GitHub Star Party pick: superpowers framework.
🔗 RESOURCES
Anthropic's distillation attack report: https://x.com/AnthropicAI/status/2025997928242811253
Anthropic's agent analysis: https://x.com/anthropicai/status/2024210053369385192
Claude Code Security: https://x.com/claudeai/status/2024907535145468326
Prompt auto-caching: https://x.com/RLanceMartin/status/2024573404888911886
Stripe minions Part 2: https://stripe.dev/blog/minions-stripes-one-shot-end-to-end-coding-agents-part-2
Code Mode for MCP: https://blog.cloudflare.com/code-mode-mcp/
Amp's statement: https://ampcode.com/news/the-coding-agent-is-dead
Skills benchmark: https://x.com/forloopcodes/status/2024819596809949341
Trajectory Explorer: https://x.com/benhylak/status/2024546696211083653
Vercel AI Gateway: https://x.com/rauchg/status/2024629864951927222
Taalas: https://x.com/taalas_inc/status/2024516399251456150
Google Pomelli Photoshoot: https://x.com/GoogleLabs/status/2024529795548102667
Superpowers: https://x.com/socialwithaayan/status/2025609619167670501
AI Agents Hour is a weekly livestream hosted by Mastra CPO Shane Thomas and CTO Abhi Aiyer. Airing Mondays at 12PM Pacific on YouTube and X, the show covers breaking AI news, agent development techniques, and features interviews with industry experts building AI applications today.
📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/book
Patterns for Building AI Agents (New Book): https://mastra.ai/blog/patterns-book https://docs.google.com/forms/d/e/1FAIpQLSduJjc515f6RZJqtkR2ByqJZrB0iP8B7SUKnjjZE9IajH_I8w/viewform
MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.
CHAPTERS
00:00 – Intro
00:25 – Anthropic: Distillation Attacks by DeepSeek, Moonshot, MiniMax
01:36 – Anthropic's Agent Report: 50% Software Engineering, Other Domains Emerging
02:47 – Claude Code Security in Research Preview
03:34 – Prompt Auto-Caching with Claude
03:53 – Stripe Minions: 1,300 PRs/Week
05:39 – Code Mode for MCP (99.9% Token Reduction)
07:29 – Amp: "The Coding Agent Is Dead"
08:36 – Skills Beat Size: Smaller Models with High-Quality Skills
09:39 – Quick Hits: Trajectory Explorer, Vercel AI Gateway, Taalas, Google Photoshoot
12:00 – GitHub Star Party: Superpowers Framework


