

Beyond The Pilot: Enterprise AI in Action
VentureBeat
AI gets real here. On “Beyond the Pilot,” top business execs share what actually happens after the AI proof of concept — from infrastructure and org design to wins, failures, and ROI. Not theory, but deep dives into how they scaled AI that works.
Episodes
Mentioned books

Apr 1, 2026 • 38min
100M Agents: Scaling the New Execution stack with Intuit
A QuickBooks customer discovered significant fraud by asking their AI assistant follow-up questions about transaction amounts that didn't add up. This isn't a demo — it's one of 3 million customers now using Intuit's AI agents in production, with 80.5% returning to use them again.
Marianna Tessel, EVP and GM of QuickBooks (formerly CTO of Intuit), walks through the architecture decisions behind one of the first enterprise AI deployments at true scale. Intuit's "done-for-you" agents now automate book closing, reconciliation, transaction categorization, and payroll — but the breakthrough came when they realized chatbots alone weren't enough. Businesses wanted human experts integrated directly into AI workflows, creating what Intuit calls the "AI + HI" model (artificial intelligence + human intelligence). The results: invoices paid 5 days faster, 90% more paid in full, 30% reduction in manual work, and 62% of users reporting bookkeeping is easier.
Tessel reveals the technical evolution: moving from monolithic agents to a dynamic orchestration layer that routes queries across multiple LLMs (including Intuit's proprietary FinLM built on open-source), 24,000 bank connections, and 600,000 customer attributes. The system now handles proactive anomaly detection, benchmarking against similar businesses, and even nascent vibe coding — all without requiring users to understand they're essentially programming workflows through natural language. She also addresses the "SaaS apocalypse" narrative head-on, explaining why QuickBooks saw 18% growth last quarter while competitors faced market pressure: durable data advantages and customer trust in financial accuracy matter more than ever when AI enters the mix.
For enterprise builders navigating agent architecture, data grounding, and human-in-the-loop design, this is a rare look inside a working system serving millions.
🎙️ GUEST: Marianna Tessel | EVP & GM, QuickBooks (Intuit)
🎙️ HOSTS: Matt Marshall | VentureBeat, Sam Witteveen | VentureBeat
00:00 Intro — Customer discovers fraud using QuickBooks AI
03:26 Intuit Intelligence: Agents, BI, and human expertise integration
05:20 First-time AI users and going beyond chatbots
08:02 How Intuit decides which workflows to automate
10:16 Sponsor: Outshift by Cisco
10:38 Human-in-the-loop: When to insert experts vs. full automation
13:00 The AI + HI model: Why customers want human verification
15:24 Human expertise as confidence layer, not just AI check
16:14 Proprietary data advantage: 24K bank connections, 600K attributes
18:39 Benchmarking: "Businesses like me" — using aggregate data for competitive insights
19:52 First-party vs. third-party data strategy
21:38 Addressing the "SaaS apocalypse" narrative — why Intuit grew 18% last quarter
24:39 Proactive AI: Anomaly detection for marketing expense spikes
25:20 Builder perspective: Leaning on LLM orchestration, not use-case-by-use-case builds
27:32 Architecture evolution: From monolithic agents to dynamic tools and skills
29:10 Composite UX: Chat side-by-side with traditional workflows
30:35 Multi-model strategy: Genos platform, FinLM, and model routing
31:16 Vibe coding and actions: Letting users automate without realizing they're coding
32:47 Personalization wave: Memory, persistence, and user-defined workflows
35:08 Docker background and primitives that survive disruption
36:00 Open Claw and agent automation: Real revolution or risky experimentation?
#EnterpriseAI #AIAgents #QuickBooks #Intuit #LLMOrchestration #AgenticAI
Presented by Outshift by Cisco Outshift is Cisco’s emerging tech incubation engine and driver of Agentic AI, quantum, and next-gen infrastructure. Learn more at outshift.cisco.com.
About VentureBeat: VentureBeat equips enterprise technology leaders with the clearest, expert guidance on AI – and on the data and security foundations that turn it into working reality.
🔗 CONNECT WITH US
Subscribe to our Newsletters for technical breakdowns: https://venturebeat.com/newsletters
Visit VentureBeat: Venturebeat.com
.
.
.
Subscribe to VentureBeat:
/ @VentureBeat
.
.
Subscribe to the full podcast here:
Apple: https://podcasts.apple.com/us/podcast/venturebeat/id1839285239
Spotify: https://open.spotify.com/show/4Zti73yb4hmiTNa7pEYls4
YouTube: https://www.youtube.com/VentureBeat
Learn more about your ad choices. Visit megaphone.fm/adchoices

14 snips
Mar 18, 2026 • 21min
The AI War For Your Personal Context
They unpack how personalized AI agents are reshaping enterprise software and shifting market power. They explore Microsoft’s Copilot move and the changing build-vs-buy decisions. The conversation highlights Zoom’s journey from templates to user-controlled skills and the race to own personal context. Identity, token budgets, multi-user risks, and the rise of agent-centric architectures are examined.

24 snips
Mar 4, 2026 • 56min
LangChain: What OpenClaw Got Right (And Why Enterprises Can't Have It)
Harrison Chase, Co-founder and CEO of LangChain, leads the team behind the LangChain library and agent frameworks. He explains why OpenClaw succeeded where AutoGPT did not. He breaks down Deep Agents: planning, to-do lists, subagents, file systems, and large system prompts. He outlines LangGraph, LangChain, and harness choices, and why context engineering matters for reliable agent loops.

Feb 18, 2026 • 36min
LexisNexis on Why Standard RAG Fails in Law
Min Chen, Chief AI Officer at LexisNexis and longtime leader in legal ML, explains why standard RAG breaks for law and how GraphRAG and point-of-law graphs provide authoritative grounding. She describes their 8-part Usefulness Score, agentic workflows like Planner and Reflection agents, and deterministic checks for hallucination detection. Practical, execution-focused breakdowns of deploying AI in zero-error legal settings.

19 snips
Feb 4, 2026 • 56min
Mastercard's 160 Billion Transactions: AI's Biggest Test
Chris Mertz, SVP of Data Science at Mastercard who builds large-scale ML systems, and Johan Gerber, EVP of Security Solutions with a law-enforcement background, pull back the curtain on running AI at massive scale. They discuss scoring 160 billion transactions under a 50ms limit, an RNN “inverse recommender,” GenAI honeypots that bait scammers, and org and cloud choices that made it all possible.

Jan 21, 2026 • 41min
Inside LinkedIn’s AI Engineering Playbook
Erran Berger, VP of Product Engineering at LinkedIn who led distilling large LLMs into ultra-efficient production models. He reveals how LinkedIn distilled 7B models down to 600M students, the multi-teacher split for policy vs. clicks, synthetic GPT-4 golden datasets, and the 10x latency savings from pruning, quantization, and context compression. He also explains the org shift to eval-first product design.

Jan 7, 2026 • 1h 3min
Most enterprise AI agents are Slop - here’s why they fail
Amjad Masad, CEO of Replit and a visionary in programming accessibility, unpacks the $10 trillion potential of enterprise AI in a fascinating discussion. He critiques the majority of AI agents as 'Slop,' explaining their generic failures and the need for quality inputs. Masad introduces 'Vibe Coding,' where non-tech experts shape software, and reveals Replit's 'Computer Use' hack, making agent development cheaper and faster. He warns leaders to ditch rigid roadmaps in favor of rapid iteration to thrive in the evolving AI landscape.

5 snips
Dec 17, 2025 • 46min
How JPMorgan Engineered a 30K AI Agent Economy
Inside the 'Agent Economy': How 30,000 AI Assistants Took Over JPMorgan
While most enterprises were scrambling after ChatGPT launched, JPMorgan Chase was already two years ahead. 🚀
In this episode of Beyond the Pilot, we sit down with Derek Waldron, Chief Analytics Officer at JPMorgan Chase, to reveal how the world’s largest bank built an internal AI platform that is now used by 1 in 2 employees daily.
Derek shares the contrarian insight that drove their strategy: AI models are commodities; the real moat is connectivity.
Learn how they scaled from zero to 250,000+ users, why they empowered employees to build 30,000+ of their own "Personal Agents," and how they are solving the data privacy challenge at an enterprise scale.
🔥 IN THIS EPISODE:
The "Super Intelligence" Thought Experiment: Why raw intelligence is useless without enterprise connectivity.
The Agent Economy: How JPM enabled non-technical staff to build 30,000 custom AI assistants.
The Adoption Playbook: How to break through the "30% wall" and get the majority of your workforce using AI.
Build vs. Buy: Why JPM built their own "LLM Suite" instead of waiting for vendors.
⏳ CHAPTERS:
00:00 - Introduction: The JPMorgan AI Story
01:45 - The 3 Core Principles Behind JPM’s Strategy
03:25 - The "Super Intelligence" Thought Experiment
05:00 - Data Privacy: Why JPM Doesn't Train Public Models
06:00 - Viral Adoption: From 0 to 250k Users
09:20 - Evolution of LLM Suite: From RAG to Ecosystem
14:00 - The "Moat" is Connectivity, Not the Model
23:00 - The Agent Economy: 30,000 Employee-Built Assistants
31:00 - Governance & Guardrails for AI Agents
33:00 - Crossing the Chasm: Getting to 60% Adoption
40:00 - The "Product" Mindset: Solving Business Problems First
42:30 - The Future: End-to-End Process Transformation
46:25 - The "Unsolved" Problem Derek Wants to Fix
🙏 SPECIAL THANKS TO OUR SPONSOR:
This episode is presented by Outshift by Cisco.
Learn more about their work on the Internet of Agents and the open-source Linux Foundation project:
🔗 https://www.agentcy.org
🎙️ GUEST:
Derek Waldron | Chief Analytics Officer, JPMorgan Chase
HOSTS:
Matt Marshall | VentureBeat
Sam Witteveen | VentureBeat
#EnterpriseAI #JPMorgan #GenerativeAI #AgenticAI #FinTech #ArtificialIntelligence #Innovation #BeyondThePilot
.
.
.
Subscribe to VentureBeat:
/ @VentureBeat
.
.
Subscribe to the full podcast here:
Apple: https://podcasts.apple.com/us/podcast/venturebeat/id1839285239
Spotify: https://open.spotify.com/show/4Zti73yb4hmiTNa7pEYls4
YouTube: https://www.youtube.com/VentureBeat
https://www.youtube.com/playlist?list=PLMQoSwszBxm5dCv2bdqGnJ0QAL9n7Ds4_
🔗 CONNECT WITH VENTUREBEAT:
Website: https://venturebeat.com
LinkedIn: https://www.linkedin.com/company/venturebeat
X (Twitter): https://twitter.com/VentureBeat
Newsletter: https://venturebeat.com/newsletters
Learn more about your ad choices. Visit megaphone.fm/adchoices

Dec 3, 2025 • 46min
How Booking.com Boosted Agent Accuracy 2x with Mini LLMs with Pranav Pathak
We built AI agents by accident... and it worked. 🤯
In this episode of VentureBeat’s Beyond the Pilot, we go inside the engineering brain of Booking.com with Pranav Pathak (Director of Product Machine Learning). Pranav reveals how they "stumbled" into agentic architectures before the term even existed, how a simple text box revealed a massive missed revenue opportunity (the "Hot Tub" story), and exactly how they stack LLMs, RAG, and Orchestrators to handle millions of travelers without breaking the bank.
If you are building Enterprise AI, this is the blueprint for moving from "cool demo" to production scale.
🚀 In this episode, we cover:
The "Hot Tub" Revelation: How free-text AI search exposed features customers desperately wanted but couldn't find.
Real ROI Metrics: How LLMs drove a 2x increase in topic detection accuracy and freed up 1.5x of agent bandwidth.
The Booking.com AI Stack: A full breakdown of their Orchestrator → Moderation → Agent → RAG architecture.
Latency vs. Intelligence: Why they don't use GPT-5 for everything and how they decide between small models and big brains.
The Memory Problem: How to build AI that remembers user preferences without being "creepy”.
00:00 Introduction to Agentic Architectures
00:30 Meet Pranav Pathak from Booking.com
01:24 Evolution of Travel Recommendations
03:41 Impact of Gen AI on Customer Service
07:29 Building an Effective AI Stack
10:32 Agentic Systems and Best Practices
13:45 Choosing Between Building and Buying AI Solutions
18:51 Evaluating AI Models for Business Use
24:10 Challenges in Human Evaluation
25:06 Recommendation System and Data Utilization
27:26 Innovations in Travel Search
29:04 Journey and Challenges in ML Integration
32:08 Managing Memory and User Data
38:07 Future of Travel Assistance
41:33 Advice for New AI Integrations
43:57 Final Thoughts and Farewell
🔗 LINKS & RESOURCES:
OutShift by Cisco (Sponsor): outshift.cisco.com
VentureBeat: www.venturebeat.com
#ArtificialIntelligence #GenAI #Bookingcom #MachineLearning #AgenticAI #LLM #TechPodcast #EnterpriseAI
.
.
.
Subscribe to VentureBeat:
/ @VentureBeat
.
.
Subscribe to the full podcast here:
Apple: https://podcasts.apple.com/us/podcast/venturebeat/id1839285239
Spotify: https://open.spotify.com/show/4Zti73yb4hmiTNa7pEYls4
YouTube: https://www.youtube.com/VentureBeat
Learn more about your ad choices. Visit megaphone.fm/adchoices

Nov 19, 2025 • 1h 29min
Beyond the Pilot - NOTION - Unpacking Ryan Nystrom's AI Journey: From Challenges to Custom Agents
In our inaugural episode, we sit down with Ryan Nystrom, leader of the AI team at Notion, to pull back the curtain on Notion 3.0. Ryan reveals the journey of integrating powerful AI agents into the productivity platform and draws fascinating parallels between the current AI era and the mobile revolution he witnessed at Instagram. He shares exclusive insights into the development challenges, the critical role of tools, context, and curation, and how custom agents are poised to reshape work. Plus, Ryan offers essential advice for any company diving into the AI space. Learn more about your ad choices. Visit megaphone.fm/adchoices


