
Ioana Apetrei
Senior Product Manager at CAST AI working on AI Enabler, with a background building B2C and B2B products and 12 years of product experience; focuses on making open-source LLM deployment accessible and cost-effective for customers.
Best podcasts with Ioana Apetrei
Ranked by the Snipd community

44 snips
Feb 19, 2026 • 1h 6min
Serving LLMs in Production: Performance, Cost & Scale // CAST AI Roundtable
Igor Šušić, founding ML engineer focused on large-scale inference and performance tuning. Ioana Apetrei, senior product manager building accessible, cost-effective LLM deployment. They debate why deployments fail at scale. They cover model routing and cost vs accuracy. They explain time-sharing GPUs, quantization, prefill vs decode separation, and when self-hosting or managed endpoints make sense.


