

Kubernetes Bytes
Ryan Wallner & Bhavin Shah
Kubernetes Bytes is a podcast bringing you the latest from the world of cloud native data management. Hosts Ryan Wallner and Bhavin Shah come to you from Boston, Massachusetts with experienced backgrounds in cloud-native tech. They'll be sharing their thoughts on recent cloud native news and talking to industry experts about their experiences and challenges managing the wealth of data in today's cloud-native ecosystem.
Episodes
Mentioned books

Sep 5, 2024 • 53min
Running Ray on Kubernetes with KubeRay
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Kai-Hsun Chen, Software Engineer at Anyscale and maintainer of the KubeRay project. The discussion focuses on how the open source Ray project can help organizations use a single tool for data prep, model training, fine tuning and model serving workflows, both for their predictive AI and generative AI models. The discussion also dives into the KubeRay project and how it provides three different Kubernetes CRDs for Data Scientists to deploy Ray clusters on demand. Check out our website at https://kubernetesbytes.com/ Cloud Native News: https://azure.github.io/AKS/2024/08/23/fine-tuning-language-models-with-kaito https://orca.security/resources/blog/kubernetes-testing-environment/ https://www.redhat.com/en/about/press-releases/red-hat-openstack-services-openshift-now-generally-available Show links: Kai's LinkedIn: https://www.linkedin.com/in/kaihsun1996/ KubeRay doc: https://docs.ray.io/en/latest/cluster/kubernetes/index.html Ray Summit registration: https://raysummit.anyscale.com/flow/anyscale/raysummit2024/reg/createaccount (code: KaiHsunC15) KubeRay repository: https://github.com/ray-project/kuberay Ray repository: https://github.com/ray-project/ray Ray Slack workspace: https://docs.google.com/forms/d/e/1FAIpQLSfAcoiLCHOguOm8e7Jnn-JJdZaCxPGjgVCvFijHB5PLaQLeig/viewform Timestamps: 00:02:40 Cloud Native News 00:07:20 Interview with Kai 00:49:15 Key takeaways

Aug 22, 2024 • 1h 2min
Building scalable data platforms using Data on EKS
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Alex Lines and Vara Bonthu from AWS to talk about the Data on EKS project. The discussion dives into why AWS decided to build the Data on EKS project and provide patterns for EKS customers to use to deploy data platforms, machine learning and GenAI tools on EKS clusters. They talk about what's included and what's not included with each of these patterns and whats coming down the line. Check out our website at https://kubernetesbytes.com/ Cloud Native News: https://kubernetes.io/blog/2024/08/16/kubernetes-1-31-prevent-persistentvolume-leaks-when-deleting-out-of-order/ https://kubernetes.io/blog/2024/08/16/matchlabelkeys-podaffinity/ https://kubernetes.io/blog/2024/08/15/kubernetes-1-31-volume-attributes-class/ https://roadmap.vcluster.com/changelog/vcluster-v020-ga https://www.cloudbees.com/blog/cloudbees-acquires-launchable-to-enable-development-teams-to-iterate-faster?Product=Launchable&Tag=Blog%2CAI%2CLaunchable%20Update Show links: https://awslabs.github.io/data-on-eks/ https://www.youtube.com/watch?v=G9aNXEu_a8k https://github.com/awslabs/data-on-eks https://www.linkedin.com/in/alex-lines-aws/ https://www.linkedin.com/in/varaprofile/ Timestamps: 00:01:45 Cloud Native News 00:12:15 Interview with Alex and Vara 00:58:21 Key takeaways

Aug 7, 2024 • 44min
Deploy and fine-tune LLM models on Kubernetes using KAITO
Sachi Desai, a Product Manager specializing in AI technologies, and Paul Yu, a Senior Cloud Advocate at Microsoft, dive into the KAITO project for deploying open source LLM models on Kubernetes. They discuss how KAITO simplifies running AI applications alongside LLM models and enables users to bring and fine-tune their own models. The conversation highlights innovative techniques like LoRa and Q-LoRa for efficient model training. Additionally, they emphasize community engagement's role in enhancing AI model deployment and future capabilities.

Jul 26, 2024 • 54min
The business case for cloud-native and Kubernetes
Danielle Cook, VP of Marketing at appCD and Co-chair of the CNCF Cartografos Working Group, dives into the business case for adopting cloud-native technologies. She discusses how technical contributors can align their strategies with business goals, emphasizing growth and efficiency. The conversation explores the Cloud Native Maturity Model, addressing the challenges organizations face during adoption. Danielle also highlights the importance of effective communication tailored for different executive roles to build a compelling case for modernization.

Jun 28, 2024 • 55min
Building the AI Hyperscaler with Kubernetes
Brandon Jacobs, Infrastructure architect at Coreweave, discusses how Coreweave uses Kubernetes to build an AI hyperscaler. They cover managing Day 0 & 2 operations for AI labs, lessons learned, and best practices for a Kubernetes based cloud. Topics include leveraging bare metal Kubernetes for GPU workloads, storage options for AI labs, observability, monitoring, handling CVEs, and customer cluster support.

Jun 14, 2024 • 1h 5min
Shifting Minds: Exploring OpenShift's AI Landscape
Andy Grimes discusses OpenShift's AI Landscape with insights on MLOps, AI model development, and accelerating model deployment using OpenShift AI. The discussion covers local experimentation, governance tools for AI, and the collaboration between IBM and Red Hat in the AI space.

May 31, 2024 • 55min
Training Machine Learning (ML) models on Kubernetes
Bernie Wu from Memverge discusses training ML models on Kubernetes, including cost-saving tips with spot instances, efficient model checkpoints, hot restarts, and reclaiming GPU resources. They delve into topics like DAG phases, transparent checkpointing, and GPU snapshotting for AI workloads.

May 17, 2024 • 1h 8min
The evolution of service mesh technologies
Christian Posta, VP and Global Field CTO at Solo.io, discusses the evolution of service mesh technologies from Linkerd to istio implementations, connecting application components outside Kubernetes. They explore shared responsibilities between developers and platform engineers, using internal developer platforms for service mesh. Topics include on-prem and cloud flexibility, challenges in cloud-based development, and importance of automation and observability in infrastructure architectures.

May 6, 2024 • 1h 3min
What are Vector Databases
In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Torsten Steinbach - VP, Chief Architect for Analytics & AI at EDB about all things Vector Databases, Postgres, and why Data is important for building AI platforms. The discussion dives into how vector databases are different than relational databases and why using Postgres extensions helps organizations use their existing data for AI applications. Check out our website at https://kubernetesbytes.com/ Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees! Episode Sponsor: Nethopper Learn more about KAOPS: @nethopper.io For a supported-demo: info@nethopper.io Try the free version of KAOPS now! https://mynethopper.com/auth Cloud Native News: https://www.reuters.com/markets/deals/ibm-nearing-buyout-deal-hashicorp-wsj-reports-2024-04-23/ https://www.wiz.io/blog/wiz-acquires-gem-security-to-reinvent-threat-detection-in-the-cloud https://techcrunch.com/2024/04/18/wiz-is-in-talks-to-buy-lacework-for-150-200m-security-firm-was-last-valued-at-8-3b/ https://www.prnewswire.com/news-releases/coreweave-secures-1-1-billion-in-series-c-funding-to-drive-the-next-generation-of-cloud-computing-for-the-future-of-ai-302133328.html https://kubernetes.io/blog/2024/04/17/kubernetes-v1-30-release https://dok.community/blog/become-a-data-on-kubernetes-in-2024-ambassador/ Show Links: https://www.enterprisedb.com/news/edb-acquires-splitgraph https://www.enterprisedb.com/resources/events https://www.enterprisedb.com/ Timestamps: 00:03:22 Cloud Native News 00:17:45 Interview with Torsten 00:57:00 Key takeaways

Apr 16, 2024 • 48min
KubeCon EU Paris News Recap
Join Bhavin Shah and Ryan Wallner for a recap of announcments and news from KubeCon Paris 2024.Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees!Nethopper Learn more about KAOPS: @nethopper.io For a supported-demo: info@nethopper.io Try the free version of KAOPS now! https://mynethopper.com/authNews https://about.gitlab.com/blog/2024/03/20/oxeye-joins-gitlab-to-advance-application-security-capabilities/ https://www.redhat.com/en/blog/unveiling-red-hat-openshift-415 https://developer.nvidia.com/blog/nvidia-nim-offers-optimized-inference-microservices-for-deploying-ai-models-at-scale/ https://www.acorn.io/resources/blog/our-new-focus-developing-an-llm-app-platform-based-on-gpt-script-technology?fromOther=true https://loft.sh/blog/deliver-secure-kubernetes-multi-tenancy-with-new-vcluster-in-rancher-integration/ https://www.observeinc.com/blog/stepping-on-the-gas/ https://thenewstack.io/kubecost-2-2-covers-carbon-cost-monitoring-and-more/ https://thenewstack.io/ovhcloud-unveils-roadmap-to-take-on-hyperscalers-from-europe/ https://www.suse.com/c/meet-rancher-prime-3-0/ https://www.suse.com/c/suse-releases-edge-3-0-highly-validated-edge-optimized-stack/ https://www.fermyon.com/blog/introducing-spinkube-fermyon-platform-for-k8s https://www.cncf.io/blog/2024/03/19/announcing-the-ai-working-groups-new-cloud-native-artificial-intelligence-whitepaper/ https://github.com/Azure/kaito https://azure.microsoft.com/en-us/updates/public-preview-kubernetes-ai-toolchain-operator-kaito-addon-for-aks/ https://cloudnativenow.com/features/solo-io-delivers-on-cilium-support-promise-for-gloo-networks/ https://docs.solo.io/gloo-network/latest/about/overview/ https://github.com/kosmos-io/kosmos https://gateway.envoyproxy.io/blog/2024/03/14/announcing-envoy-gateways-1.0-release/ https://newrelic.com/press-release/20240319 https://siliconangle.com/2024/03/29/aviatrix-revolutionizes-networking-security-distributed-cloud-firewall-kubernetes-kubeconeu/


