KubeFM

Discover all the great things happening in the world of Kubernetes, learn (controversial) opinions from the experts and explore the successes (and failures) of running Kubernetes at scale.

Episodes

Mentioned books

Mar 24, 2026 • 46min

Patroni Backups: when pgBackRest and Argo CD have your back (literally), with Ziv Yatzik

Your database backup strategy shouldn't be the thing that takes your production systems down.Ziv Yatzik manages 600+ Postgres clusters in a closed network environment with no public cloud. After existing backup solutions proved unreliable — causing downtime when disks filled up — his team built a new architecture using pgBackRest, Argo CD, and Kubernetes CronJobs.In this episode:Why storing WAL files on shared NAS storage prevents backup failures from cascading into database outagesHow GitOps with Argo CD lets them manage backups for hundreds of clusters by adding a single YAML fileThe Ansible + Kubernetes hybrid approach that keeps VM-based Patroni clusters in sync with Kubernetes-orchestrated backupsA practical blueprint for making database backups boring, reliable, and safe.SponsorThis episode is sponsored by LearnKube — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/Rg_sQYSmwInterested in sponsoring an episode? Learn more.

Jan 27, 2026 • 27min

Running a Full Kubernetes Cluster for $2 a Month, with Varnit Goyal

Varnit Goyal, a Senior Software Engineer who experiments with cloud-native tooling and home labs. He describes building a full three-node Kubernetes cluster for $2.16/month using Rackspace Spot Instances. He covers bidding and instance-choice strategies, multi-region workers and automated replacement, replacing paid load balancers with Tailscale Funnel, and running real workloads like Kafka and Postgres on a budget.

Jan 13, 2026 • 31min

We Broke Our EKS Cluster Autoscaler with the AL2023 Migration, with Dilshan Wijesooriya

Dilshan Wijesooriya, Senior Cloud Engineer, discusses a real incident where migrating EKS nodes to AL2023 caused the cluster autoscaler to lose AWS permissions silently.You will learn:Why AL2023 blocks pod access to instance metadata by default, breaking components that relied on node IAM roles (like cluster autoscaler, external-DNS, and AWS Load Balancer Controller)How to implement IRSA correctly by configuring IAM roles, Kubernetes service accounts, and OIDC trust relationships, and why both AWS IAM and Kubernetes RBAC must be configured independentlyThe recommended migration strategy: move critical system components to IRSA before changing AMIs, test aggressively in non-production, and decouple identity changes from OS upgradesHow to audit which pods currently rely on node roles and clean up legacy IAM permissions to reduce attack surface after migrationSponsorThis episode is sponsored by LearnKube — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/T_YPfTfDbInterested in sponsoring an episode? Learn more.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

KubeFM

Episodes

Mentioned books

GPU Containers as a Service, with Landon Clipp

How We Cut Build Debugging Time by 75% with AI, with Ron Matsliah

Migrating Kubernetes Off Big Cloud, with Fernando Duran

Migrating to Karpenter: Fun Stories, with Adhi Sutandi

From ECS to Kubernetes: A Real Migration Story, with Radosław Miernik

Faster EKS Node and Pod Startup, with Jan Ludvik

Kubernetes is not just for Black Friday, with Thibault Martin

Patroni Backups: when pgBackRest and Argo CD have your back (literally), with Ziv Yatzik

Running a Full Kubernetes Cluster for $2 a Month, with Varnit Goyal

We Broke Our EKS Cluster Autoscaler with the AL2023 Migration, with Dilshan Wijesooriya

The AI-powered Podcast Player