Episodios

  • Deploy and fine-tune LLM models on Kubernetes using KAITO
    Aug 7 2024

    In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.

    Check out our website at https://kubernetesbytes.com/

    Cloud Native News:

    • https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga
    • https://github.blog/news-insights/product-news/introducing-github-models/

    Show links:

    • Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main
    • https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2
    • https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2
    • Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator
    • https://paulyu.dev/article/soaring-with-kaito/
    • Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models
    • Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:
    1. Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/
    2. Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/

    Timestamps:

    • 00:02:15 Cloud Native News
    • 00:05:34 Interview with Sachi and Paul
    • 00:42:08 Key takeaways
    Más Menos
    44 m
  • The business case for cloud-native and Kubernetes
    Jul 26 2024

    In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Danielle Cook - VP of Marketing, appCD and Co-chair, CNCF Cartografos Working Group, CNCF. The discussion dives into how technical individual contributors can and should think about a business case for cloud native adoption. They talk about the cloud native maturity model and also discuss the different things business leaders care about.

    Check out our website at https://kubernetesbytes.com/

    Cloud Native News:

    • https://www.chainguard.dev/unchained/chainguard-series-c
    • https://www.cnbc.com/2024/07/23/google-wiz-deal-dead.html
    • https://www.redhat.com/en/blog/what-you-need-to-know-red-hat-openshift-416
    • https://blog.kubeflow.org/kubeflow-1.9-release/
    • https://talks.devopsdays.org/devopsdays-boston-2024/cfp

    Show links:

    • https://www.linkedin.com/in/danielle-cook-/
    • https://maturitymodel.cncf.io/
    • https://community.cncf.io/cncf-cartografos-working-group/
    • https://tag-app-delivery.cncf.io/whitepapers/platform-eng-maturity-model/

    Timestamps:

    • 00:01:36 Cloud Native News
    • 00:12:07 Interview with Danielle
    • 00:51:20 Key takeaways
    Más Menos
    54 m
  • Building the AI Hyperscaler with Kubernetes
    Jun 28 2024

    In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Brandon Jacobs, an Infrastructure architect at Coreweave. They discuss how Coreweave has adopted Kubernetes to build the AI hyperscaler. The discussion dives into details around how Coreweave handles Day 0 and Day 2 operations for AI labs that need access to GPUs. They also talk about lessons learnt and best practices for building a Kubernetes based cloud.

    Check out our website at https://kubernetesbytes.com/

    Episode Sponsor: Nethopper
    Learn more about KAOPS: nethopper.io
    For a supported-demo: info@nethopper.io
    Try the free version of KAOPS now!
    https://mynethopper.com/auth

    Cloud Native News:

    • https://siliconangle.com/2024/06/24/ollama-addresses-remote-execution-flaw-following-wiz-discovery/
    • https://siliconangle.com/2024/06/18/suse-acquires-kubernetes-observability-startup-stackstate/

    Show links:

    • https://www.linkedin.com/in/brandonrjacobs/
    • https://www.coreweave.com/

    Timestamps:

    • 00:01:39 Cloud Native News
    • 00:05:30 Interview with Brandon
    • 00:51:37 Key takeaways
    Más Menos
    55 m
  • Shifting Minds: Exploring OpenShift's AI Landscape
    Jun 14 2024

    Ryan Wallner and Bhavin Shah talk to Andy Grimes about the OpenShift AI Landscape.

    Check out our website at https://kubernetesbytes.com/

    Episode Sponsor: Nethopper

    • - Learn more about KAOPS: @nethopper.io
    • - For a supported-demo: info@nethopper.io
    • - Try the free version of KAOPS now! https://mynethopper.com/auth

    Links

    • - https://youtube.com/watch?v=nAT9U1vJ8x0
    • - https://www.theregister.com/2024/06/12/kubertenes_decade_anniversary/
    • - https://www.businesswire.com/news/home/20240606882860/en/Mirantis-Collaboration-with-Pure-Storage-Simplifies-Data-Management-with-Kubernetes
    • - https://falco.org/blog/falco-0-38-0/
    • - https://au.finance.yahoo.com/news/rancher-government-successfully-using-harvester-121100125.html
    • - https://www.youtube.com/@PlatformEngineering
    • - Video: https://www.youtube.com/watch?v=tZj8j3fdXy4
    • - Virtual Road Shows: https://www.redhat.com/en/north-america-red-hat-aws
    • - AWS Gameday August 22nd: TBS
    • - Boston Childrens Hospital RHOAI: https://www.redhat.com/en/creating-chris
    • - IBM Open Source AI https://www.youtube.com/watch?v=SuGedexBudQ&t=141s
    Más Menos
    1 h y 5 m
  • Training Machine Learning (ML) models on Kubernetes
    May 31 2024

    In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Bernie Wu, VP Strategic Partnerships and AI/CXL/Kubernetes Initiatives at Memverge. They discuss about how Kubernetes is the most popular platform to run AI model training and model inferencing jobs. The discussion dives into model training, talking about different phases of a DAG, and then talk about how Memverge can help users with efficient and cost-effective model checkpoints. The discussion goes into topics like saving costs by using spot instances, hot restart of training jobs, reclaiming unused GPU resources, etc.

    Check out our website at https://kubernetesbytes.com/

    Episode Sponsor: Nethopper

    • Learn more about KAOPS: @nethopper.io
    • For a supported-demo: info@nethopper.io
    • Try the free version of KAOPS now! https://mynethopper.com/auth

    Cloud Native News:

    • https://www.aquasec.com/blog/linguistic-lumberjack-understanding-cve-2024-4323-in-fluent-bit/
    • https://kubernetes.io/blog/2024/05/20/completing-cloud-provider-migration/
    • https://thenewstack.io/introducing-aks-automatic-managed-kubernetes-for-developers/
    • https://www.harness.io/blog/harness-to-acquire-split

    Show Links:

    • https://www.linkedin.com/in/berniewu/
    • https://criu.org/Main_Page
    • https://memverge.com/
    • https://youtu.be/tY8YOMRuqWI?si=yB3hHqLUpYPZ-KWN
    • https://youtu.be/ND4seSKpJHI?si=shh0iuA9qC-dO6eb

    Timestamps:

    • 01:04 Cloud Native News
    • 08:47 Interview with Bernie
    • 51:40 Key takeaways

    Más Menos
    55 m
  • The evolution of service mesh technologies
    May 17 2024

    In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Christian Posta - VP and Global Field CTO at Solo.io about all things Service Mesh. They discuss how things have evolved from the early Linkerd days to sidecar less istio service mesh implementations. They also talk about how service mesh can help you connect to application components running outside Kubernetes, and how developers and platform engineers have a shared responsibility model when it comes to implementing service mesh using internal developer platforms.

    Check out our website at https://kubernetesbytes.com/

    Episode Sponsor: Nethopper
    Learn more about KAOPS: @nethopper.io
    For a supported-demo: info@nethopper.io
    Try the free version of KAOPS now! https://mynethopper.com/auth

    Cloud Native News:

    • https://loft.sh/blog/our-24m-series-a-led-by-khosla-ventures/
    • https://www.harness.io/blog/celebrating-150m-in-new-financing-to-accelerate-innovation
    • https://www.akamai.com/newsroom/press-release/akamai-announces-intent-to-acquire-api-security-company-noname
    • https://www.linkedin.com/posts/rouvenbesters_its-official-the-otomi-platform-has-activity-7194604616901120000-48g7?utm_source=share&utm_medium=member_desktop
    • https://www.wiz.io/blog/celebrating-our-1-billion-funding-round-and-12-billion-valuation

    Show Links:

    • https://devsummit.infoq.com/conference/boston2024
    • https://www.solo.io/topics/cakes-stack/
    • https://www.solo.io/

    Timestamps:

    • 00:06:10 Cloud Native News
    • 00:15:37 Interview with Torsten
    • 01:01:58 Key takeaways
    Más Menos
    1 h y 8 m
  • What are Vector Databases
    May 6 2024

    In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Torsten Steinbach - VP, Chief Architect for Analytics & AI at EDB about all things Vector Databases, Postgres, and why Data is important for building AI platforms. The discussion dives into how vector databases are different than relational databases and why using Postgres extensions helps organizations use their existing data for AI applications.

    Check out our website at https://kubernetesbytes.com/

    Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees!

    Episode Sponsor: Nethopper

    • Learn more about KAOPS: @nethopper.io
    • For a supported-demo: info@nethopper.io
    • Try the free version of KAOPS now!
    • https://mynethopper.com/auth

    Cloud Native News:

    • https://www.reuters.com/markets/deals/ibm-nearing-buyout-deal-hashicorp-wsj-reports-2024-04-23/
    • https://www.wiz.io/blog/wiz-acquires-gem-security-to-reinvent-threat-detection-in-the-cloud
    • https://techcrunch.com/2024/04/18/wiz-is-in-talks-to-buy-lacework-for-150-200m-security-firm-was-last-valued-at-8-3b/
    • https://www.prnewswire.com/news-releases/coreweave-secures-1-1-billion-in-series-c-funding-to-drive-the-next-generation-of-cloud-computing-for-the-future-of-ai-302133328.html
    • https://kubernetes.io/blog/2024/04/17/kubernetes-v1-30-release
    • https://dok.community/blog/become-a-data-on-kubernetes-in-2024-ambassador/

    Show Links:

    • https://www.enterprisedb.com/news/edb-acquires-splitgraph
    • https://www.enterprisedb.com/resources/events
    • https://www.enterprisedb.com/

    Timestamps:

    • 00:03:22 Cloud Native News
    • 00:17:45 Interview with Torsten
    • 00:57:00 Key takeaways
    Más Menos
    1 h y 3 m
  • KubeCon EU Paris News Recap
    Apr 16 2024

    Join Bhavin Shah and Ryan Wallner for a recap of announcments and news from KubeCon Paris 2024.

    Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees!

    Nethopper

    • Learn more about KAOPS: @nethopper.io
    • For a supported-demo: info@nethopper.io
    • Try the free version of KAOPS now! https://mynethopper.com/auth

    News

    • https://about.gitlab.com/blog/2024/03/20/oxeye-joins-gitlab-to-advance-application-security-capabilities/
    • https://www.redhat.com/en/blog/unveiling-red-hat-openshift-415
    • https://developer.nvidia.com/blog/nvidia-nim-offers-optimized-inference-microservices-for-deploying-ai-models-at-scale/
    • https://www.acorn.io/resources/blog/our-new-focus-developing-an-llm-app-platform-based-on-gpt-script-technology?fromOther=true
    • https://loft.sh/blog/deliver-secure-kubernetes-multi-tenancy-with-new-vcluster-in-rancher-integration/
    • https://www.observeinc.com/blog/stepping-on-the-gas/
    • https://thenewstack.io/kubecost-2-2-covers-carbon-cost-monitoring-and-more/
    • https://thenewstack.io/ovhcloud-unveils-roadmap-to-take-on-hyperscalers-from-europe/
    • https://www.suse.com/c/meet-rancher-prime-3-0/
    • https://www.suse.com/c/suse-releases-edge-3-0-highly-validated-edge-optimized-stack/
    • https://www.fermyon.com/blog/introducing-spinkube-fermyon-platform-for-k8s
    • https://www.cncf.io/blog/2024/03/19/announcing-the-ai-working-groups-new-cloud-native-artificial-intelligence-whitepaper/
    • https://github.com/Azure/kaito
    • https://azure.microsoft.com/en-us/updates/public-preview-kubernetes-ai-toolchain-operator-kaito-addon-for-aks/
    • https://cloudnativenow.com/features/solo-io-delivers-on-cilium-support-promise-for-gloo-networks/
    • https://docs.solo.io/gloo-network/latest/about/overview/
    • https://github.com/kosmos-io/kosmos
    • https://gateway.envoyproxy.io/blog/2024/03/14/announcing-envoy-gateways-1.0-release/
    • https://newrelic.com/press-release/20240319
    • https://siliconangle.com/2024/03/29/aviatrix-revolutionizes-networking-security-distributed-cloud-firewall-kubernetes-kubeconeu/
    Más Menos
    48 m