Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

The Linux Foundation October 4, 2023
Video Thumbnail
The Linux Foundation Logo

The Linux Foundation

@linuxfoundationorg

About

The Linux Foundation is a nonprofit consortium dedicated to fostering the growth of Linux and collaborative software development. Founded in 2000, the organization sponsors the work of Linux creator Linus Torvalds and promotes, protects and advances the Linux operating system and collaborative software development by marshaling the resources of its members and the open source community. The Linux Foundation provides a neutral forum for collaboration and education by hosting Collaborative Projects, Linux conferences including LinuxCon, and generating original research and content that advances the understanding of Linux and collaborative software development. More information can be found at www.linuxfoundation.org.

Video Description

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Min Ong, Jina AI With the rise of AI and machine learning applications, GPU resources have become a critical bottleneck in scaling infrastructure to efficiently serve AI workloads. Kubernetes, an open-source container orchestration platform, provides a solution to this problem through the NVIDIA device plugin which allows multiple containers to share access to GPU devices. In this talk, we will explore how Kubernetes can be used to efficiently scale AI workloads by sharing GPU resources across multiple containers. We will discuss the challenges of GPU resource management, explore various techniques for optimizing GPU usage and set resource limits to ensure fair and efficient allocation of GPU resources among containers. By the end of this talk, attendees will have a solid understanding of how Kubernetes can be used to share GPU resources across multiple containers, allowing them to make the most of their GPU investments and achieve faster, more accurate results in their AI applications.

You May Also Like