Summary of "GPU‑Accelerated Workloads on KubeVirt: Scaling ML/AI in Kuberne... Amandeep Singh and Shivani Tiwari"

GPU‑Accelerated Workloads on KubeVirt: Scaling ML/AI in Kubernetes

The video titled “GPU‑Accelerated Workloads on KubeVirt: Scaling ML/AI in Kubernetes” features a lightning talk by Amandeep Singh (founder at Wellin and former senior data scientist at PayPal) and Shivani Tiwari (Developer Relations at Wellin). The session focuses on integrating GPU acceleration with KubeVirt to efficiently scale machine learning (ML) and artificial intelligence (AI) workloads in Kubernetes environments.


Key Technological Concepts and Product Features

1. Introduction to KubeVirt

2. Challenges with CPU and Containers for AI/ML Workloads

3. Role of GPUs in AI/ML

4. Enabling GPU Workloads on KubeVirt

5. Additional Tools and Monitoring

6. Challenges and Limitations


Summary of the Process to Enable GPU Acceleration with KubeVirt

  1. Install GPU device plugins on Kubernetes nodes.
  2. Ensure GPU drivers and necessary libraries are installed.
  3. Define GPU resource requests in VM YAML manifests.
  4. Launch VMs that are scheduled on GPU-enabled nodes.
  5. Enable GPU pass-through to VMs for direct hardware access.
  6. Run AI/ML workloads inside VMs leveraging GPU acceleration.

Review/Guide/Tutorial Elements


Main Speakers

The session was cut short but invited further detailed discussion in the KubeVirt community meetings.

Category ?

Technology

Share this summary

Featured Products

Video