Tuning Virtualized GPUs for Optimal Performance on ML/AI Workloads

Presented by

Lan Vu, Senior Member of the Tech Staff, VMware, Uday Kurkure, Staff Engr, VMware and Hari Sivaraman, Staff Engr, VMware

About this talk

b. VMware and NVIDIA have partnered to democratize AI for every enterprise by combining NVIDIA AI software and GPUs with virtualization. We’ll present performance data for the two modes of operation that the Ampere architecture supports using Multi Instance GPU (MIG) and vGPU. Learn to load balance, and improve latency and throughput with vGPUs. VMware vSphere and NVIDIA AI Enterprise enables multi-GPU configuration with NVLink in a virtualized environment. We’ll demonstrate how NVLink with NVIDIA A100 GPUs can enhance the performance of machine learning/AI workloads in vSphere, and show how to scale performance of NVIDIA GPUs with NVLink with manageability and scalability features of Kubernetes container services and virtual machines.

Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (5)
Subscribers (414)
NVIDIA & VMware Partnership Channel