Asynchrony and CUDA Streams | CUDA C++ Class Part 2
About
No channel description available.
Video Description
Welcome to NVIDIA’s Modern CUDA C++ Programming Class. You will learn how to unlock the GPU’s full potential by using asynchrony and CUDA Streams. This series is for C++ developers who want to use the GPU effectively—whether you’re new to CUDA and want the fastest path from “hello world” to real acceleration, or you’re an experienced CUDA programmer ready to modernize your code with the latest best practices. If you already know C++ and want to write clean, efficient, idiomatic GPU code, this course is for you. This video is part of a broader playlist containing three videos. We advise you to start from the first video. 📝 Part 1: https://youtu.be/Sdjn9FOkhnA 📝 Part 3: https://youtu.be/kTWoGCSugB4 📝 Full Course: https://www.youtube.com/playlist?list=PL5B692fm6--vWLhYPqLcEu6RF3hXjEyJr ➡️ Link to the slides and Google Colab to run the exercise for free on the GPU: https://github.com/NVIDIA/accelerated-computing-hub/tree/main/tutorials/cuda-cpp For the DLI version, please visit: https://learn.nvidia.com/courses/course-detail?course_id=course-v1:DLI+S-AC-04+V2 📥 Link to download Nsight Systems locally: https://developer.nvidia.com/nsight-systems/get-started Chapters: 00:00:00 Introduction 00:00:22 Synchronous vs Asynchronous 00:08:32 Exercise Compute-IO Overlap 00:09:16 Solution Compute-IO Overlap 00:10:43 Nsight Systems 00:11:35 Exercise Nsight Systems 00:14:38 Solution Nsight Systems 00:17:01 NVTX 00:19:50 Exercise NVTX 00:20:22 Solution NVTX 00:21:19 Stream 00:35:42 Exercise Async Copy 00:36:20 Solution Async Copy 00:38:36 Pinned Memory 00:42:50 Exercise Copy Overlap 00:43:23 Solution Copy Overlap 00:44:21 Takeways
Boost CUDA C++ Learning
AI-recommended products based on this video

AocBook 15.6'' FHD Laptop, Intel N95, Nvidia GTX 1060 4GB, 32GB DDR4 RAM, M.2 SSD, Sleek Notebook with Type-C, HDMI, RJ45 Ethernet, Backlit Keyboard, Fingerprint (32GB DDR4 | 1TB SSD)

acer Nitro 50 N50-620-UA91 Gaming Desktop | 11th Gen Intel Core i5-11400F 6-Core Processor | NVIDIA GeForce GTX 1650 | 8GB DDR4 | 512GB NVMe M.2 SSD | Intel Wi-Fi 6 AX201 | Keyboard and Mouse

TEAMGROUP Elite DDR4 16GB Kit (2 x 8GB) 3200MHz PC4-25600 CL22 Unbuffered Non-ECC 1.2V SODIMM 260-Pin Laptop Notebook PC Computer Memory Module Ram Upgrade - TED416G3200C22DC-S01

ASUS TUF Gaming A15 Gaming Laptop, 15.6” 144Hz FHD Display, AMD Ryzen 5 7535HS Processor, GeForce RTX 2050, 8GB DDR5 RAM, 512GB PCIe SSD Gen 4, Wi-Fi 6, Windows 11, FA506NF-AS51-CA

ASUS TUF F16 16" WUXGA 165Hz Gaming Laptop, Intel i7-14650HX, NVIDIA GeForce RTX 5070 8GB, 16GB DDR5, 1TB SSD, Backlit Keyboard, Number Pad, IR Camera, Wi-Fi 6E, Win 11, Gray, 1TB Docking Station Set

Dell UltraSharp 24 Monitor - U2424H

Dell UltraSharp U2723QE 27" 4K UHD WLED LCD Monitor - 16:9 - Black, Silver EPEAT

Lenovo LOQ 15.6" FHD 144Hz Gaming Laptop, Intel Core i5-12450HX, NVIDIA GeForce RTX 2050, 16GB DDR5, 2TB Storage (1TB SSD+1TB Docking Station Set), Number Pad, 720p Camera, Wi-Fi 6, Win 11, Gray

MSI Ultra-Slim Thin 15 VR-Ready High FPS Gaming Laptop, 15.6 FHD 144Hz, Intel Core i5-13420H, NVIDIA GeForce RTX 4060, 32GB RAM, 2TB SSD, Backlit KB, Wi-Fi 6, Bundle with PCO Notebook Fold Radiator

acer Nitro 50 N50-620-UA91 Gaming Desktop | 11th Gen Intel Core i5-11400F 6-Core Processor | NVIDIA GeForce GTX 1650 | 8GB DDR4 | 512GB NVMe M.2 SSD | Intel Wi-Fi 6 AX201 | Keyboard and Mouse



















