View More
View Less
System Message
My Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Wait Listed
Personal Calendar
Conference Event
Coming Soon
Conflict Found
This session is already scheduled at another time. Would you like to...
Please enter a maximum of {0} characters.
{0} remaining of {1} character maximum.
Please enter a maximum of {0} words.
{0} remaining of {1} word maximum.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Replies ()
New Post
Microblog Thread
Post Reply
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
GTC DC 2017

DC7112 - CUDA Optimization Tips, Tricks and Techniques

Session Speakers
  • Stephen Jones - Principal Software Engineer, NVIDIA

    Stephen Jones is a principal software engineer in the CUDA group at NVIDIA, working on making the CUDA language and programming model span the needs of parallel programming from high performance computing to artificial intelligence. Previously, Stephen led the simulation and analytics group at SpaceX, where he worked on various projects, including large-scale simulation of combustion processes in rocket engines. His background is in computational fluid mechanics and plasma physics, but he has worked in diverse, industries including networking, CAD/CAM, and scientific computing.

Session Description

Optimizing your code can be one of the most challenging tasks in GPU programming, but also one of the most rewarding: the performance difference between an initial version and well-tuned code can be a factor of 10 or more. Some optimizations can be quite straightforward while others require care and deep understanding of how the code is executing. A particular focus will be on optimization of the CPU part of your code, which is frequently overlooked even though it is often easier to tune and just as effective. Sometimes the biggest obstacle is just knowing what to look for, so we'll cover a range of techniques that everyone from beginners to CUDA ninjas might not have thought of before.

Additional Information
New Developer Tools, HPC and Supercomputing, AI for Accelerated Analytics
Software, General
50 minutes
Session Schedule