Programming Massively Parallel Processors with CUDA

Stanford University

4.0 (1)
Technology

Virtually all semiconductor market domains, including PCs, game consoles, mobile handsets, servers, supercomputers, and networks, are converging to concurrent platforms. There are two important reasons for this trend. First, these concurrent processors can potentially offer more effective use of chip space and power than traditional monolithic microprocessors for many demanding applications. Second, an increasing number of applications that traditionally used Application Specific Integrated Circuits (ASICs) are now implemented with concurrent processors in order to improve functionality and reduce engineering cost. The real challenge is to develop applications software that effectively uses these concurrent processors to achieve efficiency and performance goals. The aim of this course is to provide students with knowledge and hands-on experience in developing applications software for processors with massively parallel computing resources. In general, we refer to a processor as massively parallel if it has the ability to complete more than 64 arithmetic operations per clock cycle. Many commercial offerings from NVIDIA, AMD, and Intel already offer such levels of concurrency. Effectively programming these processors will require in-depth knowledge about parallel programming principles, as well as the parallelism models, communication models, and resource limitations of these processors. The target audiences of the course are students who want to develop exciting applications for these processors, as well as those who want to develop programming tools and future implementations for these processors. Visit the CS193G companion website for course materials.

See All (16)

Virtually all semiconductor market domains, including PCs, game consoles, mobile handsets, servers, supercomputers, and networks, are converging to concurrent platforms. There are two important reasons for this trend. First, these concurrent processors can potentially offer more effective use of chip space and power than traditional monolithic microprocessors for many demanding applications. Second, an increasing number of applications that traditionally used Application Specific Integrated Circuits (ASICs) are now implemented with concurrent processors in order to improve functionality and reduce engineering cost. The real challenge is to develop applications software that effectively uses these concurrent processors to achieve efficiency and performance goals. The aim of this course is to provide students with knowledge and hands-on experience in developing applications software for processors with massively parallel computing resources. In general, we refer to a processor as massively parallel if it has the ability to complete more than 64 arithmetic operations per clock cycle. Many commercial offerings from NVIDIA, AMD, and Intel already offer such levels of concurrency. Effectively programming these processors will require in-depth knowledge about parallel programming principles, as well as the parallelism models, communication models, and resource limitations of these processors. The target audiences of the course are students who want to develop exciting applications for these processors, as well as those who want to develop programming tools and future implementations for these processors. Visit the CS193G companion website for course materials.

Creator

Stanford University
Episodes

16
Show Website

Programming Massively Parallel Processors with CUDA

Science

Science

Updated 06/12/2010
Technology

Technology

Updated 19/06/2015
Podcasts

Podcasts

Updated 29/10/2012
Podcasts

Podcasts

Updated 15/11/2012
Science

Science

Updated 25/04/2012
Technology

Technology

Updated 23/07/2008
Podcasts

Podcasts

Updated 09/04/2012

Programming Massively Parallel Processors with CUDA

16. Parallel Sorting (April 20, 2010)

15. Optimizing Parallel GPU Performance (May 20, 2010)

14. Path Planning System on the GPU (May 18, 2010)

6. Parallel Patterns I (April 15, 2010)

12. NVIDIA OptiX: Ray Tracing on the GPU (May 11, 2010)

13. Future of Throughput (May 13, 2010)

11. The Fermi Architecture (May 6, 2010)

10. Solving Partial Differential Equations with CUDA (May 4, 2010)

About

Information

More From Stanford

Programming Massively Parallel Processors with CUDA

Episodes

16. Parallel Sorting (April 20, 2010)

15. Optimizing Parallel GPU Performance (May 20, 2010)

14. Path Planning System on the GPU (May 18, 2010)

6. Parallel Patterns I (April 15, 2010)

12. NVIDIA OptiX: Ray Tracing on the GPU (May 11, 2010)

13. Future of Throughput (May 13, 2010)

11. The Fermi Architecture (May 6, 2010)

10. Solving Partial Differential Equations with CUDA (May 4, 2010)

About

Information

More From Stanford