11 episodes

MacResearch.org David W. Gohara, Ph.D.

- Science
- 5.0 • 6 Ratings

MacResearch.org is a an online community for scientsts using Apple hardware and software in their research.

- OCT 8, 2009
Episode 6 - Questions and Answers (PDF)

Episode 6 - Questions and Answers (PDF)

In this episode we'll go over an example of real-world code that has been parallelized by porting to the GPU. The use of shared memory to improve performance is covered as well as a discussion of synchronization points for coordinated work within a work-group. Source code is provided.
- OCT 8, 2009
- video
Episode 6 - Shared Memory Kernel Optimization (Video)

Episode 6 - Shared Memory Kernel Optimization (Video)

In this episode we'll go over an example of real-world code that has been parallelized by porting to the GPU. The use of shared memory to improve performance is covered as well as a discussion of synchronization points for coordinated work within a work-group. Source code is provided.
- 49 min
- SEP 25, 2009
Episode 5 - Questions and Answers (PDF)

Episode 5 - Questions and Answers (PDF)

This episode covers questions hthat were generated from the previous podcast. We'll discuss GPU layout/terminology and bank conflicts resulting from shared memory access.
- SEP 25, 2009
- video
Episode 5 - Questions and Answers (Video)

Episode 5 - Questions and Answers (Video)

This episode covers questions hthat were generated from the previous podcast. We'll discuss GPU layout/terminology and bank conflicts resulting from shared memory access.
- 29 min
- SEP 10, 2009
Episode 4 - Memory Layout and Access (PDF)

Episode 4 - Memory Layout and Access (PDF)

In this episode we cover some questions regarding function calls from kernels and the use of clFinish. Also, we'll discuss basic GPU architecture, memory layout, shared memory. Thread blocks, warps and efficient data loading will also be discussed.
- SEP 10, 2009
- video
Episode 4 - Memory Layout and Access (Video)

Episode 4 - Memory Layout and Access (Video)

In this episode we cover some questions regarding function calls from kernels and the use of clFinish. Also, we'll discuss basic GPU architecture, memory layout, shared memory. Thread blocks, warps and efficient data loading will also be discussed.
- 56 min

5.0 out of 5

6 Ratings

The best resource about opencl

Mark's clear understanding and experience with the sparkling new technology is a site to behold. Even though right now the six episodes have yet to clear up some things for me I suggest listening and viewing these expertly created presentation; there is simply nothing better for the opencl enthusiast.

Outstanding quality

The production quality, presentation, and content are all suprisingly excellent. These shows are a pleasure to watch, and extremely helpful for someone coming to OpenCL from a single-core background (or transitioning from CUDA).