11 Folgen

MacResearch.org is a an online community for scientsts using Apple hardware and software in their research.

MacResearch.org David W. Gohara, Ph.D.

    • Wissenschaft
    • 5,0 • 2 Bewertungen

MacResearch.org is a an online community for scientsts using Apple hardware and software in their research.

    Episode 6 - Questions and Answers (PDF)

    Episode 6 - Questions and Answers (PDF)

    In this episode we'll go over an example of real-world code that has been parallelized by porting to the GPU. The use of shared memory to improve performance is covered as well as a discussion of synchronization points for coordinated work within a work-group. Source code is provided.

    • video
    Episode 6 - Shared Memory Kernel Optimization (Video)

    Episode 6 - Shared Memory Kernel Optimization (Video)

    In this episode we'll go over an example of real-world code that has been parallelized by porting to the GPU. The use of shared memory to improve performance is covered as well as a discussion of synchronization points for coordinated work within a work-group. Source code is provided.

    • 49 Min.
    Episode 5 - Questions and Answers (PDF)

    Episode 5 - Questions and Answers (PDF)

    This episode covers questions hthat were generated from the previous podcast. We'll discuss GPU layout/terminology and bank conflicts resulting from shared memory access.

    • video
    Episode 5 - Questions and Answers (Video)

    Episode 5 - Questions and Answers (Video)

    This episode covers questions hthat were generated from the previous podcast. We'll discuss GPU layout/terminology and bank conflicts resulting from shared memory access.

    • 29 Min.
    Episode 4 - Memory Layout and Access (PDF)

    Episode 4 - Memory Layout and Access (PDF)

    In this episode we cover some questions regarding function calls from kernels and the use of clFinish. Also, we'll discuss basic GPU architecture, memory layout, shared memory. Thread blocks, warps and efficient data loading will also be discussed.

    • video
    Episode 4 - Memory Layout and Access (Video)

    Episode 4 - Memory Layout and Access (Video)

    In this episode we cover some questions regarding function calls from kernels and the use of clFinish. Also, we'll discuss basic GPU architecture, memory layout, shared memory. Thread blocks, warps and efficient data loading will also be discussed.

    • 56 Min.

Kundenrezensionen

5,0 von 5
2 Bewertungen

2 Bewertungen

TuncTaylan ,

Fast perfekt

Ein guter Einstieg ins Thema bringt er mit seinen Podcasts.

Einzige Macke; er spricht sehr leise, so dass man sich strengen muss, um ihn von den Lüftern seines Macs zu unterscheiden.

Top‑Podcasts in Wissenschaft

Aha! Zehn Minuten Alltags-Wissen
WELT
radioWissen
Bayerischer Rundfunk
Das Wissen | SWR
SWR
Methodisch inkorrekt!
Methodisch inkorrekt!
Sternengeschichten
Florian Freistetter
Quarks Daily
Quarks