How_to_optimize_in_GPU
Cuda
★ 0
updated 4y ago
⑂ fork
This is a series of GPU optimization topics. Here we will introduce how to optimize the program on the GPU in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
No plain-English explanation yet — one is being written right now. Check back in a minute.