Implementing High-performance GEMM on GPUs

less than 1 minute read

Published:

Post available on Zhihu. For Ampere GPUs (link). For Hopper GPUs (link). And how to calculate multi-stage numbers (link).