C++ Software Pipelining Template for overlapping TMA and GEMM operations on the NVIDIA Hopper architecture. Presented at the Colfax Seminar by a team member.
Software Pipelining in the NVIDIA Hopper Architecture
Discover more from Colfax Research
Subscribe to get the latest posts sent to your email.