C++ Software Pipelining Template for overlapping TMA and GEMM operations on the NVIDIA Hopper architecture. Presented at the Colfax Seminar by a team member.
2805 Bowers Ave, Santa Clara, CA 95051 | 408-730-2275
research@colfax-intl.com
C++ Software Pipelining Template for overlapping TMA and GEMM operations on the NVIDIA Hopper architecture. Presented at the Colfax Seminar by a team member.