2805 Bowers Ave, Santa Clara, CA 95051 | 408-730-2275
research@colfax-intl.com

About Us

Our team consists of mathematicians and scientists who bring formal academic training and deep analytical rigor to GPU kernel development. We have a demonstrated record of excellence across research, education, technical writing, and open-source contributions, pairing first principles reasoning about hardware, systems, and algorithms with hands-on kernel engineering.

Research papers

We publish research at the frontier of GPU performance and the mathematics that underpins it. Our publications include:

Lectures and talks

We share our methods publicly through technical lectures on leading platforms.

Open-source contributions

We have made foundational and ongoing contributions to the FlashAttention project headed by Tri Dao, and have also made contributions to vLLM and CUTLASS.

Blogs

We regularly publish highly respected blogs, often in the form of multi-part series, covering in detail architectural and performance engineering aspects of modern GPUs, with the goal of empowering developers to acquire skills from the ground up:

We also produce joint work with other industry leaders:

These blogs serve as useful primers for deep-dive training courses we offer via live, pre-recorded, and interactive formats. More details on these course offerings will be available soon.