2805 Bowers Ave, Santa Clara, CA 95051 | 408-730-2275
research@colfax-intl.com

Author: Paul VanKoughnett

  • DeepSeek-R1 and FP8 Mixed-Precision Training

    DeepSeek-R1 and FP8 Mixed-Precision Training

    DeepSeek has shocked the world with the release of their reasoning model DeepSeek-R1. Similar to OpenAI’s o1 and Google Gemini’s Flash Thinking, the R1 model aims to improve the quality of its replies by generating a “chain of thought” before responding to a prompt. The excitement around R1 stems from it achieving parity with o1… Go to article…