News

(1) We released the 50 diffusion steps model (instead of 1000 steps) which runs 20X faster with comparable results. (2) Calling CLIP just once and caching the result runs 2X faster for all models.
One of the most prominent claims in circulation is that DeepSeek V3 incurs a training cost of around $6 million.