Deepseek bypasses CUDA to use PTX close to metal programming that makes it far more efficient. Thus needing less HW to do the same thing. Thats what hit Nvidias stock.
Dramatic optimizations do not come easy.
www.tomshardware.com
The exact opposite, Tom's Hardware should be ignored and laughed at.
NV's stock is down because their margins are going to return to the mean, lower barrier of entry and the increasing shift to inference over training means NV faces more competition and it will be harder for large customers to make ROI on such massive CapEx investments.
Not to mention models like this infer nicely in 8 GPU boxes, no need for a more fancy switching setup unless you are trying to break through to a new frontier of model ability.
If the next wave frontier models, which are far more expensive to train and run do not offer a large enough leap in capability, NV will be forced to lower prices on future or existing orders or risk orders being canceled and given to other IHVs or internal Si.
Their moat is in training, inference is a far more open game.
Oh, and they know this is inevitable, Cisco crashed when Broadcom et al offered 90%+ of the capability for half the money.
Robotics is their next big bet to try to chase the eternal boom.
But NV is a boom bust company, they cannot help themselves whenever they see a chance for a quick buck.