adroc_thurston
Diamond Member
- Jul 2, 2023
- 4,714
- 6,503
- 96
PTX is Nvidia low-level IR ISA, lmao. SASS is like a level below, but it's unreadable garbage.Deepseek bypasses CUDA to use PTX close to metal programming that makes it far more efficient. Thus needing less HW to do the same thing. Thats what hit Nvidias stock.
DeepSeek's AI breakthrough bypasses industry-standard CUDA, uses Nvidia's assembly-like PTX programming instead
Dramatic optimizations do not come easy.www.tomshardware.com
Granted, nothing stops anyone from writing amdgcn kernels either, so.