Discussion RDNA4 + CDNA3 Architectures Thread

Page 211 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

gdansk

Diamond Member
Feb 8, 2011
3,768
6,017
136
It worked even better for ATi.
It's a tribal market. To lead the tribe you have to kill the previous chieftain.
When they had leading software support with ... D3D 9?
I guess if AMD can pull off leading software and leading hardware at the same time it would work. But I considered the first impossible.
 

insertcarehere

Senior member
Jan 17, 2013
703
694
136
Yeah but when everybody buys GeForce by default anyway, you will spend years and much effort to try to make Radeon good for those people and in the end you find that you sold 10 cards a quarter to them.

That's the sort of short term thinking that got AMD so far behind in the first place. Uni students that get exposed to GPU programming with consumer hardware (which is functionally CUDA stack at this point) become engineers who will have significant say, both explicit and implicit, on what hardware/software ecosystem they prefer to develop in. Nvidia 'making GeForce good for those people' is a big reason for their crushing lead in datacenter revenue right now.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,503
96
Because DLSS/RT spam win
No, because they undersized the die.
That's the sort of short term thinking that got AMD so far behind in the first place. Uni students that get exposed to GPU programming with consumer hardware (which is functionally CUDA stack at this point) become engineers who will have significant say, both explicit and implicit, on what hardware/software ecosystem they prefer to develop in. Nvidia 'making GeForce good for those people' is a big reason for their crushing lead in datacenter revenue right now.
This narrative is a load of bull since CUDA as you know it is exclusively a Meta product.
All the actual GPGPU stuff went absolutely nowhere, right now GPUs in DC are just matrix accelerators.
 

gdansk

Diamond Member
Feb 8, 2011
3,768
6,017
136
That's the sort of short term thinking that got AMD so far behind in the first place. Uni students that get exposed to GPU programming with consumer hardware (which is functionally CUDA stack at this point) become engineers who will have significant say, both explicit and implicit, on what hardware/software ecosystem they prefer to develop in. Nvidia 'making GeForce good for those people' is a big reason for their crushing lead in datacenter revenue right now.
AMD had early ghetto solutions on consumer hardware. E.g. early btc miners were tuned for AMD hardware, not Nvidia, and used OpenCL. But that wasn't a runaway success like ML.

People always blame AMD for this but shouldn't Khronos/OpenCL bear a lot of the blame? If every hardware vendor has a hard time maximizing performance... maybe something is wrong with it.
 

insertcarehere

Senior member
Jan 17, 2013
703
694
136
AMD had early ghetto solutions on consumer hardware. E.g. early btc miners were tuned for AMD hardware, not Nvidia, and used OpenCL. But that wasn't a runaway success like ML.

People always blame AMD for this but shouldn't Khronos/OpenCL bear a lot of the blame? If every hardware vendor has a hard time maximizing performance... maybe something is wrong with it.
Khronos group is a non-profit consortium that is composed of such disparate members as Apple, IKEA, Nvidia, Valve, Epic Games, Amazon, and ARM. Blaming them for AMD's current lack of traction is asinine when thats not their remit in the first place.
 

gdansk

Diamond Member
Feb 8, 2011
3,768
6,017
136
Khronos group is a non-profit consortium that is composed of such disparate members as Apple, IKEA, Nvidia, Valve, Epic Games, Amazon, and ARM. Blaming them for AMD's current issues is asinine when that's never been anywhere near their remit in the first place.
Is it? That AMD would be better off ignoring the "open" standard is asinine. That's basically what all other arguments say: AMD should have made some alternative crap to compete with CUDA 15 years ago. It would have flopped even harder. AMD backed the wrong horse in 2008 but how was it supposed to know that trying to fix OpenCL 10 years ago would cause it to split and its creator to basically ignore it. But so it turned out to be.
 
Reactions: Tlh97

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,503
96
Yeah, i wonder why they didn't make a 80CU RDNA4 monolith size
Because that's close to N48 and everything above was tiled anyway.
AMD should have made some alternative crap to compete with CUDA 15 years ago. It would have flopped even harder.
Well they did, HSA was a thing.
Just that GPGPU was a meme, APUs for compute were even more a meme and AMD had a long ass 5 year gap between proper GPGPU offerings.
 

reaperrr3

Member
May 31, 2024
55
188
66
Yeah, i wonder why they didn't make a 80CU RDNA4 monolith size
Making N48 with 80 CU would've been a bit more work than exactly doubling N44, and would also be somewhat bandwidth-starved unless they upgraded either the IF$ or used GDDR7, both of which would've been cost drivers.

If you meant why they didn't make a bigger chip above N48 with N31-like or higher specs once chiplets were cancelled:
Time to market, mostly.

The SKUs above N48 were all meant to be chiplet-based, and by the time they cancelled those, making another, bigger monolith above N48 would've taken too long.
Might've been a different story if they cancelled chiplet-RDNA4 earlier, then we might've gotten an "N31 specs + 24 CUs" RDNA4-based monolith, which could've spanked GB203, at least.
Though knowing Jensen, if he got info about that early enough, he would've probably just made NV aim higher with GB203 then, too.
There were rumors about 112 and 96 SM until the 84 SM info arrived, they probably down-specced GB203 during development when it became apparent they wouldn't need that many SM to beat AMD.
 
Reactions: Tlh97

marees

Senior member
Apr 28, 2024
743
898
96
Making N48 with 80 CU would've been a bit more work than exactly doubling N44, and would also be somewhat bandwidth-starved unless they upgraded either the IF$ or used GDDR7, both of which would've been cost drivers.

If you meant why they didn't make a bigger chip above N48 with N31-like or higher specs once chiplets were cancelled:
Time to market, mostly.

The SKUs above N48 were all meant to be chiplet-based, and by the time they cancelled those, making another, bigger monolith above N48 would've taken too long.
Might've been a different story if they cancelled chiplet-RDNA4 earlier, then we might've gotten an "N31 specs + 24 CUs" RDNA4-based monolith, which could've spanked GB203, at least.
Though knowing Jensen, if he got info about that early enough, he would've probably just made NV aim higher with GB203 then, too.
There were rumors about 112 and 96 SM until the 84 SM info arrived, they probably down-specced GB203 during development when it became apparent they wouldn't need that many SM to beat AMD.
No chip above N48 makes sense only if RDNA 5 is releasing soon (like in 1 year's time)
 

inquiss

Senior member
Oct 13, 2010
267
383
136
No chip above N48 makes sense only if RDNA 5 is releasing soon (like in 1 year's time)
It makes sense not because rDNA 5 is coming soon (it's not) but because ROI on AI chips is bigger. Ditch the high end RDNA4 with uncertain ROI and move those people over to get a quicker cadence on AI chips with a (relatively) more certain ROI.
 

yuri69

Senior member
Jul 16, 2013
601
1,056
136
No chip above N48 makes sense only if RDNA 5 is releasing soon (like in 1 year's time)
The summer '24 leaked roadmap shows RDNA5 to simply keep the trend of RDNA4 - leaving the segment previously covered by Navi 21 and Navi 31 out.

For AMD it makes zero sense to invest $$$ to try to compete with nV at higher price points. Consumers simply buy GeForce, like they used to do for decades. The buying decision is quite simple when one buys a ~$1000 consumer dGPU - the consumer wants the full feature set with the "standard experience". That's always been GeForce.
 
Reactions: Tlh97

gaav87

Senior member
Apr 27, 2024
452
794
96
The summer '24 leaked roadmap shows RDNA5 to simply keep the trend of RDNA4 - leaving the segment previously covered by Navi 21 and Navi 31 out.

For AMD it makes zero sense to invest $$$ to try to compete with nV at higher price points. Consumers simply buy GeForce, like they used to do for decades. The buying decision is quite simple when one buys a ~$1000 consumer dGPU - the consumer wants the full feature set with the "standard experience". That's always been GeForce.
Leaked roadmap is from 1 year ago.
Rdna1 ->RDNA2 was 1 year 4 months gap not full 2 year cycle.
 
Reactions: Tlh97 and marees
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |