Discussion RDNA4 + CDNA3 Architectures Thread

Page 307 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,755
6,635
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

In2Photos

Platinum Member
Mar 21, 2007
2,355
2,493
136
Why would AMD waste this opportunity, though?

The only way they can get a meaningful advantage out of a 32GB card is if it's significantly cheaper than the 5090.

The idea here is to have a card sit above the 9070 XT that they can ask 300-500$ more for even though it's barely faster and only like 50$ more expensive to make, but takes away "local AI LLM that fit into 32GB" desktop marketshare from the 5090.
It doesn't matter how much vRAM it has, if it isn't faster than the 5090 it isn't going to take away from the 5090 market share. Nor the 5080 for that matter. At least not in any way that's meaningful.
 

reaperrr3

Member
May 31, 2024
66
216
66
When RDNA3 had it I thought it was desperate measure to make t**d look better than it is
You have to design these things into the chip/uArch long before even knowing whether a chip is going to be worse than planned, and usually you wouldn't waste resources on implementing a hardware feature if it's just an emergency marketing backup plan with no real benefit.
 

Win2012R2

Senior member
Dec 5, 2024
697
655
96
I think the point was that 10% at high end of clock/voltage curve is quite a lot.
Sure, my point was that it seems to me having desynced clocks was bringing in marginal benefits at best (at 10% ain't going to do loads) and it looks more desperate than planned - or perhaps more likely the other clock did not hit target level, which is why desyncing happened.
 

reaperrr3

Member
May 31, 2024
66
216
66
Sure, my point was that it seems to me having desynced clocks was bringing in marginal benefits at best (at 10% ain't going to do loads) and it looks more desperate than planned - or perhaps more likely the other clock did not hit target level, which is why desyncing happened.
Well, my point was you can't desync clocks like that unless you put the frontend into its own clock domain during the chip/uarch design phase, in other words at a time when they were still thinking RDNA3 would do much better than it ended up doing.
Which they wouldn't do if there was no benefit to it, as an extra clock domain isn't free in terms of area or design cost.
 

Timorous

Golden Member
Oct 27, 2008
1,815
3,442
136
Is it even sensible to have separate clock domain when the difference in clocks is expected to be fairly low, is it mainly to save power on idle rather than perf thing?

How do we know at design time the difference in clocks was meant to be low? If RDNA 3 was designed to be very high speed then it stands to reason the expected delta between the domains was greater than it ended up being.
 
Reactions: marees
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |