Discussion RDNA4 + CDNA3 Architectures Thread

Page 254 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Keller_TT

Member
Jun 2, 2024
113
112
76
I think I know exactly what to expect after reality has dawned. There is enough proper CES reporting info and new leaks ballpark.

A 250-260W 9070 and 300W 9070 XT MBA cards. (AMD could do a deceptive 295W naming)
Performance: 9070 a 7900GRE Pulse in raster, catching up to 4070S in RT, <5% better in some cases.
9070 XT MBA: 4070 TiS ± 5% across the board. AIBs can crank up perf by 10% for even inferior efficiency.

A few odd titles favoring AMD like CoD will be shown as uplift ballpark and it'll be a Zen 5% like launch affair after all the hype.

Does Lisa have the will to shake off Cheap & Inefficient tagline of AMD GPUs?
 
Reactions: Gideon

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,503
96
A 250-260W 9070 and 300W 9070 XT MBA cards. (AMD could do a deceptive 295W naming)
Performance: 9070 a 7900GRE Pulse in raster, catching up to 4070S in RT, <5% better in some cases.
9070 XT MBA: 4070 TiS ± 5% across the board. AIBs can crank up perf by 10% for even inferior efficiency
All of this is wrong.
The full die is 7900XTX perf, the chop is 7900XT. That's it. Both MBA, idk the wattage.
They're boring but good parts. Pricing will decide their fate more or less.
Does Lisa have the will to shake off Cheap & Inefficient tagline of AMD GPUs
No she personally killed Navi40/41/42.
 

SolidQ

Golden Member
Jul 13, 2023
1,068
1,457
96
The full die is 7900XTX perf, the chop is 7900XT.
12GB n48 is 7800XT, but for like 330$?

RDNA5 is gfx13 aka more good more better.
This time N52 or N51 max? But no N50


9070 XT MBA: 4070 TiS ± 5% across the board.
You saw already worst case scenarion for AMD in Wukong near 4080s, in Cyber RT 4070S. Normal scenario would be better
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,503
96
So maybe +10% clocks and +20% density?
N4 2-2 to N3 2-1 should be that or a bit more I guess? Don't remember TSM stats for N3e + N3p cumulatively.
AMD needs 3D cache under GPU
ain't happening.
hope RDNA5 will get it sorted
dead. Colossals are Very Dead. Only the simplest monodies ever on the roadmap now.
if full die real XTX, that mean RDNA5 like xx60 clas = XTX, xx70 class = 4090, xx80 class above, that usually happens with every tier on next cards
It doesn't mean anything since cost per xtor stopped declining.
 
Reactions: Tlh97 and Win2012R2

ToTTenTranz

Senior member
Feb 4, 2021
278
522
136
It doesn't mean anything since cost per xtor stopped declining.

It may (or should) not be this way forever. The only real reasons for this are TSMC having few top-of-the-line fabs doing the highest-end processes, Intel and Samsung not being able to compete at those processes, and AI chips with massive markups taking up such a huge portion of the high-end fab capacity.


It's not like all these factors are bound to stagnate forever. TSMC has the "we need to stay relevant to the world or China invades" factor but still they're building fabs for high-end processes outside Taiwan. Intel and Samsung aren't sitting idle, and AI hardware sounds increasingly like a bubble.


It's not like TSMC has enormously increased costs for their newer processes, e.g. ASML isn't charging them 20x more for their 2nm scanners than they did for the 28nm ones and the costs for their scanners are also paid off after a couple months into full production.
This is all supply and demand caused by AI demand and lack of competition over TSMC's high-end fabs.



It's both real and.. not new? AMD confirmed they were also waiting for Nvidia to play their hand about a week ago.
 

Keller_TT

Member
Jun 2, 2024
113
112
76
You saw already worst case scenarion for AMD in Wukong near 4080s, in Cyber RT 4070S. Normal scenario would be better
Wukong is software RT that rewards raster. Besides, that was AIB OC card by the look of it (330W)
I really don't care about Cyberpunk. Not a game in our house and it's sponsored by and optimized for Nvidia to show off DLSS, RT. All lighting gimmicks are meh as the setting is all artificial and can just as well be like "realism" of using another set of lights and time of day. It looks more than punk enough on PS5.

I'll judge performance by Avatar FoP, Plague's Tale Requiem, RDR2, Indiana Jones, Rachet & Clank, F1 24, sim racing titles in VR without glitches.
 

soresu

Diamond Member
Dec 19, 2014
3,501
2,782
136
So they preserved CUDA and added the DLSS lockin
CUDA is their entire compute platform that keeps the industry locked into their hardware, tensor cores just being the newest domain specific addition to that.

The compute silicon they call "CUDA cores" is the ALUs used for shading and other assorted GPU compute in games.

If nVidia had gone down AMD's path and bifurcated their GPU development with a CDNA equivalent lineage then they probably would have kept DLSS on CUDA cores if they ever made it.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |