Discussion RDNA4 + CDNA3 Architectures Thread

Page 181 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,749
6,614
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

gdansk

Diamond Member
Feb 8, 2011
3,276
5,186
136
Even if they match it on compute, they need to perform miracles to get the performance they need out of their narrow and slow memory interface.
Or use faster memory than rumoured
It wouldn't be the first time rumors are wrong (although rumors about AMD cheaping out on memory interface have an almost 100% track record...)
 

poke01

Platinum Member
Mar 8, 2022
2,584
3,411
106
I see only Nvidia taking GDDR7 shipments. who knows AMD did as well and we missed it.

we will see how GDDR7 impacts real world apps and games. But the bandwidth increase is really high.
 

ToTTenTranz

Member
Feb 4, 2021
182
313
106
Even if they match it on compute, they need to perform miracles to get the performance they need out of their narrow and slow memory interface.

Memory bandwidth requirements go down if the GPU relies on aggressive upscaling technologies (meaning lower base resolution). One such example is the PS5 Pro PSSR patches in games using lower base resolution than the original PS5 versions.


If FSR4 works as well as PSSR (and there's a good chance one is just a fork of the other), then we should get decent 4K output out of 1080p base resolution.
 

SolidQ

Senior member
Jul 13, 2023
593
747
96
2026? isn't that was rumor RTX 6xxx also 2026?
  • There is no RDNA5 code name. After RDNA4, it will be UDNA.
  • MI400 and RX9000 use the same UDNA, and the architecture uses an ALU design similar to GCN.
  • UDNA Gaming GPU is tentatively scheduled for mass production in Q2 26.
  • Sony's PS6 will use UDNA, and the CPU has not yet been determined to be ZEN4 or ZEN5. Sony's handheld will also use AMD hardware.
  • I heard that Microsoft's handheld will choose between Qualcomm and AMD? I don't know about this
  • The above information comes from the supply chain, and I don't know the specifications and performance.
 
Reactions: Tlh97 and blckgrffn

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
MI400 and RX9000 use the same UDNA, and the architecture uses an ALU design similar to GCN.
Seems unlikely.

CDNA concentrated almost exclusively on improving specific aspects of the architecture around the new matrix units and AI/ML compute.

Doing this allowed them to keep CDNA1-4 more or less consistent in programming model

RDNA meanwhile has not simply added RT and changed to wave32.

It has evolved significantly at the base level of the µArch, and many of the changes in RDNA3 do not seem to favor gaming compute much at all.

In fact I'm inclined to think that RDNA3 was in fact setting the stage for bringing professional compute and gaming back together.

It would surprise me if GFX13 or whatever comes next is basically just GFX9 on steroids.
 

Gideon

Golden Member
Nov 27, 2007
1,842
4,380
136
It has evolved significantly at the base level of the µArch, and many of the changes in RDNA3 do not seem to favor gaming compute much at all.
This!

I find the dilemma whether it uses Zen 4 or Zen 5 equally odd. Why would it use Zen 4? By that point Zen 5 has been out for 2 years. If the want to skimp on area they can just use Zen 5c and cut it down further if needed, as they did with zen 2 in PS5 (128-bit FP unit only)
 
Last edited:

Gideon

Golden Member
Nov 27, 2007
1,842
4,380
136
MI400 and RX9000 use the same UDNA, and the architecture uses an ALU design similar to GCN.

A respected graphics engine developer also calls BS on this one.


I'll quote the whole thread (highlights mine):

Sebastian Aaltonen said:
AMD UDNA will be interesting.

CDNA3 architecture is still based on GCN 4 cycle wave64 scheduling. RDNA schedules every cycle and exposes instruction latency. Scheduler runs/blocks instructions concurrently/dynamically. RDNA is much closer to Nvidia GPUs.

Thread...



CDNA has wide matrix cores and other wide compute workload improvements, which AMD wants to bring to UDNA. It also has multi-chip scaling.

Rumors tell that RDNA4 will finally have matrix cores in consumer space. Seems that AMD is integrating matrix cores early to RDNA lineup.

My expectation is that UDNA compute unit will be RDNA4 descendant instead of CDNA3 descendant. They definitely need 1 cycle low latency scheduling in consumer space, and Nvidia does well with it in AI space too. I don't see them going back to GCN-style design for UDNA.

Integrating matrix cores to RDNA4 compute unit would be an iterative step towards UDNA. They could iterate that compute unit further to meet AI workload.

I would expect UDNA to borrow lot of CDNA3 tech for caches, memory controllers and connectivity with chiplets.

There's rumors that RDNA4 will be a small iterative improvement as they are ditching RDNA arch one year later. But i'd say it's entirely possible that RDNA4 is a an iterative step towards UDNA. First time they merge new RDNA ALU pipes with CDNA matrix units. Could be a big step.

It's important to consider the importance of AI for AMD. RDNA architecture was initially designed for AMDs most important market segment (gaming), while UDNA got the old GCN design. Now AI is most important segment for them. They want newest arch for professional AI chips too.

One might argue that simple GCN-style scheduling is still good for AI workloads, but lower latency scheduling can operate better on register pressure. And it's often better for caches to avoid running too wide workload on every compute unit. RDNA/Nvidia style arch is just better.

UDNA makes perfect sense. AMD today wants matrix units for AI based upscaling (and other client AI workloads) in consumer space. And they want their latest compute unit architecture for AI chips. AI is now a priority. GCN-style arch is outdated. Nvidia/RDNA-style is better.
 

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
so basicaly:

CDNA4 = MI350
CDNA5/CDNA Next + RDNA5 = UDNA = MI400/new Radeon family
If the rumour is true and UDNA is coming in 2026 I'd say a domain specific (gfx focused) RDNA5 never existed in the first place (outside earlier conceptual or strategic roadmaps).

A 2026 release would mean it's probably been in the works for 2-3 years already.
 
Reactions: Tlh97 and Mopetar

blckgrffn

Diamond Member
May 1, 2003
9,343
3,514
136
www.teamjuchems.com
If the rumour is true and UDNA is coming in 2026 I'd say a domain specific (gfx focused) RDNA5 never existed in the first place (outside earlier conceptual or strategic roadmaps).

A 2026 release would mean it's probably been in the works for 2-3 years already.

Which would make it even easier to believe they would rather focus on that than all the permutations of RDNA4. It’ll be a pivotal architecture.
 
Reactions: Tlh97 and soresu

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
Which would make it even easier to believe they would rather focus on that than all the permutations of RDNA4. It’ll be a pivotal architecture.
I had the same thought myself.

Domain specific architectures are an impediment to their market ambitions as far as ecosystem simplification goes, so the faster they progress to UDNA the better.

That being said I wish they would hurry up and release RDNA4 already.
 

marees

Senior member
Apr 28, 2024
578
639
96
I had the same thought myself.

Domain specific architectures are an impediment to their market ambitions as far as ecosystem simplification goes, so the faster they progress to UDNA the better.

That being said I wish they would hurry up and release RDNA4 already.
I think RDNA 4 being cut down to size was a combination of
  1. UDNA ambitions
  2. TSMC (packaging) constraints
  3. RDNA 3 being a misfire
All of this still doesn't explain why we don't have a monolithic Navi 49 that replaces Navi 31, unless there is a UDNA Navi 53 waiting in the wings that can do it
 
Reactions: Tlh97
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |