Discussion RDNA4 + CDNA3 Architectures Thread

Page 50 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,747
6,598
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

branch_suggestion

Senior member
Aug 4, 2023
391
869
96
Depends on your definition of midrange these days, lol.

And not much has leaked on Blackwell. So while I am very pessimistic about the gen-on-gen perf improvement outside of the $$$ stuff it is too early to be definitive about it.
Competitive can mean many things, FWIW, I certainly think RDNA4 will be cheaper to make for a given performance tier.
N4P vs N3E makes that straightforward. It coming out first is the biggest advantage overall.
 

branch_suggestion

Senior member
Aug 4, 2023
391
869
96
Nothing stops NV from making mainstream GB20x on N4p.
Very true, but NV hasn't bothered dual sourcing nodes for a single client uarch gen since Pascal, and those were 2 very similar nodes. They are usually happy to mix the previous gen to balance it out. Then again the 700 and 600 series did do this to a large degree.
Considering DC demand for N3E, you are probably right.
RDNA4 should still hold the performance/area edge on similar nodes, so I stand by my prediction.
 

beginner99

Diamond Member
Jun 2, 2009
5,231
1,605
136
This is exactly what I don't get.

We have the canned super complex multi compute part at the halo end. We have the small monolithic dies at the low and middle end. What happened to the upper middle and high end parts? Why were they canned?
I assume everything canned was chiplet-based. This includes a gpu with just one "compute" chiplet or however the organization would be, just the minimal config. If the can't get this to work at all, all products using the tech from minimal config to monster gpu are canned.
And even if the "single chiplet" version would work, probably not worth it to invest in the "base cost" just for 1 or 2 SKUs.

EDIT: the not working part could also entirely be due to driver/software. If the driver teams says: we are not sure if we can get it working till then, then they have no other option as to delays or can the products.
 

SteinFG

Senior member
Dec 29, 2021
642
773
106
Please say that we are getting an RDNA3 high end refresh. 7950XTX. Please. I beseech you!
Vf cuve is still shit without re-making the die. 3D-stacking MCD for a part that barely beats 4080 isn't worth it. They can put faster vram there, and nothing else really. Doubt that'll help though, as its bandwidth is already reaching 4090 levels.
 
Jul 27, 2020
19,915
13,644
146
Vf cuve is still shit without re-making the die.
I'm sure there are some engineering tweaks they could do. Or maybe the node characteristics may have improved over time to let them get less leaky dies. Or even they might have been collecting the golden dies for a 7950XTX launch later that not only clock higher but also leak less, consuming less power and getting less hot.

If RDNA4 isn't gonna target the high end, they would want something at the top still, to make the extra margins on the high end. If the top RDNA4 part gets something like 85% of 7900XTX performance, they would want to increase that difference to at least 25% higher performing halo part, to get the gamers with deeper pockets to go for 25% more performance for maybe like 35% more money.
 

linkgoron

Platinum Member
Mar 9, 2005
2,408
977
136
Last edited:

Tigerick

Senior member
Apr 1, 2022
695
602
106
Btw, gfx1200 and gfx1201 are listed under RDNA3 arch in llvm processors )

Wow, nice findings. I think it is pretty much confirmed RDNA4 GPUs are based on RDNA3 architecture even though they use gfx1200 series. As I said, N43 and N44 are just a die shrink of RDNA3, there won't be much change in core design and we should hear more about 7600XT and 7700 in CES2024....
 

PJVol

Senior member
May 25, 2020
703
631
136
RDNA2 was also listed in the same group as RDNA1 before they split it up later.
Very well may be. There's one more though, where rdna 1 & 2 aren't split, and 3 & 4 described under the same MachineModel scheduling definition either:
Code:
//===----------------------------------------------------------------------===//
// GCN GFX10.
//===----------------------------------------------------------------------===//

def : ProcessorModel<"gfx1010", GFX10SpeedModel,
  FeatureISAVersion10_1_0.Features
>;

def : ProcessorModel<"gfx1011", GFX10SpeedModel,
  FeatureISAVersion10_1_1.Features
>;

def : ProcessorModel<"gfx1012", GFX10SpeedModel,
  FeatureISAVersion10_1_2.Features
>;

def : ProcessorModel<"gfx1013", GFX10SpeedModel,
  FeatureISAVersion10_1_3.Features
>;

def : ProcessorModel<"gfx1030", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1031", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1032", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1033", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1034", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1035", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

def : ProcessorModel<"gfx1036", GFX10SpeedModel,
  FeatureISAVersion10_3_0.Features
>;

//===----------------------------------------------------------------------===//
// GCN GFX11.
//===----------------------------------------------------------------------===//

def : ProcessorModel<"gfx1100", GFX11SpeedModel,
  FeatureISAVersion11_0_0.Features
>;

def : ProcessorModel<"gfx1101", GFX11SpeedModel,
  FeatureISAVersion11_0_1.Features
>;

def : ProcessorModel<"gfx1102", GFX11SpeedModel,
  FeatureISAVersion11_0_2.Features
>;

def : ProcessorModel<"gfx1103", GFX11SpeedModel,
  FeatureISAVersion11_0_3.Features
>;

def : ProcessorModel<"gfx1150", GFX11SpeedModel,
  FeatureISAVersion11_5_0.Features
>;

def : ProcessorModel<"gfx1151", GFX11SpeedModel,
  FeatureISAVersion11_5_1.Features
>;

//===----------------------------------------------------------------------===//
// GCN GFX12.
//===----------------------------------------------------------------------===//

def : ProcessorModel<"gfx1200", GFX11SpeedModel,
  FeatureISAVersion12.Features
>;

def : ProcessorModel<"gfx1201", GFX11SpeedModel,
  FeatureISAVersion12.Features
>;
 

adroc_thurston

Diamond Member
Jul 2, 2023
3,549
5,116
96
Btw, gfx1200 and gfx1201 are listed under RDNA3 arch in llvm processors )
That's a stub.
I think it is pretty much confirmed RDNA4 GPUs are based on RDNA3 architecture
They're nothing alike.
As I said, N43 and N44 are just a die shrink of RDNA3
They're not even called that.
So it IS very possible that 7950 XT and XTX are inbound!
They don't exist, much like X3D N31 does not exist.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |