Discussion RDNA4 + CDNA3 Architectures Thread

Page 32 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,770
6,719
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Frenetic Pony

Senior member
May 1, 2012
218
179
116
Top was supposedly this.

9 SEDs with 270CU in total, at least It supposedly contained 30CU per SED.
I have to wonder where were ROPs and frontend placed?
This. A: Could be entirely trash, very obviously. B. Might be referring to which arch they were going with, and nothing to do with models at all. The arch shown is an arch, not a specific configuration, RDNA4 could very easily have an entire range of MCM parts that look like RDNA3 instead of being even more chiplet heavy (a good decision given non existent high end packaging supply at the moment anyway)

Regardless, looking at it again, if they didn't want to majorly swap out the cache heirarchy (the L2 cache utilization is good right now) doubling the L1 cache size would be a good idea either way. It has a pretty poor hit rate across the board, and clocking 20% higher should keep latency down even with a bigger cache size.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,601
3,160
136
This. A: Could be entirely trash, very obviously. B. Might be referring to which arch they were going with, and nothing to do with models at all. The arch shown is an arch, not a specific configuration, RDNA4 could very easily have an entire range of MCM parts that look like RDNA3 instead of being even more chiplet heavy (a good decision given non existent high end packaging supply at the moment anyway)
It looks pretty legit to me.
It could have been an actual product.

You can say that 270(9*30)CU is a serious overkill, but with RDNA3 they were losing pretty badly. To be on par with Full Ada102 they would have needed ~50% more CU, that's 144CU in total.
270CU for next generation would be 88% more. If they wanted to get the performance crown, then this amount is not that surprising.
Regardless, looking at it again, if they didn't want to majorly swap out the cache heirarchy (the L2 cache utilization is good right now) doubling the L1 cache size would be a good idea either way. It has a pretty poor hit rate across the board, and clocking 20% higher should keep latency down even with a bigger cache size.
L1 cache is part of CU(WGP), so you would have to clock them higher too, which would raise power consumption.
 
Reactions: Tlh97 and Joe NYC

adroc_thurston

Diamond Member
Jul 2, 2023
5,237
7,317
96
It could have been an actual product.
Oh it was.
You can say that 270(9*30)CU is a serious overkill
Kinda.
but with RDNA3 they were losing pretty badly
Because it clocks a whole lot less than it should.
If they wanted to get the performance crown, then this amount is not that surprising.
They really wanted to squash some bugs but not happening for obvious reasons.
L1 cache is part of CU(WGP)
A-ha.
Well you see.
(also he meant RDNA L1 aka SA-level read-only cache, not CU-private L0).
 

PJVol

Senior member
May 25, 2020
844
823
136
P.S. Why do so many of you like to link german sites. I don't speak german.
It's just that pcgh is the only site (from my memory) that publishes hwinfo screenshots, that is a big 👍 for me.
Also they recently started to put an aggregated performance index chart for the GPUs tested, that combines all resolutions and gaming scenarios including RT.
Google translates from german nice, btw.
 
Last edited:

TESKATLIPOKA

Platinum Member
May 1, 2020
2,601
3,160
136
It's just that pcgh is the only site (from my memory) that publishes hwinfo screenshots, that is a big 👍 for me.
Google translates from german nice, btw.
I wasn't being serious. Take It as a joke from someone who is working for a German company, but is just too lazy to learn the language.
And I don't know which one is what GPU part.
You have:
GPU Core Power (VDDCR_GFX)
GPU Memory Power (VDDIO)
GPU SoC Power (VDDCR_SOC)
GPU Memory Power (VDDCI_MEM)
GPU USR Power (VddCR_USR)
What I know is TGP and TBP power.
 

Tigerick

Senior member
Apr 1, 2022
715
667
106
@TESKATLIPOKA , @DisEnchantment

When I was referring RDNA3/4 difference, I meant the performance increase in rasterization which is pretty meh from past RDNA2 with the same amount of CU. That's why I don't expect any changes on raster performance which is mostly correct. What I am missing is new features upgrade, namely ray tracing performance. As shown on Cyberpunk 2077, AMD managed to increase ray-tracing performance by almost 50% with RX7700XT which is pretty good, but still not up to nVidia level.

With new Shader ISA gfx12 and possible 3rd generation RT cores, if AMD manages to pump up additional 50% fps on ray tracing, then AMD might be reaching NV's level if the amount of CU/SM is in parity...I would say this is the biggest uncertainty on N43/N44, we shall see...

I am more convinced that N43 going to have 40CU with 12GB GDDR6 support, along with N44 which should feature 32CU and 8GB GDDR6 support. Here comes the table with AMD GPU lineup for 2024:- Pending confirmation. PS: I also throw in old RDNA2 and upcoming Strix Halo for comparison:-


ModelSRPCodenameProcessTDPDie Size
(mm2)
Shader ISART CoresCUICMemoryMemory BWTPU Perf
RX 6650 XT$399RDNA2 N23N7176 W237GFX10321st Gen3232MB128-bit GDDR6280.3 GB/s- 1%
RX 7600$269RDNA3 N33N6165 W204GFX11022nd Gen3232MB128-bit GDDR6288 GB/sBase
RX 7600 XT ?$299?RDNA4 N44N4P??GFX12?3rd Gen3232MB128-bit GDDR6288 GB/s?
RX 6750 XT$549RDNA2 N22N7250 W335GFX10311st Gen4096MB192-bit GDDR6432 GB/s+ 19%
RX 7700 ?$399?RDNA4 N43N4P??GFX12?3rd Gen4048MB192-bit GDDR6432 GB/s?
Strix HaloNARDNA3.5N3E??GFX115??4032MB256-bit LPDDR5X-8533272 GB/s?
RX 7700 XT$449RDNA3 N32N5 + N6245 W346GFX11012nd Gen5448MB192-bit GDDR6432 GB/s+ 38%
RX 7800 XT$499RDNA3 N32N5 + N6263 W346GFX11012nd Gen6064MB256-bit GDDR6620.8 GB/s+ 67%
 
Last edited:
Reactions: Tlh97

TESKATLIPOKA

Platinum Member
May 1, 2020
2,601
3,160
136
@TESKATLIPOKA , @DisEnchantment

When I was referring RDNA3/4 difference, I meant the performance increase in rasterization which is pretty meh from past RDNA2 with the same amount of CU. That's why I don't expect any changes on raster performance which is mostly correct. What I am missing is new features upgrade, namely ray tracing performance. As shown on Cyberpunk 2077, AMD managed to increase ray-tracing performance by almost 50% with RX7700XT which is pretty good, but still not up to nVidia level.

With new Shader ISA gfx12 and possible 3rd generation RT cores, if AMD manages to pump up additional 50% fps on ray tracing, then AMD might be reaching NV's level if the amount of CU/SM is in parity...I would say this is the biggest uncertainty on N43/N44, we shall see...

I am more convinced that N43 going to have 40CU with 12GB GDDR6 support, along with N44 which should feature 32CU and 8GB GDDR6 support. Here comes the table with AMD GPU lineup for 2024:- Pending confirmation. PS: I also throw in old RDNA2 and upcoming Strix Halo for comparison:-


ModelSRPCodenameProcessTDPDie Size
(mm2)
Shader ISART CoresCUICMemoryMemory BWTPU Perf
RX 6650 XT$399RDNA2 N23N7176 W237GFX10321st Gen3232MB128-bit GDDR6280.3 GB/s- 1%
RX 7600$269RDNA3 N33N6165 W204GFX11022nd Gen3232MB128-bit GDDR6288 GB/sBase
RX 7600 XT ??RDNA4 N44N4P??GFX12 ?3rd Gen ?32?128-bit GDDR6??
RX 6750 XT$549RDNA2 N22N7250 W335GFX10311st Gen4096MB192-bit GDDR6432 GB/s+ 19%
RX 7700 ??RDNA4 N43N4P??GFX12 ?3rd Gen ?40?192-bit GDDR6??
Strix HaloNARDNA3.5N3E??GFX115??4032MB256-bit LPDDR5X-8533272 GB/s?
RX 7700 XT$449RDNA3 N32N5 + N6245 W346GFX11012nd Gen5448MB192-bit GDDR6432 GB/s+ 38%
RX 7800 XT$499RDNA3 N32N5 + N6263 W346GFX11012nd Gen6064MB256-bit GDDR6620.8 GB/s+ 67%
Those specs are not very likely. If we say the smaller one is 32CU, then the bigger one should be 48CU or there is no point in giving It 50% more BW.

Then there is still the question about performance. It has to have significantly higher performance than N33 and N22, or what's the point in releasing It? Just to sell It dirt cheap so someone will buy It?

I wouldn't be surprised for RDNA4 to use N3E, If even Strix Halo IOD will use It.

Raytracing even If on par with Nvidia with these specs won't be that useful. RTX 4060TI 16GB manages only 45FPS and even that only at 1080p, at 1440p It's only 27fps in Cyberpunk.
 
Last edited:
Reactions: Tlh97 and Ranulf

Tigerick

Senior member
Apr 1, 2022
715
667
106
Those specs are not very likely. If we say the smaller one is 32CU, then the bigger one should be at least 48CU or there is no point in giving It 50% more BW.

Then there is still the question about performance. It has to have significantly higher performance than N33 and N22, or what's the point in releasing It? Just to sell It dirt cheap so someone will buy It?

I wouldn't be surprised for RDNA4 to use N3E, If even Strix Halo IOD will use It.

Raytracing even If on par with Nvidia with these specs won't be that useful. RTX 4060TI 16GB manages only 45FPS and even that only at 1080p, at 1440p It's only 27fps in Cyberpunk.
Ha, I know you won't believe it. Let me ask you:

1. Why did AMD name full die of N33 as RX7600, not 7600XT?

2. Why did AMD name cut-down version of N32 as 7700XT, not 7800 or 7700?

Something to think about....
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,601
3,160
136
Ha, I know you won't believe it. Let me ask you:

1. Why did AMD name full die of N33 as RX7600, not 7600XT?

2. Why did AMD name cut-down version of N32 as 7700XT, not 7800 or 7700?

Something to think about....
Do you really think AMD will release two RDNA4 chips and then call It as RX 7600XT and 7700 or 7800? What about the cut down versions of them?

BTW, AMD already uses 7600M XT and 7700S in mobile for N33, so using a very similar name for a different architecture looks even less likely.

Just something to think about....
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,601
2,321
136
Here comes the table with AMD GPU lineup for 2024:- Pending confirmation.

Two notes:

1. The model names will be 8000-series (or some new system), even if that means lots of rebranded gpu models. The model names are not driven by common sense or by AMD, but by OEMs, and they want a shiny new model name for their new products, regardless of what it's slapped on.
2. 24Gbit modules should be commonplace next year, all the memory fabs are sampling them. So, 128-bit bus can support up to 12GB, and 192bit bus can support up to 18GB.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |