Discussion RDNA4 + CDNA3 Architectures Thread

Page 31 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,770
6,720
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

DisEnchantment

Golden Member
Mar 3, 2017
1,770
6,720
136
I find only 40CU unlikely. Just to be on par with N32, It would need 50% higher clock.
Well if the supposed 5700XT tier RDNA4 rumor is true then it wont be on par with N32.
If N43 is indeed a 40CU GPU, then it makes sense based on current product lineups. N43 is not designed for replacing current N32, it is going to replace 6700XT/6750XT 12GB and I believe it can only support 192-bit memory bus with 12GB GDDR6 as well. And I believe it will be made by TSMC's N4P process with monolithic design, not N3E as people hope so.

As for pricing, @adroc_thurston has hinted it would cost $399 at best which make sense with smaller die size and monolithic design. I am hoping AMD would go lower like $349 with OC version cost up to $399. There is going to have some overlapping between N43 (likely going to model as Radeon RX7700 non-XT series) and 7700XT especially with OC version of 7700; but I believe AMD would focus more on RX7800XT based on current pricing.
If N43 is indeed a 40CU GPU, then it makes sense based on current product lineups. N43 is not designed for replacing current N32, it is going to replace 6700XT/6750XT 12GB and I believe it can only support 192-bit memory bus with 12GB GDDR6 as well. And I believe it will be made by TSMC's N4P process with monolithic design, not N3E as people hope so.

I am just wondering why replace lower end/midrange parts with new architecture? I would expect current lower midrange 7700XT next year will be coming down to 399USD levels. Once NV adjust 4060 prices it will come down.
Which means they don't need RDNA4 at all for next year. They could just create a Strix successor and put RDNA4 inside for 16CUs and call it a day with RDNA4.
 

Tigerick

Senior member
Apr 1, 2022
715
667
106
Well if the supposed 5700XT tier RDNA4 rumor is true then it wont be on par with N32.



I am just wondering why replace lower end/midrange parts with new architecture? I would expect current lower midrange 7700XT next year will be coming down to 399USD levels. Once NV adjust 4060 prices it will come down.
Which means they don't need RDNA4 at all for next year. They could just create a Strix successor and put RDNA4 inside for 16CUs and call it a day with RDNA4.
Good question, I believe there won't be much difference between RDNA3 and RDNA4 as people hopes; remember true RDNA4's architecture with chiplet design is cancelled and merged into RDNA5 which supposedly release by end of 2025. N43 and N44 sound like extension of RDNA3 rather than original plan of RDNA4.

Another reason might be BOM especially with RX7700XT, AMD is wasting one MCD and die area with much higher TDP. With monolithic design, AMD could use much lower power and bring down the cost as well.

With chiplet design, AMD seems like not able to bring the power consumption down, that's why we don't see N32 mobile GPU in sights...thus AMD needs to create monolithic version of GPU if they want tackle notebook platform especially with upcoming Zen5's platform. I am not talking about Strix Halo but Fire Range with 16-core and X3D as well...
 
Last edited:

branch_suggestion

Senior member
Aug 4, 2023
610
1,324
96
I think there isn't enough public info about the GFX12 uArch changes to really elaborate much on config.
Still, if AMD can make a $399 part that obsoletes N32 and a $249 part that obsoletes N33, RDNA4 is easily worth it.
I am just wondering why replace lower end/midrange parts with new architecture?
Because perf/$, both for the consumer and AMD.
Which means they don't need RDNA4 at all for next year. They could just create a Strix successor and put RDNA4 inside for 16CUs and call it a day with RDNA4.
The IP has been developed, making some midrange parts that crush what NV currently has with a nice headstart over Blackwell is a very nice position for AMD.
Think RV770 vs G92/GT200.
Good question, I believe there won't be much difference between RDNA3 and RDNA4 as people hopes; remember true RDNA4's architecture with chiplet design is cancelled and merged into RDNA5 which supposedly to come out by end of 2025. N43 and N44 sound like extension of RDNA3 rather than original plan of RDNA4.
RDNA4 is GFX12, there are big changes regardless of the packaging.
Another reason might be BOM especially with RX7700XT, AMD is wasting one MCD and die area with much higher TDP. With monolithic design, AMD could use much lower power and bring down the cost as well.
Small enough parts are best mono, yeah. N32 is just big enough that the layout they went for was better, but it will never be seen again as it was mainly a test run for the new fanout that Zen6 and other stuff will be using.
With chiplet design, AMD seems like not able to bring the power consumption down, that's why we don't see N32 mobile GPU in sights...thus AMD needs to create monolithic version of GPU if they want tackle notebook platform especially with upcoming Zen5's platform. I am not talking about Strix Halo but Fire Range with 16-core and X3D as well...
RDNA3 power issues are at the core level, chiplet power hit is ~10w for N32 vs a mono fabric. AMD has been slow on getting mobile parts out for very different reasons.
 
Reactions: Tlh97 and Tigerick

eek2121

Diamond Member
Aug 2, 2005
3,270
4,798
136
Which then confirm no high end right? because if the "next-gen" doesn't best the previous gen, that by definition means no high end. Releasing it of course still makes sense in terms of power user and features.

32-40CU is too little for a next gen chip even If they scrapped High-end, this could be the slower chip.
Considering they have strix Halo with 40CU IGP, I would expect 48CU full and 40CU as a cutdown for the weaker chip.
Stronger could be 72CU and 60CU cutdown.

I got ~171mm2 for a 48CU(3SE) RDNA3 5nm GCD, based on what Locuza posted some time ago.
If I applied N3E density improvement to It.

At worst, It could be 132mm2(+30% improvement) and at best 114mm2(+50%).
Of course, we don't know how much more transistors RDNA4 uses vs RDNA3 for comparable specs, so this estimate is inaccurate.

To your argument, If It's not better to just port RDNA3 N32 to N4P and save some money in the process. That's very hard to tell when we know absolutely nothing about RDNA4.
Considering that they plan to release It, It should still have some advantage compared to RDNA3.
I personally expect significantly higher clocks. With 48CU, It would need ~25% to be on par with 7800XT.

The problem I see is not this GCD per se, but MCD and Vram. For a 48 CU at higher clock you would need comparable BW to N32.
4 MCDs would have comparable size to that GCD, even If they cost less to produce.
Does anyone have anything bad to say about the 5700XT? Didn’t think so.

AMD could stick to 40CUs if they fix whatever issues are dogging RDNA3. I could easily see an increase of 50% perf/watt over RDNA3.

Would 40CUs compete at the top end? Of course not. Would it be competitive at midrange? yep. Very competitive. I guess we will find out when it launches…

EDIT: I personally think they will go bigger, FWIW. Probably 60CUs.

IMO they should rework RDNA3 into a single 60CU monolithic design on N4P, fixing bugs and optimizing the architecture.
 
Reactions: Tlh97

TESKATLIPOKA

Platinum Member
May 1, 2020
2,602
3,161
136
Does anyone have anything bad to say about the 5700XT? Didn’t think so.

AMD could stick to 40CUs if they fix whatever issues are dogging RDNA3. I could easily see an increase of 50% perf/watt over RDNA3.

Would 40CUs compete at the top end? Of course not. Would it be competitive at midrange? yep. Very competitive. I guess we will find out when it launches…

EDIT: I personally think they will go bigger, FWIW. Probably 60CUs.

IMO they should rework RDNA3 into a single 60CU monolithic design on N4P, fixing bugs and optimizing the architecture.
Your 50% perf/W improvement could mean many things, be more specific.

Not even sure what level of performance you consider as midrange.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,602
3,161
136
Good question, I believe there won't be much difference between RDNA3 and RDNA4 as people hopes; remember true RDNA4's architecture with chiplet design is cancelled and merged into RDNA5 which supposedly release by end of 2025. N43 and N44 sound like extension of RDNA3 rather than original plan of RDNA4.
There is no difference between RDNA4 monolith or chiplet.
The same as N33 vs N32 or N31.
With chiplet design, AMD seems like not able to bring the power consumption down, that's why we don't see N32 mobile GPU in sights...thus AMD needs to create monolithic version of GPU if they want tackle notebook platform especially with upcoming Zen5's platform. I am not talking about Strix Halo but Fire Range with 16-core and X3D as well...
There is no proof that chiplet design is the cause of missing laptop N32.

No one would pair RDNA4 monolith with Fire Range, If It's not fast enough.
And AMD was always doing bad in laptops, I doubt they would especially make a monolith mainly for laptop use.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,602
3,161
136
Do you think N44 is 40CU? N41 was supposed go to 270CU?


I'm think same for N43 = 60CU. If N22 to 32 going 40 to 60CU, then possible N32 going to N43 with higher clock and much better RT
Who knows?
N4C(N41 or N4X) had 270CU in 9 SEDs. Or 30CU pre SED.
If you think about It, then even 40CU looks very little in comparison. That's only 14.8% of compute, ignoring frequency difference.
But N4C was a GPU using 9 Shader Engine chiplets, It could be scaled down easily.

edit:
What I want to know is what memory configuration was planned to be used with It.
If It needed the same BW requirement as N31, then It would need ~3x higher BW.
Even with 512-bit GDDR7 you would get only 2.1x better BW. So the rest would have to be compensated with more cache.
 
Last edited:

PJVol

Senior member
May 25, 2020
844
823
136
With missing clocks
Oh... what a surprise for those thinking the chiplets are the culprit.
I even dare to assume they're among those who "silently predicted" 50% perf/w failure unlike most of the rumormongers rgt, mlid, et al condemned here in this forum, who claimed 2.5x performance for top to top
 
Last edited:

TESKATLIPOKA

Platinum Member
May 1, 2020
2,602
3,161
136
Because the high end chiplet models got cancelled. I am assuming that the N32 replacement would have been the single chiplet.
If N32 replacement was already a chiplet, then the 2 surviving monoliths would perform worse than N32.
One thing would be replacing N33 with 20-30% faster chip, but what's the point of replacing N24?
Performance would be weak and I am pretty skeptical about price being low enough. And what about Vram? With anything less than 8GB Vram It would be DOA.
 
Reactions: Tlh97

PJVol

Senior member
May 25, 2020
844
823
136
They must have needed to drop the clocks back to ≤2 Ghz to make that work
Not really.
Rough math: assuming n31 gfx core 6SE ~= n41 3SE consuming 150W - ~ 20% process advantage gives 3x120 ~= 360W. Add to it 100-120W for the rest we have ~ 475W ASIC. This is not taking into account rdna3 architecture-wise lack of perf/w
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,602
3,161
136
Not really.
Rough math: assuming n31 gfx core 6SE ~= n41 3SE consuming 150W - ~ 20% process advantage gives 3x120 ~= 360W. Add to it 100-120W for the rest we have ~ 475W ASIC. This is not taking into account rdna3 architecture-wise lack of perf/w
How did you get that 90CU 3SED N41 consumes 150W and performs as N31?
 
Last edited:
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |