Discussion RDNA4 + CDNA3 Architectures Thread

Page 155 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,749
6,614
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

marees

Senior member
Apr 28, 2024
578
639
96
Hopefully with N31 being such a conservative CU increase we will see at least 1.33-1.5x more CUs on the RDNA5 hi end, possibly higher still for the enthusiast SKU if chiplets makes a super hi CU count possible without exploding overheads.
I hope there is a monolithic 6700xt/navi 48 style RDNA 5 GPU that could beat 7900xtx. I am scared after AMD nixxed many chiplet designs such as
  1. N36 = 4090 competitor
  2. N43 = original multi chip N48
  3. N42 = N31 replacement
  4. N41 = 4090 killer
  5. N40? = 5090 killer
If the chiplet style RDNA 5 design is a success then we might see numbers such as HD 9990 or even a HD 9999 😉
 

Timorous

Golden Member
Oct 27, 2008
1,775
3,333
136
I would start with 8700XT and move the x700 range to be 16GB parts. I would move x600 to 12GB parts so the 8600XT gets the twice cut 12GB N48 variant.

For N44 I would have 8500XT and 8400XT for the full and cut parts.

While AMD could go with 8800XT for the top part realistically it won't even match the 5070 so I don't think AMD would want their x800 parts priced at the level they would need to price them at to sell as it will be setting up a large price increase for RDNA 5.
 
Reactions: Tlh97 and marees

marees

Senior member
Apr 28, 2024
578
639
96
I would start with 8700XT and move the x700 range to be 16GB parts. I would move x600 to 12GB parts so the 8600XT gets the twice cut 12GB N48 variant.

For N44 I would have 8500XT and 8400XT for the full and cut parts.
Agree. High vram is the only PR win for AMD today

For ex: in BMW once you enable cinematic mode at 1080p 75% upscaled (with FG) it cripples the 1% lows of the 8gb 4060 ti card. While the 16gb 4060 ti card provides a steady 'cinematic fps' (24 fps 1% low & 30fps avg before FG)

So even nv fans acknowledge that 8gb cards cannot play 1080p max

It would be a bad PR move to have 12gb on 8700 class card
Correspondingly if the 8600xt is 12gb, it contrasts well vs the expected 8gb 5060 (5060 ti could be 12gb)
 
Last edited:
Reactions: Tlh97

jpiniero

Lifer
Oct 1, 2010
15,365
5,884
136
While AMD could go with 8800XT for the top part realistically it won't even match the 5070

I am not so sure in raster. Granted the 5070 will definitely cost more.

It would be a bad PR move to have 12gb on 8700 class card

Given how things have gone with AMD, it'd be a bigger bad PR move for the 8700 XT to be substantially more expensive than the current 7700 XT price ($389). Which it would be if it was a 16 GB model.
 
Reactions: Tlh97 and blckgrffn

reaperrr3

Junior Member
May 31, 2024
21
57
51
While AMD could go with 8800XT for the top part realistically it won't even match the 5070
GB205 allegedly has only 50 SM, only 4 more than are active on a 4070 and less than a Super.

Considering how NV usually operates, I wouldn't expect more than 48 enabled SM on a mere 5070.

Meanwhile, N48 not only has 2 WGP/4 CU more than N32, but - perhaps more importantly - 4 SE instead of 3, presumably 128 instead of 96 ROPs, in addition to RDNA4's general architectural improvements.
It'll also clock a good 20% higher than the 7800XT, if latest rumors are true.

In other words, Blackwell would have to beat RDNA4 by wider margin in terms of IPC per SM and/or clocks than Ada vs. RDNA3. Not impossible, but I wouldn't expect GB205 to beat N48 in Raster by a wide margin, if at all.

What do you folks think will be RT performance of navi 48 xtx ?

I am hoping 4070 ti super
others are saying it will definitely be less than that

50% above a 7800XT would be roughly 79XTX/4070Ti territory.

But the PS5Pro is more like RDNA3.7, has only 2 SE and either less or even no Infinity Cache, so older Raster-uArch and other weaker parts than N48.
If the PS5Pro is 50% above a 7800XT in RT, I'd expect N48XT(X) to be more like 70% above it, which would be pretty much exactly 4070Ti Super territory.
 
Reactions: Tlh97 and marees

marees

Senior member
Apr 28, 2024
578
639
96
50% above a 7800XT would be roughly 79XTX/4070Ti territory.

But the PS5Pro is more like RDNA3.7, has only 2 SE and either less or even no Infinity Cache, so older Raster-uArch and other weaker parts than N48.
If the PS5Pro is 50% above a 7800XT in RT, I'd expect N48XT(X) to be more like 70% above it, which would be pretty much exactly 4070Ti Super territory.
I sincerely hope n48 xtx trades blows with 4070 ti super on RT

That would make an immediate launch for $600 feasible

Otoh, 4070 super territory means they need to price it closer to $500 where it starts cannibalizing the 7900 GRE stocks

In other words if RT is not high then RDNA 4 cannot launch until & unless 7900 GRE stocks are sold out
 

ToTTenTranz

Member
Feb 4, 2021
182
313
106
What do you folks think will be RT performance of navi 48 xtx ?

I am hoping 4070 ti super
others are saying it will definitely be less than that


I think the poll isn't very useful because the base premise is wrong.
The PS5 shouldn't be 1.5x 7800XT in RT. In scenarios where the RT performance is a big bottleneck, the boost should be a lot more than +50%.

Sony's own docs claim something between 2x and 4x higher RT performance than the PS5.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,555
3,104
136
50% above a 7800XT would be roughly 79XTX/4070Ti territory.


But the PS5Pro is more like RDNA3.7, has only 2 SE and either less or even no Infinity Cache, so older Raster-uArch and other weaker parts than N48.
If the PS5Pro is 50% above a 7800XT in RT, I'd expect N48XT(X) to be more like 70% above it, which would be pretty much exactly 4070Ti Super territory.
PS5Pro 50% above 7800XT in RT with similar specs?

That would mean in a real game like A Plague Tale Requiem this:
RTX 4090: 98.8FPS at 4K, with RT enabled only 79FPS (-20% compared to raster).
RTX 4070 Ti super: 62.6FPS at 4K, with RT enabled only 50.1FPS (-20% compared to raster).
7900XTX: 78.6FPS at 4K, with RT enabled only 45.6FPS (-42% compared to raster).
7800XT: 48.8 FPS at 4K, with RT enabled only 28.8FPS (-41% compared to raster).
PS5 Pro or comparable desktop equivalent: 48.8 FPS at 4K in A Plague Tale Requiem, with RT enabled 45.6FPS (-7% compared to raster).

This is simply impossible, even RTX 4090 loses 20% of It's performance after enabling RT while this PS5 Pro would loose only 7%.
Even If AMD managed only 20% reduction in perf, this PS5 Pro would still need to achieve 57FPS without RT, in other words 17% higher FPS to be on par with RX 7900XTX in RT.

 
Last edited:

gaav87

Member
Apr 27, 2024
180
380
96
SKU1 (8800xtx) 64CU 256bit 64MB Infinity Cache 16GB ddr6 20Gbps
SKU2 (8800xt) 56CU 256bit 64MB IC 16GB ddr6 18Gbps
SKU3 (8700xt) 48CU? 192bit 48MB IC 12GB ddr6 19Gbps
SKU4 (8600xt) 32CU 128bit 32MB IC 8GB ddr6 18Gbps

SKU1 (8800xtx) 50+ TFLOPs of FP32 (for 50+ must be 3.05GHz or above) [7900xt ~52.5TFLOPs of FP32]
I estimate the compute perf +/- 5% ~7900xt Gaming performance between 7900xt and 7900xtx (closer to 7900xt)
SKU2 (8800xt) Gaming perf 7900 gre
SKU3 (8700xt) Gaming perf 7800 xt
SKU4 (8600xt) performance ~7700 xt
 
Reactions: Tlh97

Mahboi

Golden Member
Apr 4, 2024
1,035
1,900
96
In hindsight, since there is nothing new to expect in performance, there is ONE thing I really want from RDNA 4. It's encoding quality.
I can do with the rest just fine. Upscale for 4K, accept that it's more or less aiming for good 1440p. But I still watch a lot of streams, and every time, I keep thinking that if you genuinely asked me to market AMD cards to streamers, I couldn't. The encoding quality for h264 is just not worth it. Ofc one can, and should, and I sure do, blame Twitch, since YT has managed AV1 streaming for months/over a year now.
But even for AV1, it's close, but below in a significant enough fashion. It typically doesn't show too badly on the video quality itself, which if you pump enough bitrate can just do it with brute force, but it's visible when almost every recording is somewhere around 10% bigger on AMD than Nvidia IIRC.

I don't know just how impactful a bunch of streamers are, but they unsurprisingly get a lot of visibility. It's kind of an abysmal market image when literally 99% of them have Nvidia.
I'm not holding any hopes for AMD h264 to get good now. Unless somehow they integrate Xilinx IP and the Xilinx IP turns out to be monstrously better than the historical AMD one.
But for AV1, it wasn't THAT far. If someone could kindly present their foot's tip at the Twitch CEO's anus and push as hard as he can inside, we might actually get AV1 before RDNA 5 comes out.

I'm just really hoping for that and a new gen of FSR that has AI image correction like NV. If we can get that, we really are only behind in driver velocity (look at how long it took for RDNA 3 to more or less fully fix VR? 7 months, and then people had to wait for Meta to get things going on their SW too?) and in RT. I still think most people don't care that much for RT. But FSR and better encoding, that's my real hopium.
 
Reactions: Tlh97

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
tbh I'm all but disinterested in the gaming side of things and just want to see how much Blender Cycles perf increases.

Even on the Eevee side of things it's great because the big overhaul seems to have implemented ReSTIR like ray tracing, bringing the potential quality of Eevee much closer to that of Cycles.
 
Reactions: Mahboi

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
If someone could kindly present their foot's tip at the Twitch CEO's anus and push as hard as he can inside, we might actually get AV1 before RDNA 5 comes out.
Ironically Twitch actually made a major contribution to AV1 development in the form of S-Frames, which makes their slow transition all the stranger.

Certainly at this point AV1 hw decoding access is fairly widespread, and becoming ever more so since Qualcomm and Apple finally took the stick outta their posteriors and implemented it.

Even where it isn't there on older generations there is plenty of CPU power to decode it efficiently using dav1d, which at this point is quite possibly the most well optimised video decoder library in history.

I don't think that a Vulkan GPU implementation would be a significant improvement for perf/watt over CPU given how complex AV1 is as a codec.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |