Discussion RDNA4 + CDNA3 Architectures Thread

Page 400 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,762
6,682
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Keller_TT

Member
Jun 2, 2024
139
148
76
I was able to get a ASUS 9070 Prime for 530 CHF from a retailer buddy, which is about $600, and it's into a build for the kids with a Zen 4 7700 in it, and it's looking great! The 9070 with UV and clock tweaks, power limited to 250W is pretty much 9070 XT stock level. I spent the most of yesterday testing stability and performance and it is well under limits and runs silently.
I'm a sucker for efficiency and this gave me great joy. And now I'm up early installing games. lol.

The PC didn't have a graphics card in it, the kids have their PS5 and I had decided to bite on the PS5 Pro and sell the original for 350 bucks. The Pro and Disc drive would cost me 500 Francs more and I was Ok as i wouldn't be looking around until 2027 Holiday season.
But the 9070 efficiency really impressed, I couldn't pass up on one of AMD's most impressive cards ever, and getting hardware has never been a problem. I am lucky to have a couple of good friends in the business. All worked out well in the end.
 

Racan

Golden Member
Sep 22, 2012
1,219
2,236
136
Last edited:

gaav87

Senior member
Apr 27, 2024
634
1,223
96
Anyway this taichi is kinda crazy (i did ml translation from polish thats why its gargled)
I can go up to 400W or 220W with almost same perf...
I matched 4090 or even beat some 4090 in monster hunter 1080p ultra
matched oced 4080S in forza horizon 4k maxed out RT while only 36% slower than 4090 on mora...
10% over 5070Ti in RT cyberpunk
This card can go +-15% from base depending how much power u wana throw at it.

 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,599
3,151
136
I don't know about you guys, but I am kind of more excited about N44 than N48, but of course I am talking about the 16GB version.
Hopefully AMD will stick with 20gbps GDDR6 at least for the 16GB version, which would provide 11% higher BW compared to predecessor.

Known models:
ModelGPU specs
(CU, Shaders, TMU, ROPs)
Frequency (Boost)Infinity CacheMemory (Bandwidth)VramTotal Board PowerPerformance (TPU 4K)
RX 760032:2048:128:641720 MHz (2655 MHz)32 MB128-bit 18gbps (288 GB/s)8 GB165 W (62.7 %)41.8 %
RX 7600 XT32:2048:128:641720 MHz (2755 MHz)32 MB128-bit 18gbps (288 GB/s)16 GB190 W (72.2 %)56.7 %
RX 7700 XT54:3456:216:961900 MHz (2544 MHz)48 MB192-bit 18gbps (432 GB/s)12 GB245 W (93.2 %)82.1 %
RX 7800 XT60:3840:240:961800 MHz (2430 MHz)64 MB256-bit 19.5gbps (624 GB/s)16 GB263 W (100 %)100 %
RX 970056:3584:224:1282070 MHz ( 2520 MHz)64 MB256-bit 20gbps (640 GB/s)16 GB220 W (83.7 %)130 %
RX 9700 XT64:4096:256:1282400 MHz ( 2970 MHz)64 MB256-bit 20gbps (640 GB/s)16 GB304 W (115.6 %)149.3 %

N44 could look like this ->
ModelGPU specs
(CU, Shaders, TMU, ROPs)
Frequency (Boost)Infinity CacheMemory (Bandwidth)VramTotal Board PowerPerformance (TPU 4K)
RX 960032:2048:128:642070 MHz ( 2520 MHz)32 MB128-bit 18gbps (288 GB/s)8 GB125 W?
RX 9600 XT32:2048:128:642400 MHz ( 2970 MHz)32 MB128-bit 20gbps (320 GB/s)16 GB165 W75% ?

Performance is pretty questionable. At these "conservative" clocks It won't hit 7700XT level of performance. I think 75% of RX 7800XT is more realistic or 32% over RX 7600XT for the 16GB version.
Of course AMD will push clocks hard for the extra performance, even 3.2-3.3GHz shouldn't be out of the question and at those clocks It should hit 7700XT level of performance, but TBP could hit 200W and efficiency will take a nosedive, although It will still be better than 7700XT.

P.S. I kinda wonder why no one is using GDDR6W at least in laptops. It saves space, because per chip you have double capacity and the height is also smaller.
 

marees

Senior member
Apr 28, 2024
880
1,151
96
I don't know about you guys, but I am kind of more excited about N44 than N48, but of course I am talking about the 16GB version.
Hopefully AMD will stick with 20gbps GDDR6 at least for the 16GB version, which would provide 11% higher BW compared to predecessor.

Known models:
ModelGPU specs
(CU, Shaders, TMU, ROPs)
Frequency (Boost)Infinity CacheMemory (Bandwidth)VramTotal Board PowerPerformance (TPU 4K)
RX 760032:2048:128:641720 MHz (2655 MHz)32 MB128-bit 18gbps (288 GB/s)8 GB165 W (62.7 %)41.8 %
RX 7600 XT32:2048:128:641720 MHz (2755 MHz)32 MB128-bit 18gbps (288 GB/s)16 GB190 W (72.2 %)56.7 %
RX 7700 XT54:3456:216:961900 MHz (2544 MHz)48 MB192-bit 18gbps (432 GB/s)12 GB245 W (93.2 %)82.1 %
RX 7800 XT60:3840:240:961800 MHz (2430 MHz)64 MB256-bit 19.5gbps (624 GB/s)16 GB263 W (100 %)100 %
RX 970056:3584:224:1282070 MHz ( 2520 MHz)64 MB256-bit 20gbps (640 GB/s)16 GB220 W (83.7 %)130 %
RX 9700 XT64:4096:256:1282400 MHz ( 2970 MHz)64 MB256-bit 20gbps (640 GB/s)16 GB304 W (115.6 %)149.3 %

N44 could look like this ->
ModelGPU specs
(CU, Shaders, TMU, ROPs)
Frequency (Boost)Infinity CacheMemory (Bandwidth)VramTotal Board PowerPerformance (TPU 4K)
RX 960032:2048:128:642070 MHz ( 2520 MHz)32 MB128-bit 18gbps (288 GB/s)8 GB125 W?
RX 9600 XT32:2048:128:642400 MHz ( 2970 MHz)32 MB128-bit 20gbps (320 GB/s)16 GB165 W75% ?

Performance is pretty questionable. At these "conservative" clocks It won't hit 7700XT level of performance. I think 75% of RX 7800XT is more realistic or 32% over RX 7600XT for the 16GB version.
Of course AMD will push clocks hard for the extra performance, even 3.2-3.3GHz shouldn't be out of the question and at those clocks It should hit 7700XT level of performance, but TBP could hit 200W and efficiency will take a nosedive, although It will still be better than 7700XT.

P.S. I kinda wonder why no one is using GDDR6W at least in laptops. It saves space, because per chip you have double capacity and the height is also smaller.

I guess it beats the 4060 ti in both raster & RT

That means it is between the 5060 & 5060 ti
 

Josh128

Senior member
Oct 14, 2022
682
1,173
106
I had never realized that 7600XT (204mm) is so much smaller than the 6700XT (354mm) and is only ~10-15% slower. Thats a huge die size reduction on just a tweak of the same 7nm process for very little performance loss. I dont know if they'll do it, but a ~ 150-175mm RDNA4 die, clocked to the hilt, would probably be ~10-20% faster than the 6700XT and quite a potent little entry level GPU that could also be pretty affordable. If they really want some market share, that would be the way to go, IMO.
And it's gonna be hella slow. A rebrand of the 7600 would be fine enough competitor
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,599
3,151
136
I had never realized that 7600XT (204mm) is so much smaller than the 6700XT (354mm) and is only ~10-15% slower. Thats a huge die size reduction on just a tweak of the same 7nm process for very little performance loss. I dont know if they'll do it, but a ~ 150-175mm RDNA4 die, clocked to the hilt, would probably be ~10-20% faster than the 6700XT and quite a potent little entry level GPU that could also be pretty affordable. If they really want some market share, that would be the way to go, IMO.
N22 had too much BW(192-bit controller) and Infinity cache(96MB) compared to Its raw power.
N44 shouldn't be only ~ 150-175mm, but >180mm2 in my opinion.
 
Reactions: Tlh97 and marees

Josh128

Senior member
Oct 14, 2022
682
1,173
106
N22 had too much BW(192-bit controller) and Infinity cache(96MB) compared to Its raw power.
N44 shouldn't be only ~ 150-175mm, but >180mm2 in my opinion.
I would think 180mm would be the limit, unless they are planning a yet smaller N42 (which I doubt). 6700XT/2080Ti perf is still very capable even today at 1440p and 1080p. Plus you'd get better RT, FSR 4, and presumably the new video engine. 180mm or less would allow for some fantastic yields and # of die per wafer, and thus price. The last thing they want to do is get upended by Nvidia on 50/60 class pricing due to silicon/production costs. Slap a $279 price tag on it and watch gamers gobble it up like Puck Man.

Adroc previously seemed to indicate that N44 would be a massive disappointment, Im not sure if what I am envisioning is way beyond what it would actually be or not, but it doesnt seem like a very high bar for AMD. 🤷‍♀️
 
Reactions: Tlh97

gaav87

Senior member
Apr 27, 2024
634
1,223
96
I'm testing stability at -100mV. It does hit 3400 MHz with just the undervolt.
I did extensive testing since morning.
Mem sweet spot is 2764mhz fast timings (any higher or lower u lose perf)
-65mV is rock stable at least on 4 cards
-100mv vs 65mV gives 2% perf uplift but its not stable for me. 90mV is stable but not furmark/3dmark stable
+450mhz offset for core is sweetspot
+6-8% power draw is sweet spot

Full monster OC vs this above settings is 1.5% diff
380W vs ~340W full load with RT or (288-300w in light or no rt games)

Try this settings i already gave them to multiple people one guy gained 16% from stock hellhound with this settings.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,599
3,151
136
I would think 180mm would be the limit, unless they are planning a yet smaller N42 (which I doubt). 6700XT/2080Ti perf is still very capable even today at 1440p and 1080p. Plus you'd get better RT, FSR 4, and presumably the new video engine. 180mm or less would allow for some fantastic yields and # of die per wafer, and thus price. The last thing they want to do is get upended by Nvidia on 50/60 class pricing due to silicon/production costs. Slap a $279 price tag on it and watch gamers gobble it up like Puck Man.
Navi 48 is 356.5 mm2.
Half of that is already 178.25mm2 and don't forget not everything is halved(PCIe, video engine etc.), that's why >180mm2. I think over 190mm2 is very likely.

It will perform a lot better than RX 6700XT, which is only 13% faster than 7600XT 16GB at QHD or 4K.

Even a 200mm2 chip is pretty cheap to produce, but the 16GB version will certainly cost more than $279, more like $379-399 MRSP.
 
Last edited:
Reactions: maddie and marees

GaiaHunter

Diamond Member
Jul 13, 2008
3,673
287
126
I did extensive testing since morning.
Mem sweet spot is 2764mhz fast timings (any higher or lower u lose perf)
-65mV is rock stable at least on 4 cards
-100mv vs 65mV gives 2% perf uplift but its not stable for me. 90mV is stable but not furmark/3dmark stable
+450mhz offset for core is sweetspot
+6-8% power draw is sweet spot

Full monster OC vs this above settings is 1.5% diff
380W vs ~340W full load with RT or (288-300w in light or no rt games)

Try this settings i already gave them to multiple people one guy gained 16% from stock hellhound with this settings.
It hasn't crashed in the games I played yet at -100mv (Guild Wars 2, Borderlands 3, MW5 and RDR2).

In RDR2 it hit 3400 but didn't stay there long, mostly it hovers around 3300 MHz.

I'll play with memory tomorrow. Don't have more time today, wife is starting to give me the stink eye...
 

maddie

Diamond Member
Jul 18, 2010
5,067
5,390
136
N22 had too much BW(192-bit controller) and Infinity cache(96MB) compared to Its raw power.
N44 shouldn't be only ~ 150-175mm, but >180mm2 in my opinion.
Agree. Roughly 1/2 (N48 - video display/encoder engines) + N48 video display/encoders.

Performance of XT model could approach RX 6800 (=1.33 RX 7600 XT) as the smaller die normally clock higher than bigger models.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |