Discussion RDNA4 + CDNA3 Architectures Thread

Page 206 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Win2012R2

Senior member
Dec 5, 2024
647
609
96
AW2, Cyber - all way too heavy ray tracing games, I tried to play the latter in RTX mode in 3090 but turned that off, then tried again in 4090 and still I don't think it's the perf wise - visual upgrades were just not there for loss of perf, maybe finally 5090 can run it at acceptable framerates so it will be "ok why not turning it all on", but these are real outliers. Have not tried Wukong yet (all new release gaming is on pause for me until 5090 is out).

Most likely they've tried other stuff - say Indians Jones might be very good candidate, especially with its high memory usage.
 

gaav87

Senior member
Apr 27, 2024
452
794
96
Im just messing found this funny.
For 9070xt > 5080 You just need:
3500 fMax and 380w (+15% power slider) from the "leaked" 330w 9070xt (from chiphell) and 2750mhz on mem oc and you good
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,994
3,696
136
AW2, Cyber - all way too heavy ray tracing games, I tried to play the latter in RTX mode in 3090 but turned that off, then tried again in 4090 and still I don't think it's the perf wise - visual upgrades were just not there for loss of perf, maybe finally 5090 can run it at acceptable framerates so it will be "ok why not turning it all on", but these are real outliers. Have not tried Wukong yet (all new release gaming is on pause for me until 5090 is out).

Most likely they've tried other stuff - say Indians Jones might be very good candidate, especially with its high memory usage.
you mean path tracing ? ray tracing in CP2077 , witcher 3 and the the few others i have tried seem fine on 7900xtx , what i have found is having even very basic RT GI/ shadows / reflections , adds a lot ( shadows less so ) at very modest perf hit but i cant tell the difference when turning up the RT quality levels/ number of bounces.

At least to me the first order effect is like 99% of the value in RT. so if NV does have a significate advantage in RT at a given price point i expect dumb shenanigans/ pre-set configurations.
 
Last edited:

gaav87

Senior member
Apr 27, 2024
452
794
96
Saw this, but find it very unlikely that 5080 will get double mem bandwidth and boost only to same levels as before, considering that card won't be cheap this is bull for sure
OFC it's bull.
About the double mem b/w.
4070ti -> 4070ti super
+7% cuda's +33% b/w
+9% performance (TPU new perf chart)
So only cuda and b/w mean nothing w/o arch changes and clock speed increase.
 
Reactions: Kaluan

RNR_Forte

Banned
Dec 24, 2024
16
7
36
The rumor mill is a rumor tornado. Sincerely i even not know what rumor is true if performance is kinda like =~ 4080S raster and rt or not.


PS: the only thing i know is that AMD buyers/fans will be surprised positively.
 
Last edited:

ToTTenTranz

Senior member
Feb 4, 2021
276
522
136
Saw this, but find it very unlikely that 5080 will get double mem bandwidth and boost only to same levels as before, considering that card won't be cheap this is bull for sure

Nvidia could be taking advantage of GDDR7's higher bandwidth to decrease the L2 size and produce a smaller chip, i.e. make more money out of the same performance tier.
 

branch_suggestion

Senior member
Aug 4, 2023
504
1,051
96
Nvidia could be taking advantage of GDDR7's higher bandwidth to decrease the L2 size and produce a smaller chip, i.e. make more money out of the same performance tier.
The bandwidth bump is for offline rendering and ML perf beyond anything else.
Gaming it will help, but CPU bottlenecks are becoming an increasing problem unless you crank the fps virus.
 

Win2012R2

Senior member
Dec 5, 2024
647
609
96
Nvidia could be taking advantage of GDDR7's higher bandwidth to decrease the L2 size and produce a smaller chip
I reckon cache is only going up with Nvidia, at least for 4080/4090 levels, it's only AMD that cuts "infinity cache", I guess technically it still legit as half of infinity is still infinity. 4080 is clearly being positioned to take place of a 4k card, this will need good cache.

Here is the useful reminder of cache hit rate depending on size and resolution - 128 MB had only 62% hit rate at 4k, and textures ain't getting any less high rest now compared to 2020. They did not show how far it goes, but clearly 256 MB would have have provided awesome hit rates, just going up from 62% to 80% would HALF memory accesses! It was clearly necessary to get cache up to 192 or even 256 MB, then maybe 256 bit memory interface will be ok, but cheapMD took the other view.
 

Attachments

  • rdna2ichitrates.jpg
    285.9 KB · Views: 18
Reactions: Kaluan and Tlh97

reaperrr3

Member
May 31, 2024
55
188
66
I reckon cache is only going up with Nvidia, at least for 4080/4090 levels (...)
4080 [sic: presumably meant 5080] is clearly being positioned to take place of a 4k card, this will need good cache.
No offense, but that's wishful thinking.

The 5080 will rely mostly on GDDR7 to provide the additional bandwidth needed. 256bit@32Gbps is roughly the same VRAM bandwidth as the 4090, and the 4090 has only 72MB L2 active, so 64MB (like on the 4080) is close enough to hit similar performance levels, if the raw performance is there.

GB203 is ~ the same die size as AD103 with the same number of SM on virtually the same process. That makes anything more than 64MB L2 exceptionally unlikely, as 128MB would definitely increase die size substantially.

The only Blackwell that will (probably) get more cache than its predecessor will be GB202, at 128MB.
But that's in part simply because with a 512bit interface, the alternative would've been 64MB (256MB would've probably made it exceed the reticle limit, so not really an option).
Although I wouldn't be surprised if they disabled 25% of it again, since VRAM bandwidth is up by so much that 96MB L2 will be a sufficient increase over the 4090.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |