Discussion RDNA4 + CDNA3 Architectures Thread

Page 156 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,749
6,614
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Mahboi

Golden Member
Apr 4, 2024
1,035
1,900
96
Ironically Twitch actually made a major contribution to AV1 development in the form of S-Frames, which makes their slow transition all the stranger.

Certainly at this point AV1 hw decoding access is fairly widespread, and becoming ever more so since Qualcomm and Apple finally took the stick outta their posteriors and implemented it.

Even where it isn't there on older generations there is plenty of CPU power to decode it efficiently using dav1d, which at this point is quite possibly the most well optimised video decoder library in history.

I don't think that a Vulkan GPU implementation would be a significant improvement for perf/watt over CPU given how complex AV1 is as a codec.
Not really. Tons of people still run basic Turing/Pascal era GPUs without it. AV1 decode is only a 2020 onwards thing.
Twitch is forced to allow transcoding from AV1 to h264. Especially in countries where changing hardware often means changing from a dead GPU/PC to a used one.
Now the real question is in a world where MA35D exists, where the hell is it difficult for Twitch to do that? It's an expensive card I guess if you take in bulk, but we're talking about needing a few thousands of them to at least get a basic server farm and test bed. Getting ONE batch of a few hundreds in a server room should be peanuts for a company the size of Twitch, and getting that testbed going with a few big streamers to see how far it can go is nothing. It's freaking 35W of usage per card, and each can transcode 32 h264 1080p streams. We're not talking about a farm of MI300Xs here. Even 50000 viewers boils down to about 1500 cards. And that's not 50000 viewers WATCHING, it's 50000 needing h264 transcode, which should be about 30% of all viewers give or take.

I don't know from where you think dav1d is "efficient enough that there is plenty of CPU power". Last I looked AV1 SW decode ate something around 8 times more than h264. And even Zen 3 CPUs take around 10% of their power to decode h264. I don't see 60% Zen 3 usage as "enough" and for people running old 8000ks or for the one crazy still running Bulldozer, definitely not.
 
Reactions: Tlh97 and DaaQ

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
I don't know from where you think dav1d is "efficient enough that there is plenty of CPU power"
I was talking about mobile devices.

Anything with 3-4 cores of A76 at a reasonable clock speed can tank 1080p60 10 bpc content with ease, and possibly even 4K30 10 bpc too (4K60 might be pushing it without more cores).

Anything >X1 or comparable Axx/Mx based (NEON wise) from Apple can handle 4K60 in software with dav1d.

Like I said, it's a very efficient decoder.

I've been following dav1d development from pretty much the day it was first announced.

It's been a while since I saw a comprehensive CPU comparison with dav1d so if I'm wrong do please enlighten me.
 

ToTTenTranz

Member
Feb 4, 2021
182
312
106
Will AMD introduce RDNA 4 (in a month's time?) when RDNA 2 sales are still 1/3rd of total discrete sales ???


Does it matter? 1/3rd of very few is almost nothing.

Without new products bringing higher performance-per-dollar, AMD's sales of Radeon GPUs will continue to go in a downward spiral to the point of making the business unsustainable.
In Q2 2024, AMD's gaming division revenue went down 59% YoY, and the quarter before it was 48%. It doesn't matter if they're still selling 4 year-old GPUs that were produced during the 2nd crypto-craze with margins that will (hopefully) never happen again in the consumer market.
 

jpiniero

Lifer
Oct 1, 2010
15,365
5,884
136
Does it matter? 1/3rd of very few is almost nothing.

Without new products bringing higher performance-per-dollar, AMD's sales of Radeon GPUs will continue to go in a downward spiral to the point of making the business unsustainable.
In Q2 2024, AMD's gaming division revenue went down 59% YoY, and the quarter before it was 48%. It doesn't matter if they're still selling 4 year-old GPUs that were produced during the 2nd crypto-craze with margins that will (hopefully) never happen again in the consumer market.

Like nearly all of the Gaming revenue is from the Consoles. AMD even said they made more on Radeon YoY but that doesn't tell you that much.
 
Reactions: Tlh97 and marees

marees

Senior member
Apr 28, 2024
578
639
96
Does it matter? 1/3rd of very few is almost nothing.

Without new products bringing higher performance-per-dollar, AMD's sales of Radeon GPUs will continue to go in a downward spiral to the point of making the business unsustainable.
In Q2 2024, AMD's gaming division revenue went down 59% YoY, and the quarter before it was 48%. It doesn't matter if they're still selling 4 year-old GPUs that were produced during the 2nd crypto-craze with margins that will (hopefully) never happen again in the consumer market.
Looking at the top selling cards, I think a navi 48 xtx will sell like hot chips for $600, if it can trade blows with 4070 ti super (or even the 4070 ti) in RT

 

ToTTenTranz

Member
Feb 4, 2021
182
312
106
Looking at the top selling cards, I think a navi 48 xtx will sell like hot chips for $600, if it can trade blows with 4070 ti super (or even the 4070 ti) in RT

Yes, the consumer hasn't had any significant price-performance upgrade since 2019. We're probably paying as much per-FPS today than we were back then.

A card that brought significant gains on that front would be welcomed at least by all the major youtube channels that have been complaining about it incessantly.
 
Reactions: marees

Mahboi

Golden Member
Apr 4, 2024
1,035
1,900
96
Like I said, it's a very efficient decoder.

I've been following dav1d development from pretty much the day it was first announced.

It's been a while since I saw a comprehensive CPU comparison with dav1d so if I'm wrong do please enlighten me.
I can't.
I can't find a single good review that gives proper usage. Maybe I'm just the one who assumed wrong and it's less costly than it once was.
But I am certain that the transcoding issue is one of the problems at Twitch.
However I also am personally convinced that their code is a hellhole of years of spaghettification, and since their entire stack is on h264 since forever, they are trying to roll out a new codec through over a decade of spaghettifying optimisations.
 
Reactions: Ranulf

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
I can't find a single good review that gives proper usage. Maybe I'm just the one who assumed wrong and it's less costly than it once was.
Probably Phoronix would be the only consistent site for dav1d perf in hardware reviews.

I don't ever save them tho, so once it comes and goes it's hit and miss if it sticks in my head.
 

Mahboi

Golden Member
Apr 4, 2024
1,035
1,900
96
Probably Phoronix would be the only consistent site for dav1d perf in hardware reviews.

I don't ever save them tho, so once it comes and goes it's hit and miss if it sticks in my head.
They're not valid, they don't give any core usage. I don't really care about the max framerate, I just am curious about CPU usage.
I still doubt than say Zen 3 can have AV1 decode without significant weight. Like my 5600x would probably take something like 30-50% usage just to watch a 1440p video. That's a CPU that hardly ever consumes more than 20% for any web tasks normally.
 

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
They're not valid, they don't give any core usage. I don't really care about the max framerate, I just am curious about CPU usage.
I still doubt than say Zen 3 can have AV1 decode without significant weight. Like my 5600x would probably take something like 30-50% usage just to watch a 1440p video. That's a CPU that hardly ever consumes more than 20% for any web tasks normally.
Hol' up, just downloading a YT video to test my 3950X then you can infer from that.
 

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
This may not be a particularly high bitrate example which would affect the utilisation, but it is 1440p25:


MPC-HC at the bottom left is the player.

Another screenshot with full core utilisation graph after I shutdown Firefox:
 
Last edited:
Reactions: Mahboi

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
According to amd.com specs 5600X has 300 mhz lower boost clock than 3950X, but also 300 mhz higher base clock, though Zen3 also has higher IPC over Zen2.
 

soresu

Diamond Member
Dec 19, 2014
3,323
2,599
136
Despite AV1 being a more complex codec it is actually possible to be significantly more efficient when decoding it vs the older VP9 because more possible parallelisation routes were designed into AV1 before bitstream freeze.
 

Mopetar

Diamond Member
Jan 31, 2011
8,149
6,861
136
Now they just need to release it and all will be well. Planning to get a top tier RDNA4 at launch if raster is really at 7900XT levels.

I'm waiting to see prices, but if it's reasonable I might pick one up. The only issue is that AMD knows what reasonable is and how to tack an extra $100 on top of that because NVidia is even worse in that regard and will charge $150-$200 over reasonable.

I wouldn't mind seeing a pro refresh for the consoles either. My PS3 died recently and I'm without a BluRay player.
 
Reactions: Tlh97 and marees

jpiniero

Lifer
Oct 1, 2010
15,365
5,884
136
I wouldn't mind seeing a pro refresh for the consoles either. My PS3 died recently and I'm without a BluRay player.

I got bad news about that... the PS5 Pro is most likely $599 (or more even) and that's without the drive (which is $80)

You can pretty much take it to the bank that people are going to complain about RDNA 4's prices.
 
Reactions: Tlh97 and marees

Ghostsonplanets

Senior member
Mar 1, 2024
701
1,122
96
I wonder if AMD will really do RX 8600 and RX 8500. If they price it correctly, they'll have an advantage over Nvidia in the low-end market.

Navi 44 is a 130mm² die. AMD needs to use this advantage of an extremely small die to their fullest and be aggressive to win marketshare.
either less or even no Infinity Cache
AMD x86 Consoles don't have IC. They rely on hUMA with a single pool of memory.
No way N44 can reach that level of performance.
I agree. Folks forget that Navi 44 is still a 16 WGPs design and Navi 33 already clocked quite high. I believe N44 Radeon 8600 XT performance will be around Radeon 6750 XT or RTX 3070/Ti.
All the watts tweeted
N33<<N44<N32
This equals ~7700xt
I think it will be close to 7700xt but NOT between 7700xt and 7800xt.
Something like RTX 3070/Ti performance, yes.
 
Reactions: Tlh97 and marees

marees

Senior member
Apr 28, 2024
578
639
96
Looking at the top selling cards, I think a navi 48 xtx will sell like hot chips for $600, if it can trade blows with 4070 ti super (or even the 4070 ti) in RT

View attachment 106696
Looking at NewEgg there is $150 gap between the 7900gre & 7900xt

That is a significant hole in the segment of the market which is selling big now

If only AMD had a $600 card with RT of 4070 ti or 4070 ti super that they could launch 🤔
 
Last edited:
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |