Discussion RDNA4 + CDNA3 Architectures Thread

Page 201 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

SolidQ

Golden Member
Jul 13, 2023
1,066
1,456
96
For that AMD needs to think outside the box and stop following Nvidia.
Yeah, at least they made AFMF, which NV doesn't have. Interesting there is possible make compromise upscaling only for RT effects?

simply put the Radeon teams
feels like someone sabotage Radeon division.

quick sync is much better than NVENC.
NV pr still strong. AMD alveo is good, but that only AV1
 

tajoh111

Senior member
Mar 28, 2005
340
380
136
Have you not been parroting this since the interest/ loans were becoming due days? Back then it was CPU, now its GPU. It is tiring. I mean it is the same lengthy paragraphs of doom and gloom all the way to the OCN heydays.

If you are an insider, please keep your NDA intact, and spare us the analysis of your predictions from years ago rhetoric. Please.

It is obvious you are Intel/Nvidia through and through. Always some sort of AMD has done this, that, or the other wrong, in your opinion.

It is so tiring. Like 10 years tiring.

At least maybe just try and be somewhat constructive with your criticism from the past ten years. So it is TLDR.
I am hardly criticizing what they are doing, actually the opposite. I am agreeing with their current actions about leaving high end and refocusing their resources as the high end discrete market is just a losing battle with little chance for victory.

I said similar praise for them in the past when they kind of stripped down the GPU division(which I was berated for) to focus on CPU which gave them the resources for Ryzen.

My last two desktop builds have been AMD processors and if you didn't see previously, I am not particularly optimistic about Intel's chances in the videocard market. It's not doom and gloom, it's realistic and observing the reality of the situation.

BTW I have also made some scathing remarks about Nvidia on overclock.net. As of late however, they are an execution machine and their financials are proof of this.
 

SolidQ

Golden Member
Jul 13, 2023
1,066
1,456
96
I am agreeing with their current actions about leaving high end and refocusing their resources as the high end discrete market is just a losing battle with little chance for victory.
They still have chance make UDNA 2k gpu, with good perfomance and sell it as AI card, like 4090. People buying 7900XTX for AI. Just need good software
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
MS will follow Sony, next gen xbox will copy PS6 in terms of GPU/AI feature sets.

6 years ago RT/AI upscaling was barely relevant and it wasn't
Point is, there are no "RT feature sets". RTRT is frozen in time.
quick sync is much better than NVENC.
Not really. But it's good.
That’s the point, make it so good the shills can’t move the goal posts.
That's what N4c was for.
For that AMD needs to think outside the box and stop following Nvidia.
Yes. Nothing 300CUs can't solve.
simply put the Radeon teams need to be more like Ryzen teams. Be the leader instead of following the carrot.
Ryzen isn't a product, it's just server table scraps served to hungry-hungry peasants.
Something that's not possible to do in graphics.
with little chance for victory.
Victory isn't hard. Making money off it is.
As of late however, they are an execution machine and their financials are proof of this.
GB200 is literally tumbling around like a meme. Be serious.
They still have chance make UDNA 2k gpu
Doesn't exist.
RDNA5 is 3 whole dies. That's it.
 

yuri69

Senior member
Jul 16, 2013
601
1,056
136
LMAO, imagine how the whole AMD management chain responsible for the software-side performs. What are the KPIs? How is the performance review process set? This must be so hilariously bad.
9070 XT would be beyond DOA at $600. Even at $500 it's a tough sell vs $600 5070.
The great pricing strategy of AMD is to price their dGPUs as (nV - a few tens $). So yea, I fully expect them to release the RDNA4 top end with a slow RT and half-broken drivers at $600 and call it a day.
 

Keller_TT

Member
Jun 2, 2024
113
112
76
It doesn't have to, it just gotta have "AI" in the name.

It's so relevant MS hasn't made any DXR changes in 6 years. Be serious.

Unlikely. For cheaper stuff there's FSR2.
Mr. Thurston always seems to be the contrarian to the folks buying cards out there who vote with their wallets. Even the recent Sony PS5 Pro deep dive event pretty much declared the writing on the wall. That future efficiency and more bang for the silicon can only come through RT innovations and ML libraries (Project Amethyst) and traditional raster is at its limit.
As for me, I'm predominantly a Sim racer who needs the best VR performance and I dabble in Blender for some amateur personal modding. I had to sell my 7900 XT and get a 4080 because it's all so much easier with NV and I had to wait to see the Ngreedia price reduce.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
We don't have any news about it
Someone already leaked the roadmap some time ago.
there who vote with their wallets
They vote for whatever NV card they can afford since ~2006 or so.
That future efficiency and more bang for the silicon can only come through RT innovations and ML libraries (Project Amethyst) and traditional raster is at its limit.
RTRT is inherently the opposite of silicon (or memory) efficiency lmao.
MAO, imagine how the whole AMD management chain responsible for the software-side performs. What are the KPIs? How is the performance review process set? This must be so hilariously bad.
This might be beyond obvious to some, but training isn't the focus at all.
 
Reactions: Tlh97 and marees

SolidQ

Golden Member
Jul 13, 2023
1,066
1,456
96
As for me, I'm predominantly a Sim racer who needs the best VR performance and I dabble in Blender for some amateur personal modding. I had to sell my 7900 XT and get a 4080 because it's all so much easier with NV and I had to wait to see the Ngreedia price reduce
That also big problem for AMD, they avoid fixes like plague. For real they need new software team, which can be creative, feels like AMD software team is 70-80 years guys
 
Reactions: Tlh97 and Keller_TT

insertcarehere

Senior member
Jan 17, 2013
703
694
136
Ryzen isn't a product, it's just server table scraps served to hungry-hungry peasants.
Something that's not possible to do in graphics.
Spoken as if Nvidia hasn't been selling plenty of rebadged Turing/Ampere/Ada GPUs for a hefty markup to the professional space for years lmao. Or thinks that tensor cores doing matrix multiplication for DLSS upscaling, can't be repurposed for matrix multiplications elsewhere..
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
Spoken as if Nvidia hasn't been selling plenty of rebadged Turing/Ampere/Ada GPUs for a hefty markup to the professional space for years lmao.
That's really not what they're build for and you know it.
Or thinks that tensor cores doing matrix multiplication for DLSS upscaling, can't be repurposed for matrix multiplications elsewhere..
GEMM units everywhere are a price you gotta pay for preserving CUDA onion model, and they sort of broke it with H100 anyway.
Also the volume driver Turing parts (TU11x) had no GEMM cores. Next.
 

gdansk

Diamond Member
Feb 8, 2011
3,768
6,015
136
traditional raster is at its limit.
He says as the 5090 is about to push raster another 1.5x.
And still it won't be enough for some High Quality "AAAA" games at 4K.
Raster keeps scaling despite more and more die area wasted on RT/ML crap. But it seems no one cares (you all are invited to argue why).
 
Last edited:

insertcarehere

Senior member
Jan 17, 2013
703
694
136
That's really not what they're build for and you know it.
Who cares what they're supposedly built for? 4090s are gaming cards first but the US still restricts their imports to China all the same. The Ampere family encompasses both accelerator only (GA100) as well as dual-use cards (GA102 onwards) and Blackwell will be the same.

GEMM units everywhere are a price you gotta pay for preserving CUDA onion model, and they sort of broke it with H100 anyway.
Also the volume driver Turing parts (TU11x) had no GEMM cores. Next.

Point being the tensor core tech being developed are very much dual use between consumer and professional markets.

GTX1650 will still compile most CUDA code fine, can't say the same about RDNA1 and the mess that isROCm lmao.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
Who cares what they're supposedly built for? 4090s are gaming cards first but the US still restricts their imports to China all the same. The Ampere family encompasses both accelerator only (GA100) as well as dual-use cards (GA102 onwards) and Blackwell will be the same.
Everyone? A purpose-build client part is a purpose-build client part.
Not in any way, shape or form comparable to something like Vermeer or Granite Ridge.
Point being the tensor core tech being developed are very much dual use between consumer and professional markets.
No it's just a price you gotta pay for CUDA compatability. But they found other uses too.
GTX1650 will still compile most CUDA code fine, can't say the same about RDNA1 and ROCm lmao.
Glad to hear.
 

ajsdkflsdjfio

Member
Nov 20, 2024
171
117
76
He says as the 5090 is about to push raster another 1.5x.
And still it won't be enough for some High Quality "AAAA" games at 4K.
Raster keeps scaling despite more and more die area wasted on RT/ML crap. But it seems no one cares (you all are invited to argue why).
5090 pushes raster 1.5x with 1.32x the cuda cores(meaning 1.32x die size atleast likely), and 1.25x the power while costing 1.5x the previous gen (street pricing). There's a reason why raster is still able to be pushed forward, it's because Nvidia has just increased the price of their cards by 2-3x over the past decade, and pushing power(GTX titan 250w vs 5090 600w), all to keep the same level of historical uplifts(1080 was also 1.5x the 980 btw at only 50$ more, and using the same exact power). At a certain point they can't keep doing this though which is why they are leaning into upscaling technology, AI technology in general, and possibly cloud-computing like with their (unsuccesful)ventures into geforce now/nvidia shield.
 
Last edited:

Keller_TT

Member
Jun 2, 2024
113
112
76
He says as the 5090 is about to push raster another 1.5x.
And still it won't be enough for some High Quality AAAA games at 4K 120.
I'm only relaying what Mark Cenry said w.r.t what'll make sense for premium consoles going forward within a silicon budget, and to keep it all packaged nice and tight. The PS5 Pro is ~RX 6800 raster with RT closer to 4070, with the SoC power efficiency the best it can be. It's pretty much a testing ground to advance Sony's proprietary technologies and graphics stack to be as mature and efficient as it can be for PS6 (probably launching in 2027).
 

gdansk

Diamond Member
Feb 8, 2011
3,768
6,015
136
5090 pushes raster 1.5x with 1.32x the cuda cores(meaning 1.32x die size atleast likely), and 1.25x the power while costing 1.5x the previous gen (street pricing). There's a reason why raster is still able to be pushed forward, it's because Nvidia has just increased the price of their cards by 2-3x over the past decade, and pushing power(GTX titan 250w vs 5090 600w), all to keep the same level of historical uplifts(1080 was also 1.5x the 980 btw at only 50$ more, and using the same exact power). At a certain point they can't keep doing this though which is why they are leaning into upscaling technology, AI technology in general, and possibly cloud-computing like with their (unsuccesful)ventures into geforce now/nvidia shield.
The limits to raster performance remains memory bandwidth. RT needs more buffers and specialized hardware on top of that. It hits the same memory bandwidth wall but even earlier. The idea that somehow raster is any more tapped out than RT is suspect. RT has more market headroom because most people don't care about playing at 4K 120Hz+ but rather see visual flair to justify their purchase even if it's subnative and half the frame rate. But it's not a technical limit.

And jesus I thought the cloud-compute gaming meme died with the XBONE.
 

ajsdkflsdjfio

Member
Nov 20, 2024
171
117
76
The limits to raster performance remains memory bandwidth. RT needs more buffers and specialized hardware on top of that. It hits the same memory bandwidth wall but even earlier. The idea that somehow raster is any more tapped out than RT is suspect. RT has more market headroom because most people don't care about playing at 4K 120Hz+ but rather see visual flair to justify their purchase even if it's subnative and half the frame rate. But it's not a technical limit.
I don't disagree, but raster is definitely getting to the point where improvements in raster are coming at higher and higher costs to consumers, even if not tapped out in a technical sense. A side effect of transistor costs no doubt. Either way it is not sustainable because eventually people are gonna buy less and less of the new +40-50% generation that costs +30-40% percent more as well. Unless something radical happens, I doubt people want to spend 10k on a GPU in 2034.
 

gaav87

Senior member
Apr 27, 2024
452
794
96
Yeah, at least they made AFMF, which NV doesn't have. Interesting there is possible make compromise upscaling only for RT effects?


feels like someone sabotage Radeon division.


NV pr still strong. AMD alveo is good, but that only AV1
Interesting idea RT "ray generation" could be done for rayCast. But would it have any performance benefits?. Basically guessing the ray position w/o actually calculating it from the previous postion is interesting idea.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,789
4,101
106
Ryzen isn't a product, it's just server table scraps served to hungry-hungry peasants.

Ryzen revenue is higher than EPYC, and client margin is improving.

New AMD CFO, Jean Hu, is not a fool (like Intel Intel's in the past which killed / sold off low margin divisions). Jean likes margin dollars as mush as margin rates.

Meaning if she can't get high margin %, will settle for margin $s at lower margin %. I think Lisa Su broadly agrees with her.

Which, to me, implies that AMD will in fact go for market share, especially on the CPU side, and at some point in GPU too. Just depends on what kind of weapons AMD has at its disposal.

Also NVidia is, IMO, worried about regulators, perhaps more than about AMD. Splitting NVidia could be a mortal blow.

So IMO, NVidia is not going to participate in any price war with AMD. Which means, that AMD, could hold its own or improve in market share of both client and datacenter divisions.

There is an ace that AMD is holding, which is Strix Halo. It can, potentially, take more market share from NVidia client dGPU than AMD's dGPU division. Which is why NVidia is moving quickly in this area, but AMD has 1 year head start and x86 license.
 

Keller_TT

Member
Jun 2, 2024
113
112
76
Did they manage to replicate the CPU bottleneck from Zen2 3700S? Sony's reasoning was that *for most games to push 60 FPS* Not 120, they don't a mid-gen CPU upgrade.
And your screenshot means nothing by itself as some devs are exceedingly good at optimization and others just suck. There are some standout results for the Pro that get the best out of the new hardware.
 

RnR_au

Platinum Member
Jun 6, 2021
2,204
5,271
106
GTX1650 will still compile most CUDA code fine, can't say the same about RDNA1 and the mess that isROCm lmao.
r/localllama folk are reporting the 5700xt cards are working ok with rocm with a couple of tweaks.

From my understanding rocm has come a very long way in the last year. Still not perfect ofcause, I managed to get it working with an RX 6600 for AI inference under linux.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |