Discussion RDNA4 + CDNA3 Architectures Thread

Page 199 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
SM 3.0 was in like 2-3 games, Far Cry HDR and splinter cell and that all.
Yeah but it was Feature of the Day You Just Can't Miss and there was that.
R520 yes was bad, but R580 easily beat any G70
Yeah R580 kicked major ass but it also released close to g80 so gg.
more a comment on the thinking that heavily discounted previous gen is the best comparison.
yeah like. GPU streep prices have been going down close to a new gen launch for 25 years.
 

SolidQ

Golden Member
Jul 13, 2023
1,067
1,457
96
Yeah but it was Feature of the Day You Just Can't Miss and there was that.
maybe for some people, but a lot friend was have X800 series, especially X800XL, because it's was cheap. I also buyed X800XL and played Doom3 etc
and SM 3.0 on 6xxx was irrelevant, because, back there GPU became obsolete much faster, than today. Like 2 years for top GPU and dead, today you can sitting on middle gpu like 6-7 years, on top even more
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
maybe for some people, but a lot friend was have X800 series, especially X800XL, because it's was cheap. I also buyed X800XL and played Doom3 etc
and SM 3.0 on 6xxx was irrelevant, because, back there GPU became obsolete much faster, than today. Like 2 years for top GPU and dead, today you can sitting on middle gpu like 6-7 years, on top even more
Yeah, but remember, NV product marketing is very sawwy and makes a mountain out of every molehill.
 
Reactions: KompuKare

Joe NYC

Platinum Member
Jun 26, 2021
2,790
4,102
106
Devil's Advocate: 530mm2 of silicon on a relatively complicated packaging process just only matched NV's 380mm2 die size in raster with far worse ray tracing performance and "feature set" (regardless of its value to you).

Of course, RDNA3's issues seem to stem from its architecture rather than its chiplet packaging, but it can be hard to separate the two without direct acknowledgement from AMD on what went wrong.

The problems were clock speed and power consumption (almost no increase in performance / power). On the main die. Neither is related to chiplet arrangement. The chiplet has a a bit of an overhead, but limited. Not the cause of RDNA3 not being able to clock high enough at reasonable power consumption.
 

insertcarehere

Senior member
Jan 17, 2013
703
694
136
Spoken like someone who never had CUDA setup break in horrendous ways on update.
You're talking in abstract hypotheticals and not anything related to NV devportal experience. Stop.
It does not take much experience reading Dylan's article from SemiAnalysis here benchmarking the MI300X to the NVs to see how much worse the experience is on team AMD. Even with VIP custom dev builds directly from AMD themselves specifically for the test (!) this still gets worse performance than any of NV's offerings out of the box with more bugs and other nasties. Small pricing discounts will not cut it for enterprises to tolerate these sort of issues.

Choice quotes from the article:
The only reason we have been able to get AMD performance within 75% of H100/H200 performance is because we have been supported by multiple teams at AMD in fixing numerous AMD software bugs. To get AMD to a usable state with somewhat reasonable performance, a giant ~60 command Dockerfile that builds dependencies from source, hand crafted by an AMD principal engineer, was specifically provided for us, since the Pytorch Nightly and public PyTorch AMD images functioned poorly and had version differences. This docker image requires ~5 hours to build from source and installs dependencies and sub-dependencies (hipBLASLt, Triton, PyTorch, TransformerEngine), a huge difference compared to Nvidia, which offers a pre-built, out of the box experience and takes but a single line of code.

Although AMD’s own documentation recommends using PyTorch native Flash Attention, for a couple months this summer, AMD’s PyTorch native Flash Attention kernel ran at less than 20 TFLOP/s, meaning that a modern CPU would have calculated the attention backwards layer faster than an MI300X GPU. For a time, basically all Transformer/GPT model training using PyTorch on the MI300X ran at a turtle’s pace. Nobody at AMD noticed this until a bug report was filed following deep PyTorch/Perfetto profiling showing the backwards pass (purple/brown kernels) took up far more time than the forward pass (dark green section). Normally, the backwards section should take up just ~2x as much time as the forward pass (slightly more if using activation checkpointing).
 
Reactions: xpea

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
It does not take much experience reading Dylan's article from SemiAnalysis here benchmarking the MI300X to the NVs to see how much worse the experience is on team AMD. Even with VIP custom dev builds directly from AMD themselves specifically for the test (!) this still gets worse performance than any of NV's offerings out of the box with more bugs and other nasties. Small pricing discounts will not cut it for enterprises to tolerate these sort of issues.
Yeah, that's training. Not what the customers asked for (for now, anyway).
Now please showcase your actual hands-on experience with CUDA.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,994
3,696
136
Jack twitter, and that where 40% market share came
And ?

you people are so odd.

Try working in the highend of design for an IT megacorp, people say stuff all the time.....

This thread just feels like FanFic........

What are AMD's actions.

When I look at AMD's actions , it looks like they are building top to bottom product stacks in every market they compete in except DGPU....

I say this as quite the happy owner of a 7900XTX.

You could probably argue the only reason to have a 200-300 watt class DGPU is to lop 10-15% clock off it to put it into 100-150watt laptop chassis.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
AMD EPYC revenue: ~1.5 billion
34% unit share with CPU capex majorly down due to GPU overspend.
NVidia client dGPU revenue: ~3.3 billion
91% unit share.
The dGPU market is not table scraps
Yeah it is, especially in a tough comp env that forces a price war.
I think the bigger problem is AMD not having enough staff for datacenter GPU
They do.
and probably raided the client GPU division.
Client GFX IP roadmap is wholly intact. They just surrendered the actual discrete GPUs as a market.
 
Reactions: Tlh97 and Joe NYC

SolidQ

Golden Member
Jul 13, 2023
1,067
1,457
96
Their enemy is the rising semiconductor R&D cost.
Also, but when my friend change 7800XT for 4070 super because it's like 7 times faster in Blender, that minus client and there a lot people with same situation. They need fix Adobe/Blender etc perfomance. Step by step they need fix everything, because monopoly is bad for all of us
 
Reactions: Tlh97 and Shmee

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
but when my friend change 7800XT for 4070 super because it's like 7 times faster in Blender, that minus client and there a lot people with same situation.
Niche. Average NV client customer is a kid that needs a GPU.
They need fix Adobe/Blender etc perfomance. Step by step they need fix everything, because monopoly is bad for all of us
Irrelevant. They're not in the game without a halo part so any ounce of bling is just money down the drain.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
Not everyone need halo product
You need a halo product to move units.
when non-forum people hearing AMD driver bad/software is bad etc. ofc you won't gain any share
It's really easy for the shills to find new ways to bash AMD GPUs with. It's a futile effort. A game long lost.
Niche, not niche. Money is money. Share is share
That only works with a chainsaw. No chainsaw and you're just drowning money for kicks.
 

adroc_thurston

Diamond Member
Jul 2, 2023
4,714
6,501
96
Seems like your agreeing with me now about struggling with R and D cost
They're not short of cash.
the discrete market battle isn't worthwhile to AMD considering other market opportunities.
It can be worthwhile, but it's also a very casino market and running expensive R&D programs to gamble is exactly the thing that almost killed AMD before.
 

DaaQ

Golden Member
Dec 8, 2018
1,725
1,213
136
I've posted this before at least twice, but I'll post it again - here's AMD's TAM strategy for Polaris the last time they were extremely behind with just a mainstream lineup (basically 480 was kind of competitive with the 1060, Nvidia was alone with 1070/1080 and of course 1080ti later on). I remember that the "cheap" 4GB (for $200) was mostly marketing and in reality it was just for the initial launch or for a very short while and then they actually stopped making them (so actual entry price was $240), but I couldn't find a source to back my memory up. When AMD is extremely behind they always claim how they're going for TAM or whatever. What are their other options? Saying that they have a weak brand? that Nvidia is pushing for RT and DLSS and that they're behind?

They could say that they've failed with their chiplet strategy for client, and how they wrecked four years of client GPUs (RDNA3/RDNA4 at least), but they don't want to do that. They've failed before with going all-in on HBM on client, and now they've made a similar mistake with chiplets.


I get it, I've had solely AMD cards since the Radeon HD 4870 (last Nvidia card was the 6600GT), but given AMD's recent behavior even if they'll have a winner, they won't significantly undercut Nvidia. They had a competitive lineup with RDNA2 and didn't really undercut Nvidia at launch MSRP-wise with the 6800xt and 6700xt. RDNA3 also (IMO) didn't really undercut Nvidia enough. They've shown this time and time again. We'll see what happens with RDNA4, but given RDNA3's pricing with the 7900xtx and 7900xt - I'm not hopeful. Only when it was very clear that RDNA3 was a dud, only then did AMD provide decently priced cards - with the 7800XT and 7900GRE.
Bold:
Looks like you only want them to undercut Nvidia to lower Nvidia pricing. You already said you are going that route.
Underlined:
Do you own a chiplet RDNA3 card? Doesn't seem a dud to me.
 

Shmee

Memory & Storage, Graphics Cards Mod Elite Member
Super Moderator
Sep 13, 2008
7,903
2,879
146
I certainly hope that AMD does not give up the competition, they have been doing relatively well with their innovation and delivering nice things. Anyway time will tell, but I hope that we can get a top end card with RDNA5/UDNA whatever...and hopefully with a name that makes sense

Anyway, I will be happy for now with my 7900XTX from Sapphire. But I do hope that someone will continue to compete with Nvidia at the higher end, as many of us don't want the melty connectors on our cards. Without AMD Radeon to do this, who would? Intel? It would be even harder for them.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |