Discussion RDNA4 + CDNA3 Architectures Thread

Page 140 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,684
6,227
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

gaav87

Junior Member
Apr 27, 2024
15
1
16
"aco: use 1.5x vgprs for gfx1151 and gfx12"
FeatureMaxHardClauseLength32,
Feature1_5xVGPRs]>;

if (family == CHIP_NAVI31 || family == CHIP_NAVI32 || family == CHIP_GFX1151 ||
gfx_level >= GFX12) {
program->dev.physical_vgprs = program->wave_size == 32 ? 1536 : 768;
program->dev.vgpr_alloc_granule = program->wave_size == 32 ? 24 : 12;
} else {
program->dev.physical_vgprs = program->wave_size == 32 ? 1024 : 512;
if (gfx_level >= GFX10_3)

if (program->gfx_level >= GFX12) {
/* Same as GFX11, except one less for VSAMPLE. */
program->dev.max_nsa_vgprs = 3;


In the code, gfx12 VGPR configuration:
- If the wave size is 32, gfx12 has 1536 physical VGPRs.
- If the wave size is 64, gfx12 has 768 physical VGPRs.
The VGPR allocation granule for gfx12 is:
-24 for a wave size of 32.
-12 for a wave size of 64.
7900xtx has 1536 physical vgprs per simd does this mean rdna4 will have 2304 vgpr per simd as the feature has not been merged to LLVM ?
 
Last edited:

Rekluse

Member
Sep 16, 2022
36
46
51
The issue is the current track record

RDNA3 Launch = Borked
RDNA4 Launch = Gimped
RDNA5 Launch = ??????

Overall it's not looking promising for AMD on the chiplet GPU front. RDNA2 was good but this is a whole different ball-game with no evidence of a working concept yet

(MI300X doesn't run games remember)
 

beginner99

Diamond Member
Jun 2, 2009
5,223
1,598
136
Any reason why it can't? Don't see why they can't scale that down to a consumer version, minus the CPU cores.
because latency between dies matters much less in compute. For gaming it's essential.

That and i don't even know if MI300x appears as just one GPU, earlier MI generations appears as 2 gpus to the OS which again is not very relevant for compute but very much so for gaming.
 
Reactions: Tlh97 and marees

marees

Member
Apr 28, 2024
101
66
61
Overall it's not looking promising for AMD on the chiplet GPU front. RDNA2 was good but this is a whole different ball-game with no evidence of a working concept yet
At this point I am just hoping that AMD doesn't lose the surface handheld to Intel

The Xbox next seems locked to RDNA 5
& PlayStation seems locked to Radeon for a long time
But the steam deck (this now sells half as much as xbox) could go nvidia+arm
 
Reactions: Tlh97 and Rekluse

poke01

Golden Member
Mar 8, 2022
1,380
1,585
106
RDNA2 was good but this is a whole different ball-game
I really liked RDNA2, it was simple and smart architecture so much so that I got a RX 6750 XT.

This generation I planned to get RDNA3 when it got discounted. Instead, I ordered an Ada GPU taking advantage of the EOFY sales. I will post the 4070S performance in the Steel Nomad thread maybe this weekend.

I dislike the new connector on the 40 series/not providing VRAM relative to the price but another than that Ada is a great architecture compared to RDNA3 and any other GPU architecture for that matter from Apple etc.

For me it’s simple, having the best efficient architecture but also a powerful architecture will get my money. That was something AMD did with RDNA2. Ampere was good too but it was horrible due to that Samsung node efficiency wise.

It can go either way with RDNA 4, let’s see what AMD does.
 
Reactions: marees
Jul 27, 2020
17,798
11,599
106
That and i don't even know if MI300x appears as just one GPU

AMD conducted a demo of a 40 billion parameter Falcon-40B model running on a single MI300X GPU, but no performance metrics were provided. Instead, the LLM wrote a poem about San Francisco, the location of AMD's event. AMD says this is the first time a model this large has been run on a single GPU.
So it must be appearing to the OS as a single GPU.
 
Reactions: Tlh97 and marees

leoneazzurro

Golden Member
Jul 26, 2016
1,005
1,597
136
Reactions: Tlh97 and marees
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |