Discussion RDNA4 + CDNA3 Architectures Thread

Page 190 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

DisEnchantment

Golden Member
Mar 3, 2017
1,754
6,631
136





With the GFX940 patches in full swing since first week of March, it is looking like MI300 is not far in the distant future!
Usually AMD takes around 3Qs to get the support in LLVM and amdgpu. Lately, since RDNA2 the window they push to add support for new devices is much reduced to prevent leaks.
But looking at the flurry of code in LLVM, it is a lot of commits. Maybe because US Govt is starting to prepare the SW environment for El Capitan (Maybe to avoid slow bring up situation like Frontier for example)

See here for the GFX940 specific commits
Or Phoronix

There is a lot more if you know whom to follow in LLVM review chains (before getting merged to github), but I am not going to link AMD employees.

I am starting to think MI300 will launch around the same time like Hopper probably only a couple of months later!
Although I believe Hopper had problems not having a host CPU capable of doing PCIe 5 in the very near future therefore it might have gotten pushed back a bit until SPR and Genoa arrives later in 2022.
If PVC slips again I believe MI300 could launch before it

This is nuts, MI100/200/300 cadence is impressive.



Previous thread on CDNA2 and RDNA3 here

 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,798
8,666
136
Level 0 – Legacy Solutions
Level 1 – Software on Traditional GPUs
Level 2 – Ray/Box and Ray/Tri Testers in Hardware
Level 3 – Bounding Volume Hierarchy (BVH) Processing in Hardware
Level 4 – BVH Processing with Coherency Sorting in Hardware
Level 5 – Coherent BVH Processing with Scene Hierarchy Generator in Hardware

Interesting what level is RDNA4 lvl3?
Probably Level 3 at worst, Level 4 at best. RDNA2/3 was already Level 2. ARC is Level 4, Ada is Level 4.
 
Reactions: Tlh97 and Elfear

Saylick

Diamond Member
Sep 10, 2012
3,798
8,666
136
Looks like RDNA 4 finally gets something for thread coherency, which means it's Level 4.

Also, Amethyst is not about proprietary features and is a collaboration between Sony and AMD. This is good, if it means FSR4 gets to leverage what's going on under-the-hood in PSSR.
 

Mopetar

Diamond Member
Jan 31, 2011
8,201
7,027
136
I just want an efficient architecture with no major bottlenecks that can run my games at reasonable frame rates without needing to rely on upsampling or other trickery to make bigger bars solely for the purpose of making me feel better about my purchase.

I get that some people are excited about these technologies (and if supported they do give the card longer legs down the road) but I mainly view them as taking resources away from what I want. I don't care how good FSR4 is. I hope to never have to turn it on in the first place.
 

Saylick

Diamond Member
Sep 10, 2012
3,798
8,666
136
Is it more like NVIDIA's implementation, which isn't exactly double speed, but it seems to be a lot better than AMD's? I guess it can only be better than current sad state of affairs.
I tried to look into how CDNA 3's dual issue works and based on what I can understand from C&C's article, it still has its limitations, especially since it appears more suited to how GCN works vs RDNA. Nvidia's approach appears to be more flexible overall, imo.
 
Reactions: Tlh97 and Win2012R2

Bigos

Member
Jun 2, 2019
170
424
136
I am still waiting for a day when the game engine developer job - the upscaling - is actually done by game engine developers and not ISV... This stupid precedent is made by Nvidia having enough developers to create a vendor lock in...
 

marees

Senior member
Apr 28, 2024
743
898
96
Also, Amethyst is not about proprietary features and is a collaboration between Sony and AMD. This is good, if it means FSR4 gets to leverage what's going on under-the-hood in PSSR.
Looks like Amethyst is a generalized architecture (like Vulkan) for using CNNs. What nvidia is using in Tensor Cores

Sony said this is a 4 year project. So probably something for RDNA 5, 6 (or UDNA 1, 2) & PS6

For PS5 pro, sony has mentioned that they are using register memory as RAM to avoid latency for PSSR (Again no Tensor cores). I wonder if RDNA 4 uses same architecture for FSR 4 ?
 
Reactions: Tlh97 and Win2012R2
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |