Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 636 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

tsamolotoff

Member
May 19, 2019
164
280
136
Why would they increase the execution resources on the FP side, if they cannot sustain more than 2x512b loads per cycle?
considering that they've improved load/store, they could have went for 4x256 which arguably is more useful than 2x512 bit, no? AVX512 in all of its forms is very niche so i was expecting improvements that were more applicable in general if the 40% ipc rumours are to be believed
 

Josh128

Senior member
Oct 14, 2022
203
336
96
There s nothing exceptional, the guy is just unaware that his chip still consume much more than Zen 4 in this very bench which is a best case for Intel.

FI he boast 36k3 at 170W, just imagine that it s about the score of a stock 7950X3D that use barely 130W to do so, with some UV and tweaking like this one you can get at 110W.

FTR a 14900KS does 41k at stock and using 330W, so his score at 270W is not even much better overall than a stock chip.

Guess that s telling at wich point some people are in denial, seeing as great what is actually very mediocre, but hey, that s my prefered brand.
Matheus de Souza? Hes undervolting his i9 to get those purported results.
 

MS_AT

Member
Jul 15, 2024
126
319
91
considering that they've improved load/store, they could have went for 4x256 which arguably is more useful than 2x512 bit, no? AVX512 in all of its forms is very niche so i was expecting improvements that were more applicable in general if the 40% ipc rumours are to be believed
Do you mean 4x256b loads? I guess it's better to do 2x512 as you have to decode less instructions [or keep less of them in the uop cache] and calculate less addresses and x86 still has only 16GPRs, so it's easy to spill from GPRs to cache for some algos. It's also nice when you can fetch whole cacheline in one go. But I am probably a bit biased as 512b units are one of the zen5 selling points for me It's different for ARM where you have 32 GPRs and simpler instructions [but you need more of them, that's why ARM design often end up with bigger inst caches than data caches I guess]

Now in general about AVX512 being niche its thanks to Intel market segmentation policy. AVX512 brought a lot of useful instructions regardless of register size so it's nice AMD wants to make it more popular. I mean Zen4 was already showing benefits even with the limitations it had.

If one could dream, these 2ADD 2FMA 512b units could turn into 4ADD 4FMA 256b units if they encountered 256b instructions [likewise for load/store], but I guess this wouldn't fly due to issues in silicon implementation.
 

Abwx

Lifer
Apr 2, 2011
11,446
4,201
136
While we are on the Cinebench register a slide from AMD state that Strix Point has 20% better ST perf in CB 2024 than Intel s 185H MTL.

Given that SC frequency is the same for the two chips this would imply about 20% better IPC in CB 2024 for Zen 5, and quite more uplift than the 17% IPC improvement stated for CB R23 nT.

Perhaps that our resident leaker could do a CB 2024 run to check the things, but anyway if true, and because CB relevance is galacticaly overestimated by the usual crowd, this could be quite the panic in some HQs.



 

Josh128

Senior member
Oct 14, 2022
203
336
96
nothing like testing a new chip few days before release with undisclosed "daily PBO + CO" settings and a freaking car radiator to cool it to get headlines
Did you make an account here just to post ignorant things like this? Please get back to WCCFTECH and X and let the adults discuss. Else, please post your 45K R23 and locked 5GHz runs getting 43.3 R23, then we can talk.

pic.twitter.com/w1IeFG95Jt


Note: This person is a known troll and Intel shill that posts regularly on WCCFTECH and X. He will bring nothing serious to the discussion here.
 

Geddagod

Golden Member
Dec 28, 2021
1,295
1,368
106
While we are on the Cinebench register a slide from AMD state that Strix Point has 20% better ST perf in CB 2024 than Intel s 185H MTL.

Given that SC frequency is the same for the two chips this would imply about 20% better IPC in CB 2024 for Zen 5, and quite more uplift than the 17% IPC improvement stated for CB R23 nT.

Perhaps that our resident leaker could do a CB 2024 run to check the things, but anyway if true, and because CB relevance is galacticaly overestimated by the usual crowd, this could be quite the panic in some HQs.



Not bad, but ARL will likely catch up. Also, LOL at Qualcomm's chips scoring higher. Really, this isn't that impressive.
You can get fine wine already with cachyOS with their latest Zen4/5 optimised release.

Excluding the AVX-512 datasets it’s about a 14.5% gain in IPC. It’s clear that Zen 5 is a server first architecture more than every other Zen.
If Zen 5 was server, well specifically HPC, first, I would imagine it would also feature larger, core private caches, and not SMT. And even in a general server setting, I would imagine it would focus on increasing power at the lower end of the power curve, as well as not expanding core area that much (in order to increase core density).
 

Kryohi

Member
Nov 12, 2019
41
92
91
Not bad, but ARL will likely catch up. Also, LOL at Qualcomm's chips scoring higher. Really, this isn't that impressive.

If Zen 5 was server, well specifically HPC, first, I would imagine it would also feature larger, core private caches, and not SMT.
Care to explain why not SMT? Mike Clark specifically said it does not have any significant area cost, and many HPC workloads absolutely benefit from it.

And even in a general server setting, I would imagine it would focus on increasing power at the lower end of the power curve, as well as not expanding core area that much (in order to increase core density).
Isn't that what the C cores are made for?
 
Reactions: Tlh97

Josh128

Senior member
Oct 14, 2022
203
336
96
Just some numbers to chew on from GamersNexus 9950X OC video. In the vid, they did a locked 5GHz run in R23 and got 43375 score. There may be some noise/variance run to run but this gives +15.94% IPC in R23 MT for Zen 5 vs Zen 4.

-9950X 5GHz, 43375 R23: 8675 points/GHz MT

- 7950X 5.5GHz, 41149 R23 : 7482 points/GHz MT



https: //youtu.be/pYWtP4tZe30?t=820

https: //www.youtube.com/watch?v=ccg6aU7Hmjw&t=78s
 

poke01

Golden Member
Mar 8, 2022
1,874
2,370
106
Not bad, but ARL will likely catch up. Also, LOL at Qualcomm's chips scoring higher. Really, this isn't that impressive.
I would say AMD and Intel have long way to go. 5.1GHz on Strix and yet still cannot match M3 in ST performance is not a pretty sight.
Note: This person is a known troll and Intel shill that posts regularly on WCCFTECH and X. He will bring nothing serious to the discussion here.
let’s hope the person behaves here. The mods are strict and trolling is a no go.
 
Reactions: Tlh97 and Saylick

tsamolotoff

Member
May 19, 2019
164
280
136
If one could dream, these 2ADD 2FMA 512b units could turn into 4ADD 4FMA 256b units if they encountered 256b instructions
Yes, that's precisely what I'd prefer

Note: This person is a known troll and Intel shill that posts regularly on WCCFTECH and X. He will bring nothing serious to the discussion here.
He also took one of my screenshots to claim that Zen5 is worse than Zen4 when it turned out to be slower in Blender at low PPTs
 
Reactions: Tlh97 and Josh128

Kryohi

Member
Nov 12, 2019
41
92
91
Official dies sizes for the CCD of Granite Ridge (Eldora) and Strix Point:The CCD with 8x Zen 5 is 70,6 mm² – the same as Durango (CCD with Zen 4). Strix Point is growing to 232,5 mm². Both being manufactured in N4P.


Edit: according to another xitter user (do we have an official source?), zen 5 CCD has 8.315B transistors, so, ~117 MTr/mm².
 

Geddagod

Golden Member
Dec 28, 2021
1,295
1,368
106
Care to explain why not SMT? Mike Clark specifically said it does not have any significant area cost, and many HPC workloads absolutely benefit from it.
Many HPC workloads don't benefit from SMT. Stuff like memory bandwidth limitations of workloads simply not scaling mean that SMT can worsen performance in many instances due to increased competition for resources.
Isn't that what the C cores are made for?
Just because you have dense cores doesn't mean you don't want to aim for better core density with your normal products/cores as well.
I would say AMD and Intel have long way to go. 5.1GHz on Strix and yet still cannot match M3 in ST performance is not a pretty sight.
Ong
Official dies sizes for the CCD of Granite Ridge (Eldora) and Strix Point:The CCD with 8x Zen 5 is 70,6 mm² – the same as Durango (CCD with Zen 4). Strix Point is growing to 232,5 mm². Both being manufactured in N4P.

IIRC Eldora transistor density is something like 25% higher than Durango
 
Reactions: exquisitechar

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
29,177
23,270
146
nothing like testing a new chip few days before release with undisclosed "daily PBO + CO" settings and a freaking car radiator to cool it to get headlines
You are new here so please take the time to read the CPU forum rules. Flame bait gets extinguished fast around here. You can't go in a vendor thread and start trashing and bashing that vendor or trolling the members. If you fail to comply with the rules and guidelines your stay here will be a short one.

Don't respond to this post, as it is an Moderator comment. Moderation issues has its own forum, post there if you would like to question moderation.

Mod DAPUNISHER
 

Saylick

Diamond Member
Sep 10, 2012
3,493
7,715
136
nothing like testing a new chip few days before release with undisclosed "daily PBO + CO" settings and a freaking car radiator to cool it to get headlines
If you’re not happy about how headlines are being generated, maybe you should tell your boy Hassan to stop writing articles where he’s not fully reporting all the details of the source.
 

SarahKerrigan

Senior member
Oct 12, 2014
735
2,033
136
Many HPC workloads don't benefit from SMT. Stuff like memory bandwidth limitations of workloads simply not scaling mean that SMT can worsen performance in many instances due to increased competition for resources.

HPC is a tiny fraction of the total server market, and most server workloads are commercial code that gets quite a bit out of SMT.

Note that the RISC/UNIX world, which has been purely server for a long time, consisted exclusively of multithreaded cores by 2008, across all four silicon vendors (Intel/HP, IBM, Fujitsu, Sun/Oracle.)
 

DisEnchantment

Golden Member
Mar 3, 2017
1,732
6,510
136
Official dies sizes for the CCD of Granite Ridge (Eldora) and Strix Point:The CCD with 8x Zen 5 is 70,6 mm² – the same as Durango (CCD with Zen 4). Strix Point is growing to 232,5 mm². Both being manufactured in N4P.


Edit: according to another xitter user (do we have an official source?), zen 5 CCD has 8.315B transistors, so, ~117 MTr/mm².

If we roughly assume that around 10mm2 is for the GMI/SMU/DFT blocks, which took around ~100MTr in Z4 (from ISSCC ppt), that is 55mm2 for 8.2B Xtor, which is around 145+MTr/mm2,(Z4 has 118 MTr/mm2 for the CCX alone )



It seems evident, the moment N4P perf can hit anywhere near the needed frequency without usage of several custom cell optimizations, they focus on density instead, the side benefit is that there is no tangible change in efficiency despite the widening of few structures.
Die size went down, by ~0.6%, but this is with scribe lines, should be 65mm2 without scribe lines




140MTr/mm2 is Phoenix level density.
 
Last edited:

BeyondFPS

Banned
Jul 10, 2024
3
8
36
Moderator callouts are not allowed
It is fascinating how I answer someone that calls me troll and shill, my message gets removed, and this person's message stays there like it's totally fine. Rules for some but not for all?

If you’re not happy about how headlines are being generated, maybe you should tell your boy Hassan to stop writing articles where he’s not fully reporting all the details of the source.

Since when is Hassan my buddy?

You have already been told the only place to question moderation is the moderation discussion forum.

As to the rules: You come in, kick the metaphorical beehive, then complain when the they start stinging you. Since you read the rules, you know vendor threads are safe spaces and you violated that rule. Not going to punish others for taking umbrage at being trolled in a vendor thread.
 
Last edited by a moderator:
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |