Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 678 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

SarahKerrigan

Senior member
Oct 12, 2014
735
2,035
136
View attachment 104018
The future is HERE NOW. Just get on over to Green Hills!


This is utterly irrelevant to your original point.

First, Coremark is literally the worst benchmark there is. Dhrystone is better.

Second, I never said some compilers aren't better than others. I said that aggressive optimization for one microarchitecture doesn't really accomplish much on modern OoO cores (none of which are tiny Cortex-M's anyway.) Of course some compilers are better than others. This is much of why there are compiler vendors to begin with.

At this point you're just tossing crap at the wall in the hope something sticks. I would encourage reading a book.
 
Reactions: Nothingness

Philste

Senior member
Oct 13, 2023
248
442
96
HW Monitor shows 33.1W for the package and 28.2W for the CPU as max after a cinebench run.
Thx. So it's 15.5K@33W vs 13K @28-32.5W for 7840U. Not particularly great with 2 cores/4 threads advantage.
P cores max at 5.05 and E cores max at 3.32
Cool, so David Huangs numbers were right. What is the clock during the run with 33W? How hard are the 5c cores already gimping? I bet the normal ones are at 3.8 or something.
 

NTMBK

Lifer
Nov 14, 2011
10,301
5,302
136
Because there is no other developer here to help me out.

@NTMBK @eek2121 @quikah @repoman0 , HELP!!!!
Eh, the only specific microarchitecture we'd even think about targeting would be the PS5's Zen 2. Potentially sizing a few specific arrays to fit neatly into the L2 during a hot loop, would probably be the extent of intentional targeting we would do these days. But just profiling the code on that CPU and then optimizing based on that profile will generally lead you to accidentally prefer that uarch slightly- you'll hit the problem cases that crop up for that chip, and maybe miss other ones that might be more of an issue on other systems.

Outside of super hot algorithm loops (stuff like renderer code, physics engine etc), a lot of the time game code is more dictated by the flexibility you need to actually make a game. You could try to really maximise your register usage and minimise branch predictor misses on a chunk of code, but then game design ask you to make a big change to the feature and all your micro-optimization goes out the window and you're back at square one. Better to focus on getting big picture performance stuff right, and only go to the micro optimization right near the end of the project when things are locked down if you really need to squeeze the performance budget (and have time, and aren't too busy doing last minute bug fixes).
 

Abwx

Lifer
Apr 2, 2011
11,516
4,303
136
My interpretation was 33W CPU Package 28W Core power.

That s the case in DT but i noticed that for laptops it s sometime the package as a whole, that is, the whole laptop.

Other than this those numbers have all the characteristics of a chip that is somewhat overvolted to not only increase yields but foremost to guarantee an inconditional stability, that s more important for AMD than chasing even more perf/Watt advantage, they are already clearly in the lead so that s not as important as getting a good image.
 

Geddagod

Golden Member
Dec 28, 2021
1,295
1,368
106
You don't do these things in BIOS on Laptops. Its quite normal for them to have everything locked down. Try Fn+F or something.
The asus M16 is nice in that regard with how you can control PL1, PL2, and the fan curve without ever having to explicitly go into ur bios either.
 

majord

Senior member
Jul 26, 2015
491
622
136
That s the case in DT but i noticed that for laptops it s sometime the package as a whole, that is, the whole laptop.

Other than this those numbers have all the characteristics of a chip that is somewhat overvolted to not only increase yields but foremost to guarantee an inconditional stability, that s more important for AMD than chasing even more perf/Watt advantage, they are already clearly in the lead so that s not as important as getting a good image.
its not the rop SKU either.
 

Josh128

Senior member
Oct 14, 2022
290
403
96
I am not sure how to lock the frequency?
Hey! Maybe try to install the latest Ryzen Master. It may let you lock a low all core freq from within Windows (reboot will be needed). Even 3GHz would be fine.

Either that or turn off CPB (Core performance boost) in the BIOS-- that locks your CPU down to base frequency, at least on desktop Ryzen.
 

Abwx

Lifer
Apr 2, 2011
11,516
4,303
136
its not the rop SKU either.

Right but it should still have better efficency numbers, i explained above to which extent.

If IPC is increased by 17% you can reduce frequency by 1.17x and have the same throughput at same core count.

At 1.25x the core count you can reduce frequency by 1.25X at same throughput, so with these two parameters combined the 365 can have 1/(1.17 x 1.25) = 0.68x the frequency at same throughput, wich imply less than 0.5x the power, yet here it s about 0.65x, so that could mean a voltage that is 14% higher than the 7840U at a same frequency.

This can also mean that voltage is not that much increased but that Zen 5 uarchitectural efficency is something like 10% lower than Zen 4 and combined with a slightly higher voltage.
 

Mopetar

Diamond Member
Jan 31, 2011
8,084
6,695
136
Minor point, but at 17% IPC increase you'd only reduce the clock speed by 14.53%, (1 / 1.17).

Otherwise if you increased IPC by 100%, you'd have a very hard time getting similar performance if you reduced the clock speed by 100%.
 

Abwx

Lifer
Apr 2, 2011
11,516
4,303
136
Minor point, but at 17% IPC increase you'd only reduce the clock speed by 14.53%, (1 / 1.17).

Otherwise if you increased IPC by 100%, you'd have a very hard time getting similar performance if you reduced the clock speed by 100%.

There s no error, you noticed that i wrote 1/(1.17 x 1.25) = 0.65x for the combined frequency reductions allowed by IPC and core count.

The rest of the math is that the typical Power/F curve of TSMC process is of the form P(f) = f^2.36.

So at 0.65x the frequency power is 0.65^2.36 = 0.36x at same throughput.

Even if the process is close to a theorical perfect mosfet curve this would still be 0.65^2 = 0.42x the power.
 
Last edited:

majord

Senior member
Jul 26, 2015
491
622
136
Right but it should still have better efficency numbers, i explained above to which extent.

If IPC is increased by 17% you can reduce frequency by 1.17x and have the same throughput at same core count.

At 1.25x the core count you can reduce frequency by 1.25X at same throughput, so with these two parameters combined the 365 can have 1/(1.17 x 1.25) = 0.68x the frequency at same throughput, wich imply less than 0.5x the power, yet here it s about 0.65x, so that could mean a voltage that is 14% higher than the 7840U at a same frequency.

This can also mean that voltage is not that much increased but that Zen 5 uarchitectural efficency is something like 10% lower than Zen 4 and combined with a slightly higher voltage.

looking at the desktop Zen 5 leaks I don't think Zen 5 has much improved perf/watt in throughput.

but still, 7840U was the top silicon for the 15w -28w class so without testing halo SKU back to back t's hard to tell.. If the 370 and 365 were reduced core counts to differentiate themselves down the stack then you could probably rely on them being good quality silicon still, and assess them on a per core basis, but not so much when they're all 12 c parts (albeit with lower GPU CU on the 365) . they'll likely be both artificially limited in performance via boost clocks (st and NT) combined with higher yielding silicon with less favourable VF characteristics .


-Edit I made a mistake there , I forgot the AI 365 is 10c not 12 .
 
Last edited:
Reactions: lightmanek

Abwx

Lifer
Apr 2, 2011
11,516
4,303
136
looking at the desktop Zen 5 leaks I don't think Zen 5 has much improved perf/watt in throughput.

but still, 7840U was the top silicon for the 15w -28w class so without testing halo SKU back to back t's hard to tell.. If the 370 and 365 were reduced core counts to differentiate themselves down the stack then you could probably rely on them being good quality silicon still, and assess them on a per core basis, but not so much when they're all 12 c parts (albeit with lower GPU CU on the 365) . they'll likely be both artificially limited in performance via boost clocks (st and NT) combined with higher yielding silicon with less favourable VF characteristics .

Not easy to do an evaluation without knowing some parameters, FI if Zen 4 use the standard N5 process then N4P has 28% bettter efficency at same frequency, but if Zen 4 use a refined N5 then the difference is of course lower.

Beside we dont know what is the uarchitectural efficency of Zen 5 compared to Zen 4 but given that the core is substancially bigger it could be lower by an unknown amount.

Guess that reviews will clear all those numbers in a week or so.
 

Malachijtjfjf

Member
Oct 9, 2022
26
42
51
I’ve noticed that amd ipc numbers aren’t being met in cinebench, way lower than 17% ipc they claimed. Zen 5 ipc in cb is only on par with raptor lake which is really bad when it’s suppose to be compared to arrow lake.
 

Josh128

Senior member
Oct 14, 2022
290
403
96
@Malachi - the R23 MT numbers posted by the ES user here do actually hit about 16.5% in MT in R23, as did the locked 5GHz run done during tech day. None of the ST scores or MT scores of auto-boosting SKUs do though- most likely because they arent reaching the same frequencies as their Zen 4 counterparts. Which is pretty shitty because that Donny W dude specifically said Zen 5 has better "clock residency" than Zen 4. From literally every single leak we've seen, perhaps other than this CPU-z leak, that has obviously not been the case.

 
Reactions: lightmanek
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |