Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 674 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

gdansk

Platinum Member
Feb 8, 2011
2,826
4,169
136
True, but the rate at which anecdotes are piling up increasingly point to that being indeed the case.

Remember the recent tweet from a laptop reviewer about weirdness in Cyberpunk 2077 performance?
Hadn't heard of it personally but if measuring performance on a dual CCX laptop with separate core types there is a lot of room for scheduling tomfoolery, clock rate differences and so on.
 
Reactions: lightmanek
Jul 13, 2024
70
75
46
Hadn't heard of it personally but if measuring performance on a dual CCX laptop with separate core types there is a lot of room for scheduling tomfoolery, clock rate differences and so on.
If dual - CCX was the sole reason, why specifically mention Cyberpunk?

SMT on AMD was broken since launch of Cyberpunk, and wasn't fixed till the 2.0 patch almost 3 years after release.
 
Jul 27, 2020
19,465
13,347
146
I suspect that Zen 5 performance gains in most existing non-AVX-512 software will be meager until software is optimized and recompiled for Zen 5. I've said it before many times. Zen 5 is a FOSS powering design where compiling binaries before deployment or serious usage isn't an alien concept.

The crazy trajectory that Windows 11 seems to be on and the fact that I don't like the UI changes Microsoft has made to it in comparison to their far better Windows 10, makes me confident that I'll switch to mostly Linux based computing in the future and so Zen 5 seems like a very attractive option to me that will give me years of loyal service with incremental improvement in software performance.

I'll wait at least a year post launch for a Phoronix anniversary article on Zen 5 performance before calling it a less successful design than Zen 3 or Zen 4 were over their predecessors.
 

Nothingness

Diamond Member
Jul 3, 2013
3,015
1,942
136
I suspect that Zen 5 performance gains in most existing non-AVX-512 software will be meager until software is optimized and recompiled for Zen 5. I've said it before many times. Zen 5 is a FOSS powering design where compiling binaries before deployment or serious usage isn't an alien concept.

The crazy trajectory that Windows 11 seems to be on and the fact that I don't like the UI changes Microsoft has made to it in comparison to their far better Windows 10, makes me confident that I'll switch to mostly Linux based computing in the future and so Zen 5 seems like a very attractive option to me that will give me years of loyal service with incremental improvement in software performance.

I'll wait at least a year post launch for a Phoronix anniversary article on Zen 5 performance before calling it a less successful design than Zen 3 or Zen 4 were over their predecessors.
And you'd have to recompile all your Linux distro from scratch with the proper Zen5 compiler flags. I have never done that as I never felt the need during my 30 years of using Linux because my new CPU was always performing better by default than the previous one.

If a CPU requires recompilation of existing applications to perform well, I call that a failure. Note "perform well" is different from recompiling to extract more performance than the default on existing binaries, which I can understand and which I'd like to see benchmarked.

The last few times I tested native compiler flags on the various CPUs I used, the benefit was in the noise (except of course for new instructions, but that doesn't imply the use of specific instruction scheduling rules), so I've stopped wasting my time on that. If Zen5 requires compiler tuning and recompilation to perform well, I'd be extremely disappointed (I've seen no proof of that yet; sure there might be some outliers, but I think in general the perf uplift looks good).

TLDR: A modern successful CPU should not require recompilation of applications to perform well.
 

Nothingness

Diamond Member
Jul 3, 2013
3,015
1,942
136

Even Zen 4 sees a performance improvement when running on an Intel optimized (and thus recompiled) distro.

View attachment 103989
That likely is the use of new instructions. Do you really think Intel used AMD specific compiler flags to build their Clear Linux?

Edit: flags show that Clear Linux tuning is for Skylake. So yeah definitely no AMD tuning. You're basically proving my point: no uarch specific tuning was needed for Zen4 and I hope it will be the same for Zen5.
 
Last edited:

Kryohi

Member
Nov 12, 2019
42
92
91
And you'd have to recompile all your Linux distro from scratch with the proper Zen5 compiler flags.
Not necessarily. There are many distros nowadays that ship optimized x86_64-v4 (AVX512) binaries, and some are going further, having repositories for packages compiled for zen4 and zen5 uarchs (CachyOS). I don't know how extensive these repos are, but it should be feasible to include most common and not-so-common packages beyond the system ones.

A modern successful CPU should not require recompilation of applications to perform well
Definitely, but what's the problem if a part of the performance increase can only be obtained after recompilation? It's been the same for many years tbh, how many years did "normal users" had to wait before AVX2 became widely used?
 

Nothingness

Diamond Member
Jul 3, 2013
3,015
1,942
136
Not necessarily. There are many distros nowadays that ship optimized x86_64-v4 (AVX512) binaries, and some are going further, having repositories for packages compiled for zen4 and zen5 uarchs (CachyOS). I don't know how extensive these repos are, but it should be feasible to include most common and not-so-common packages beyond the system ones.


Definitely, but what's the problem if a part of the performance increase can only be obtained after recompilation? It's been the same for many years tbh, how many years did "normal users" had to wait before AVX2 became widely used?
I certainly agree with you. But this is different from having to do specific tuning for a uarch (rather than targetting new instructions). And that's the point I'm arguing about 😀

As far as the specific repos for zen4 go, I wonder what the benefit of fine tuning for that specific uarch vs only using new instructions supported by zen4 is. If it's less than 2/3% that's what I call noise (and I've seen examples where using the compiler native flag was producing slightly slower executables).

Edit: sorry I forgot to address a specific point. I have no problem with recompiling to extract the last bit of performance. The problem is if I have to do it to gain anything from a new CPU. That shows the CPU has issues.
 

poke01

Golden Member
Mar 8, 2022
1,952
2,478
106
CachyOS is a niche of a niche, it’s an enthusiast OS with a great community. It will be great OS to move to if MS keeps on filling Windows with crapware.
——
Here is the thing tho, CPUs are general purpose and they should work great OOTB. Unlike GPUs which require drivers and optimisations and continuous improvement , CPUs need to just work and performance should be there day 1 for 99.99% of tasks. Zen 5 lays the ground work for future Zen architectures.

We know a good architecture when we see one because it stands out not only in benchmarks but in real world tests.
 

Josh128

Senior member
Oct 14, 2022
254
374
96
I suspect that Zen 5 performance gains in most existing non-AVX-512 software will be meager until software is optimized and recompiled for Zen 5. I've said it before many times. Zen 5 is a FOSS powering design where compiling binaries before deployment or serious usage isn't an alien concept.

The crazy trajectory that Windows 11 seems to be on and the fact that I don't like the UI changes Microsoft has made to it in comparison to their far better Windows 10, makes me confident that I'll switch to mostly Linux based computing in the future and so Zen 5 seems like a very attractive option to me that will give me years of loyal service with incremental improvement in software performance.

I'll wait at least a year post launch for a Phoronix anniversary article on Zen 5 performance before calling it a less successful design than Zen 3 or Zen 4 were over their predecessors.

Igor, assuming the ES user that provided the leaks earlier still has it, is there any way it would be possible to get him to run a locked 4 or 5 GHz ST run to further analyze the apparent ST discrepancy vs AMDs +17% R23 IPC claims that we are seeing in these R23 leaks?
 
Reactions: inf64

Philste

Senior member
Oct 13, 2023
246
439
96
What claims? Did I miss something?
They claimed a 11-22% MT uplift in Blender. (11% for 9700X, 16% for 9600X, 17% for 9900X and 22% for 9950X). Blender has 23% IPC according to them. So for 17% IPC R23 you get like 8-16% uplift. 8% for 9700X, ~12% for 9600X and 9900X and 16% for 9950X.
 
Jul 27, 2020
19,465
13,347
146
They claimed a 11-22% MT uplift in Blender. (11% for 9700X, 16% for 9600X, 17% for 9900X and 22% for 9950X). Blender has 23% IPC according to them. So for 17% IPC R23 you get like 8-16% uplift. 8% for 9700X, ~12% for 9600X and 9900X and 16% for 9950X.




There. Matter settled. It's roughly 16% average IPC lift. I don't see why we should keep beating this IPC dead horse over and over. Not gonna make the dead horse run any faster

AMD footnote for Zen 5 IPC calculation:

Testing as of May 2024 by AMD Performance labs. "Zen 5" system configured with: Ryzen 9 9950X GIGABYTE X670E AORUS MASTER motherboard, Balanced, DDR5-6000, Radeon RX 7900 XTX, VBS=ON, SAM=ON, KRACKENX63 vs. "Zen 4" system configured with: Ryzen 7 7700X, ASUS ROG Crosshair X670E motherboard, Balanced, DDR5-6000, Radeon RX 7900 XTX, VBS=ON, SAM=ON, KRAKENX62 {FixedFrequency=4.0 GHz}. Applications tested include: Handbrake, League of Legends, FarCry 6, Puget Adobe Premiere Pro, 3DMark Physics, Kraken, Blender, Cinebench (n-thread), Geekbench, Octane, Speedometer, and WebXPRT. System manufacturers may vary configurations, yielding different results. GNR-03

SO

Turn off VBS, push RAM all the way up to 7200 or 8000 MT/s for Zen 5 (maybe do that for Zen 4 too if possible), change power plan to High Performance, run all benchmarks at same fixed frequency above 5 GHz and there's a good chance of improvement in the average IPC uplift.

AMD IS SANDBAGGING!!!!

But here's something new to predict for you folks:

What will be the rough average IPC uplift for Turin?

I predict ~17%

 
Reactions: lightmanek

SarahKerrigan

Senior member
Oct 12, 2014
735
2,033
136
And you'd have to recompile all your Linux distro from scratch with the proper Zen5 compiler flags. I have never done that as I never felt the need during my 30 years of using Linux because my new CPU was always performing better by default than the previous one.

If a CPU requires recompilation of existing applications to perform well, I call that a failure. [...]


TLDR: A modern successful CPU should not require recompilation of applications to perform well.

Amen.

"Developers just need to optimize for it more!" is a bad excuse made by vendors of subpar microarchitectures for significantly longer than I've been alive.
 

SarahKerrigan

Senior member
Oct 12, 2014
735
2,033
136
Jul 27, 2020
19,465
13,347
146
"Developers just need to optimize for it more!" is a bad excuse made by vendors of subpar microarchitectures for significantly longer than I've been alive.
Yeah but there will be developers who WILL optimize for Zen 5.

Think Unreal Engine.

Think AAA games.

Think Windows itself (once they release an optimized Visual Studio).

And obviously Linux developers will optimize too coz that's their passion.

And many, many others.

If you are competing in a tough market, you will do whatever is necessary to maintain your edge.
 

SarahKerrigan

Senior member
Oct 12, 2014
735
2,033
136
Yeah but there will be developers who WILL optimize for Zen 5.

Think Unreal Engine.

Think AAA games.

Think Windows itself (once they release an optimized Visual Studio).

And obviously Linux developers will optimize too coz that's their passion.

And many, many others.

If you are competing in a tough market, you will do whatever is necessary to maintain your edge.

Okay. Elaborate. What kind of optimizations do you propose? Be specific.
 
Jul 27, 2020
19,465
13,347
146
Hardly a compelling argument for "recompiling to get acceptable perf is normal."
Because that's the quickest one I could find. My reasoning is that if an Intel optimized distro is giving Zen 4 an uplift, imagine what a distro focused on Zen 4/5 performance would do.

If I had the time and resources, I would find open source versions of software before Zen 4 announcement and one year after announcement and build them both with the latest compiler and maybe then we would have more data on what I'm claiming.
 

SarahKerrigan

Senior member
Oct 12, 2014
735
2,033
136
Because that's the quickest one I could find. My reasoning is that if an Intel optimized distro is giving Zen 4 an uplift, imagine what a distro focused on Zen 4/5 performance would do.

If I had the time and resources, I would find open source versions of software before Zen 4 announcement and one year after announcement and build them both with the latest compiler and maybe then we would have more data on what I'm claiming.

Except that doesn't say what you think it does, because there are generalized compiler improvements over time - that affect all microarchitectures.

What sorts of optimizations are you proposing?
 
Reactions: Nothingness
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |