Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 781 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

MS_AT

Senior member
Jul 15, 2024
207
497
96
CB 2024 is no more using Embree, it use Maxon s renderer, so it s not comparable to R23, one more time, how did Intel gain more than 10% in ST from R15 to R23, the fact that we re talking of ST eliminate the cache possibility
Why ST negates possible advantage in private caches? Also I am not exactly sure which Intel generation you mean specifically as Skylake was at cache disadvantage and Raptor Lake at advantage. To truly judge what is the reason we would need someone to do a profiling of R23 on specific architectures to compare what the performance counters say.
and the X3D is no better than the regular chip
You are rendering tile by tile, and tiles mostly do not share data, since on Zen the L3 is a victim cache it is only fed the data that were discarded from L2, but if you do not render the same tile twice, from data side it will not give you large advantage. Now from instruction side it might be able to contain the whole program, but if there is no gain from x3d in Zen4 vs Zen4x3d that simply means the program footprint was small enough to fit in L3 of vanilla Zen4. Also for ST test, Zen4x3d will be at frequency disadvantage.
because Cinebench R20/23 are ICC compiled and are the only renderering tests where the 12900K is ahead of the 5950X, actualy it s about the only Computerbase MT benches where it was ahead.
Until we know the option used for ICC it's really hard to say. Y-Cruncher for a long time was also ICC compiled and there Zen4 did not have issues with ranking high. So while ICC is historically controversial compiler it's usage doesn't have to equate the Zen -50%.

As to why 12900k is ahead of 5950x once again I would say we would need to have a nice profiling traces to be able to tell. I have seen people jump to conclusions only to be proven wrong by the actual measurements. While it's not a satisfactory answer most of the time, it's the correct one. That is why in software world when you listen to performance focused talks they will always repeat ad nausea "measure, measure". The CPU execution pipeline is so complex that it's really hard to make correct assumptions. It's another thing that it is also hard to measure correctly
 

Kryohi

Member
Nov 12, 2019
42
92
91
Cinebench R20/23 are ICC compiled
Seriously? Who even uses ICC anymore?
Also, I might be wrong but ICC is basically deprecated software, it has been replaced by an LLVM-based compiler similar to clang (ICX).
The benchmarking landscape on the Windows side is in a seriously bleak situation imo.

Edit: had a look at ICX and it has some dubious/shady flags usage:
With ICX 2022.0.0 and later releases  -O2 and -O3 are not sufficient to enable Intel advanced loop optimizations and vectorization. To enable extra levels of loop optimizations and vectorization use the processor targeting option -x or /Qx along with a target architecture. For example, -xskylake-avx512. Or you use the -xhost or /Qxhost option to enable all available Intel optimizations and advanced vectorization for the processor of the platform where you compile your code.

I don't get why people don't simply use Clang/GCC. It's not like they're some amateur hobby projects...
 
Last edited:

sl0519

Junior Member
Aug 10, 2024
20
49
46
NBC published their review of the 9700X.

Overall very strong ST perf, FI in GB 5.0 and 5.5, Kracken, WebXPRT 3.

In gaming it s just 3% below the 14900K and 14% faster than the 14700K, FTR 21 games are tested.

The only drawback according to NBC is the strong competition price wise by AMD s own previous gen, much less by Intel given their current degradation problems.





From CB R15 to CB R20 Intel got a 10% ST uplift in respect of AMD, and another few % with R23 that was released just one year after R20, check in the review i just linked, there s CB 11.5 as well as R15, R20 and R23.

In the Maxon video i linked we can see that they endorsed Intel surely in exchange of all their software suite for free, guess that there must be some pay back as a consequence

I mean by selecting a convenient scene, because all renderings using a same soft wont yield the same results between 2 CPU as it depend of the exact arithmetic ops distributions that are used, and independently of the fact that R15, R20 and R23 all use Embree wich is Intel s in house renderer and wich is also used by Corona, now compare those renderers scores to say Blender and Vray.

Gaming performance on par with 14900k? Why's there discrepancy between NBC and other review outlets like HUB?
 

MS_AT

Senior member
Jul 15, 2024
207
497
96
I don't get why people don't simply use Clang/GCC. It's not like they're some amateur hobby projects...
http://www.numberworld.org/y-cruncher/news/2024.html#2024_5_10 at least for Y-cruncher there are regressions from switching to ICX from ICC. Still it (clang) is my compiler of choice lately.

The biggest problem is CPU companies use clang/llvm as base of their own compilers (Intel, AMD, and I think ARM too) but either don't upstream their optimizations or are very late to do it. And AMD is extremely poor here seeing upstream CLANG still doesn't have Zen5 target...
 

Timmah!

Golden Member
Jul 24, 2010
1,510
824
136
I must say I am not impressed with Mike Clark between his lousy answer on the decoders and then later telling Dr. Cutress saying how software will catch up and the Zen 6/7 guys will get credit for the work Zen 5 did.
Clearly, Mike Clark works in a wrong department. Rather than Chief Architect he would do better at marketing, as he fooled us all At very least they should rename his position to something like Chief Marketect
 

MS_AT

Senior member
Jul 15, 2024
207
497
96
They don't care since their real cash cow Turin isn't out yet.
Intel is usually year ahead of the cash cow launch. If you want to have software that is using your nice and dandy features you better make it possible to target it. Especially if you claim the software might benefit from recompilation... But AMD instead is sending misleading patches to GCC and ignoring CLANG completely. How nice software company it is
 

Abwx

Lifer
Apr 2, 2011
11,517
4,303
136
Why ST negates possible advantage in private caches? Also I am not exactly sure

You are rendering tile by tile, and tiles mostly do not share data, since on Zen the L3 is a victim cache it is only fed the data that were discarded from L2, but if you do not render the same tile twice, from data side it will not give you large advantage. Now from instruction side it might be able to contain the whole program, but if there is no gain from x3d in Zen4 vs Zen4x3d that simply means the program footprint was small enough to fit in L3 of vanilla Zen4. Also for ST test, Zen4x3d will be at frequency disadvantage.

All tiles are not equal quantity of computations wise, that s a given, beside CB use hardly more than SSE 4.2, so it should have about the same improvement as what is measured by Spec_FP, yet the ST improvement for Zen 5 is only 10-12% fo CB R20/R23 while it s 15% for CB 2024.


Until we know the option used for ICC it's really hard to say. Y-Cruncher for a long time was also ICC compiled and there Zen4 did not have issues with ranking high. So while ICC is historically controversial compiler it's usage doesn't have to equate the Zen -50%.

As to why 12900k is ahead of 5950x once again I would say we would need to have a nice profiling traces to be able to tell. I have seen people jump to conclusions only to be proven wrong by the actual measurements. While it's not a satisfactory answer most of the time, it's the correct one. That is why in software world when you listen to performance focused talks they will always repeat ad nausea "measure, measure". The CPU execution pipeline is so complex that it's really hard to make correct assumptions. It's another thing that it is also hard to measure correctly

It s impossible to know without being in the know...

As for y-Cruncher this soft use AVX512 and Zen 4 half rate AVX512 will still have a big impact even if ICC compiled.

Gaming performance on par with 14900k? Why's there discrepancy between NBC and other review outlets like HUB?

Dunno why exactly, NBC use a 4090 as well and do tests at several definitions, not only 720p, and with 21 games their sample is large enough, i ll have to crawl through the review to see what s going on there.
 

MS_AT

Senior member
Jul 15, 2024
207
497
96
All tiles are not equal quantity of computations wise, that s a given, beside CB use hardly more than SSE 4.2, so it should have about the same improvement as what is measured by Spec_FP, yet the ST improvement for Zen 5 is only 10-12% fo CB R20/R23 while it s 15% for CB 2024.
Spec_FP usage of SIMD depends on the quality of the compiler's autovectorizer and the instruction set you allow it to use. David Huang, Anandtech and Geekerwan are all using different settings here. You cannot therefore assume that it will translate to CB because CB is not making use of anything more than SSE 4.2. Not to mention Cinebench instruction and data flow might be different that SPEC average. Maybe you should try to find spec subset that correlates best.
It s impossible to know without being in the know...
Nah, just profile first. Analyze and then if something smells fishy show the traces with argumentation why they are fishy
As for y-Cruncher this soft use AVX512 and Zen 4 half rate AVX512 will still have a big impact even if ICC compiled.
Zen3 was also doing quite well against contemporary Skylake derivatives.
 

tsamolotoff

Member
May 19, 2019
174
304
136
Once again if you claim with 100% certainty the benchmark is flawed towards Intel, it would be nice if you could back this up with some relevant metrics
This is probably just a not so eloquent figure of speech, what he meant is that with Zen CPUs there was less of a gap (or bigger advantage) vis a vis Intel for more SIMD-heavy calculations (I guess no one really did cb24 with ln2 on intel properly on hwbot, or maybe the gap was narrowed somehow) :

 

Nothingness

Diamond Member
Jul 3, 2013
3,031
1,971
136
All tiles are not equal quantity of computations wise, that s a given, beside CB use hardly more than SSE 4.2, so it should have about the same improvement as what is measured by Spec_FP, yet the ST improvement for Zen 5 is only 10-12% fo CB R20/R23 while it s 15% for CB 2024.
I'm not sure I follow you. You can't expect two different benchmarks to get the same speedup. In particular when one is the geomean of several tests with different profiles and can make use of AVX2/AVX-512 depending on how it's compiled.
 

Abwx

Lifer
Apr 2, 2011
11,517
4,303
136
Spec_FP usage of SIMD depends on the quality of the compiler's autovectorizer and the instruction set you allow it to use. David Huang, Anandtech and Geekerwan are all using different settings here. You cannot therefore assume that it will translate to CB because CB is not making use of anything more than SSE 4.2. Not to mention Cinebench instruction and data flow might be different that SPEC average. Maybe you should try to find spec subset that correlates best.
Nah, just profile first. Analyze and then if something smells fishy show the traces with argumentation why they are fishy

Zen3 was also doing quite well against contemporary Skylake derivatives.

Zen 3 was doing well in y-Cruncher because of the sheer amount of cores, SKL
didnt get over 10 cores.

What i m questioning is why the 7950X had 7% advantage in ST over the 12900Ks in CB R15 and why in CB R20 the 12900KS suddenly became faster by 3% and 4% in R23, basically that s 11% artificial better IPC for Alder Lake that appeared from nowhere.

The advantage in MT shrinked as well from 48% to 38% from R15 to R20.

Comparatively in Corona that also use Embree the 7950X was 51% faster, we are talking of the same rendering engine used by both apps here, so theorically the same kind of ops flow.

Curiously only in Povray there was only a 35% difference, and precisely, as proved by member MarkPost, because this software is using AVX2 only for Intel CPUs, so basically Maxon profiled their R20/R23 tests such that they produce the same results as disabling AVX2 for AMD and enabling it for Intel.

 

Heartbreaker

Diamond Member
Apr 3, 2006
4,323
5,433
136
And care to explain how exactly its flawed in Intel's favour? Here is R24 review by C&C https://chipsandcheese.com/2023/10/22/cinebench-2024-reviewing-the-benchmark/ it contains profiling data, while not exactly related to Zen5 or RaptorLake it should be able to help you along the way.

Btw, for people worrying about Core to Core latency regression and it's effect on performance:

Simple.

Right up until Intel added E-Cores, CB was the flawless goto benchmark.

But since, Intel E-cores tipped the balance, and started winning, it's a bad bench mark.
 

Philste

Senior member
Oct 13, 2023
248
442
96
NBC published their review of the 9700X.

Overall very strong ST perf, FI in GB 5.0 and 5.5, Kracken, WebXPRT 3.

In gaming it s just 3% below the 14900K and 14% faster than the 14700K, FTR 21 games are tested.
NBC isn't really the go to Website for gaming tests. 3% slower than 14900K and 14% faster than 14700K is already impossible, normally there isn't such a gap between 14900K and 14700K.
 

rydeon95

Junior Member
Aug 13, 2024
8
2
36
hi everyone, i have a ryzen 9 9950x
with unlocked PPT it reaches 280 watts with 95 degrees, i saw in the post of @igor_kavinski
that it is much colder. what am i doing wrong? i updated the bios to the latest version, i have a x670E-F Asus Rog
custom cooling with MO-RA 420 and Watercool Heatkiller as waterblock
ambient temperature 25 degrees, liquid temperature 26 degrees
 

yottabit

Golden Member
Jun 5, 2008
1,482
513
146
Clearly, Mike Clark works in a wrong department. Rather than Chief Architect he would do better at marketing, as he fooled us all At very least they should rename his position to something like Chief Marketect
It’s not his fault they woke him up from his dream too early.

Let him close his eyes, go to sleep, and wake up in another 3 years and try again.
 

Abwx

Lifer
Apr 2, 2011
11,517
4,303
136
NBC isn't really the go to Website for gaming tests. 3% slower than 14900K and 14% faster than 14700K is already impossible, normally there isn't such a gap between 14900K and 14700K.

They also tested the 9600X and got the same results, i noticed that they used, among others, Xplane where Zen 5 does very well, but also F1 22, Far Cry, Final Fantasy, Dota 2, Strange Brigade, Metro Exodus, F1 2020, all games where Zen 5 does also very well.

Beside they use 15 games, wich is certainly no worse than the 13 games used by HWUB wich btw are all of the same kind, a first person shooter view.



are E / compact cores actually worth it in real world? or just bench number crunching to ignite flamewars in threads like this 😂
That s an argument thrown out of total ignorance, those e cores only flip the marketing driven core count, in Blender, Corona, Vray, Mental ray and any other renderer a 7950X is largely ahead, and even in CB R15, the only exception is precisely CB R20/R23 and of course the AVX2 crippling Povray that produce the same result as R20/23 -3%.
 
Last edited:

GTracing

Member
Aug 6, 2021
78
193
76
unpopular opinion HX 370 probably would be better off with 8 full zen5 instead of 4 full 8c
For gaming, definitely. For office/productivity, I'm not so sure. For battery life, I would imagine that two CCXs is better.

Seeing as strix point is a high end chip that is primarily in gaming laptops, 8 full Zen5 cores would be a better fit overall.
 
Reactions: marees
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |