Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 746 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Heartbreaker

Diamond Member
Apr 3, 2006
4,323
5,433
136
Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)
That's at least my take on this data..

That graph shows improvement of only 1.5% with SMT disabled... Which isn't exactly massive.

Depending on the set of games, sometimes SMT improves the result, sometimes it makes it worse. I doubt this will lead to a rewrite of Windows scheduler.
 

Det0x

Golden Member
Sep 11, 2014
1,224
3,833
136
That graph shows improvement of only 1.5% with SMT disabled... Which isn't exactly massive.

Depending on the set of games, sometimes SMT improves the result, sometimes it makes it worse. I doubt this will lead to a rewrite of Windows scheduler.
Updated screenshots
Its 1.5% at 4k res, higher at lower res

Some more screens








 
Last edited:

Tuna-Fish

Golden Member
Mar 4, 2011
1,474
1,966
136
I think I read in some online posts that higher tREFI can result in hotter DIMMs. Is it possible to do 65536 tREFI without special cooling on DIMMs?

No, it reduces heat. But it increases sensitivity to heat, which means you have to worry more about cooling.

tREFI is the refresh interval, or how long a single row go between refreshes. The caps that hold charge in DRAM are leaky, and tREFI needs to be short enough that if a cap has charge, there is still sufficient charge in the cap to sense as 1 just before the refresh triggers. Because of fairly basic physical reasons, this time is extremely sensitive to temperature. The hotter your modules get, the shorter it needs to be.

Because a bank that is refreshing cannot be accessed, this has a fairly direct effect on average bandwidth and worst-case latency, but also because the memory manufacturers are scared of lots of ram being faulty and returned, the default is set quite low.

The correct way to adjust tREFI is to set it as high as you dare, very exhaustively test memory while also running some torture test on GPU to get your machine hot, and then if the machine passes, pull it down ~25% just to be sure it still passes when your case has more dust and/or the ambient temperature is higher. The common advice of just setting it to 64k scares me.
 
Last edited:

Heartbreaker

Diamond Member
Apr 3, 2006
4,323
5,433
136
Updated screenshots
Its 1.5% at 4k res, higher at lower res

And at lower res, you can see the same kind of thing for Zen 4 as well. This has been the reality of HT/SMT since it was first introduced decades ago. Sometimes it helps, sometimes it harms, but the overall consesnus is just turn it on and forget it.

A completely reworked core, is going to shift SMT behavior somewhat, but there isn't anything that significant going on here, except a whole lot of cope grasping at straws.
 

StefanR5R

Elite Member
Dec 10, 2016
5,891
8,759
136
Two areas in which Zen 5 notably improves over Zen 4 have been pointed out here:
  • vector arithmetic,
  • web browsers/ JITs and the likes.
Glancing over the TPU review, it appears there is another area:
  • databases.
MySQL TPC-C test:
9700X ......... 15,200 TPS (126 %)
7700X ......... 12,900 TPS (107 %)
7700 ........... 12,100 TPS (100 %)
7800X3D .... 11,700 TPS (97 %)​

MongoDB 6, time for 10M requests:
9700X ......... 67.5 s (141 %)
7700X ......... 90.0 s (106 %)
7700 ........... 95.4 s (100 %)
7800X3D .... 98.3 s (97 %)​
edit: added relative performance, Ryzen 7700 as baseline
 
Last edited:

yuri69

Senior member
Jul 16, 2013
531
948
136
Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)

Don't think i've observed this behavior on my 16 core Zen5
Issues like this *might* be caused by a power-saver or bugged power plan/scheduling scheme. "Filling" all physical cores first should be the energy efficient strategy.
Two areas in which Zen 5 notably improves over Zen 4 have been pointed out here:
  • vector arithmetic,
  • web browsers/ JITs and the likes.
Glancing over the TPU review, it appears there is another area:
  • databases.
MySQL TPC-C test:
9700X ........ 15,200 TPS​
7700X ........ 12,900 TPS​
7700 .......... 12,100 TPS​
7800X3D .... 11,700 TPS​
MongoDB 6, time for 10M requests:
9700X ........ 67.5 s​
7700X ........ 90.0 s​
7700 .......... 95.4 s​
7800X3D .... 98.3 s​
Check Phoronix's DB section, the DB gains are nice.
 

Det0x

Golden Member
Sep 11, 2014
1,224
3,833
136
And at lower res, you can see the same kind of thing for Zen 4 as well. This has been the reality of HT/SMT since it was first introduced decades ago. Sometimes it helps, sometimes it harms, but the overall consesnus is just turn it on and forget it.

A completely reworked core, is going to shift SMT behavior somewhat, but there isn't anything that significant going on here, except a whole lot of cope grasping at straws.
Did you miss the following quote ?
During the course of our testing, we observed that Windows 11 was scheduling workloads on the 9700X in a manner that would try to saturate a single core first, by placing workloads on each of its logical threads. Additionally, the placement would put load on the CPPC2 "best" or "second-best" core (gold and silver in Ryzen Master)—which makes sense. However, if a highly demanding single threaded workload runs on one core, scheduling another demanding workload on the second thread of that core will result in lower overall performance. It would be better to place them on two separate cores, where they each have access to the full resources of that core.
This is not how SMT scheduling have worked in the past, nor how it should work
(for reference, check the numbers for 7700X how it behaves with SMT ON/OFF as a comparison)

Kinda seems more like its someone else that's "coping and grasping at straws" as you put it.. Why is that ?
 
Last edited:

StefanR5R

Elite Member
Dec 10, 2016
5,891
8,759
136
Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)
That's at least my take on this data..
During the course of our testing, we observed that Windows 11 was scheduling workloads on the 9700X in a manner that would try to saturate a single core first, by placing workloads on each of its logical threads.
Whoa, I didn't realize that Windows is still that bad. I am forced to use it (Win 10) as application launcher at work, and ignore it as best as I can…
 
Reactions: Rigg
Jul 27, 2020
19,613
13,477
146
Some more screens
I love how the situation totally changes with SMT off in Excel and Outlook, arguably two of the most used applications in offices all around the world:



7700X with SMT disabled is closer to 9700X SMT disabled in Outlook but loses any chance whatsoever in Excel, which really fills me with delight!

Now just gotta wait for Excel 2024 with AVX-512!
 

DaaQ

Golden Member
Dec 8, 2018
1,438
1,039
136
Did you miss the following quote ?

This is not how SMT scheduling have worked in the past, nor how it should work
(for reference, check the numbers for 7700X how it behaves with SMT ON/OFF as a comparison)

Kinda seems more like its someone else that's "coping and grasping at straws" as you put it.. Why is that ?
He didn't read the article.. hopefully he will. Probably won't.
 

CouncilorIrissa

Senior member
Jul 28, 2023
520
1,995
96
Last edited:
Jul 27, 2020
19,613
13,477
146
Clearly something was very lacking in Zen 4.

9700X SMT disabled performance uplift @1080p:

Baldur's Gate 3 +6.78%

Remnant II +6.67%

Spiderman Remastered +17.17% (!!!!) and RT +15.82% (!!!!)

9700X SMT disabled performance uplift @1440p:

Baldur's Gate 3 +6.69%

Spiderman Remastered +17.22% (!!!!) and RT +18.73% (!!!!)

9700X SMT disabled performance uplift @ 2160p:

Baldur's Gate 3 +7.23%

Spiderman Remastered +5.16% and RT +8.10% (!!!!)

AND

+11.61% CS 1080p minimum fps

+12.63% Remnant II 1080p minimum fps

+60.27% (!!!!) Spiderman Remastered 1080p minimum fps

+22.50% (!!!!) Last of Us 1080p minimum fps


+8.30% Baldur's Gate 3 2160p minimum fps

+31.91% (!!!!) Spiderman Remastered 2160p minimum fps


And people are disappointed.

SMH
 

CouncilorIrissa

Senior member
Jul 28, 2023
520
1,995
96
Clearly something was very lacking in Zen 4.

9700X SMT disabled performance uplift @1080p:

Baldur's Gate 3 +6.78%

Remnant II +6.67%

Spiderman Remastered +17.17% (!!!!) and RT +15.82% (!!!!)

9700X SMT disabled performance uplift @1440p:

Baldur's Gate 3 +6.69%

Spiderman Remastered +17.22% (!!!!) and RT +18.73% (!!!!)

9700X SMT disabled performance uplift @ 2160p:

Baldur's Gate 3 +7.23%

Spiderman Remastered +5.16% and RT +8.10% (!!!!)

AND

+11.61% CS 1080p minimum fps

+12.63% Remnant II 1080p minimum fps

+60.27% (!!!!) Spiderman Remastered 1080p minimum fps

+22.50% (!!!!) Last of Us 1080p minimum fps


+8.30% Baldur's Gate 3 2160p minimum fps

+31.91% (!!!!) Spiderman Remastered 2160p minimum fps


And people are disappointed.

SMH
If these figures are from TYC review, I wouldn't put too much stock into them yet. I'd wait for someone else to verify that.
 

Hail The Brain Slug

Diamond Member
Oct 10, 2005
3,485
2,406
136
So you're buying today, right?
This is the very thing I've been railing against chatting with others. Your opinion of the launch doesn't matter if you never intended to buy.

Not to devalue or dismiss everyone's opinions, but if there was no circumstance under which you would have bought these new products, your opinion doesn't matter to AMDs bottom line.

Only people who would have bought but don't, or wouldn't but do make a difference to their bottom line.
 
Reactions: Jan Olšan
Jul 27, 2020
19,613
13,477
146
If these figures are from TYC review, I wouldn't put too much stock into them yet. I'd wait for someone else to verify that.
@B-Riz already posted one graph of Baldur's Gate 3 minimum fps improving with 9600X. At this point, at least I don't need further proof that there's some VERY good and VERY practical improvements in Zen 5, enough to warrant an upgrade for some, if not all, Zen 4 users.
 
Reactions: Rigg

Heartbreaker

Diamond Member
Apr 3, 2006
4,323
5,433
136
Some more screens

Now you are cherry picking the screens where SMT does worse, ignoring the ones where it does better.

If they would have included Intel CPUs with HT in their you would likely see a similar thing. SMT is never an exclusive win.

If 9700X is behaving worse with SMT on than previous cores, that's Zen 5 problem not that HT/SMT scheduling is wrong after it's been on the market for decades.
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,474
1,966
136
That's a...non-insignificant gain in front-end bound workloads like browsing. I expected less.
2 decode clusters doing work?

I don't know if the two decode clusters can actually serve a single thread in practice. However, the µop cache does seem to work, in that the machine can do two taken branches per clock from µop cache, which is very useful on it's own when running interpreters.
 
Jul 27, 2020
19,613
13,477
146
This is the very thing I've been railing against chatting with others. Your opinion of the launch doesn't matter if you never intended to buy.
Zen 5 has launched at an unfortunate time. If we had data to see how many users upgraded in the past 6 months from their aging platform to 7800X3D or any other Zen 4 CPU, we would understand why the Zen 5 reception is lukewarm. AMD should've launched Zen 5 in January. This is a market where waiting just hurts you more because a lot of people with the upgrade itch don't wait for impending launch of new CPUs. There's a reason why it's called an "itch". People just want something new and they want it NOW.
 

CouncilorIrissa

Senior member
Jul 28, 2023
520
1,995
96
I don't know if the two decode clusters can actually serve a single thread in practice. However, the µop cache does seem to work, in that the machine can do two taken branches per clock from µop cache, which is very useful on it's own when running interpreters.
Gotta wait for C&C article to finally get some clarity on this.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |