- Mar 3, 2017
- 1,747
- 6,598
- 136
Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)
That's at least my take on this data..
Updated screenshotsThat graph shows improvement of only 1.5% with SMT disabled... Which isn't exactly massive.
Depending on the set of games, sometimes SMT improves the result, sometimes it makes it worse. I doubt this will lead to a rewrite of Windows scheduler.
I think I read in some online posts that higher tREFI can result in hotter DIMMs. Is it possible to do 65536 tREFI without special cooling on DIMMs?
Updated screenshots
Its 1.5% at 4k res, higher at lower res
Works on my machine!The common advice of just setting it to 64k scares me.
Issues like this *might* be caused by a power-saver or bugged power plan/scheduling scheme. "Filling" all physical cores first should be the energy efficient strategy.Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)
Don't think i've observed this behavior on my 16 core Zen5
Check Phoronix's DB section, the DB gains are nice.Two areas in which Zen 5 notably improves over Zen 4 have been pointed out here:
Glancing over the TPU review, it appears there is another area:
- vector arithmetic,
- web browsers/ JITs and the likes.
MySQL TPC-C test:
- databases.
9700X ........ 15,200 TPS7700X ........ 12,900 TPS7700 .......... 12,100 TPS7800X3D .... 11,700 TPSMongoDB 6, time for 10M requests:
9700X ........ 67.5 s7700X ........ 90.0 s7700 .......... 95.4 s7800X3D .... 98.3 s
Video transcoding, source code compilation, ...How many? And which of these are you running once a month or more often? Thanks in advance.
Did you miss the following quote ?And at lower res, you can see the same kind of thing for Zen 4 as well. This has been the reality of HT/SMT since it was first introduced decades ago. Sometimes it helps, sometimes it harms, but the overall consesnus is just turn it on and forget it.
A completely reworked core, is going to shift SMT behavior somewhat, but there isn't anything that significant going on here, except a whole lot of cope grasping at straws.
This is not how SMT scheduling have worked in the past, nor how it should workDuring the course of our testing, we observed that Windows 11 was scheduling workloads on the 9700X in a manner that would try to saturate a single core first, by placing workloads on each of its logical threads. Additionally, the placement would put load on the CPPC2 "best" or "second-best" core (gold and silver in Ryzen Master)—which makes sense. However, if a highly demanding single threaded workload runs on one core, scheduling another demanding workload on the second thread of that core will result in lower overall performance. It would be better to place them on two separate cores, where they each have access to the full resources of that core.
Whoa, I didn't realize that Windows is still that bad. I am forced to use it (Win 10) as application launcher at work, and ignore it as best as I can…Seems like either windows 11 or nvidia drivers needs to update the thread scheduling for Zen5 (?)
That's at least my take on this data..
During the course of our testing, we observed that Windows 11 was scheduling workloads on the 9700X in a manner that would try to saturate a single core first, by placing workloads on each of its logical threads.
I love how the situation totally changes with SMT off in Excel and Outlook, arguably two of the most used applications in offices all around the world:Some more screens
He didn't read the article.. hopefully he will. Probably won't.Did you miss the following quote ?
This is not how SMT scheduling have worked in the past, nor how it should work
(for reference, check the numbers for 7700X how it behaves with SMT ON/OFF as a comparison)
Kinda seems more like its someone else that's "coping and grasping at straws" as you put it.. Why is that ?
That's a...non-insignificant gain in front-end bound workloads like browsing. I expected less.Updated screenshots
Its 1.5% at 4k res, higher at lower res
Some more screens
View attachment 105032
View attachment 105033
View attachment 105034
View attachment 105035
View attachment 105036
If these figures are from TYC review, I wouldn't put too much stock into them yet. I'd wait for someone else to verify that.Clearly something was very lacking in Zen 4.
9700X SMT disabled performance uplift @1080p:
Baldur's Gate 3 +6.78%
Remnant II +6.67%
Spiderman Remastered +17.17% (!!!!) and RT +15.82% (!!!!)
9700X SMT disabled performance uplift @1440p:
Baldur's Gate 3 +6.69%
Spiderman Remastered +17.22% (!!!!) and RT +18.73% (!!!!)
9700X SMT disabled performance uplift @ 2160p:
Baldur's Gate 3 +7.23%
Spiderman Remastered +5.16% and RT +8.10% (!!!!)
AND
+11.61% CS 1080p minimum fps
+12.63% Remnant II 1080p minimum fps
+60.27% (!!!!) Spiderman Remastered 1080p minimum fps
+22.50% (!!!!) Last of Us 1080p minimum fps
+8.30% Baldur's Gate 3 2160p minimum fps
+31.91% (!!!!) Spiderman Remastered 2160p minimum fps
And people are disappointed.
SMH
So you're buying today, right?And people are disappointed.
9800X3D for meSo you're buying today, right?
This is the very thing I've been railing against chatting with others. Your opinion of the launch doesn't matter if you never intended to buy.So you're buying today, right?
@B-Riz already posted one graph of Baldur's Gate 3 minimum fps improving with 9600X. At this point, at least I don't need further proof that there's some VERY good and VERY practical improvements in Zen 5, enough to warrant an upgrade for some, if not all, Zen 4 users.If these figures are from TYC review, I wouldn't put too much stock into them yet. I'd wait for someone else to verify that.
Some more screens
That's a...non-insignificant gain in front-end bound workloads like browsing. I expected less.
2 decode clusters doing work?
Zen 5 has launched at an unfortunate time. If we had data to see how many users upgraded in the past 6 months from their aging platform to 7800X3D or any other Zen 4 CPU, we would understand why the Zen 5 reception is lukewarm. AMD should've launched Zen 5 in January. This is a market where waiting just hurts you more because a lot of people with the upgrade itch don't wait for impending launch of new CPUs. There's a reason why it's called an "itch". People just want something new and they want it NOW.This is the very thing I've been railing against chatting with others. Your opinion of the launch doesn't matter if you never intended to buy.
Gotta wait for C&C article to finally get some clarity on this.I don't know if the two decode clusters can actually serve a single thread in practice. However, the µop cache does seem to work, in that the machine can do two taken branches per clock from µop cache, which is very useful on it's own when running interpreters.