Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 741 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Timorous

Golden Member
Oct 27, 2008
1,748
3,239
136
Hey, you underestimate the number of people who run Cinebench R23 continuously on cold winter days and nights especially to stay warm!

If they want to stay warm Intel is the way to go for that...

Depending on workflow for some use cases even if a piece of software is not very well threaded you can just run multiple instances of it or you can have a rendering / encoding task going while working on some other task and having more cores allows you to dedicate more resources to running those background tasks for what might be a single threaded primary app that you are currently working in.
 

inquiss

Member
Oct 13, 2010
179
261
136
The least they can do is give dissimilar CCDs to 9900X, as in one 8 core CCD + one 4 core CCD so at least it wouldn't get ignored so much by gamers. There is only one local retailer in UAE who got Ryzen 9000 CPUs and they are already out of 9700X. 9600X and 9900X are still in stock.
I don't think that works. How does that work with binning strategy? How many chips need to be binned down to 4 cores? It also doesn't help with cross cc'd talk, you still have that same issue
 

B-Riz

Golden Member
Feb 15, 2011
1,594
756
136
I'm sure that AMD knew about how mediocre Zen 5 was when they were deciding TDPs. Why did they choose 65W for them? To cripple them even more? It lost in a lot of benchmarks to 7600X/7700X because of that low TDP. I have a couple ideas why:
1. They wanted at least something good out of this gen, so leaned on "power efficiency" story (forgetting that 65W Zen4 parts exist).
2. They wanted X3D to appear a lot more powerful than regular chips.
3. OEMs and SIs asked for 65W chips to be right there on launch, so they can put them in crappy B840 motherbaords.
4. They drank their own kool aid and decided TDPs before knowing performance?
Most likely the first one, but it was interesting to think about.

.1% gaming lows says this thing is a bad arse frame ripper!

 

DisEnchantment

Golden Member
Mar 3, 2017
1,747
6,598
136
TBH we already know the design rationale. They wanted a new forward-looking base design - >4-wide decode, 6 ALUs, 8-wide dispatch, and 512b-compatible bandwidth.

That's about it. A new base with a lot of cut corners and odd features.
You are being too generous with the kind of questions being asked by "tech" journalist who got access to Mike Clark, Mahesh Subramony et al. What a waste.
Only Cheese was able to ask some good questions but clearly he did not get enough time, because these architects probably also need to talk to someone like Jarred Walton from Tom's HW, or Jayz two cents. Who knows
 

StefanR5R

Elite Member
Dec 10, 2016
5,891
8,759
136
Depending on workflow for some use cases even if a piece of software is not very well threaded you can just run multiple instances of it or you can have a rendering / encoding task going while working on some other task and having more cores allows you to dedicate more resources to running those background tasks for what might be a single threaded primary app that you are currently working in.
Networked PCs are supporting this type of applications inexpensively.
 
Jul 27, 2020
19,613
13,477
146
Jul 27, 2020
19,613
13,477
146
I don't think that works. How does that work with binning strategy? How many chips need to be binned down to 4 cores?
It could work if they at least create some new 9905X SKU with dissimilar CCDs and look at the sales data after a year to find out what consumers prefer. This is now the 3rd generation where x900X SKU seems to be disliked and always has to be gotten rid of through firesales. They should just stop with this stupid SKU.

They also have the option of creating 9930X or something like that with one 8 core CCD and one 6 core CCD inbetween 9900X and 9950X for 28 threads and I bet it would sell better than 9900X.
 
Reactions: Tup3x

MS_AT

Senior member
Jul 15, 2024
207
497
96
It may help because most games won't go above 16 threads which will ensure no thread needs to go outside the 8 core CCD. On 6 core CCD, the number of threads prove insufficient and some game/gpu driver threads need to be moved to the other CCD.

Like look here: https://www.techspot.com/review/2811-cpu-cache-vs-cores/
That depends on nature of communication, what you advocate would help if the threads are talking to each other heavily, but if they are just workers spawned to do something and die then CCD2CCD overhead doesn't matter that much and 6C x 6C could be better due to more balanced nature [same amount of LLC per core, where 8x4 would require more complex scheduling logic to extract max performance]. After all there are some gaming benchmarks were 7900x is leading 7700x.
 

Josh128

Senior member
Oct 14, 2022
290
403
96
"And that fabric and IOD, seems they stopped working on it because that one single guy went on paternity leave."

lol what is this about? That sound like complete BS.
 
Reactions: Thibsie

CakeMonster

Golden Member
Nov 22, 2012
1,492
653
136
I remember when Zen3 launched there was speculation that 5900X might perform better in some workloads than 5950X because it would have more cache per thread. I don't think that proved to be true, but theoretically I can understand how you could imagine that.
 

static shock

Member
May 25, 2024
93
44
51
Is lacking love of Strix Halo here. No one understand how much this SoC is important to the whole computer market.
3N with RX 6750 XT performance.
Gpus becoming part of the CPU is future and m1 ultra is here to prove how much is possible to do an Ultimate APU.
Zero attention for the STXH tablet/notebook console that Lenovo is doing.
 

B-Riz

Golden Member
Feb 15, 2011
1,594
756
136
I remember when Zen3 launched there was speculation that 5900X might perform better in some workloads than 5950X because it would have more cache per thread. I don't think that proved to be true, but theoretically I can understand how you could imagine that.

5950X would always perform better; it gets a top binned primary 8 core then a mid tier 8 core.

5900X was a mid tier 6 core and a lower tier 6 core.

5900 was a sleeper good chip, it was binned for low power performance, but opened up when given more power to work with.
 

StefanR5R

Elite Member
Dec 10, 2016
5,891
8,759
136
It may help because most games won't go above 16 threads which will ensure no thread needs to go outside the 8 core CCD.
And who assigns CPU affinities to the game threads to ensure that they don't wander off to another cache domain?

This test was done without locking the core clocks.

Edit,
That depends on nature of communication, what you advocate would help if the threads are talking to each other heavily, but if they are just workers spawned to do something and die then CCD2CCD overhead doesn't matter that much and 6C x 6C could be better due to more balanced nature [same amount of LLC per core, where 8x4 would require more complex scheduling logic to extract max performance]. After all there are some gaming benchmarks were 7900x is leading 7700x.
Incidentally, the Linux kernel's process scheduler ignores last level cache domains in its scheduling decisions for threads of the same process. This is in contrast to NUMA nodes, for which the kernel implements the default policy to try and keep all threads of a process on the same node. I don't know why such a policy is not applied WRT cache domains, but I suspect it is because the optimum for performance cannot be predicted by the kernel. I don't know what Windows' process scheduler does in this regard.
 
Last edited:

inquiss

Member
Oct 13, 2010
179
261
136
It may help because most games won't go above 16 threads which will ensure no thread needs to go outside the 8 core CCD. On 6 core CCD, the number of threads prove insufficient and some game/gpu driver threads need to be moved to the other CCD.

Like look here: https://www.techspot.com/review/2811-cpu-cache-vs-cores/

View attachment 104907
Not sure what you're saying here. Yes some games require more threads than 6, but how does that help when windows will put threads on both CCDs. You still get cross talk in 8+4, plus that doesn't address the point around binning. It doesn't make commercial sense, which is backed up by the fact we haven't seen this config.
 

inquiss

Member
Oct 13, 2010
179
261
136
It could work if they at least create some new 9905X SKU with dissimilar CCDs and look at the sales data after a year to find out what consumers prefer. This is now the 3rd generation where x900X SKU seems to be disliked and always has to be gotten rid of through firesales. They should just stop with this stupid SKU.

They also have the option of creating 9930X or something like that with one 8 core CCD and one 6 core CCD inbetween 9900X and 9950X for 28 threads and I bet it would sell better than 9900X.
12 core CPU's don't sell well, agreed. More cores is such a niche anyway. AMD has a killer product with 8 cores and cache. If you're into games you buy that. If you really want more cores on a consumer platform then 16c is the way to go. 12c is sort of unloved for good reason. It isn't core maxing, it isn't gaming maxing. An esoteric config within the 12c space, that costs money to assemble and explain to people doesn't make business sense, and we haven't seen AMD do it for precisely that reason.
 

CouncilorIrissa

Senior member
Jul 28, 2023
520
1,995
96
Is lacking love of Strix Halo here. No one understand how much this SoC is important to the whole computer market.
3N with RX 6750 XT performance.
Gpus becoming part of the CPU is future and m1 ultra is here to prove how much is possible to do an Ultimate APU.
Zero attention for the STXH tablet/notebook console that Lenovo is doing.
Plenty of people are interested in STXH, it's just that we're far away from its release. The hot topic is Granite Ridge dumpster fire.

I am very interested in learning how much Zen 5 is held back (if at all) by the antiquated uncore that should be fixed with the Halo.
 

inquiss

Member
Oct 13, 2010
179
261
136
Is lacking love of Strix Halo here. No one understand how much this SoC is important to the whole computer market.
3N with RX 6750 XT performance.
Gpus becoming part of the CPU is future and m1 ultra is here to prove how much is possible to do an Ultimate APU.
Zero attention for the STXH tablet/notebook console that Lenovo is doing.
Strix Halo I really want to be successful. This is what AMD can do when the naysayers in OEMs stop holding them back from doing something truly fun. Combining RAM between CPU and GPU makes so much sense.
 

StinkyPinky

Diamond Member
Jul 6, 2002
6,883
1,096
126
12 core CPU's don't sell well, agreed. More cores is such a niche anyway. AMD has a killer product with 8 cores and cache. If you're into games you buy that. If you really want more cores on a consumer platform then 16c is the way to go. 12c is sort of unloved for good reason. It isn't core maxing, it isn't gaming maxing. An esoteric config that costs money to assemble and explain to people doesn't make business sense, and we haven't seen AMD do it for precisely that reason.

I feel personally attacked
 

MS_AT

Senior member
Jul 15, 2024
207
497
96
And who assigns CPU affinities to the game threads to ensure that they don't wander off to another cache domain?
I would assume game developer as they best know the nature of their own workload. Look up Cyberpunk settings for CPU, they have very basic option for this exposed to the user. Generally there are windows api that let you outright pin workloads to specific cores [though they are generally not recommended as you might shoot yourself in the foot] or to suggest to the kernel it should keep specific threads together on specific sets of cores. It's even mentioned in publicly available resources from AMD, but I don't have them at hand and can't look them up right now.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |