- Mar 3, 2017
- 1,687
- 6,243
- 136
They already support both AVX512-VNNI and AVX512-BF16.AVX-VNNI support
100% performance increase per core going from Zen3 -> Zen5, really? What do you base that on?
Also, Zen5 may very well have memory controller and other CPU design improvements + more cache, making it less memory bandwidth hungry than Zen3, all else equal.
Of course, more memory bandwidth is always better. But in several workloads the difference was not that big. And note that we’re talking about twice the amount of cores (64), and half the bandwidth (DDR4 3200). So with 32C and DDR5 6400 you’ll have 4x the memory bandwidth per core compared to that, when using the same number of channels.
If (according to sources like OC3D, GN, TS) a 24c CPU from another brand matches a 7950x in MT while having a 1% difference between DDR4 and DDR5, then there is no basis to say memory bandwidth is an issue in the first place. A 24c Zen 4 CPU would be fine on DDR4 and a desktop Zen 5 CPU with 32c let alone 24c would also still be fine on DDR5 for the market segment that its targeting.
Thanks for finding this. Though in several of these tests, 5965WX (24 cores) is as fast or faster than 5995WX (64 cores). Maybe a 16-core CPU would be even faster...Here's an article. 1, 2, 4, 8, memory channels and 32, 64 cores tested.
AMD Threadripper PRO Memory Channel Performance Scaling
https://www.pugetsystems.com/labs/articles/amd-threadripper-pro-memory-channel-performance-scaling/
Thanks for this find![Phoronix] also tested memory scaling on the 5965WX and found that in some workloads there is an astronomical difference in performance. In others not so much but if the 5965WX was only using 4 channels it would be closer to a 14900K in this chart on the basis of that scaling test.
Consider the chances you are NOT being paid for it, which IMO is true for much bigger part of the world than it isnt. In US, cashiers at stores can be paid as much money per hour, as some people with master degrees in some parts of Europe.I also do not see a reason to offer more. If you do need more mt than what 16 Zen5 cores can offer then chances are you are being paid for it and you should probably run the numbers on a TR system because long term it will pay for itself.
What's wrong with threadripper 7970X? (or 9970X for Zen 5 when it comes out)For me would 32 core Zen 5 be worth easily $2k, maybe 2.5k. Only alternatives I have are second-hand Epyc CPUs or maybe Siena.
Similar bandwidth arguments were around before release of 16 core Zen 2. Both the Puget and Phoronix reviews illustrate that there are workloads where memory bandwidth does not play a significant role (Phoronix - compilation, Puget - rendering). So there are some of us running such workloads and waiting for AMD to release CPUs with more cores, as we know we would be benefiting from it. So the argument of "more cores with 2CH memory are not needed" is not valid *in general* - please do not judge for all of us. The main reason that AMD is not increasing core count is lack of competition and their own profits cannibalization, but it is not that there would not be users of such systems.
For me would 32 core Zen 5 be worth easily $2k, maybe 2.5k. Only alternatives I have are second-hand Epyc CPUs or maybe Siena. Those are the markets such hypothetical 32 core would cannibalize.
Total costs: the $2-2.5k for 32 core Zen 5 translate to $1.5k-2k in 32 core Zen 4 and the motheboards cost also additional $500. I rather built two machines with 7950X for $3k.What's wrong with threadripper 7970X? (or 9970X for Zen 5 when it comes out)
That is surely part of the consideration that AMD is doing. I still expect the higher-products cannibalization being more relevant to AMD's economical decision.There are users for everything. TR is already super niche in DIY and even as a pro workstation lineup and I think higher core count Ryzen desktop parts would also be super niche.
It all comes down to the fact the juice is not worth the squeeze.
See this is below of what Zen3 did and you can't deliver below of what Zen3 did with a fatter tock.18% single-threaded total performance improvement over Zen 4, and 15% in multi-threaded load: higher gain in FP (wider core), but lower gain in INT loads (consumption)
See this is below of what Zen3 did and you can't deliver below of what Zen3 did with a fatter tock.
Not at AMD (or anywhere else that matters) really.Sure you can
Not to the degree that matters here.but it is entirely possible to miss targets
^^^^ This. AMD is intentionally sandbagging the core count on desktop CPUs now. This time I think it'll be Intel that leads the dance to higher core count, and AMD will then reluctantly follow.I am staying curious if Intel will be the one to push the core count, whether the leaks about second batch of Arrow Lake with 8+32 cores are sound, as I expect that would push AMD into releasing higher core parts.
They don't have to do anything.This time I think it'll be Intel that leads the dance to higher core count, and AMD will then reluctantly follow.
Yet even though they maybe didn't need to, AMD did release 24-, 32-, and 64-core CPUs (and actually also a 96-core CPU) to the DIY market, in the Zen 4 generation. Just because the price-per-core of these CPUs is higher than, say, that of a run-of-the-mill 8-core CPU, and they only work in top-tier mainboards¹ doesn't mean that these products don't exist or cannot be bought. Yes, their price level is certainly somewhat influenced by what the direct competitor is offering.But same logic is valid to 16 core from Zen 2 lineup. The reason why they released it back then was that they needed the complete performance lead, while now such a 32 core part is not needed.
I commented earlier on how CPU image renderer benchmarks fail to reflect real applications. Now, code compilation: If somebody works with a large code base on a desktop computer, i.e. a single-user PC, then they compile the code just incrementally much of the time. Which has low parallelism. IOW, high parallelism is rare for the use case of code compilation, making it a niche.Both the Puget and Phoronix reviews illustrate that there are workloads where memory bandwidth does not play a significant role (Phoronix - compilation, Puget - rendering).
Are we doing The Price is Right rules?Total costs: the $2-2.5k for 32 core Zen 5 translate to $1.5k-2k in 32 core Zen 4 and the motheboards cost also additional $500. I rather built two machines with 7950X for $3k.
That is surely part of the consideration that AMD is doing. I still expect the higher-products cannibalization being more relevant to AMD's economical decision.
But same logic is valid to 16 core from Zen 2 lineup. The reason why they released it back then was that they needed the complete performance lead, while now such a 32 core part is not needed. I am staying curious if Intel will be the one to push the core count, whether the leaks about second batch of Arrow Lake with 8+32 cores are sound, as I expect that would push AMD into releasing higher core parts.
To the main topic of the thread, if people are betting the performance lead, here is my guess (based on just reading this thread). 18% single-threaded total performance improvement over Zen 4, and 15% in multi-threaded load: higher gain in FP (wider core), but lower gain in INT loads (consumption) - but I wish hyping by @adroc_thurston of the front-end is true and I am wrong. Zen 4 already increased MT performance a lot, so I think it will be more challenging without significant manufacturing node jump. Where is the guess from @inf64? What if reviews are out in 3 months, you do not wish to miss the chance extend your record of successful guesses .
No.And are they making a new IOd with more GMI links?
8P + 2x16E has been rumored for second batch of Arrow Lake. Would be very good for a lot of the MT use cases being discussed here.Otherwise a 32 core part would have to be all dense cores and not worth buying in the slightest. An emergency edition product not worth making.
Would be very good for a lot of the MT use cases being discussed here.
You mean the subset of the market that you happen to be interested in. But not everyone only plays games.But not the target market which is games.
DIY CPU is pretty much games.But not everyone only plays games.
For games a 16C Zen5 is also not the best option thanks to 2 CCDs.But not the target market which is games.
Good for Intel but it doesn't suddenly make a 32 Zen 5C product desirable. If IOd is the same then 24 cores mismatched between Zen 5 and Zen 5C seems the maximum useful configuration that doesn't give up 1T performance.8P + 2x16E has been rumored for second batch of Arrow Lake. Would be very good for a lot of the MT use cases being discussed here.