Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

DisEnchantment · Sep 29, 2022

Speculate at will

Fjodor2001 · Jan 19, 2024

Ajay said:
The companies buying TR CPUs are doing it for their professional engineers designing semiconductor chips or simulating stresses on mechanical designs, 3D design and rendering artists, etc... The cost of the machine pales in comparison to the cost of paying those people.

Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.

Fjodor2001 · Jan 19, 2024

Thunder 57 said:
Also, what's up with these slow ass forums? Anyone else having issues?

They’ve downgraded to a 4C CPU since they heard that ST perf is all that matters. 🤣

itsmydamnation · Jan 19, 2024

Fjodor2001 said:
Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.

what are you on about , TR is an EPYC platform , you know they have multiple platforms for different targets........

Fjodor2001 said:
They’ve downgraded to 4C CPUs since they heard that ST perf is all that matters. 🤣

4C is probably the most common VM size you will find for server VM's ...... i know this must hurt you deeply...

show us on the dolly where the 4C CPU touched you.......

Goop_reformed · Jan 19, 2024

adroc_thurston said:
Well it ain't gonna change anything since he's getting a $349 6 core in but a few months.

So 6 cores zen 5 performs about the same as 8 cores zen 4. 1-6/8 = 25%. Hmmmm

TESKATLIPOKA · Jan 19, 2024

Goop_reformed said:
So 6 cores zen 5 performs about the same as 8 cores zen 4. 1-6/8 = 25%. Hmmmm

Are you sure?
With your calculation 4 core vs 8 cores would be 1-4/8= 50%.

It should have been
8/6 = 1.3333 -> 33.33%
or
1/(6/8) = 1.3333 -> 33.33%

FlameTail · Jan 19, 2024

Will Strix Halo iGPU actually be more efficient than a dGPU?

According to rumours, total SoC TDP for Strix Halo is 125W. Not sure how much of that is the GPU, but I have a feeling that the GPU is going to be less efficient than say... a low wattage RTX 4070 for example.

branch_suggestion · Jan 19, 2024

FlameTail said:
Will Strix Halo iGPU actually be more efficient than a dGPU?

According to rumours, total SoC TDP for Strix Halo is 125W. Not sure how much of that is the GPU, but I have a feeling that the GPU is going to be less efficient than say... a low wattage RTX 4070 for example.

Strix Halo, first and foremost is about enabling laptops designs that are impossible with CPU+dGPU designs.
Both in regards to packaging and cooling. You pay a premium on the advanced packaging cost of putting it all together.
The config is RDNA3.5 2SE/4SA/20WGP, 32MB of MALL and 256-bit LPDDR5X-8533 for 270GB/s of membw. N4P is presumed and maximum GPU power is 125W.
I'll assume a 5% IPC increase and +20% clocks at iso power/-40% power at iso clocks vs N31/32. The closest parts are either the 7700XT or the good old 6700XT.
Using these numbers I think it can hit 7700XT clocks at ~110W, adding for IPC and an extra clock bump I will put final STX-Halo GPU performance @125W at 85% of the 7700XT. That places performance as similar to Navi22, which has the same core config at slightly over half the power.
The 4070 is a fatter part, at 125W it would probably be faster, though the numbers I have chosen are conservative without knowing STX1 numbers yet and knowing how bad clock/power scaling is on RDNA3.
4070M on the other hand would be smashed, which is the far more relevant comparison.

EDIT: SoC tile is N3E and I really should know better, I'll revise performance estimate to between the 3070 and 7700XT. It will probably match a 125W desktop 4070 and fit nicely in the huge perf gap between the 4070M and 4080M.
Cutdown AD104 perf, as was alluded earlier.

FlameTail · Jan 19, 2024

branch_suggestion said:
Strix Halo, first and foremost is about enabling laptops designs that are impossible with CPU+dGPU designs.

Yes yes. I know the drill.

branch_suggestion said:
The config is RDNA3.5 2SE/4SA/20WGP, 32MB of MALL and 256-bit LPDDR5X-8533 for 270GB/s of membw. N4P is presumed and maximum GPU power is 125W.

What is MALL?

branch_suggestion said:
I'll assume a 5% IPC increase and +20% clocks at iso power/-40% power at iso clocks vs N31/32. The closest parts are either the 7700XT or the good old 6700XT.
Using these numbers I think it can hit 7700XT clocks at ~110W, adding for IPC and an extra clock bump I will put final STX-Halo GPU performance @125W at 85% of the 7700XT. That places performance as similar to Navi22, which has the same core config at slightly over half the power.
The 4070 is a fatter part, at 125W it would probably be faster, though the numbers I have chosen are conservative without knowing STX1 numbers yet and knowing how bad clock/power scaling is on RDNA3.
4070M on the other hand would be smashed, which is the far more relevant comparison.

4070M 125W gets smashed???

4050M 60W is already 3x perf of 780M.
4070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.

Strix Halo GPU vs Radeon 780M;
• 3.33x the CU count
• RDNA3.5 (vs RDNA3)
• N3E
• 273 GB/s

Based on these, I think we can optimistically say Strix Halo GPU will be like 4x-5x that of 780M.

rainy · Jan 19, 2024

FlameTail said:
What is MALL?

Memory Attached Last Level aka Infinity Cache.

branch_suggestion · Jan 19, 2024

FlameTail said:
What is MALL?

Infinity Cache, if you rather, the memory attached last level cache which is L3 in this case.

FlameTail said:
4070M 125W gets smashed???
4050M 60W is already 3x perf of 780M.
View attachment 920894070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.

WLE is a glorified mobile benchmark, also only a single point of data. 780M is consuming way less power here and would get increasingly memory bottlenecked at higher power.
Mainly though, you massively overestimate the 4070M, it is only about 40-50% faster than the 4050M.
NV's laptop parts this gen are a complete joke, 4070M is AD106, which isn't much better than AD107. AMD isn't competitive, so they could do this, the parts are basically equal to the previous gen parts of equivalent branding.
Not beating the 4070M comfortably would be a failure.

jpiniero · Jan 19, 2024

Strictly using Timespy, the average 4070M is like 10% faster than the average 40 CU 6800M. Course that's RDNA2, but I also don't think the power draw of the average Strix Halo laptop is going to be that high either. Like I'd expect it to be like 65 W max.

branch_suggestion · Jan 19, 2024

jpiniero said:
Strictly using Timespy, the average 4070M is like 10% faster than the average 40 CU 6800M. Course that's RDNA2, but I also don't think the power draw of the average Strix Halo laptop is going to be that high either. Like I'd expect it to be like 65 W max.

About 15% behind on average from what I've seen. Lends very high confidence to Strix Halo beating the 4070M handily.

Glo. · Jan 19, 2024

dr1337 said:
And in 5 years advanced nodes will be so dense and also relatively cheap that this sounds like a joke

People have constantly said node prices are increasing yet compute-per-dollar has only gone up as time goes on. For most people in 2018, 7nm and 5nm sounded like myths, and here we are in 2024 with relevant foundries steaming ahead to angstrom scale.

Logic density gains only slow down with smaller nodes, and memory density already is not scaling with smaller nodes.

I wonder where will come those massive gains in density to maintain moore's law? Magic? Vibranium Wafers?

Glo. · Jan 19, 2024

FlameTail said:
Yes yes. I know the drill.

What is MALL?

4070M 125W gets smashed???

4050M 60W is already 3x perf of 780M.
View attachment 920894070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.

Strix Halo GPU vs Radeon 780M;
• 3.33x the CU count
• RDNA3.5 (vs RDNA3)
• N3E
• 273 GB/s

Based on these, I think we can optimistically say Strix Halo GPU will be like 4x-5x that of 780M.

RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.

Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.

Shivansps · Jan 19, 2024

Glo. said:
RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.

Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.

remember the 780M is sharing the banwidth with the CPU, so the actual bandwidth could be considerably lower than that.

Tuna-Fish · Jan 19, 2024

Fjodor2001 said:
Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.

No. Epyc has lower clocks, which hurts on many workloads. Also, the market for low-end TR is, and has always been, customers who need more PCIe, not customers who need more threads. 32C AM5 would be useless for them.

TR has a market between Ryzen and EPYC, so long as it requires nothing expensive to make. TR is basically EPYC tuned differently, which makes it very cheap to make. They could never profitably make a different IO die for TR, for example, but as long as it uses the EPYC one it will be fine.

Kepler_L2 · Jan 19, 2024

Glo. said:
Logic density gains only slow down with smaller nodes, and memory density already is not scaling with smaller nodes.

I wonder where will come those massive gains in density to maintain moore's law? Magic? Vibranium Wafers?

CFET, TMD and 3D stacking. Biggest problem going forward is cost, the "free lunch" aspect of Moore's Law is over.

TESKATLIPOKA · Jan 19, 2024

Glo. said:
RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.

What solely thanks to 16MB IC, that's not true.
6500M/6550M has 25/41% higher BW than 780M even without IC and It's not shared between CPU and IGP.
Both of them also have 33% more CU than 780M.

Glo. said:
Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.

It's interesting how both you and @branch_suggestion already know that It will easily compete or beat 4070M without even knowing IGP's frequency.

Fjodor2001 · Jan 19, 2024

Tuna-Fish said:
Also, the market for low-end TR is, and has always been, customers who need more PCIe, not customers who need more threads. 32C AM5 would be useless for them.

Of course, but I was not not talking about the low-end TR models at 12/16C. But those that mainly need more PCIe lanes can also get a low-end Epyc.

adroc_thurston · Jan 19, 2024

FlameTail said:
4070M 125W gets smashed???

Yea that's easy.

FlameTail said:
Will Strix Halo iGPU actually be more efficient than a dGPU?

Yeah kinda the point of shipping console SoCs in laptop.

Glo. · Jan 19, 2024

TESKATLIPOKA said:
What solely thanks to 16MB IC, that's not true.
6500M/6550M has 25/41% higher BW than 780M even without IC and It's not shared between CPU and IGP.
Both of them also have 33% more CU than 780M

What I mean by this, is that the end result in perf difference is much bigger than just by counting CUs/ALUs, Clock frequency and memory bandwidth, even added up.

TESKATLIPOKA said:
It's interesting how both you and @branch_suggestion already know that It will easily compete or beat 4070M without even knowing IGP's frequency.

We're talking about the highest end TDP configuration. And that should easily compete/beat 4070M, thanks to IC. In essence, what Strix Point Halo needs to do to compete or beat 4070M is to achieve 6850M performance.

Joe NYC · Jan 19, 2024

Fjodor2001 said:
Reverse Moore's law. No we're not seeing that yet. Stagnated Moore's law is bad enough.

Maybe China will one day save the Moote's Law

Fjodor2001 · Jan 20, 2024

Joe NYC said:
Maybe China will one day save the Moote's Law

You mean if/when China is able to produce chips on latest node using only China developed tech?

TESKATLIPOKA · Jan 20, 2024

Joe NYC said:
Maybe China will one day save the Moote's Law

They won't because there is no Moote's law.

P.S. I don't think this is the correct thread for this debate.

RnR_au · Jan 20, 2024

Fjodor2001 said:
You mean if/when China is able to produce chips on latest node using only China developed tech?

Not likely in our life time under the current administration. ASML is the ace the West holds tightly to their chest.

Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Golden Member

Diamond Member

Diamond Member

Platinum Member

Senior member

Platinum Member

Diamond Member

Senior member

Diamond Member

Senior member

Senior member

Lifer

Senior member

Diamond Member

Diamond Member

Diamond Member

Golden Member

Senior member

Platinum Member

Diamond Member

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Platinum Member

Golden Member