Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 241 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Fjodor2001

Diamond Member
Feb 6, 2010
3,923
403
126
The companies buying TR CPUs are doing it for their professional engineers designing semiconductor chips or simulating stresses on mechanical designs, 3D design and rendering artists, etc... The cost of the machine pales in comparison to the cost of paying those people.
Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,863
3,413
136
Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.
what are you on about , TR is an EPYC platform , you know they have multiple platforms for different targets........

They’ve downgraded to 4C CPUs since they heard that ST perf is all that matters. 🤣

4C is probably the most common VM size you will find for server VM's ...... i know this must hurt you deeply...

show us on the dolly where the 4C CPU touched you.......
 

FlameTail

Diamond Member
Dec 15, 2021
3,151
1,800
106
Will Strix Halo iGPU actually be more efficient than a dGPU?

According to rumours, total SoC TDP for Strix Halo is 125W. Not sure how much of that is the GPU, but I have a feeling that the GPU is going to be less efficient than say... a low wattage RTX 4070 for example.
 
Reactions: Joe NYC

branch_suggestion

Senior member
Aug 4, 2023
294
630
96
Will Strix Halo iGPU actually be more efficient than a dGPU?

According to rumours, total SoC TDP for Strix Halo is 125W. Not sure how much of that is the GPU, but I have a feeling that the GPU is going to be less efficient than say... a low wattage RTX 4070 for example.
Strix Halo, first and foremost is about enabling laptops designs that are impossible with CPU+dGPU designs.
Both in regards to packaging and cooling. You pay a premium on the advanced packaging cost of putting it all together.
The config is RDNA3.5 2SE/4SA/20WGP, 32MB of MALL and 256-bit LPDDR5X-8533 for 270GB/s of membw. N4P is presumed and maximum GPU power is 125W.
I'll assume a 5% IPC increase and +20% clocks at iso power/-40% power at iso clocks vs N31/32. The closest parts are either the 7700XT or the good old 6700XT.
Using these numbers I think it can hit 7700XT clocks at ~110W, adding for IPC and an extra clock bump I will put final STX-Halo GPU performance @125W at 85% of the 7700XT. That places performance as similar to Navi22, which has the same core config at slightly over half the power.
The 4070 is a fatter part, at 125W it would probably be faster, though the numbers I have chosen are conservative without knowing STX1 numbers yet and knowing how bad clock/power scaling is on RDNA3.
4070M on the other hand would be smashed, which is the far more relevant comparison.

EDIT: SoC tile is N3E and I really should know better, I'll revise performance estimate to between the 3070 and 7700XT. It will probably match a 125W desktop 4070 and fit nicely in the huge perf gap between the 4070M and 4080M.
Cutdown AD104 perf, as was alluded earlier.
 
Last edited:

FlameTail

Diamond Member
Dec 15, 2021
3,151
1,800
106
Strix Halo, first and foremost is about enabling laptops designs that are impossible with CPU+dGPU designs.
Yes yes. I know the drill.
The config is RDNA3.5 2SE/4SA/20WGP, 32MB of MALL and 256-bit LPDDR5X-8533 for 270GB/s of membw. N4P is presumed and maximum GPU power is 125W.
What is MALL?
I'll assume a 5% IPC increase and +20% clocks at iso power/-40% power at iso clocks vs N31/32. The closest parts are either the 7700XT or the good old 6700XT.
Using these numbers I think it can hit 7700XT clocks at ~110W, adding for IPC and an extra clock bump I will put final STX-Halo GPU performance @125W at 85% of the 7700XT. That places performance as similar to Navi22, which has the same core config at slightly over half the power.
The 4070 is a fatter part, at 125W it would probably be faster, though the numbers I have chosen are conservative without knowing STX1 numbers yet and knowing how bad clock/power scaling is on RDNA3.
4070M on the other hand would be smashed, which is the far more relevant comparison.
4070M 125W gets smashed???

4050M 60W is already 3x perf of 780M.
4070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.

Strix Halo GPU vs Radeon 780M;
• 3.33x the CU count
• RDNA3.5 (vs RDNA3)
• N3E
• 273 GB/s

Based on these, I think we can optimistically say Strix Halo GPU will be like 4x-5x that of 780M.
 
Last edited:

branch_suggestion

Senior member
Aug 4, 2023
294
630
96
What is MALL?
Infinity Cache, if you rather, the memory attached last level cache which is L3 in this case.
4070M 125W gets smashed???
4050M 60W is already 3x perf of 780M.
View attachment 920894070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.
WLE is a glorified mobile benchmark, also only a single point of data. 780M is consuming way less power here and would get increasingly memory bottlenecked at higher power.
Mainly though, you massively overestimate the 4070M, it is only about 40-50% faster than the 4050M.
NV's laptop parts this gen are a complete joke, 4070M is AD106, which isn't much better than AD107. AMD isn't competitive, so they could do this, the parts are basically equal to the previous gen parts of equivalent branding.
Not beating the 4070M comfortably would be a failure.
 

jpiniero

Lifer
Oct 1, 2010
14,831
5,444
136
Strictly using Timespy, the average 4070M is like 10% faster than the average 40 CU 6800M. Course that's RDNA2, but I also don't think the power draw of the average Strix Halo laptop is going to be that high either. Like I'd expect it to be like 65 W max.
 

branch_suggestion

Senior member
Aug 4, 2023
294
630
96
Strictly using Timespy, the average 4070M is like 10% faster than the average 40 CU 6800M. Course that's RDNA2, but I also don't think the power draw of the average Strix Halo laptop is going to be that high either. Like I'd expect it to be like 65 W max.
About 15% behind on average from what I've seen. Lends very high confidence to Strix Halo beating the 4070M handily.
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
And in 5 years advanced nodes will be so dense and also relatively cheap that this sounds like a joke

People have constantly said node prices are increasing yet compute-per-dollar has only gone up as time goes on. For most people in 2018, 7nm and 5nm sounded like myths, and here we are in 2024 with relevant foundries steaming ahead to angstrom scale.
Logic density gains only slow down with smaller nodes, and memory density already is not scaling with smaller nodes.

I wonder where will come those massive gains in density to maintain moore's law? Magic? Vibranium Wafers?
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
Yes yes. I know the drill.

What is MALL?

4070M 125W gets smashed???

4050M 60W is already 3x perf of 780M.
View attachment 920894070M 125W will be atleast 2x that of the 4050 here. Hence 6x the 780M.

Strix Halo GPU vs Radeon 780M;
• 3.33x the CU count
• RDNA3.5 (vs RDNA3)
• N3E
• 273 GB/s

Based on these, I think we can optimistically say Strix Halo GPU will be like 4x-5x that of 780M.
RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.

Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.
 
Reactions: Tlh97

Shivansps

Diamond Member
Sep 11, 2013
3,873
1,527
136
RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.

Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.
remember the 780M is sharing the banwidth with the CPU, so the actual bandwidth could be considerably lower than that.
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,421
1,750
136
Then they might as well buy EPYC CPUs.

It’s questionable whether the volume of TR CPUs sold is sufficient to justify their existance going forward. DT + EPYC could be sufficient, if split so DT is up to 32C and above that EPYC (for Zen 5/6). Then no need for a separate tiny TR niche category squeezed inbetween.

No. Epyc has lower clocks, which hurts on many workloads. Also, the market for low-end TR is, and has always been, customers who need more PCIe, not customers who need more threads. 32C AM5 would be useless for them.

TR has a market between Ryzen and EPYC, so long as it requires nothing expensive to make. TR is basically EPYC tuned differently, which makes it very cheap to make. They could never profitably make a different IO die for TR, for example, but as long as it uses the EPYC one it will be fine.
 

Kepler_L2

Senior member
Sep 6, 2020
460
1,895
106
Logic density gains only slow down with smaller nodes, and memory density already is not scaling with smaller nodes.

I wonder where will come those massive gains in density to maintain moore's law? Magic? Vibranium Wafers?
CFET, TMD and 3D stacking. Biggest problem going forward is cost, the "free lunch" aspect of Moore's Law is over.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,423
2,914
136
RX 6500M has 64 bit memory bus, with 128 GB/s bandwidth, which means its close to 780M Bandwidth. With 6400 MHz DDR5 780M has 102 GB/s bandwdith available, which should be enough for it to be close to 6500M, right?

It isn't. 6500M is 50% faster GPU than 780M, solely thanks to 16 MB of Infinity Cache.
What solely thanks to 16MB IC, that's not true.
6500M/6550M has 25/41% higher BW than 780M even without IC and It's not shared between CPU and IGP.
Both of them also have 33% more CU than 780M.
Strix Halo will have 32 MB of Infinity Cache, which will be available above 256 bit LPDDR5 bus and 270 GB/s bandwdith.

It will easily be competing with 4070M, because all of the bottlenecks should be alleviated because of the Infinity Cache.
It's interesting how both you and @branch_suggestion already know that It will easily compete or beat 4070M without even knowing IGP's frequency.
 

Fjodor2001

Diamond Member
Feb 6, 2010
3,923
403
126
Also, the market for low-end TR is, and has always been, customers who need more PCIe, not customers who need more threads. 32C AM5 would be useless for them.
Of course, but I was not not talking about the low-end TR models at 12/16C. But those that mainly need more PCIe lanes can also get a low-end Epyc.
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
What solely thanks to 16MB IC, that's not true.
6500M/6550M has 25/41% higher BW than 780M even without IC and It's not shared between CPU and IGP.
Both of them also have 33% more CU than 780M
What I mean by this, is that the end result in perf difference is much bigger than just by counting CUs/ALUs, Clock frequency and memory bandwidth, even added up.
It's interesting how both you and @branch_suggestion already know that It will easily compete or beat 4070M without even knowing IGP's frequency.
We're talking about the highest end TDP configuration. And that should easily compete/beat 4070M, thanks to IC. In essence, what Strix Point Halo needs to do to compete or beat 4070M is to achieve 6850M performance.
 
Reactions: Tlh97
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |