Speculation: Ryzen 4000 series/Zen 3

Page 31 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

amd6502

Senior member
Apr 21, 2017
971
360
136
Well probably lots of wishful thinking on my part since i want my next notebook to be ULP and have a threadripping mode too. You almost surely are correct on that and unfortunately i'ts going to be another year, end 2020 till we know for sure.

Right now I think essentially zero chance on SMT4 and guesstimate of very low chance (~3% or 1:30) only of 4-way multithreading .

Hopefully Nosta finds more interesting patents.
 

Richie Rich

Senior member
Jul 28, 2019
470
229
76
Put a fork in it, SMT4 is dead. It was never a thing. There was never any evidence to suggest it.
I don't care about SMT4 itself, I want wide core with high IPC. Me as a customer I demand good products. And I cannot be happy that Apple mobile CPU A11 from 2017 is much stronger then Zen 3 will be in 2020. That's a shame for x86.

I'm pretty sure AMD wouldn't leak SMT4 feature on that Zen 2 presentation. Yeah, hope's still alive.
On the other side. Canceling ambitious projects might be the reason why Keller left AMD again in 2015. He left AMD in 1999 when they canceled his high performance Hammer CPU. History repeats? I wouldn't be surprised.
 
Last edited:

Kedas

Senior member
Dec 6, 2018
355
339
136
1) TSMCs High Volume Production ready for 7nm+ !!

2) AMDs Design of ZEN 3 is ready.

3) We know it's the same AM4 socket (Ryzen 5000, Zen 4 is DDR5)

So what are we waiting for?
 
Reactions: lightmanek

Gideon

Golden Member
Nov 27, 2007
1,709
3,927
136
How are people still parroting this meme
Why is this a meme? I agree that apple cores are different, they sacrifice considerable density to achive what they do, etc ... but they are considerably wider and considerably faster in most general-purpose operations, even when you disregard full stack optimizations, etc.

E.g. if Zen4 ends up being 50% wider, then why is this a "meme"?

EDIT: I get if people don't take specint2006 and other synthetic workloads seriously, but there have been plenty of other cases. One software dev tried some single-threaded self-made workloads (impossible to "optimize" by vendor or full-stack advantages, apple supposedly has) on both Desktop Mac Pro and IPhone and the phone still had the same (or better) ST performance, despite running Ghz's slower. A12/A13 has the best IPC in the business, and this was also said by Anandtech's Andrei.
 
Last edited:

DrMrLordX

Lifer
Apr 27, 2000
21,794
11,143
136
I don't care about SMT4 itself, I want wide core with high IPC. Me as a customer I demand good products. And I cannot be happy that Apple mobile CPU A11 from 2017 is much stronger then Zen 3 will be in 2020. That's a shame for x86.

A11 isn't really stronger than Zen2. What makes you think it'll be stronger than Zen3? It might be stronger at some arbitrary low-power point but that's functionally meaningless. Anyway, SMT2 will do quite well on a wider core. Instead of 25-30% performance improvement, we might see more gains. 40% is not too much to hope for in applications that can't saturate the pipeline with just one thread per core.

So what are we waiting for?

For Zen2 to go through its product cycle. July 2020 here we come!
 

DisEnchantment

Golden Member
Mar 3, 2017
1,684
6,227
136
New AMD Patent Application
Prefetch data from RAM into L3 to reduce latency. With those big L3s this could mean something.

20190294546 - PREFETCHER BASED SPECULATIVE DYNAMIC RANDOM-ACCESS MEMORY READ REQUEST TECHNIQUE

A method includes monitoring a request rate of speculative memory read requests from a penultimate-level cache to a main memory. The speculative memory read requests correspond to data read requests that missed in the penultimate-level cache. A hit rate of searches of a last-level cache for data requested by the data read requests is monitored. Core demand speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding core demand data read request based on the request rate and the hit rate. Prefetch speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding prefetch data read request based on the request rate and the hit rate.
 

Vattila

Senior member
Oct 22, 2004
805
1,394
136
What if Zen 3 has an interposer? How would that change number of links?

It should have no effect. The interconnect between L3 slices is implemented on the CPU chiplet.

An interposer should allow more complex, wider, faster and more efficient interconnect between chiplets, though, since a silicon interposer allows much finer metal layers and much lower energy-per-bit.
 

Richie Rich

Senior member
Jul 28, 2019
470
229
76
A11 isn't really stronger than Zen2.
A12 has 158% of Skylake IPC in SPECint. A11 is slower but not much because it has 6xALUs too. It is nice example that 6xALU core needs some evolution steps to get max performance (pick the lowest fruits).

we might see more gains. 40% is not too much to hope for
Good point. If Zen 2 SMT2 can gain +20% more performance this means average ALU loading is 80%. With 6xALU core you have base of 150%... so theoretically there might be +70% gain ( to Zen 2). However according to Zen 3 ST it would be around 40%. That's massive gain.
 

NTMBK

Lifer
Nov 14, 2011
10,269
5,134
136
It should have no effect. The interconnect between L3 slices is implemented on the CPU chiplet.

An interposer should allow more complex, wider, faster and more efficient interconnect between chiplets, though, since a silicon interposer allows much finer metal layers and much lower energy-per-bit.

Of course, an active interposer could move some logic off the compute die and into the interposer, opening up all sorts of options for topology.
 

Ajay

Lifer
Jan 8, 2001
16,094
8,104
136
New AMD Patent Application
Prefetch data from RAM into L3 to reduce latency. With those big L3s this could mean something.

20190294546 - PREFETCHER BASED SPECULATIVE DYNAMIC RANDOM-ACCESS MEMORY READ REQUEST TECHNIQUE

A method includes monitoring a request rate of speculative memory read requests from a penultimate-level cache to a main memory. The speculative memory read requests correspond to data read requests that missed in the penultimate-level cache. A hit rate of searches of a last-level cache for data requested by the data read requests is monitored. Core demand speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding core demand data read request based on the request rate and the hit rate. Prefetch speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding prefetch data read request based on the request rate and the hit rate. View attachment 11717

Wait a minute here. How much memory bandwidth does Zen3 have in order to have significant enough read throughput to make speculative reads?!
Heh, and why does that patent show only 4 cores? I don't this this Patent is for Zen3. Seems like AMD is engaged in some patent bracketing or something.
 

soresu

Platinum Member
Dec 19, 2014
2,941
2,164
136
A12 has 158% of Skylake IPC in SPECint. A11 is slower but not much because it has 6xALUs too. It is nice example that 6xALU core needs some evolution steps to get max performance (pick the lowest fruits).
Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

And we already know the intrinsic vector length limits of NEON SIMD are below that of AMD, let alone Intel with AVX512.

Of course this will change in the future with SVE2, but that is then, this is now.

There still seems to be a gulf between benchmarking the 2 platforms that respects all possible performance avenues, and vector/SIMD length is a big one in certain use cases.
 

soresu

Platinum Member
Dec 19, 2014
2,941
2,164
136
Of course, an active interposer could move some logic off the compute die and into the interposer, opening up all sorts of options for topology.
I thought the whole point of the interposer was interconnect, surely integrating the IO would be the better use case then?
 
Reactions: DarthKyrie

scannall

Golden Member
Jan 1, 2012
1,947
1,638
136
Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

And we already know the intrinsic vector length limits of NEON SIMD are below that of AMD, let alone Intel with AVX512.

Of course this will change in the future with SVE2, but that is then, this is now.

There still seems to be a gulf between benchmarking the 2 platforms that respects all possible performance avenues, and vector/SIMD length is a big one in certain use cases.
iOS is OSX, with a touch interface. Should be pretty easy to compare.
 

soresu

Platinum Member
Dec 19, 2014
2,941
2,164
136
iOS is OSX, with a touch interface. Should be pretty easy to compare.
Should be, and yet we still have these strangely limited benchmarks that miss a crucial area of modern CPU performance in the SIMD execution.

Dunno how we would go about comparing them though - perhaps dav1d would suffice to at least test an AVX2 cpu vs a NEON cpu, but dav1d lacks AVX512 code at present to compare further.
 

Gideon

Golden Member
Nov 27, 2007
1,709
3,927
136
Should be, and yet we still have these strangely limited benchmarks that miss a crucial area of modern CPU performance in the SIMD execution.

Dunno how we would go about comparing them though - perhaps dav1d would suffice to at least test an AVX2 cpu vs a NEON cpu, but dav1d lacks AVX512 code at present to compare further.
Geekbench has had AVX512 for ages, why not use it?
 

DrMrLordX

Lifer
Apr 27, 2000
21,794
11,143
136
A12 has 158% of Skylake IPC in SPECint.

Yeah, but . . .

Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

. . . ah, you beat me to it. A11 (and A12) don't clock high enough to be 158% faster than Skylake, Zen2, or Zen3. Only the Apple design team really knows why someone, somewhere hasn't tried making a higher-clocked version of their cores to set the world on fire. They ARE impressive in what they've gained over the years. AMD should be taking notes. But let's be realistic here.

Bringing it back to Zen3 . . . yes, I think AMD could benefit from a wider core. And I'll repeat my point that SMT2 (not 4) will make it pretty easy for everyday users to exploit that wider core. I just don't think AMD needs to lose any sleep over the possibility that Zen3 might be slower than some Apple SoC at such a low clockspeed that nobody's really going to care about that comparison anyway. Zen3 might lock horns with an Axx variant in the notebook sector, eventually. But the software they run will be so different that it'll be hard to make reliable comparisons between the two.
 

Richie Rich

Senior member
Jul 28, 2019
470
229
76
Apple SoC at such a low clockspeed that nobody's really going to care about that comparison anyway
Don't be influenced by desktop CPUs. Server 64c Epyc 7742 has a base frequency 2.25 GHz (may boost to 2.5 GHz within TDP). Apple A13 runs at 2.66 GHz too.... so for servers is freqency identical however performance is around +50% higher for fruit machine A13. Power consumption for A13 is around 4W, subtract consumption of GPU and idling/sleeping 5 more cores, it can be 3.5W x 64c = 224 W (Epyc has TDP 225W). Pretty comparable consumption with massive performance gain +50%.

6xALUs is killing feature. That is loud alarm for Intel and AMD and they should lose a sleep. So far they are lucky that this 6xALU beast is bounded in iPhone only thanks to Apple management. Steve Jobs was very challenging person and IMHO he would had a courage to change server business by server version of their 6xALU beast (Apple needs for their cloud service thousands servers too). And cloud service allows to keep HW in Apple's hands by selling service instead of HW.

Zen 3 with 6xALUs will be already 3 years behind Apple in CPU technology (A11 appeared in 2017). If Zen 3 won't be wide core, then it is tragedy for x86 and ARM with Cortex A78 will take server and laptop markets. Don't forget how ended up superior archs like IBM PowerPC, Itanium, Motorola 68000, DEC Alpha - all these were smashed by cheap, mass produced and thus faster evolving black horse called x86. Today the history repeats, just this time the black horse is ARM.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |