Discussion Speculation: Zen 4 (EPYC 4 "Genoa", Ryzen 7000, etc.)

Page 297 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Except for the details about the improvements in the microarchitecture, we now know pretty well what to expect with Zen 3.

The leaked presentation by AMD Senior Manager Martin Hilgeman shows that EPYC 3 "Milan" will, as promised and expected, reuse the current platform (SP3), and the system architecture and packaging looks to be the same, with the same 9-die chiplet design and the same maximum core and thread-count (no SMT-4, contrary to rumour). The biggest change revealed so far is the enlargement of the compute complex from 4 cores to 8 cores, all sharing a larger L3 cache ("32+ MB", likely to double to 64 MB, I think).

Hilgeman's slides did also show that EPYC 4 "Genoa" is in the definition phase (or was at the time of the presentation in September, at least), and will come with a new platform (SP5), with new memory support (likely DDR5).



What else do you think we will see with Zen 4? PCI-Express 5 support? Increased core-count? 4-way SMT? New packaging (interposer, 2.5D, 3D)? Integrated memory on package (HBM)?

Vote in the poll and share your thoughts!
 
Last edited:
Reactions: richardllewis_01

Exist50

Platinum Member
Aug 18, 2016
2,452
3,101
136
Considering FP workloads need the most bandwidth and Zen 4c is rumored to halve the AVX512 throughput over Zen 4 the approach you describe makes perfect sense to me.
I'd take those rumors about halving AVX512 throughput with a grain of salt. My understanding is that Zen 4 and 4c are pretty much architecturally identical.

That said, cloud tends to be int heavy, and relatively insensitive to L3 cache. I think they could get away with less die to die connectivity if necessary.
 

Exist50

Platinum Member
Aug 18, 2016
2,452
3,101
136
All AMD said is that the ISA was compatible. They didn't say anything about performance.
They didn't, but that doesn't mean it has changed either. And frankly, AVX512 is probably the worst thing to target from a design effort standpoint. I'm not even sure it's able to be cracked, and even if its possible, that would be a substantial engineering investment for dubious gain. I'd be willing to bet we see the same behavior across Zen 4 and 4c.
 

coercitiv

Diamond Member
Jan 24, 2014
6,387
12,812
136
Has anyone seen motherboard costs for the x670 chipsets?
I saw 1 article that showed the lowest (no PCIE 5.0 slot) for over 300$ USD
All I can tell is the prices will be higher than Z690 on average, and this is based on exactly one of those early listed boards, the MSI’s PRO X670-P WIFI at around $300.

When compared with the Z690-A PRO WiFi, which started out as a $220 board, the newer X670-P comes with an upgraded VRM of 14+2+1 80A stages versus 14+1+1 55A stages. The VRM cooling also loks to be improved in terms of metal mass, as does the chipset cooler. On top of that the AMD board seems to be a 7-layer PCB design, versus 6-layer on the Intel board (we need confirmation on this though). Last comes the audio part, which gets an upgrade as well.

So we have a "low cost" X670 board that is already an upgrade over the $220 board from last year, with higher BOM. On top of that we add inflation. Even if we take into consideration the "early adopter tax", I think the MSI PRO X670-P WIFI will hover around $250-270 after the price settles to a reasonable level. And as you already said, this is mostly a PCIe 4.0 board, only one M2 slot is PCIe 5.
 

Timmah!

Golden Member
Jul 24, 2010
1,462
725
136
These boards were model compared to the same x570 model was 220$ ish which compared to the same x670 model at 500$.
The 300$ board was the lowend x670 chipset.
Which is where my concern comes from.
That just seems outlandish.

i saw the preliminary prices for Asus boards on wccftech. I hope its not true and someone pulled those from you know where, cause cheapest 670E was like 800 EUROs, the Hero was like 1100 EUROs and Crosshair Extreme 1475 EUROs. I was like what. You could have WRX80 boards at those prices. It would explain the "E" in the naming though, extreme for extreme pricing. Cause sure as hell there is nothing "extreme" about it otherwise.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,688
1,222
136
I'm not even sure it's able to be cracked, and even if its possible, that would be a substantial engineering investment for dubious gain. I'd be willing to bet we see the same behavior across Zen 4 and 4c.
Port 0 and Port 1, every Intel AVX512 design.


AVX512 has been cracked from the start. Client so far has never seen the dedicated AVX512.

Family 19h FPU:

Purple = AVX512 (Full decode width)
Red = Low AVX512
Blue = High AVX512

Zen4c cuts FE/LSU/FPU but keeps full Integer execution.
Red&Blue execution is temporal with Zen4c: Port 0, 2, 4 are the only units available.
 
Last edited:

NostaSeronx

Diamond Member
Sep 18, 2011
3,688
1,222
136
Please don't waste my time trying to correct me if you don't even understand the terms I'm using.
You used the term cracked. So, far no one has explicitly stated AVX512 will not be decoded fully.

AVX512 is full-width till it hits NSQ(FPU Mapper) then it splits to Scheduler 0 and Scheduler 1... or it loads first half to Scheduler 0 then the second half to Scheduler 0.

Cracked via decode where it is a double macro-op = not occurring
Cracked via execution = Same as Intel's client cores as Port0&Port1
 

LightningZ71

Golden Member
Mar 10, 2017
1,659
1,942
136
Port 0 and Port 1, every Intel AVX512 design.
View attachment 66099

AVX512 has been cracked from the start. Client so far has never seen the dedicated AVX512.

Family 19h FPU:
View attachment 66102
Purple = AVX512 (Full decode width)
Red = Low AVX512
Blue = High AVX512

Zen4c cuts FE/LSU/FPU but keeps full Integer execution.
Red&Blue execution is temporal with Zen4c: Port 0, 2, 4 are the only units available.

Incorrect on AVX512 support (two ports on every design...)

[For Ice Lake Client, carried over to Rocket Lake, Alder Lake and we speculate Raptor Lake CLIENT]
"For the execution ports, now that Intel has moved AVX-512 into the mainstream Core design, there are a few changes. AVX-512 is now supported on Port 0 (FMA, ALU, Shift) and Port 5 (ALU, Shuffle). There is only one AVX-512 port, but also a 256-bit FMA port, so either 1x512-bit or 2x256-bit throughput is possible" - https://www.anandtech.com/show/14514/examining-intels-ice-lake-microarchitecture-and-sunny-cove/3

That's half the throughput of AVX-512 on their Xeon SKUs. This is evident on benchmarks that focus on that support. Client supports the operations, but with less throughput.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,688
1,222
136
Incorrect on AVX512 support (two ports on every design...)

[For Ice Lake Client, carried over to Rocket Lake, Alder Lake and we speculate Raptor Lake CLIENT]
"For the execution ports, now that Intel has moved AVX-512 into the mainstream Core design, there are a few changes. AVX-512 is now supported on Port 0 (FMA, ALU, Shift) and Port 5 (ALU, Shuffle). There is only one AVX-512 port, but also a 256-bit FMA port, so either 1x512-bit or 2x256-bit throughput is possible" - https://www.anandtech.com/show/14514/examining-intels-ice-lake-microarchitecture-and-sunny-cove/3

That's half the throughput of AVX-512 on their Xeon SKUs. This is evident on benchmarks that focus on that support. Client supports the operations, but with less throughput.


1. AVX512 occupies port 1 when executed in port 0.
2. AVX512 does not exist in port 5 when in client parts. Port 5 in client is AVX256, except when it is not exact behavior below;
Exact behavior in the field:
Port 0 -> Port 1 = AVX512 MUL or ADD or FMA
Port 5 -> Port 1 = AVX512 MISC (adds can go this way for speed.)
Two port execution:
Skylake-X: P0+P1
Palmcove/Cannonlake: P0+P1
Sunnycove/Icelake: P0+P1
Willowcove/Tigerlake: P0+P1
Goldencove/Alderlake: P0+P1 or P1+P5 depending on the ZMM instruction.

Port 0 and Port 1 is the behavior that AMD is using. Which is cracked execution.

AMD Zen4:
Port 0+1 = Multiply AVX512
Port 2+3 = Addition AVX512 (FMA: 0+1 bridge to 2+3; 0->2 + 1->3 simultaneously)
Port 4+5 = Misc AVX512

AMD Zen4c:
Port 0 = Half-rate Multiply AVX512
Port 2 = Half-rate Addition AVX512
Port 4 = Half-rate Misc AVX512

Neoverse N1/N2/AmpereOne (Zen4c is against this) versus Neoverse V1 (Zen4 is against this)
 
Last edited:
Reactions: igor_kavinski

nicalandia

Diamond Member
Jan 10, 2019
3,331
5,282
136
Latest Cinebench R23 ST Run on an ES sample. Cinebench can only detect 256 Threads, but this is a 384T CPU. If CBR23 could take advantage of all cores, we might be looking past 160,000 points.



ST: 1300 Points at 3.7 Ghz
MT(256T)110,000 Points at 2.15 Ghz
 
Last edited:

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Not quite what i wanted, but my point was in principle about more focus on aesthetics and you seem to get it, as your designs looks significantly better and less busy than the original. Good work.

Thanks. Feel free to make your own — would be cool to see what you had in mind. I think there is a lot of possibilities for unexplored creativity on the IHS. Check out the Clark and Su editions in my edited post.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |