Discussion Speculation: Zen 4 (EPYC 4 "Genoa", Ryzen 7000, etc.)

Page 431 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Except for the details about the improvements in the microarchitecture, we now know pretty well what to expect with Zen 3.

The leaked presentation by AMD Senior Manager Martin Hilgeman shows that EPYC 3 "Milan" will, as promised and expected, reuse the current platform (SP3), and the system architecture and packaging looks to be the same, with the same 9-die chiplet design and the same maximum core and thread-count (no SMT-4, contrary to rumour). The biggest change revealed so far is the enlargement of the compute complex from 4 cores to 8 cores, all sharing a larger L3 cache ("32+ MB", likely to double to 64 MB, I think).

Hilgeman's slides did also show that EPYC 4 "Genoa" is in the definition phase (or was at the time of the presentation in September, at least), and will come with a new platform (SP5), with new memory support (likely DDR5).



What else do you think we will see with Zen 4? PCI-Express 5 support? Increased core-count? 4-way SMT? New packaging (interposer, 2.5D, 3D)? Integrated memory on package (HBM)?

Vote in the poll and share your thoughts!
 
Last edited:
Reactions: richardllewis_01

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,751
14,781
136
@eek2121
Well, you need to take performance into account as well. If the AVX test is much faster even downclocked, then total energy consumption for a given workload very likely is smaller. The main reason for the behaviour should be that the occupancy of execution units is much higher with AVX.
I know nobody else wants to hear about DC performance (max utilization of all cores 24/7) but if you really want to know, check the BOINC log files in regular and avx-512 tasks. The 7950x in both categories just blows away the 13900k. In points, even the 64 core EPYCs with the current project. Now if I had a 7773x maybe that would all change, but I don't have the money for that. But it does beat the 7763 Milans I have.
 
Reactions: Tlh97 and Mopetar

mmaenpaa

Member
Aug 4, 2009
82
145
106
@Det0x
Your numbers are a bit better as the ones from CnC. But they again show quite some lack of bandwidth compared to RTL.

@mmaenpaa
So you have this shiny new 7700X. Might you give the Micro Bench posted by @Det0x a try? 😉 I am really interested how a single CCD ZEN4 might turn out bandwidth wise.

System is at Asrock defaults except EXPO @6000. Windows 11 Pro latest as of today. It seems I need to upgarde bios to 1.11.AS03 (beta) which enchances S3 mode (=corp speak, it may now work 🤣)

 

Det0x

Golden Member
Sep 11, 2014
1,061
3,105
136
Interesting. Now that I have my new system built, I've been doing some Aida64 runs and one thing I noticed is that my memory latency scores are higher than what I thought they would be using DDR5 7400. The lowest latency score I've had so far is 61.7ns. This version I am using is a more recent build so they cannot truly be compared, but I definitely think that AMD has the advantage in memory latency due to being able to use Gear 1 with DDR5. If whoever did that run with CL30 DDR5 6000 memory, he would have gotten into the 5xns range for sure.

But Zen 4's read, write and copy scores leave much to be desired to be honest. Not sure what's up with that.
This is how aida64 looks for my maxed out 7950x
(dont know where nicalandia got that other aida64 screenshot, but it's certainly not my system)


System is at Asrock defaults except EXPO @6000. Windows 11 Pro latest as of today. It seems I need to upgarde bios to 1.11.AS03 (beta) which enchances S3 mode (=corp speak, it may now work 🤣)

View attachment 70711View attachment 70712View attachment 70713
Wow thats very low latency, have to give single CCD a try when i get home later today

That is 58.5ns @ ~65mb (not max value that's ~1gigabyte)
 
Last edited:

Carfax83

Diamond Member
Nov 1, 2010
6,841
1,536
136
This is how aida64 looks for my maxed out 7950x
(dont know where nicalandia got that other aida64 screenshot, but it's certainly not my system)
View attachment 70714

Now that is definitely some tweaked memory! What voltage are you using to get the timings so low? And yeah, that latency is extremely low to the point that I doubt anything other than a massively tweaked and overclocked Raptor Lake CPU could match it.

BTW, could you also show your cache benchmarks, especially the L3 ones.

Here's one of mine at 7600mhz:

 
Reactions: Tlh97 and ZGR

Det0x

Golden Member
Sep 11, 2014
1,061
3,105
136
Now that is definitely some tweaked memory! What voltage are you using to get the timings so low? And yeah, that latency is extremely low to the point that I doubt anything other than a massively tweaked and overclocked Raptor Lake CPU could match it.

BTW, could you also show your cache benchmarks, especially the L3 ones.

Here's one of mine at 7600mhz:

Here is one of my older screenshots
6200MT/s 2200mhz FCLK.
(my new settings is 6400MT/s 2133mhz FCLK)

Running 1.53 vdimm with custom cooling on memory

Btw we have a competition thread going over at overclock.net with this new memory benchmark:

New Memory Bandwidth & Latency Competition

I will post my 5800x3d and 7950x with single and dual CCD numbers in that benchmarkthread when i get home from work later today
Would be interesting if you guys here could share your numbers over there also.
I'm especially keen on seeing bandwidth numbers from one of @Markfw's epic systems if possible.. Currently there is a dual socket Xeon laying waste in the bandwidth department over there
 

Carfax83

Diamond Member
Nov 1, 2010
6,841
1,536
136
Here is one of my older screenshots
6200MT/s 2200mhz FCLK.
(my new settings is 6400MT/s 2133mhz FCLK)
View attachment 70715
Running 1.53 vdimm with custom cooling on memory

You have to have a lot of knowledge to tweak at such a high level so my hat is off to you. And AMD deserves credit for how brutally fast their L3 cache is with Zen 4.

My own score is somewhat depressed because I have mine underclocked and undervolted. I'm done with balls to the wall overclocking and tweaking though. No longer have the time or patience. I just want things nice and easy these days

Btw we have a competition thread going over at overclock.net with this new memory benchmark:

New Memory Bandwidth & Latency Competition
https://www.overclock.net/threads/new-memory-bandwidth-latency-competition.1801933/#post-29068256

Nothing but hardcore dudes over at Overclock.net to be sure. I used to have a membership over there a long time ago but if my membership is still valid I don't have a clue what my password would have been.
 

DisEnchantment

Golden Member
Mar 3, 2017
1,687
6,243
136
I'm especially keen on seeing bandwidth numbers from one of @Markfw's epic systems if possible
Time for @Markfw to step up to dual socket Genoa X and dual socket SH5 APUs(if they are at all dual socket). SH5 APUs will have V Cache + HBM.
These big boys are only for folks going for the ultimate epeen with the most demanding loads.
 

BorisTheBlade82

Senior member
May 1, 2020
667
1,022
136
System is at Asrock defaults except EXPO @6000. Windows 11 Pro latest as of today. It seems I need to upgarde bios to 1.11.AS03 (beta) which enchances S3 mode (=corp speak, it may now work 🤣)

View attachment 70711View attachment 70712View attachment 70713
Thanks a lot. That pretty much confirms that single CCD SKUs are severely bandwidth limited through IFoP (not that this would be a big concern for Games and most other workloads).
Could you do me a favour and test the write bandwidth in isolation as well?
 

nicalandia

Diamond Member
Jan 10, 2019
3,331
5,282
136
I will post my 5800x3d and 7950x with single and dual CCD numbers in that benchmarkthread when i get home from work later today
It would be interesting to know the latency numbers for the 5800X3D and compare it with 5800X and see the latency jump on the 5800X when it runs out of 32MiB L3 and jumps to System RAM while the 5800X3D keeps those low until it runs out of L3 at the 96MiB mark
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,751
14,781
136
Here is one of my older screenshots
6200MT/s 2200mhz FCLK.
(my new settings is 6400MT/s 2133mhz FCLK)
View attachment 70715
Running 1.53 vdimm with custom cooling on memory

Btw we have a competition thread going over at overclock.net with this new memory benchmark:

New Memory Bandwidth & Latency Competition

I will post my 5800x3d and 7950x with single and dual CCD numbers in that benchmarkthread when i get home from work later today
Would be interesting if you guys here could share your numbers over there also.
I'm especially keen on seeing bandwidth numbers from one of @Markfw's epic systems if possible.. Currently there is a dual socket Xeon laying waste in the bandwidth department over there
They run linux. They will not even start to load win 10 with 256 threads. Do you have a suggestion of what to run ? I can maybe later today. The dual Milan system would probably be the fastest.
 

Det0x

Golden Member
Sep 11, 2014
1,061
3,105
136
5800x3d @ 1800:3600 XMP timings

Latency = 78.82ns


Dram read bandwidth = 37.89 GB/s


7950x single CCD (8cores only) "6200MT/s" 2133 FCLK

Latency = 71.22ns


Dram read bandwidth = 67.70 GB/s


As a side note, it looks like the 5800x3d gets much of its gaming prowess from around the 100mb datasize..
Most games must use that data size.. (?)

V-cache difference is showing from 65mb upto 262mb
(5800x3d vs my "8core" Zen4 below)

Latency:
vs

Bandwidth:
vs
 
Last edited:

nicalandia

Diamond Member
Jan 10, 2019
3,331
5,282
136
5800x3d @ 1800:3600 XMP timings


Main v-cache difference is from 65mb upto 262mb

Thanks for sharing those numbers, but I was more interested on the L3 access latency numbers for Zen3X3D vs Zen4 and where most of the gaming gains come from.

Here are the Results Highlighted.

5800X3D


One CCD 7950X




Look at the Latency numbers at 1024 KB mark. Thanks to Zen4 larger L2$ it's just 3.78ns that right there is a huge help on games and general purpose apps(Zen3X3D is 10.75ns)

No lets move to the 98304 KB On Zen4 it's 52.28ns and on Zen3X3D is 26.22ns That has a huge impact on many games(not all of them of course)
 

nicalandia

Diamond Member
Jan 10, 2019
3,331
5,282
136


Against SPR-SP will be even more area efficient since Ice lake xeons are 6.5 mm^2, Server grade Golden Cove are about 8 mm^2


This is the AMD EPYC 9654P which is a Single Socket CPU

 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |