Discussion Speculation: Zen 4 (EPYC 4 "Genoa", Ryzen 7000, etc.)

Page 369 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Except for the details about the improvements in the microarchitecture, we now know pretty well what to expect with Zen 3.

The leaked presentation by AMD Senior Manager Martin Hilgeman shows that EPYC 3 "Milan" will, as promised and expected, reuse the current platform (SP3), and the system architecture and packaging looks to be the same, with the same 9-die chiplet design and the same maximum core and thread-count (no SMT-4, contrary to rumour). The biggest change revealed so far is the enlargement of the compute complex from 4 cores to 8 cores, all sharing a larger L3 cache ("32+ MB", likely to double to 64 MB, I think).

Hilgeman's slides did also show that EPYC 4 "Genoa" is in the definition phase (or was at the time of the presentation in September, at least), and will come with a new platform (SP5), with new memory support (likely DDR5).



What else do you think we will see with Zen 4? PCI-Express 5 support? Increased core-count? 4-way SMT? New packaging (interposer, 2.5D, 3D)? Integrated memory on package (HBM)?

Vote in the poll and share your thoughts!
 
Last edited:
Reactions: richardllewis_01

naad

Member
May 31, 2022
64
176
76
People forget not all benchmarks are either one core or "embarrassingly parallel" loads, Raptor lake has a whole 8 cores more than Raphael, any workload(plenty of those in PTS) not putting those 8 extra core to use will make the 13900k look like last gen perf compared to 7950x

tldr for workloads:

1-8 core = tie with 2-3% differences here and there
8-16 core = Raphael clearly stomps
16-24 core = mostly tie, around 2-5% difference again


IMO from the plethora of benchmarks I've seen around most of them usually stop scaling at around 10-12 cores
 

Abwx

Lifer
Apr 2, 2011
11,167
3,862
136
People forget not all benchmarks are either one core or "embarrassingly parallel" loads, Raptor lake has a whole 8 cores more than Raphael, any workload(plenty of those in PTS) not putting those 8 extra core to use will make the 13900k look like last gen perf compared to 7950x

tldr for workloads:


16-24 core = mostly tie, around 2-5% difference again

Above 8T the 2-5% difference is only at 32T.

ST Zen 4 or RPL = 100.

8T will yield 800 for both RPL and Zen 4.

ST perf of small core is 60.

9 threads yield 900 for Zen 4 and 860 for RPL
10 threads is 1000 for Zen and 920 for RPL.

16T is 1600 for Zen and 1280 for RPL.

Above 16T the Zen 4 advantage will gradually decrease and tie is at 32T, the greater advantage is hence at 16T with 25% better throughput.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
Also we have to take into account that small E cores have half the AVX2 width so anything that is using AVX and is multithreaded up to 32 threads is going to be hampered versus 32T Raphael that has 16 "fat" cores (32T) with full width AVX(2). Also, take into account the fact that E cores are going to be running at much lower MT boost clocks VS Zen 4 cores - all in all it's no surprise why 7950X will be overall superior product.
 

Abwx

Lifer
Apr 2, 2011
11,167
3,862
136
I took account of the e cores lower frequencies but not of AVX2 throughput, all in all and on average it should be close to those numbers.
 

naad

Member
May 31, 2022
64
176
76
The capped memory bandwidth kinda worries me for EPYC, but I guess anyone needing max bandwidth will be using NPS4 anyway, plus 12 channels is a lot more than the 8 the competition has
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
The capped memory bandwidth kinda worries me for EPYC, but I guess anyone needing max bandwidth will be using NPS4 anyway, plus 12 channels is a lot more than the 8 the competition has
Well, memory BW is capped because the chart shows the same DDR5 6000 numbers, other variables are not important for it.
 
Reactions: lightmanek

PJVol

Senior member
May 25, 2020
622
556
136
What is this "the competition" language, anything like this ?
"Imagine, the task force guy says, telling a passenger on arrival that a dildo
kept her baggage on the East Coast. Sometimes it's even a man. It's airline
policy not to imply ownership in the event of a dildo. Use the indefinite
article.
A dildo.
Never your dildo.
Never, ever say the dildo accidentally turned itself on.
A dildo activated itself and created an emergency situation that required
evacuating your baggage"
(c) 1996 Chuck Palahniuk
 

Kaluan

Senior member
Jan 4, 2022
503
1,074
106
The capped memory bandwidth kinda worries me for EPYC, but I guess anyone needing max bandwidth will be using NPS4 anyway, plus 12 channels is a lot more than the 8 the competition has
Capped? What do you mean? That's not really a memory scaling chart, it's a (memory in relation to) FCLK scaling one.

If anything it just means FCLK = 2000 is enough for optimum use of 6000MTs DDR5... as we've already kinda known.

6400MTs (or more) + going beyond 2000MHz FCLK might be a different story. ("golden sample" talk aside).

Also Raphael IOD/IMC =/= Genoa IOD/IMC, I wouldn't bother speculating too much based on one for the other.

As some of the leakers and speculations said, people are overestimating how memory sensitive Zen4 is.
 

naad

Member
May 31, 2022
64
176
76
Capped? What do you mean? That's not really a memory scaling chart, it's a (memory in relation to) FCLK scaling one.

If anything it just means FCLK = 2000 is enough for optimum use of 6000MTs DDR5... as we've already kinda known.

6400MTs (or more) + going beyond 2000MHz FCLK might be a different story. ("golden sample" talk aside).

Also Raphael IOD/IMC =/= Genoa IOD/IMC, I wouldn't bother speculating too much based on one for the other.

As some of the leakers and speculations said, people are overestimating how memory sensitive Zen4 is.

My mistake, didn't look properly, didn't see it was fclock scaling, sorry @inf64 as well
 

Det0x

Golden Member
Sep 11, 2014
1,057
3,092
136
AMD Zen 4 Memory Latency / Bandwidth

Seems to be diminishing returns above 2000 FCLK with static 1:1 UCLK 3000mhz and MCLK 6000MT/s

Now we only need to find out how high we can run the UCLK and MCLK 1:1 with locked FCLK at 2000mhz
View attachment 67751

It's actually not diminishing returns, the graph is simply not using a consistent scale for the X-axis

Here's a corrected graph using the data points included. Note the gaping hole at 1900 MHz.
Looks close to linear for latency, while read and copy might just be logarithmic
 
Last edited:

PJVol

Senior member
May 25, 2020
622
556
136
Looks like Zen3/4 six-core didn't fit )
Just curious of 7600X uplift, my 5600x scored 178 and 190, stock and tuned in 3.3, though still impressive for the 8-core parts and even more so for 16.
 
Last edited:

Antey

Member
Jul 4, 2019
105
153
116
Remember that time when this guy richie rich was all over here with his IPC/PPC comparisons using just geekbench and such and you all hated him? well, i think it's interesting how different the situation is nowadays in that matter.

(1) Info - TOP 20 of the World's Most Powerful CPU Cores - IPC/PPC comparison | AnandTech Forums: Technology, Hardware, Software, and Deals

He estimated that the A14 would have 558 PPC and it tourned out to be close to 530, and it's funny because he estimated that the A15 would have 640 PPC! and it was once again 530. And now, the A16, even though this CPU isn't part of his estimations, but it's again, 540-545!

Four iterations (A13, A14, A15 and A16) and the apple big cores have increased their (geekbench) ''IPC'' just 8%.

And in the x86 world the Zen 4 now has 395-400 PPC! That's just 36-38% more ''IPC'' for the ARM world, and what about the X3? wasn't it 11% higher IPC than the previous core? what is it going to be? 460 PPC? a mere 15% better than their x86 counterparts?

at this rate...

Just noting that
 

DrMrLordX

Lifer
Apr 27, 2000
21,805
11,159
136
Remember that time when this guy richie rich was all over here with his IPC/PPC comparisons using just geekbench and such and you all hated him? well, i think it's interesting how different the situation is nowadays in that matter.

It's really not worth stirring up that nonsense again. Everyone can see now what Apple has in mind for desktop/workstation, despite the fact that the performance of their cores on actual applications (rather than GB5) is somewhat muddied by the extensive use of fixed-function hardware. In any case, AMD is not worried about any such PPC comparisons, for good reason.
 

Abwx

Lifer
Apr 2, 2011
11,167
3,862
136

Asterox

Golden Member
May 15, 2012
1,028
1,786
136
As expected considering the global situation,..................................huh

 

Kaluan

Senior member
Jan 4, 2022
503
1,074
106
Looks like Zen3/4 six-core didn't fit )
Just curious of 7600X uplift, my 5600x scored 178 and 190, stock and tuned in 3.3, though still impressive for the 8-core parts and even more so for 16.
It did ~229 (so should've been at the bottom), but apparently not in that version.
I wouldn't put much weight in comparison there, particularly across the different versions of Blender. I think they had runs in one of the versions were 4700G seemed faster than 5800X.
Anyway the general idea seems to be that 7600X is slightly faster than 5800X and 12600K in productivity (which also means > efficiency than both, but unclear if it can muster better efficiency than 5600X's 76W @ typical load).

My mistake, didn't look properly, didn't see it was fclock scaling, sorry @inf64 as well
Hehe, no worries mate. I was pretty confused of what the chart was actually showing at first too.
 
Last edited:

Det0x

Golden Member
Sep 11, 2014
1,057
3,092
136

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,741
14,773
136
Sadly its a papir weight without supporting motherboard.
I think and hope there will be good amounts of stock this Ryzen release..
Lets hope so... I have memory, case, PSU HSF(AIO) and video card all ready to go. I just need CPU, motherboard and NVME. Waiting on that hoping for PCIE 5.0 compatible. If not PCIE 4.0 will do.
 
Reactions: Tlh97 and Kaluan
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |