Speculation: Ryzen 3000 series

Page 69 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

naukkis

Senior member
Jun 5, 2002
779
636
136
Signals travel really fast in such a short space. Propagation delay will be really small by putting IO on a separate die. The buffering logic will probably be the main contributor of signal delay. I'd be surprised if it added even close to 30 ns of latency.

2700x random access latency is ~83ns with 2666, so maybe 7ns more. But that es sample is with single-channel memory - random access latency should be a bit lower with dual-channel configuration - and we don't know memory timings which might be still suboptimal with ES.
 

dnavas

Senior member
Feb 25, 2017
355
190
116
I'm not as familiar with the inner workings of IF and such, how much serializing are they doing from chiplet to IO, does anyone know? Is it fully parallel?

That's why I used the inter-CCX numbers for Ryzen7. All things being equal, you're not going to get better than on-chip, and lo and behold, we did, so, all things aren't equal, they're a lot better. I was more curious why someone thought those numbers looked fishy -- lack of rise between cross-CCX L3 access and memory maybe? Wasn't sure....

2700x random access latency is ~83ns with 2666, so maybe 7ns more.

Note: I was factoring in some improvements to memory access with Zen2, thus my 60/30 split :shrug:
 

DrMrLordX

Lifer
Apr 27, 2000
21,805
11,159
136
Does it? Cross-CCX latency was 120ns-ish on the 1800X, so if the separate IO die is only adding (say) 30ns to RAM access, that actually sounds on the better side of what could have been. Clearly the separate I/O die is going to add latency, that isn't a surprise....

The memory controller was already separated from the CCXs by an IF link on Zen/Zen+ so the propagation delay to a IF-connected I/O die shouldn't be that much different.

2700x random access latency is ~83ns with 2666, so maybe 7ns more. But that es sample is with single-channel memory - random access latency should be a bit lower with dual-channel configuration - and we don't know memory timings which might be still suboptimal with ES.

Right, we don't know the timings. Subtimings do matter. I have learned the hard way that if you raise the wrong one, memory performance can go straight to hell.

If Zen2 is still 2CCX design it has 2 L3 caches like Zen1, and memory access latency rises after 16MB L3 chunk like it does after 8MB with Zen1.

That's possible, or it's 16MB per chiplet, and inter-chiplet latency is killing it.
 

Topweasel

Diamond Member
Oct 19, 2000
5,436
1,655
136
I'm not sure it is following the same pattern.
That TR graph shows clearly that there's a difference between going across CCX than going to memory. It goes cross CCX between 8MB and 16MB, then to memory beyond that.
The ES goes straight from 16MB L3 to what looks like memory by the point it hits 32MB. I would be expecting the big jump to be after 32MB if there was 32MB L3, but we see that it is already taking that hit at 32MB.
No idea why it then drops off slightly beyond that though. Seems unusual.

But it evens out with memory when it goes to the second chip. Seems like the 7ns jump at 16mb replaces the old 80ns jump when switching CCX's and the big jump at 32 is because the software tricks itself out and thats really because at that size it's at the intersection of on die and off die.
 

Timorous

Golden Member
Oct 27, 2008
1,727
3,152
136
2700x random access latency is ~83ns with 2666, so maybe 7ns more. But that es sample is with single-channel memory - random access latency should be a bit lower with dual-channel configuration - and we don't know memory timings which might be still suboptimal with ES.

As the clockspeed and memory speed are so low it is entirely possible the IO die is not running at full speed either and we all know memory latency improves as IF speed increases.
 

coercitiv

Diamond Member
Jan 24, 2014
6,394
12,826
136
2700x random access latency is ~83ns with 2666, so maybe 7ns more. But that es sample is with single-channel memory - random access latency should be a bit lower with dual-channel configuration - and we don't know memory timings which might be still suboptimal with ES.
For reference, I ran the test on a 1600X with 2133 CL15 RAM in single-channel, and the result was 93.8ns.
 

Grumpy Cat

Junior Member
Jan 25, 2019
3
4
41
Is everyone expecting the 8 core Ryzen 3000 to be a single chiplet like Lisa Su held up?
I'm thinking that 2 four core chiplets might be faster.
With only a single chiplet isn't half the I/O chip to chiplet bandwidth being wasted.
I think there is a reason we don't know the cache, memory configuration. Remember Rome isn't a NUMA design.
 
Reactions: Olikan

DrMrLordX

Lifer
Apr 27, 2000
21,805
11,159
136
Is everyone expecting the 8 core Ryzen 3000 to be a single chiplet like Lisa Su held up?

I expect there to be an SKU that matches the ES they demonstrated, which presumably was one chiplet plus an I/O die.

that 2 four core chiplets might be faster.
With only a single chiplet isn't half the I/O chip to chiplet bandwidth being wasted.

Uh? So far as I can tell, the I/O die just has two external IF links. Leaving one of those IF "dead" is meaningless. You don't want two 4c chiplets anyway since you'll create the same (or worse) thread migration/L3 cache snoop penalties we see on Zen/Zen+. At least in a Zen CCX, we have a direct IF crossbar between CCXs (one hop). In Zen2, there will be two hops from one chiplet to the other.
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,422
1,759
136
Regarding that benchmark, and the latency graph, the various articles are suggesting that the steep rise in latency between 16MB and 32MB indicates 32MB L3 cache, but if the rise is between the two then that must indicate that its 16MB L3, not 32MB, surely? What am I missing?

The current assumption is that each chiplet consist of two CCX, and each CCX has 4 cores and 16MB of cache. So a 12-core chip would have one core of each CCX disabled for 4x3core CCX and each CCX with 16 MB of cache. Since the L3 is a victim cache to it's "own" CCX, no other CCX can benefit from it.
 

IEC

Elite Member
Super Moderator
Jun 10, 2004
14,359
5,017
136
I expect there to be an SKU that matches the ES they demonstrated, which presumably was one chiplet plus an I/O die.

Uh? So far as I can tell, the I/O die just has two external IF links. Leaving one of those IF "dead" is meaningless. You don't want two 4c chiplets anyway since you'll create the same (or worse) thread migration/L3 cache snoop penalties we see on Zen/Zen+. At least in a Zen CCX, we have a direct IF crossbar between CCXs (one hop). In Zen2, there will be two hops from one chiplet to the other.

Agreed. It's also why I suspect the single chiplet models will be best for *gaming*, while those of us who want more cores will certainly be interested in 16c/32t models.
 

Timorous

Golden Member
Oct 27, 2008
1,727
3,152
136
I was posting in a console speculation thread else where after reading a user suggest the console CPU might have less cache and it got me thinking.

What if single chiplet designs only have half of the L3 cache and multi chiplet designs use all of it. It might be a way to upsell the 12c and 16c models as well as another way to make use of partially defective chips.
 

Kedas

Senior member
Dec 6, 2018
355
339
136
Agreed. It's also why I suspect the single chiplet models will be best for *gaming*, while those of us who want more cores will certainly be interested in 16c/32t models.
Well I assume there will also be some 'gaming mode' for >8 core versions like they have now.
 

PPB

Golden Member
Jul 5, 2013
1,118
168
106
Two 4-core chiplets on the 8-core sounds ok. They can save the 8-core chiplets for Threadripper and Epyc until yields improve more. I think 16-core desktop will come later.
What makes you think the yields of an 80mm2 7nm chip are low?

People here underestimate the enormous benefit AMD has found in doing your uncore on your n-1 node: you largely increase yields on your important bit, the n node core die.
 

Mockingbird

Senior member
Feb 12, 2017
733
741
106
I am sick of you calling this fake tv. The guy got a few things wrong last video. That does not make it fake. He got some right also.
Shut the hell up !

Aside from braking the forums rules (http://www.portvapes.co.uk/?id=Latest-exam-1Z0-876-Dumps&exid=threads/anandtech-forum-guidelines.60552/), you just disregarded you own moderator request.


If you have an issue with moderation,
you create a post in moderator discussions.
You do NOT do it here.


AT Mod Usandthem
 
Last edited by a moderator:
Reactions: Grazick
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |