Ryzen: Strictly technical

Page 67 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Status
Not open for further replies.

Ajay

Lifer
Jan 8, 2001
16,094
8,106
136
There's a limited number of pins on AM4. Future products will likely have the same integrated I/O capabilities. Eight cores on AM4 (what is really a mainstream platform) I think is plenty for the next several years. All AMD needs to do is commit to IPC increases while maintaining the same or less power. Any clock improvements would only be a bonus. That seems to be AMD's forward focus is IPC improvements.
Wouldn't it make sense for AMD to modify the CCXs @ 7nm to,say, 6 cores each?
It would certainly be a large boost for their sever line (+50% on cores alone).
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,864
3,417
136
Wouldn't it make sense for AMD to modify the CCXs @ 7nm to,say, 6 cores each?
It would certainly be a large boost for their sever line (+50% on cores alone).
Yes it does,

Also by increasing the CCX size, cache coherency and memory inter connect complexity stays the same. CCX's keep track of their own L1/L2's. Memory controllers/cache directory keep track of what memory is currently in each CCX.

It also looks like intel will bring 6 cores to laptop ranges as well, so amd will want to "keep up" there as well.
 

Ajay

Lifer
Jan 8, 2001
16,094
8,106
136
Yes it does,

Also by increasing the CCX size, cache coherency and memory inter connect complexity stays the same. CCX's keep track of their own L1/L2's. Memory controllers/cache directory keep track of what memory is currently in each CCX.

It also looks like intel will bring 6 cores to laptop ranges as well, so amd will want to "keep up" there as well.

Yes, because then the 7nm follow on APU can have 'up to' six cores with just one CCX - perfect, IMHO. We'll just have to wait and see.
 
Reactions: CatMerc

Jan Olšan

Senior member
Jan 12, 2017
312
403
136
I don't expect that. That would raise the number of "virtual sockets" from 8 to 12, if you used 2S motherboard. Which isn't trivial, if you look at past Opterons, then the first MCM generation (2-die) and now Naples (4-die), the maximum number of chips possible in a single system has always stayed the same - eight, or equivalent of 8S platform back when they used single-chips. With 2-die (Magny Cours), they had to cut number of sockets to 4S, now Naples (4-die) only offers 2S.

From that it is clear that raising the number of chips would be a problem. They would need to redo their logic so that each MCM is "virtually" just one socket again and not 4 like it is now.
 
Reactions: Space Tyrant

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
Isn't the benefits of fabbing tiny die on 7nm late 2018 greater than increasing core #? From what we see, there are many more masking operations needed for 7nm than 14nm, at least before EUV is used. One would imagine defect management will be an issue. I think the greatest possibility is staying with a 4 core enhanced IPC CCX and pushing the clocks.

A ~ (100-110)mm^2 Zen2 8 core @ 4.5-5.0+ GHz would be a very attractive product.
 
Reactions: beginner99

moinmoin

Diamond Member
Jun 1, 2017
4,994
7,765
136
I don't expect that. That would raise the number of "virtual sockets" from 8 to 12, if you used 2S motherboard. Which isn't trivial, if you look at past Opterons, then the first MCM generation (2-die) and now Naples (4-die), the maximum number of chips possible in a single system has always stayed the same - eight, or equivalent of 8S platform back when they used single-chips. With 2-die (Magny Cours), they had to cut number of sockets to 4S, now Naples (4-die) only offers 2S.

From that it is clear that raising the number of chips would be a problem. They would need to redo their logic so that each MCM is "virtually" just one socket again and not 4 like it is now.
All good points. But going by the roadmap you linked AMD will have separate dies for 1x dualcore CCX and 1x quadcore CCX at the low end and for use in APUs as well as 2x quadcore CCX and as you wrote 3x quadcore CCX. Considering how well AMD does off a single die atm I'm not so sure adding more dies to the portfolio is a big priority versus improving IF/MCM scalability.
 

Jan Olšan

Senior member
Jan 12, 2017
312
403
136
Existence of the dualcore is contrary to the "they will want everything to have the same CCX", that is true. But that said, having to develop a 6core, 4core AND 2core CCX is bigger pita/expense than 2core and 4core.
 

mtcn77

Member
Feb 25, 2017
105
22
91
While true that 40% clock speed uplift would entail 3.5>4.5GHz Ryzen 7's, I think the true benefit is again from the process scalability in bringing forth 24-core Rippers at the same power envelope as current Ryzen's.
 

Jan Olšan

Senior member
Jan 12, 2017
312
403
136
Does anybody know the status of the Zen bug that leads to segfaults in GCC compilations under Linux?
I saw a claim that it is going to be fixed by microcode update, but with no source to base the claim on.

Finally, AMD shareholders that were concerned about previous reports of problems with Ryzen chip performance on Linux systems can breathe a sigh of relief. The firm now reports those issues have seemingly been fixed via a microcode update.

https://www.benzinga.com/analyst-ra...ip-dynamics-may-favor-advanced-micro-devices-
 

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de

KTE

Senior member
May 26, 2016
478
130
76
Buildzoid (Actually Hardcore Overclocking) reviews the VRM on the MSI B350M Gaming Pro (cheapest B350 motherboard):
https://youtu.be/3BYAdyO9nZo

TLDW: The board is 3+2 and at realistic maximum ratings (125C, 100A, 1.42V vcore) the board will be dissipating 22W of heat through the vcore VRMs. He is pretty critical of the FETs used by MSI (which are the same PK616BA and PK632BA FETs on all their AM4 boards) because they are 1) cheap and 2) inefficient. Which is actually okay IMO on a $69.99 motherboard (price has dropped since I purchased).

BTW I wasn't able to set vcore voltage higher than 1.40V on any EFI for this board, so MSI clearly knows the board's limits and added an extra margin for safety.
MSI has always got away with close to maximum possible on the VRMs. Whatever they allow is usually clean but pushing limits that bum out pretty quickly.

Same since the Phenom.

Sent from my HTC 10 using Tapatalk
 

BradC

Junior Member
Apr 24, 2017
19
15
81
Does anybody know the status of the Zen bug that leads to segfaults in GCC compilations under Linux?
I saw a claim that it is going to be fixed by microcode update, but with no source to base the claim on.

This just keeps dragging on. Easy to reproduce fault, radio silence from AMD. I just want to swap my old FX-8350 for an 1800X, but I keep bumping into this rotten segfault.
 

moinmoin

Diamond Member
Jun 1, 2017
4,994
7,765
136
This just keeps dragging on. Easy to reproduce fault, radio silence from AMD. I just want to swap my old FX-8350 for an 1800X, but I keep bumping into this rotten segfault.
"Easy to reproduce" is kind of wrong, people are still playing hit the pot trying to reduce the test cases to the actual cause. The best effort so far appears to be this script:
https://github.com/hayamdk/ryzen_segv_test
But it appears to run fine using GCC 7.0.1 instead the currently common GCC 6.3.0 (and haven't seen any tests with the most recent GCC 6.4.0 yet). Also there are some indications that error rate improved with AGESA 1.0.0.6a.
 

french toast

Senior member
Feb 22, 2017
988
825
136
Isn't the benefits of fabbing tiny die on 7nm late 2018 greater than increasing core #? From what we see, there are many more masking operations needed for 7nm than 14nm, at least before EUV is used. One would imagine defect management will be an issue. I think the greatest possibility is staying with a 4 core enhanced IPC CCX and pushing the clocks.

A ~ (100-110)mm^2 Zen2 8 core @ 4.5-5.0+ GHz would be a very attractive product.
We are going to see the real ambition of AMD with zen2 (not Zen+ pinnacle ridge H1 2018).
Im hoping for some major improvements to uarch, AVX256, maybe double L2, 12mb L3, 6 core CCX, 4x SMT, increase in execution units making the core wider? Maybe some special low latency optimised hbm L4 cache for sever or on CCX for APU?
If that gives 15℅ IPC lift, plus an 15-20℅ increase in SMT yield and a 10℅ clock increase over pinnacle ridge for a little more power consumption that would be awesome on 7nm.
Hopefully some improvements to infinity fabric bandwidth and latency as icing on the cake.

I hope they don't go the cheap easy route by keeping 4 core CCX and same width cores to increase clock speed, as we know clocks won't go much higher than 5ghz down the road as processes are scaling better with power consumption than frequency increase, it would be short sighted to have minor IPC bump and push clocks, if they make a wider core that will scale better imo.

Obviously they would have internal modelling and projections of the right Balance, I just hope if there if a close call and a clear choice to make from keeping a moderately wide core and pushing clocks with process or going wider core and minimum clock increase they will take the latter option.
Only if it makes sense of course.
 
Last edited:

CatMerc

Golden Member
Jul 16, 2016
1,114
1,153
136
Wouldn't it make sense for AMD to modify the CCXs @ 7nm to,say, 6 cores each?
It would certainly be a large boost for their sever line (+50% on cores alone).
Starship, the 7nm Zen EPYC CPU, is 48 cores according to rumors.

Options:
1. Each CCX contains 6 cores
2. Each die contains 3 CCX's
3. Each package contains 6 dies

I think option 1 is most likely, as options 2 and 3 massively complicate inter-die and inter-CCX communication. If they want to keep one hop between each die like they have currently, and don't want to increase wiring complexity exponentially.
 

BradC

Junior Member
Apr 24, 2017
19
15
81
"Easy to reproduce" is kind of wrong, people are still playing hit the pot trying to reduce the test cases to the actual cause.

Sorry, by "easy to reproduce" I should have made clear that I hit it about 10 times a day (at least) on a staging system that I'm really like to put into production. AGESA upgrade made zero difference and I shouldn't have to fart around with different version compilers or options that my UEFI doesn't have in any case. This shouldn't happen on plain old x86 code that works on every other CPU reliably. My point was there appears to be no outward community interaction on the part of AMD and a boat load of people putting relatively reliable test cases together because the bug *is* so damn easy to hit.

Sure it's not a 10 line test case that crashes every time (yet), but AMD would have a bucket load more knowledge and instrumentation at their disposal. I would assume they'd be looking into it, it's just odd to get zero feedback at all.

I certainly could not recommend anyone put a Zen based system into a production environment until they get it sorted. I've seen other processes die the same way gcc does, it's just a lot more "random" and consequently harder to reproduce. That sort of unpredictability does not instil confidence. I'm sure they'll get it sorted, but a quick "hey we are actually really looking into this" would be helpful. So far I haven't really seen that level of engagement.
 

Jan Olšan

Senior member
Jan 12, 2017
312
403
136
Option 1 also complicates stuff, though.

The cores in one CCX are according to the block diagrams connected similarly to dies in Naples - each core has a link with each other. You can do that with 3 links per node if you have 4 cores in CCX, but if there were six, there would need to be 5 links in each of those cores - 15 paths in the CCX crossing between the cores, an awful lot of wiring. With 4 cores, you only have 6 paths and just two cross each other.
 

wildhorse2k

Member
May 12, 2017
180
83
71
If they introduce some kind of simplified mesh like Skylake-X has, then 6 core CCX makes sense. Not every core needs to talk to every core directly. But we would lose the sweet advantage of Ryzen - many core CPU that can have low latencies of 4 core CPU if threads are placed correctly.

Option 2 leads to even longer rectangular die or wasted space. Based on this I think either option 3 or 1 (if they solve the link problem well) based on your proposition.

But there is another option you haven't considered. 48 cores can also be done with 3 x 16 cores. 16 cores means 4 CCX in a single square die. That should be easy to do and will also increase performance as cross CCX latencies are expected to be better than cross die. I think this is what AMD should pursue.

With Ryzen/Threadripper/Epyc AMD had no choice and had to use the same die as they were on brink of bankrupcy. But now with funds flowing in they can design 2 CPUs - mainstream one (2 CCX Ryzen) and server one (4 CCX Threadripper/Epyc). I don't think it makes sense to keep increasing core counts in mainstream so soon. They also need 4/6 core options, so they definitely need 2 CCX version.
 
Last edited:

LightningZ71

Golden Member
Mar 10, 2017
1,659
1,944
136
Having more than two CCX units per die breaks the interconnect arrangement between individual Zen dies in an MCM and in 2P in epyc. That would require a wholesale tear up of their interconnect fabric. For platform stability purposes, Zen must keep sockets aND external interfaces the same. This means that improvements are strictly kept in the die. With that in mind, this is what we're left with:

Larger L3
Improved schema for L3
Larger L2
Lower latency L2
More cores in the CCX
Improved DDR4 controller

I don't see them re-floor planning individual cores at this point. Maybe some minor errata tweaks and a touch up on critical paths, but that's it. Now, they can refloorplan the whole uncore as they will have more area to work with in their packages with a smaller feature size. This means that there is room to resize things, like the L3. They can reshape the CCXs into a 3x2 grid, making intercore communications less wiring ugly while still expanding to 6 cores in each ccx. They can even do an 8 core ccx, but I don't think that they would be able to expand the L3 sufficiently to keep 1mb per core in the whole die.
 
Status
Not open for further replies.
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |