Speculation: AMD's 7nm processors will all be APUs

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Vattila

Senior member
Oct 22, 2004
805
1,394
136
I suppose they could build [GPU-less] parts from salvaged dies where the GPU is defective.

That's a very good point. This is what AMD has done before. With all the production using the same die, there may be just enough dies with a defective GCX to satisfy the small minority of the HEDT market that absolutely do not want (to pay for) it.
 

Vattila

Senior member
Oct 22, 2004
805
1,394
136
makes me wonder if you could integrate something like that into the CCX itself.

Having a separate IP building block — my hypothetical GCX — seems more flexible and scalable. This way it can be mixed and matched with CCXs, e.g. for customers in the semi-custom segment. Integrating the graphics into the CCX makes it less reusable, in particular for scalable discrete graphics where you want to replicate many graphics building blocks (GCXs) with no CPUs.
 

firewolfsm

Golden Member
Oct 16, 2005
1,848
29
91
HBM on a processor would act as a massive L4 cache and buffer between ram. It could be a real game changer.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,866
3,418
136
HBM on a processor would act as a massive L4 cache and buffer between ram. It could be a real game changer.
HBM latency isn't any better then DDR, so all it would be is a bandwidth multiplier, but that isn't free on the latency side. For your typical latency sensitive workloads HBM likely adds nothing and will infact decrease performance.
 
Reactions: ao_ika_red

DrMrLordX

Lifer
Apr 27, 2000
21,807
11,161
136
The reason to include an HBM2 buffer would be for the iGPU. It would not do a whole lot to help CPU performance.
 
Reactions: DarthKyrie

Ancalagon44

Diamond Member
Feb 17, 2010
3,274
202
106
It is an interesting theory because the 48 core count is a little odd. I don't find the idea of AMD going to 6 cores per CCX particularly compelling.

But, this would mean that for every single CPU sold in the server, threadripper and high end desktop markets, AMD will be wasting die space. So, I'm not sure it is really worth it from AMD's point of view. They could easily get by with what they are doing now - make one die for server, threadripper and desktop, and one APU die for mobile and low end desktop.

The only case where I think they would consolidate that into a single die is if the savings were massive. But, considering the volumes involved in either having one die or multiple, I just can't see the savings being that big.
 

Qwertilot

Golden Member
Nov 28, 2013
1,604
257
126
What they really could do with doing is getting the time gap between the server CPU and the APU much smaller next time. Presumably the extra money, more established everything etc should help hugely with that.
 

PeterScott

Platinum Member
Jul 7, 2017
2,605
1,540
136
What they really could do with doing is getting the time gap between the server CPU and the APU much smaller next time. Presumably the extra money, more established everything etc should help hugely with that.

It looks like that gap continues in 2019. It makes sense that they build a standalone CPU first (and for GPU upgrades alone as well) to work out all the bugs alone before integrating them. In 2019 Desktop gets Zen 2 cores, APU just gets Raven Ridge (Zen 1) on a new process:

 

el etro

Golden Member
Jul 21, 2013
1,581
14
81
It could reuse 704SPs Vega IGPs for higher-end models(Ryzen 3, 5 and 7), but is just my wish...
 

LightningZ71

Golden Member
Mar 10, 2017
1,661
1,945
136
My thoughts:

The initial rollout is three 4-core CCX modules on a flexible die for use in TR, Epyc and standard desktop Ryzen with no GCX.

There will follow an APU that is two 4-core CCX modules with a GCX. This replaces RR.

Eventually, there will be a power and cost reduced die with one 4-core CCX and a scaled down GCX for the small form factor, low power market.

The major enabling factors for this are that shrinking the existing footprint or expanding it for non integrated Doc applications will incur extra cost. Keep the die size and interface grid the same and you keep generational costs down. Instead, use the extra die space carefully, like in expanding caches, maybe adding an L4 cache if there is indeed extra die area left over.

With the shrink, it allows the APU to gain a CCX and live in the same footprint as 14/12 nm RR, which puts it on par with i7 and below to cover all of Intel's normal desktop line. The HEDT line doesn't really need an iGPU. Neither will 12 core ryzen.

If AMD needs a high end part with GCXs, they can use a pair of APU dies in a TR version that has 16 cores and a pair of iGPUs with a mountain of bandwidth.
 
Reactions: Lodix

PeterScott

Platinum Member
Jul 7, 2017
2,605
1,540
136
I'll ask for the third time, is there any source for this slide other than informatica cero (that huge watermark)?

Not that I have seen.

This is obviously not a slide for public consumption, so you can't really expect multiple leaks of it.

I doubt it's a fake. Usually on a fake, people will put it some real juicy details, that are not in this one.
 

Topweasel

Diamond Member
Oct 19, 2000
5,436
1,655
136
Yes, except it wont be useless in 90% of the target market.

To fully compete with Intel, AMD needs a die in the bulk of the market that requires integrated graphics. The hypothetical die provides that. Whether AMD finds it cost-effective to develop a smaller separate die for the high volume low-end market is a good question. If so, it will of course include graphics. Whether AMD develops a dedicated die for the low volume HEDT market is a bad question. And whether it is cost-effective to develop a separate die for the server market is also doubtful. Compared to server ASP, the extra manufacturing cost of including a GCX is miniscule, and server volume is miniscule compared to the mainstream.

Also, the GCX is a compute feature in itself that, with the right software, can be put to good use in both the HEDT and the server markets. See my earlier replies on HSA in this thread.

But, most important, consider the benefit of cost and time-to-market with having only one die to develop, qualify and manufacture across all processor SKUs. Note the increasing cost for mask sets on 7nm. Note the current long delay between Ryzen and Ryzen Mobile using separate dies on 14nm.

I think people miss what the market actually is and how AMD is trying to impact that.

90% of the users need igpu, but not 8 cores. The existence of Ryzen is about having a die that they can use in as many markets as possible but allow them to create an awesome server chip. In that sense Ryzen is AMD's enthusiast CPU built on server chip. Nothing more nothing less. We think of $200-$300 CPU's as general user chips because of the impact of going below that with i3's. But the reality is this is enthusiast pricing, Ryzen is an enthusiast CPU not a general user CPU. In the few circumstances where an 8 core CPU is needed, throwing a $40 GPU into the computer would mean nothing. But people don't buy $1000+ computers for home they buy the $500-$700 systems.

So APU (4c+GPU) is Mainstream.
Ryzen is Enthusiast
ThreadRipper is HEDT
EPYC is Server

It will stay that way because AMD has little to gain for the incredibly small market of 6c+ req system's that would be greatly impacted price wise by including a cheap discrete graphics card. Specially if it in anyways negatively affects their ability to make a competitive server CPU.

This does take the sheen off the 1600 and 1600x with Coffee Lake but future R5's can be 8c and 10c cpu's in the future. It also with the APU's Intel having an iGPU 6 core is going to be weird with AMD still only offering 4c APU's. But the GPU's will be loads better and we don't know how Intel is going to handle mobile CPU's with CoffeeLake and future releases. It's possible CFL mobiles will still max out at 4c8t.
 

moinmoin

Diamond Member
Jun 1, 2017
4,994
7,765
136
Not that I have seen.

This is obviously not a slide for public consumption, so you can't really expect multiple leaks of it.

I doubt it's a fake. Usually on a fake, people will put it some real juicy details, that are not in this one.
It does contain a couple oddities, like a font different from the previous set of Ryzen slides, "k APU" and Bristol Ridge containing Polaris. The watermarked version has been quoted and reported everywhere, a second source without that watermark would essentially prove its validity.
 

PeterScott

Platinum Member
Jul 7, 2017
2,605
1,540
136
It does contain a couple oddities, like a font different from the previous set of Ryzen slides, "k APU" and Bristol Ridge containing Polaris. The watermarked version has been quoted and reported everywhere, a second source without that watermark would essentially prove its validity.

I agree some confirmation would be nice. We are reading tea leaves when we look that far out.

AMD releases earning and has a investor conference call later this month, so hopefully something new gets added to their standard slide deck.
 

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Based on the recently released die shot of the Raven Ridge chip, I made a quick mock-up of what my proposed 12-core 7nm APU die might look like with 3 CCXs and 1 GCX (11 Vega CUs).



Nice and square, and at 7nm it should be less than the size of the 14nm Raven Ridge, as the CCX takes up much less than half of the die of the latter.

PS. Here is the original Raven Ridge die shot:

https://hexus.net/media/uploaded/2017/10/ffef815e-8764-4771-98d5-9e92ba8803ba.png
 

Yotsugi

Golden Member
Oct 16, 2017
1,029
487
106
Based on the recently released die shot of the Raven Ridge chip, I made a quick mock-up of what my proposed 12-core 7nm APU die might look like with 3 CCXs and 1 GCX (11 Vega CUs).



Nice and square, and at 7nm it should be less than the size of the 14nm Raven Ridge, as the CCX takes up much less than half of the die of the latter.

PS. Here is the original Raven Ridge die shot:

https://hexus.net/media/uploaded/2017/10/ffef815e-8764-4771-98d5-9e92ba8803ba.png
3CCX means a LOT of coherency traffic.
Kinda bad idea when the whole idea of CCX is to maintain cache locality and consistent latencies through the chip/MCM.
 

Vattila

Senior member
Oct 22, 2004
805
1,394
136
Here is a mock-up of my hypothetical Starship chip using a 2.5D interposer with 4 stacks of High Bandwidth Memory and 4 7nm APU dies, each with 3 CCXs (12 cores) and 1 GCX (11 CUs), for a total of 48 cores and 44 CUs.

 

SPBHM

Diamond Member
Sep 12, 2012
5,058
410
126
the IGP is pretty big and useless for too many applications, I think they are going to try replicating the current strategy.
 

Vattila

Senior member
Oct 22, 2004
805
1,394
136
the IGP is pretty big and useless for too many applications

Don't think of my hypothetical GCX as merely an IGP. Think of it in HSA terms, as a non-x86 processor specialised for embarrassingly parallel compute tasks, of which there are many in the server world, in the business world with big data, and increasingly in the consumer world as well, with todays media-rich applications and games. Also, think of it as an alternative to AVX512 and ever widening vector registers burdening the general-purpose x86 core.

For high-end gaming, think of it, not as a useless IGP sitting there idle, but as a supplement to the x86 cores and the discrete graphics card, available for games to help out with the rendering, do matrix calculations, physics or any other heavy parallel compute tasks, for which a general-purpose x86 core is not best suited.

Including a GCX on the die is also a solution to the "dark silicon" problem, in which you cannot light up all the transistors you can fit on a 7nm die at the same time within given power constraints — so you use the transistor budget to provide more functionality instead. Having both general-purpose cores and parallel compute cores on future chips seams a good use of transistors to me.

At one point in history even the FPU was considered costly and useless for too many applications to integrate on the die. As I type now, the FPU and 128-bit vector units across all my 8 Zen cores are sitting there mostly idle.

The only question is whether the time has come for the GCX at 7nm, or whether the transistor budget is still better spent elsewhere.
 
Last edited:

Yotsugi

Golden Member
Oct 16, 2017
1,029
487
106
Think of it in HSA terms, as a non-x86 processor specialised for embarrassingly parallel compute tasks
These are few; just use a dedicated coprocessor for them.
It's wasted die area and nothing more.
 

DrMrLordX

Lifer
Apr 27, 2000
21,807
11,161
136
HSA would be great if we had useful developer tools for it. Carrizo/Bristol Ridge has been a prime target for something like that for some time, but there's still nothing really there.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,726
1,342
136
I would love to see all future Zen products starting at the 7nm node become APUs, but probably not in the way that you think.

Rather than putting a big GPU on the die, which on a high end product would most often be wasted die space, I'd like to see the smallest GPU possible. Just the display controller and a handful of shaders, maybe even as little as 1 CU.

For starters, this eight-twelve core processor would be a much better match for notebooks, being able to exploit switchable graphics for low idle power. Instead of super chunky desktop replacements, it could service a High End NoteBook role, or HENB, which right now is a market that doesn't really exist yet.

For servers, a separate video card or display controller is no longer necessary to get display output.

For desktops, you can now start your PC without a video card, which is convenient and makes it easier to troubleshoot. No more confusion from less knowledgeable buyers why the video output on their motherboard doesn't work. You also gain additional display outputs for more monitors, HDMI receivers, and the like. There are markets that either want a lot of displays, or don't need any GPU grunt, and an eight-twelve core Zen processor with the bare minimum IGP would be an ideal fit for them.

Having some kind of small IGP that takes up as little die space as possible opens up a new market for the die, and is a small added bonus / convenience everywhere else.
 
Last edited:
Reactions: Excessi0n
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |