New Zen microarchitecture details

Page 203 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

lolfail9001

Golden Member
Sep 9, 2016
1,056
353
96
Even with the Polaris 11 GPU a Quad Core RavenRidge APU will be sub 200mm2.
Actually, with Polaris 11 GPU, RR APU would be over 200mm^2 with ease. You are replacing a CCX (44mm^2 or something) with P11 (123mm^2). Basically it would be like 260-270mm^2, if 180-190 estimation is on point.
 

Glo.

Diamond Member
Apr 25, 2015
5,763
4,667
136
Nice,

Even with the Polaris 11 GPU a Quad Core RavenRidge APU will be sub 200mm2.
You didn't accounted that 1024 GCN core Vega, made from two 8 CU Shader Engines will be slightly bigger than Polaris 11. So it will deffinitely be bigger in die size than 200 mm2. Bigger, but not bigger than Polaris 10. So the end-user prices can range from 170 to 350$, and AMD still will make profit on this die.
 
Reactions: Drazick

CatMerc

Golden Member
Jul 16, 2016
1,114
1,153
136
Actually, with Polaris 11 GPU, RR APU would be over 200mm^2 with ease. You are replacing a CCX (44mm^2 or something) with P11 (123mm^2). Basically it would be like 260-270mm^2, if 180-190 estimation is on point.
There would be a lot of variables at play here. For example inter-CCX communication wouldn't need to be present, nor would the links required for the Naples MCM packaging. And the GPU on Raven wouldn't have its 128bit GDDR5 controller, rather a 64bit DDR4 controller (To combine with the controller on the CCX to create a dual channel system).

I'm not agreeing or disagreeing, just saying that it's not that simple.
 

Glo.

Diamond Member
Apr 25, 2015
5,763
4,667
136
More importantly: what core count is that CPU. 4C/8T? Or 6C/12T? 4C/8T at 299$ is meh, even with 4.2 GHz core clock. 6C/12T at 4.2 GHz, and 299$ is amazing deal.
 
Reactions: Drazick

Doom2pro

Senior member
Apr 2, 2016
587
619
106
If AMD Zen based APUs use the MCM technique, it doesn't matter how big Zen is or how big the iGPU is... They will be individual dies fabbed on separate wafers with their own respective yields.

All AMD has to do is bin them and then glue them together on an organic package with Infinity Fabric. This would be killer for AMD as in the past they have had to merge CPU and GPU designs and libraries and then compromise on a single library density for both on a single mask set and then reap the yields from the resulting large die.

Now they can use separate library densities and mask sets for each respective CPU and GPU without having to compromise on library densities, and reaping the benefits of yields from two separate designs and then link them together in an MCM with Infinity Fabric.

Tada, Zen based APU with Vega Graphics in an MCM with separate dies and no need to scavenge dies with defective iGPUs.
 
Last edited:
Reactions: Drazick

CatMerc

Golden Member
Jul 16, 2016
1,114
1,153
136
So I uh, got bored and had nothing better to do... So...

I present - FrankenDie:


11 CU's
16 ROP's
I stretched and squashed various parts, but I made sure they always take the same area, and I didn't overdo it by turning a block into a long string that goes around the die or anything like that lol.

Edit: Realized that 16 ROPs would probably be overkill for 11 CU's, so here's the 8 ROPs variant:
 
Last edited:

Doom2pro

Senior member
Apr 2, 2016
587
619
106
So I uh, got bored and had nothing better to do... So...

I present - FrankenDie:



11 CU's
16 ROP's
I stretched and squashed various parts, but I made sure they always take the same area, and I didn't overdo it by turning a block into a long string that goes around the die or anything like that lol.

Since you have the free time, now make an MCM package with a 4c/8t Zen SOC die + a Vega GPU die on separate dies connected via Infinity Fabric.

And if AMD doesn't have a 4c/8t design and only uses 8c/16t designs for 4c SKUs use a salvaged 8c/16t die and a Vega die on an MCM...

Now die size and yields don't matter so much.
 
Last edited:
Reactions: Drazick

Glo.

Diamond Member
Apr 25, 2015
5,763
4,667
136
I have been digging, and came to conclusion, that if Raven Ridge APU will have HBM2, and it will be available to both: CPU and GPU, that means we are looking not at 4C/8T + 16 CU MCM package, one, separate design.

HBM2 memory controller must be on the die, to get it functional for both: CPU and GPU. Overall die size should not be bigger than 220mm2 for 4C/8T + 16 CU design.

Ryzen CPU is 180-190 mm2. So there is some of room to play with for "Frankensteining" the Raven Ridge APU from Ryzen + Polaris dies .
 
Reactions: Doom2pro
Mar 10, 2006
11,715
2,012
126
If AMD Zen based APUs use the MCM technique, it doesn't matter how big Zen is or how big the iGPU is... They will be individual dies fabbed on separate wafers with their own respective yields.

APUs won't be MCM, they'll be integrated. On-die integration is always better for performance than MCM.
 

Doom2pro

Senior member
Apr 2, 2016
587
619
106
APUs won't be MCM, they'll be integrated. On-die integration is always better for performance than MCM.

That is assuming the older on-die integration performance is significantly better than what Infinity Fabric can offer and since we don't know the latter, we can't assume the former.

I would not be surprised if Infinity Fabric is the glue that connects Zen and Vega into an APU, after all AMD has been hyping IF hard core like it's the second coming of Hyper Transport.

Do you really doubt their inter module communication IP considering they practically invented HBM and Hypertransport?
 
Reactions: Drazick

raghu78

Diamond Member
Aug 23, 2012
4,093
1,475
136
That is assuming the older on-die integration performance is significantly better than what Infinity Fabric can offer and since we don't know the latter, we can't assume the former.

I would not be surprised if Infinity Fabric is the glue that connects Zen and Vega into an APU, after all AMD has been hyping IF hard core like it's the second coming of Hyper Transport.

Do you really doubt their inter module communication IP considering they practically invented HBM and Hypertransport?

Infinity fabric is the key technology which powers everything AMD is building going forward. IF will be present in every AMD product going forward - CPUs, GPUs and APUs. imo IF might be the most important AMD innovation since the original Athlon 64 and Hypertransport. IF brings a level of granularity and scalability unseen before. IF will scale from on die to on package to across sockets, across PCI-E links and maybe even more . AMD has not yet revealed all the details but IF is a huge deal.

http://semiaccurate.com/2017/01/19/amd-infinity-fabric-underpins-everything-will-make/
 
Reactions: Drazick

krumme

Diamond Member
Oct 9, 2009
5,956
1,595
136
IF looks like Jim Keller most important work to me. And i asume he is the co author of this as he was on hypertransport.

If amd opens up for others it will be one of the most important concepts ever for microprocessers as it enable a new way computing is done.

Most noteworthy is imo the separation of control/safety (latency) and data (bandwith). Makes it far more flexible.

I think hypertransport pales in comparison to it. Its the kind of modular platform thinking that the car industry have been exploiting and refining for decades. We need more of that.
 

scannall

Golden Member
Jan 1, 2012
1,948
1,640
136
Mark Papermaster was very clear about no HBM2 in 2017. The cost wasn't there yet. Unfortunate. 2018? Possible. 2019? Probable.

Perhaps someone more learned than I can answer this though. When HBM2 is cost effective, is there any particular reason AMD couldn't use separate dies for the CPU and GPU in an APU? With the HBM2 feeding just the GPU portion, and the CPU going out to DDR4? With the infinity mesh wrapping it up with a bow on top?
 

raghu78

Diamond Member
Aug 23, 2012
4,093
1,475
136
Canard PC just stated that all cores boost is to be set between 3.65 and 3.8GHz depending on yields.

This give weight to BJT2 statement that they could hit as high as 4GHz@95W with all cores, if not at launch this should require only a slight maturation of the specific productions lines at GF.

https://www.cpchardware.com/ryzen-les-frequences-a-la-loupe/

I can definitely foresee a 3.8 Ghz 8C/16T with 4 Ghz Turbo right in time for Skylake-X. Process maturity, production ramp yield learning and maybe a stepping if needed. Voila you have it ready.

Mark Papermaster was very clear about no HBM2 in 2017. The cost wasn't there yet. Unfortunate. 2018? Possible. 2019? Probable.

Perhaps someone more learned than I can answer this though. When HBM2 is cost effective, is there any particular reason AMD couldn't use separate dies for the CPU and GPU in an APU? With the HBM2 feeding just GPU portion, and the CPU going out to DDR4? With the infinity mesh wrapping it up with a bow on top?

HBM2 will be used as a common L4 cache as AMD APUs support a single address space. CPU and GPU can share pointers. This is only possible with a design where HBM2 is a high bandwidth cache. AMD Vega's High bandwidth cache controller is a dead giveaway that this is the design of the future.
 
Reactions: Drazick

BeepBeep2

Member
Dec 14, 2016
86
44
61
HBM2 will be used as a common L4 cache as AMD APUs support a single address space. CPU and GPU can share pointers. This is only possible with a design where HBM2 is a high bandwidth cache. AMD Vega's High bandwidth cache controller is a dead giveaway that this is the design of the future.
I was thinking eSRAM / eDRAM?
 

raghu78

Diamond Member
Aug 23, 2012
4,093
1,475
136
I was thinking eSRAM / eDRAM?

Both these are not options due to the capacity and die size/cost constraints. HBM2 is the best option as you get massive bandwidth which GPUs need at low power and they can be manufactured on cheaper DRAM processes.
 
Reactions: Drazick

BeepBeep2

Member
Dec 14, 2016
86
44
61
Both these are not options due to the capacity and die size/cost constraints. HBM2 is the best option as you get massive bandwidth which GPUs need at low power and they can be manufactured on cheaper DRAM processes.
Xbox One - 32MB eSRAM GPU cache (218 GB/s)
Broadwell Iris Pro 6200/P6300 - 128MB eDRAM shared L4

Not saying I would be betting money on it happening but I felt it could be a possibility. Raven Ridge is not going to have HBM2 and though there is easy space for 16 CUs it is still going to be bandwidth choked if it has only 8.

Having a memory controller design that can work with a large cache could probably use 256MB eDRAM effectively?
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,224
136
Raven Ridge is not going to have HBM2 and though there is easy space for 16 CUs it is still going to be bandwidth choked if it has only 8.
Shhh.. no one needs to know.
https://www.rambus.com/memory-and-interfaces/ddrn-phys/hbm/
  • East-West orientation (PHY can be placed in corner of die)
http://i.imgur.com/OOHK1bH.jpg
I circled the HBM2 phys for you. It is part of the CCX. ;P
(Check Fiji's HBM Phys then compare and correlate to those.. hint: they match.)

With the addition of Vega and Infinity Fabric and High Bandwidth Cache Controllers in the L3(CCX) & Uncore(Vega) => HBM is assured.

16 CUs are not needed because Vega CUs have the ability to shoot 128 32-bit ops compared to Polaris->Southern Islands 64 32-bit ops. With that and plus Vega CUs can split into smaller SIMDs as well to increase utilization absolute destroying previous generations. Hence, why Raven Ridge is mostly 12 CUs aka 1536 32-bit ops.
 
Last edited:

lopri

Elite Member
Jul 27, 2002
13,211
597
126
This is only tangentially related to the earlier disclosed die sizes of AMD and Intel's 14nm cores.



v. mobile SOCs


But still interesting. ^^
 
Reactions: psolord

AtenRa

Lifer
Feb 2, 2009
14,003
3,361
136
Actually, with Polaris 11 GPU, RR APU would be over 200mm^2 with ease. You are replacing a CCX (44mm^2 or something) with P11 (123mm^2). Basically it would be like 260-270mm^2, if 180-190 estimation is on point.

1. RavenRidge APU doenst need all the IC that SummitRidge has for Servers. Quad Core + PCH + 128bit DDR-4 should be not more than 100mm2
2. You have to take-off Polaris 128bit GDDR-5 memory controllers (16-20mm2), you only need the DDR-4 on the CPU. So the die comes down to ~100mm2

1+2 = max 200mm2
 
Reactions: Drazick
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |