[Techpowerup] AMD "Zen" CPU Prototypes Tested, "Meet all Expectations"

Page 14 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

beginner99

Diamond Member
Jun 2, 2009
5,228
1,603
136
I don't think that AMD will price its 8 core Zen particularly cheaply and there is a very simple reason why.

Agree. And for mobile it also makes sense to price it normal to high (at least if ti delivers). If you price it cheaply it will end up in crappy, cheap and slow laptops which further hurts AMD brand. Also a cheapo laptop with crap screen will have lower battery life than one with better components. Also hurts. You want your SOC paired with good components, especially the screen as that is becomign the most power hungry part like it already is on smartphones.
 
Mar 10, 2006
11,715
2,012
126
Are you saying Intel can never recover from their Pentium 4 debacle

AMD stumbles one time and you automatically assume they will stumble again when they have done so much that clearly shows they are on the right track? They even brought in a great deal of specialized talent (more than just Jim Keller, he's just the biggest name), but some who had FinFet experience.

With what we know about Zen (which is more than many seem to realize), it should be quite an interesting CPU. The FPU is what we don't really know it will perform, though I've done some work towards estimating that:

http://looncraz.net/ZenAssignments.html

Between the launch of the Pentium 4 in the year 2000 and the launch of Core 2 Duo in 2006, Intel's R&D spending rose sharply from just north of $3 billion to nearly $6 billion, meaning that Intel actively upped its investments substantially between the Pentium 4 "failure" (and yes, Pentium 4 did suck) and Core 2 "salvation".

AMD has shed talent and its R&D budget now sits below $1b/yr in today's dollars, and this includes what should be a fairly major GPU effort. Just FYI.
 
Last edited:
Mar 10, 2006
11,715
2,012
126
Intel has you convinced that it is difficult for them to give you more performance, that's just what they want you to think.

If Intel wanted to give you more, they'd give you more. The only factor is achievable profit margin. Intel could absolutely give bigger performance jumps, but it would be unusual for them do so.

I would like you to explain this thought a little more, please. What about giving bigger performance jumps gen/gen would impact Intel's profit margins?
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,896
3,466
136
Between the launch of the Pentium 4 in the year 2000 and the launch of Core 2 Duo in 2006, Intel's R&D spending rose sharply from just north of $3 billion to nearly $6 billion, meaning that Intel actively upped its investments substantially between the Pentium 4 "failure" and Core 2 "salvation".

AMD has shed talent and its R&D budget now sits below $1b/yr in today's dollars, and this includes what should be a fairly major GPU effort. Just FYI.

How much does it cost to design a CPU core......... R&D as a total value is irrelevant without understand how it is being spent (you dont).

AMD isn't going to do to anything revolutionary here, its going to be evolutionary, and many parts of both bulldozer and cat cores are in really good shape to use as a base. For all bulldozers suckieness it actually did lots of good things they just all ended up horribly bottlenecked :sneaky:....

op/loop cache, decode, PRF, prefetch,predictors, load and store, DVFS. AMD already have a good handle on TSV, MCM as well. So i bet we are going to see evolutions of these area's. We know we are seeing changes in both INT and FP execution units. But it looks like the caching system will be completely new.

So now how much R&D $ does it cost of 4 years to bring that kind of product to market? how are you coming to that figure? Without that your point is irrelevant.
 
Mar 10, 2006
11,715
2,012
126
How much does it cost to design a CPU core......... R&D as a total value is irrelevant without understand how it is being spent (you dont).

You're right, my bad, I have absolutely no clue what I'm talking about. Things are going just swimmingly for AMD and life is going to be just smooth sailin' for them from now on. Yep.
 
Mar 10, 2006
11,715
2,012
126
So would you care to explain how giving less performance leads to bigger profits? Seems to me more performance gains would lead to more sales (new sales and upgrades), thus more profits. Do you really think Intel is purposely holding back performance, especially with the increasing competition from ARM and Mobile?

This is such an excellent, excellent point. Intel gains nothing by dribbling out small performance increases and giving customers little reason to buy new systems. It seems like an awful waste to pour all of that money into new architectures, new processes, and marketing of new products if you're going to artificially constrain what you're going to build to the point that the products simply aren't that compelling.

It is a simple fact that the easy gains have been made long ago and every incremental gain becomes increasingly difficult, especially when trying to do it in a power constrained envelope.

Another excellent point. Skylake is basically the industry's leading architecture in terms of ST performance (high frequency + high perf/clock), so trying to actually build on top of that is no easy feat.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,896
3,466
136
You're right, my bad, I have absolutely no clue what I'm talking about. Things are going just swimmingly for AMD and life is going to be just smooth sailin' for them from now on. Yep.

good work,

1. you moved the goal posts
2. you didn't address a single point
3. you then said i said something that i didn't.

congrats on the trifecta....... :\
 
Mar 10, 2006
11,715
2,012
126
Sure. Intel doesn't have to worry about performance to drive sales, they have no competition. The user upgrade market is quite small compared to the OEM market as well, so they can make more money by release only small upgrades and charging those who make the jump each time (or many of the times) out the yang.

The real money is in servers. There, Intel is in the same situation. They have no competition, so they only advance as much as they must to keep sales up and margins high.

While x86 gains are certainly not extremely simple, if you look at the changes Intel has made to the Core design to get us what we have today it is quite clear that they have only been doing what it takes to provide a small boost that will keep people interested and give the OEMs a more expensive CPU to purchase.

Every change is incremental and simple (for the most part). These are things we said Intel could do when Core 2s were out and about to increase performance (full AGUs, another ALU and FPU port, more scheduler slots, improved branch prediction, etc.) together with a few things that are more unique (reduced misprediction penalties).

Almost everything Intel has done has been to give the illusion of struggling to create more performance. They keep a GPU on these dies that will often go completely unused in higher end systems... they could EASILY take that off and use that die space to provide more performance. Either with two more cores or with wider cores.

The biggest hurdle to overcome is the microcode itself. When you add an ALU you have to know what you will have execute with it, same with an extra FPU pipe.

Want to know how to make it wildly faster? Easy:

8xALU/FPU pipes, 4x AGUs, 1x Store AGU, 1x Store Data, double the caches, double the instruction fetch, double the register pool, etc. Assign four logical cores to this core and use a single unified scheduler.

Now, each thread can potentially access 2x MUL, 2x DIV, 8x ADD, 8xLEA, 4x LOAD, 5xSTORE, 8x fop, etc... all at once.

You throw all of the instructions into the same cache, without any regard for which thread owns what, and you pull the instructions. When an instruction goes through decoding the register is chosen in the pool based on the thread to which the instruction belongs, and the data is loaded, and the instruction continues like nothing else is different.

This is completely doable, and would easily fit in the die space (it would actually be a bit smaller than two cores, but should perform just as well). With Intel's economies of scale this would probably even be affordable for those who *really* need high IPC.

Of course, this will NOT double IPC, as there's always a point in the code where you start working a bit too far out of the context and there are legacy concerns, but this is worth a good 50% single threaded boost.

You would then have Celerons and Pentiums with one super-fast core and two or more threads.

Boy, you sure do make it sound easy to build an "Intel-killer" chip to take advantage of the fact that Intel is just lazing about dribbling out performance increases. Maybe AMD should hire you to run their CPU division?
 

TechGod123

Member
Oct 30, 2015
94
1
0
Boy, you sure do make it sound easy to build an "Intel-killer" chip to take advantage of the fact that Intel is just lazing about dribbling out performance increases. Maybe AMD should hire you to run their CPU division?

Please note that he suggested those modifications. No one is necessarily saying that it would be easy to implement those changes but for a company like Intel who you point out continuously as having a large Research and Development budget, they *should* be able to. You also mention how the engineers at AMD aren't as good as Intel's either. Why can't the combination of pure genius and immense amount of money come up with an extremely powerful CPU? One change in particular which was relating to removing the iGPU in the 6500K or the 6600K, why doesn't intel do that? Why not free up transistors for the CPU?
 

A5

Diamond Member
Jun 9, 2000
4,902
5
81
Please note that he suggested those modifications. No one is necessarily saying that it would be easy to implement those changes but for a company like Intel who you point out continuously as having a large Research and Development budget, they *should* be able to. You also mention how the engineers at AMD aren't as good as Intel's either. Why can't the combination of pure genius and immense amount of money come up with an extremely powerful CPU? One change in particular which was relating to removing the iGPU in the 6500K or the 6600K, why doesn't intel do that? Why not free up transistors for the CPU?

Because those products don't sell in enough volume (or at a high enough price point) to justify a different mask and separate production line.

If you want a CPU without the iGPU, they will gladly sell you a Haswell-E or Xeon.

Intel as a whole does not care *at all* about enthusiasts. They've spent the last 5 years focusing on performance/watt because Dell and Apple buy more laptop CPUs in a quarter than our whole market segment does in a year.
 

MrTeal

Diamond Member
Dec 7, 2003
3,595
1,768
136
This so much. The unified socket will be glorious. Pick up a decent mobo and you'll have great upgrade paths available. Way better than paying more for an X99 mobo or one that is designed for OCing or one that has these extra features that the other ones don't have because of different chipsets etc...This immediately makes Zen better for me if it matches Haswell.

That, and Zen+ will most likely use the AM4 socket as well which means an even bigger upgrade path.

You know there are good technical reasons for moving to X99 and LGA2011, right? AMD hasn't released any official confirmation that the 8C/16T Zen will be available in dual channel AM4, let alone Zen+, unless something new has come out in the last couple months and hasn't been wildly reported.

It's not hard to imagine a lot of situations where a 4GHz 8-core Zen with Haswell IPC would be platform constrained by Summit Ridge with 16 PCIe links and a dual channel DDR4 MC.
 
Last edited:

el etro

Golden Member
Jul 21, 2013
1,581
14
81
How much does it cost to design a CPU core......... R&amp;D as a total value is irrelevant without understand how it is being spent (you dont). <br />
<br />
AMD isn't going to do to anything revolutionary here, its going to be evolutionary, and many parts of both bulldozer and cat cores are in really good shape to use as a base. For all bulldozers suckieness it actually did lots of good things they just all ended up horribly bottlenecked <img src="images/smilies/familiar/sneaky.gif" border="0" alt="" title="Sneaky" smilieid="40" class="inlineimg" />....<br />
<br />
op/loop cache, decode, PRF, prefetch,predictors, load and store, DVFS. AMD already have a good handle on TSV, MCM as well. So i bet we are going to see evolutions of these area's. We know we are seeing changes in both INT and FP execution units. But it looks like the caching system will be completely new.<br />
<br />
So now how much R&amp;D $ does it cost of 4 years to bring that kind of product to market? how are you coming to that figure? Without that your point is irrelevant.
<br />
<br />
Hey itsmydammnation, do you think that the biggest part of Zen IPC increase come from beefing up the excution engine per core(like adding two ALUs and doubling the per core FPU)?
 

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
Thanks for posting the numbers ;-)

Just goes to show what I've had to tell so many people, there's really no reason why Zen shouldn't be able to clock well on 14nm LPP - beyond AMD's lack of experience with it and the new IP involved (if anything's going to slow down the clock speeds, I think it'll be the caches).

You re welcome..

Process looks good and we know that there s a faster iteration than the fast one (LVT) in theses tests.

As for explainin the uarch that s not my specialty but looks that there s worse than me as there s some belief by there that AMD has beefed the Integer exe ressources by 100% and FP exe ressources by 150-200% just to grab 25% better IPC than EXV.

FTR Zen FPU is twice the whole FPU of a module, i guess that there will be hugely disappointed people when looking at the poll...
 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
OK, so confirmed it 100%: A9 is LPP not LPE.

LOL.

You should mail a few sites to tell them that they are wrong and that you know better...

Anyway this has nothing to do with Zen other than some people hoping that the process is no good.
 
Mar 10, 2006
11,715
2,012
126
LOL.

You should mail a few sites to tell them that they are wrong and that you know better...

What's so funny? Unlike most of the sites that have been blindly parroted that A9 is built on 14nm LPE, I went and did the work required to learn what process A9 is actually built on. This information has been verified with multiple industry sources, so I have confidence in it. You can "LOL" all you want though.
 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
What's so funny? Unlike most of the sites that have been blindly parroted that A9 is built on 14nm LPE, I went and did the work required to learn what process A9 is actually built on. This information has been verified with multiple industry sources, so I have confidence in it. You can "LOL" all you want though.

Verified and no link posted...

Because Hardware.fr dont have their own sources, neither does THG....

FTR the A9 is completely irrelevant since it use a RVT variant that is two step below in speed (and two step above in leakage..) than the sLVT wich interest us.

Doesnt the A9 work at 1.85GHz..?.

Did you pay attention to the datas i posted and wich point the RVT as being the best at this frequency.?.

To get back on topic the LVT was showed at 2.41GHz, that s 640MHz higher than in the RVT testbed, and Zen will use at least the sLVT if not better since GF can improve the speed further at the expense of some leakage, this is not a phone CPU we re talking about but of a 8C/16T HEDT CPU.
 
Mar 10, 2006
11,715
2,012
126
Verified and no link posted...

Because Hardware.fr dont have their own sources, neither does THG....

Well you can choose to believe them or me, I honestly don't care. Just providing information to anybody in this thread who happens to be interested.
 

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de
Looncraz, it's not that easy to add units, reduce misprediction penalties, and so on. Scheduler complexity and power consumption would quickly go up. Pipelines aren't changed that easily etc. Design tools were less advanced in the past.

Soon or later you'll be able to actually play with your ideas.
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Looncraz, it's not that easy to add units, reduce misprediction penalties, and so on. Scheduler complexity and power consumption would quickly go up. Pipelines aren't changed that easily etc. Design tools were less advanced in the past.

Soon or later you'll be able to actually play with your ideas.

No doubt, but in this day of synthesized designs and capable tools there's really not much excuse. In fact, for the last decade it has been rather simple to add new logical units into a CPU and have the physical design finish itself.

AMD just spent three years doubling up everything in their CPU. Bulldozer was almost completely synthesized and was a much greater effort than Intel has put into any of their CPUs in one jump (except for the Pentium Pro, Penitum 4, and Itanium, perhaps :sneaky.

Intel can't be blamed for doing the incremental design changes, it's the safe route. They don't have a reason to try and push the design pace any more than they are. Adding a couple extra cores isn't child's play, but it's something that can be done in a couple months. Adding execution resources requires a great deal of programming and profiling even after the hardware layout is done. Intel has the resources, they could rather easily plan to make a double-wide core with a unified scheduler and get it done in just a couple years time.

They could go as far as to power and clock gate each execution unit so that they are only sucking power when in use.

If they had serious competition and really needed to do it, they could certainly do so.
 

Tuna-Fish

Golden Member
Mar 4, 2011
1,458
1,904
136
No doubt, but in this day of synthesized designs and capable tools there's really not much excuse. In fact, for the last decade it has been rather simple to add new logical units into a CPU and have the physical design finish itself.

...

They could go as far as to power and clock gate each execution unit so that they are only sucking power when in use.

You appear to have no idea what it costs to add a new execution unit. The cost isn't die area (ALUs take minimal area on modern chips), but clock speed. The clock speed of chips is limited by the longest path, both in transistors that need to switch and wire distance that needs to be crossed, that has to be traversable in a single clock cycle. This longest path is almost invariably found in the forwarding network that shuffles results from one execution unit to another. Adding more execution units is very easy, but it also directly reduces the maximum clock speed of your chip.
 

nismotigerwvu

Golden Member
May 13, 2004
1,568
33
91
I am not so sure about that. The Performance rating was outright disasterous. It took K6 to actually compete with Pentiums.



You did exactly what I said you would and confused IPC and outright performance. The PR system was a dumb move all around, but clock for clock the K5 was really quite good in terms of integer performance. Floating point...that would have to wait (K6 2 if you count SIMD, but K7 for x87 performance). The issue with the K5 was just that they could only get to about 116mhz if memory serves correct. Even then, outside of gaming it held up well enough for where it was priced. Not a great chip by any means though.
 

tynopik

Diamond Member
Aug 10, 2004
5,245
500
126
The clock speed of chips is limited by the longest path, both in transistors that need to switch and wire distance that needs to be crossed, that has to be traversable in a single clock cycle. This longest path is almost invariably found in the forwarding network that shuffles results from one execution unit to another. Adding more execution units is very easy, but it also directly reduces the maximum clock speed of your chip.

it seems that chips are more often impacted by thermal limits than design limits when it comes to clockspeed (especially in mobile which is all anyone cares about)
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |