[Techpowerup] AMD "Zen" CPU Prototypes Tested, "Meet all Expectations"

Page 11 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Fjodor2001

Diamond Member
Feb 6, 2010
3,988
440
126
Can you please point out what this has to do with this thread, or his post, because honestly I don't get it. Why do you change subject every now and then only to talk about Shintai?

Arachnotronic says he longs for Zen to become a failure => That will lead to monopoly which he previously said he was against (in previous discussions we've had with ShintaiDK on that topic, thus the reference to that) => Therefore I wonder why Arachnotronic longs for Zen to become a failure, because it would go against his previous beliefs.
 

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
ZEN architecture is not based on Bulldozer CMT but in SMT like Intels. Also each ZEN Core has 50% more execution resources (4x ALUs + 2x AGUs vs 2+2) than what Bulldozer Integer Core has.

100% more integer exe ressources than an EXV core, 150% more FP exe ressources than a whole module...


The problem is not in the IPC, but if the architecture and the 14nm LPP process can achieve 4GHz or above at the designated TDP.

The 14nm LVT, the one whose perf/Watt were published, would double the perf/watt of a chip like Kaveri, and there s a faster sLVT enhancement..


Whatever you say

Can't wait for Zen to arrive. I was on these forums when Derpdozer was being hyped to the moon and I will be sure not to miss the discussion when Zen ultimately turns out to be a disappointment.

I have no doubt that you you ll be deeply disappointed...
 
Last edited:

Burpo

Diamond Member
Sep 10, 2013
4,223
473
126
We're not disappointed that you're finally off vacation from us. D:
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136
What exactly does that mean? Is that a good thing?

Yes, it's a good thing. It means they didn't differentiate the AGUs and the AGUs can be used for any memory op. So, it can do two loads at the same time or two stores at the same time, or one of each. There were rumors that one was load-only and the other was store-only. I think it came from a misreading of the gcc patch (since the patch creates a znver1-load and znver1-store distinction... but when you look at the definition it is znver1_agu, which is defined as znver1_agu0 | znver1_agu1, meaning either unit can do either (I missed this until yesterday and believed the rumor until then)).

AGU Specialization is used to limit how many connections you need to make to an AGU and to reduce the complexity of handling multiple AGUs and generated addresses at once elsewhere in the design. Intel's Core chips (up until Sandy Bridge) used this design. AMD, however, has had full AGUs since the Phenom era, IIRC (a requirement of the clustered design), which was even considered one of their weak points against the Core architecture by some (entirely as a side-effect of the cluster design (1AGU + 1ALU attached to a queue, which complicated scheduling and meant stalls were more likely)).

Sandy Bridge has two full AGUs and they discovered the greater benefit came from just the extra store AGU capability (the extra load was still helpful, but really only for SMT). Haswell has those plus an addiitional store-only AGU. It gained them relatively little, which illustrates just how well two individually addressable fully functional AGUs can operate.

If we compare Zen to Sandy Bridge, we learn a few things that are interesting.

Sandy Bridge has 3 ALUs and 2 AGUs.
Zen has 4 ALUs and 2 AGUs.

Sandy Bridge shares its 3 ALU pipes with floating point instructions.
Zen has 4 dedicated floating point pipes.

There are certain cases where this pipeline dedication will be a meaningful benefit for Zen, such as when one thread is integer heavy and the other is floating point heavy (quite common, actually) while using SMT, or when on thread uses a great deal of each (which is fairly common in unoptimized complex loops).
 

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
We're not disappointed that you're finally off vacation from us. D:

A big LOWL..since i voluntarly didnt post theses weeks...

It s just that i wont prop up a forum that favour troll posts and ad hominem like the one you just posted, funny that the first answer i got prove this point...
 

Burpo

Diamond Member
Sep 10, 2013
4,223
473
126
You missed us! Just glad to see you posting again (reaches for popcorn)
 
Last edited:

TechGod123

Member
Oct 30, 2015
94
1
0
Yes, it's a good thing. It means they didn't differentiate the AGUs and the AGUs can be used for any memory op. So, it can do two loads at the same time or two stores at the same time, or one of each. There were rumors that one was load-only and the other was store-only. I think it came from a misreading of the gcc patch (since the patch creates a znver1-load and znver1-store distinction... but when you look at the definition it is znver1_agu, which is defined as znver1_agu0 | znver1_agu1, meaning either unit can do either (I missed this until yesterday and believed the rumor until then)).

AGU Specialization is used to limit how many connections you need to make to an AGU and to reduce the complexity of handling multiple AGUs and generated addresses at once elsewhere in the design. Intel's Core chips (up until Sandy Bridge) used this design. AMD, however, has had full AGUs since the Phenom era, IIRC (a requirement of the clustered design), which was even considered one of their weak points against the Core architecture by some (entirely as a side-effect of the cluster design (1AGU + 1ALU attached to a queue, which complicated scheduling and meant stalls were more likely)).

Sandy Bridge has two full AGUs and they discovered the greater benefit came from just the extra store AGU capability (the extra load was still helpful, but really only for SMT). Haswell has those plus an addiitional store-only AGU. It gained them relatively little, which illustrates just how well two individually addressable fully functional AGUs can operate.

If we compare Zen to Sandy Bridge, we learn a few things that are interesting.

Sandy Bridge has 3 ALUs and 2 AGUs.
Zen has 4 ALUs and 2 AGUs.

Sandy Bridge shares its 3 ALU pipes with floating point instructions.
Zen has 4 dedicated floating point pipes.

There are certain cases where this pipeline dedication will be a meaningful benefit for Zen, such as when one thread is integer heavy and the other is floating point heavy (quite common, actually) while using SMT, or when on thread uses a great deal of each (which is fairly common in unoptimized complex loops).

Thanks for the detailed explanation.
 

TechGod123

Member
Oct 30, 2015
94
1
0
Whatever you say

Can't wait for Zen to arrive. I was on these forums when Derpdozer was being hyped to the moon and I will be sure not to miss the discussion when Zen ultimately turns out to be a disappointment.

Define disappointment. Cause it sure as hell isn't looking like one despite what some Intel shills would have you believe.
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Because the focus seems to be on bulldozer's module design. The problem is that even if you take an 8 core bulldozer and turn off half of each module, it is still slow as dirt compared to pretty much every i5. An i5-6600k scores almost 8000 in passmark. The aforementioned bulldozer configured as 4 "real" cores only scores around 4500 in passmark. This is with all the modular design constraints removed. So even putting aside the module vs SMT design issues, AMD still needs to pull roughly 80% more performance out of their hat. How?

You can't actually get rid of all of the module overhead, it's always present to some degree. There are several extra stages in the pipeline because of it, and then there are dedicated schedulers for each core, plus one for the FPU you would not otherwise need. Then, there is the constant minimal overhead when writing out results to the shared cache through the WCC (probably only a cycle here, though). Any way you look at it the constant module overhead is always there and is simply made worse by the other core in a module being active.

It's like when you're making an algorithm thread-safe while using shared resources. Even if all you are doing is reading the resource you have to gain a lock to prevent anyone else from writing to it. Most of the time no one else will try to change the resource while you are reading it, but it only takes one time for it to happen to cause a disaster.

By the time you are done wrapping all of your resources in locks, you have reduced the performance of your algorithm when running on one thread. This time, however, you can run it on two or more threads and get multiples of times faster overall. But, no matter what, that extra overhead is always there.
 
Mar 10, 2006
11,715
2,012
126
Arachnotronic says he longs for Zen to become a failure => That will lead to monopoly which he previously said he was against (in previous discussions we've had with ShintaiDK on that topic, thus the reference to that) => Therefore I wonder why Arachnotronic longs for Zen to become a failure, because it would go against his previous beliefs.

I don't long for Zen to be a failure, but the expectations are being inflated at such a rapid rate that it will almost certainly be a disappointment by the time it actually hits the shelves.
 

TechGod123

Member
Oct 30, 2015
94
1
0
I don't long for Zen to be a failure, but the expectations are being inflated at such a rapid rate that it will almost certainly be a disappointment by the time it actually hits the shelves.

I'm expecting Haswell level performance on average. I won't be disappointed especially since Skylake was overhyped as hell...it was hyped arguably more than Zen and it just turned out to be a disappointment.
 
Mar 10, 2006
11,715
2,012
126
Define disappointment. Cause it sure as hell isn't looking like one despite what some Intel shills would have you believe.

Better perf/watt than Skylake as one poster in this thread suggested. At this rate? The expectation ahead of launch will be that Zen will take the absolute performance crown, performance per watt crown, and be much cheaper to produce than any competing Intel chip.
 
Mar 10, 2006
11,715
2,012
126
I'm expecting Haswell level performance on average. I won't be disappointed especially since Skylake was overhyped as hell...it was hyped arguably more than Zen and it just turned out to be a disappointment.

I am expecting Haswell level performance per clock, but lower peak frequency and far fewer power management features. It may prove to be a competent low cost alternative to Intel products in desktops but I don't think it will be impressive in notebooks and other perf/watt sensitive use cases.
 

Fjodor2001

Diamond Member
Feb 6, 2010
3,988
440
126
Better perf/watt than Skylake as one poster in this thread suggested. At this rate? The expectation ahead of launch will be that Zen will take the absolute performance crown, performance per watt crown, and be much cheaper to produce than any competing Intel chip.

I don't think there is anyone on the forum that expects all of that from Zen. Thus they won't be disappointed either if it performs below that.

I think a lot of people will be happy with "just" Ivy/Haswell-E performance. Some may even be ok with SB-E performance assuming it is priced lower than Intel's HEDT alternatives.
 
Last edited:

TechGod123

Member
Oct 30, 2015
94
1
0
Better perf/watt than Skylake as one poster in this thread suggested. At this rate? The expectation ahead of launch will be that Zen will take the absolute performance crown, performance per watt crown, and be much cheaper to produce than any competing Intel chip.

And that's an idiotic belief to hold. Keller isn't a God but what I do believe was that he was in charge of a very competent group of engineers and I am in no way expecting better performance or even better performance per watt than Skylake at all. My expectations are reasonable, just because some people are crazy doesn't mean everyone is hyping Zen up that much.
 

TechGod123

Member
Oct 30, 2015
94
1
0
I am expecting Haswell level performance per clock, but lower peak frequency and far fewer power management features. It may prove to be a competent low cost alternative to Intel products in desktops but I don't think it will be impressive in notebooks and other perf/watt sensitive use cases.

Why lower peak frequency? They're using Samsung's LPP 14nm node right? Should be able to achieve fairly high clocks.

You're forgetting something here, if it has Haswell level performance per clock, DDR4 support and a unified socket, then that instantly becomes about 5x more appealing to an enthusiast that isn't a fanboy of any brand.
 
Mar 10, 2006
11,715
2,012
126
I don't think there is anyone on the forum that expects all of that from Zen. Thus they won't be disappointed either if it performs below that.

Sure there are.

I think a lot of people will be happy with "just" an 8 core Ivy/Haswell-E performance. Some may even be ok with SB-E performance assuming it is priced lower than Intel's HEDT alternatives.

If AMD prices its 8 core Zens at $899 and it delivers 8-core Haswell-E performance, will this bring the benefit to the consumers that you hope for?
 

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
You missed us! Just glad to see you posting again (reaches for popcorn)

If you say so..

Although there s not much left, if anything, to post, seems that about eveything was already debated by some experts in doom and gloom...


I am in no way expecting better performance or even better performance per watt than Skylake at all. My expectations are reasonable, just because some people are crazy doesn't mean everyone is hyping Zen up that much.

Exact perfs we dont know other than the published 40%, as for perf/Watt it can be better than its Intel counterpart, GF high perf 14nm LPP LVT caracteristics have been published and efficency numbers are better than Intel s 14nm for some important parameter.
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Wow. Intel has literally been spending billion$ on trying to eke out all of the performance/watt it can from each architecture and you think AMD is going to go from the lousy Bulldozer lineage to a new architecture built on a relatively shoestring budget on a foundry process optimized for mobile products and wind up with better perf/watt in key PC/server use cases? OK.

It's actually not that hard to understand. Intel spends billions on the fab research, not the CPU development. The CPU development, itself, is actually relatively cheap. It's even cheaper if you already have the IP and don't have to design it.

Zen has four simple ALUs, basically the ones in Excavator with some tweaks (basically just reassigning what instructions go where, as you'd expect).

Zen has two full AGUs, exactly like the ones in Excavator. Probably the exact same ones, maybe with some minor updates.

Zen has a four fully-feature decoders. Exactly like the ones in Excavator. Probably just some minor tweaks.

Zen has a new cache system, probably more similar to the Stars cache system than to Excavator. This means no more WCC, no more concurrency issues, and fewer, cheaper, cache stalls. This is where some design effort was truly needed. They don't need to beat the pants off Intel here, they need to beat the pants of Excavator. That should be easy.

The next thing Zen has that is new is its data fabric. It was created by the same guy that created the awesome HyperTransport tech. We should expect big things.

All in all, Zen vs Excavator is almost like Apple's A7 vs Apple's A6.

The A7 has nearly double the IPC of the A6, but a good half of that is from the added AGU (2 vs 1) and things that don't translate to Zen vs Excavator.

I mention this because the A7 was designed by a familiar face and looks awfully familiar

 
Mar 10, 2006
11,715
2,012
126
Why lower peak frequency? They're using Samsung's LPP 14nm node right? Should be able to achieve fairly high clocks.

On what do you base this notion? Samsung's 14nm LPP is, very rightfully, optimized for low power mobile applications since that's what Samsung's largest foundry customer (Apple) was gunning for. What makes you think that a high IPC/high frequency design will be particularly feasible on this process, especially designed by whatever team is left at AMD which to be frank is not exactly the industry's A-team?

You're forgetting something here, if it has Haswell level performance per clock, DDR4 support and a unified socket, then that instantly becomes about 5x more appealing to an enthusiast that isn't a fanboy of any brand.

OK, so why will it be any more appealing than the quad core Skylake chips you can buy today or the 6-8 core Broadwell-E chips that you'll be able to buy in a few months?
 

Fjodor2001

Diamond Member
Feb 6, 2010
3,988
440
126
Sure there are.
Seems strange. But in that case I've missed it.
If AMD prices its 8 core Zens at $899 and it delivers 8-core Haswell-E performance, will this bring the benefit to the consumers that you hope for?
For me personally, I think it would have to be priced a bit lower than that. Hard to say exactly how much lower, since it depends on the options available at the time Zen is released. But if priced competitively enough, sure it'll benefit the customers. At least it won't be worse than what we have now with not much competition in the HEDT and server segments at all.
 
Last edited:

TechGod123

Member
Oct 30, 2015
94
1
0
On what do you base this notion? Samsung's 14nm LPP is, very rightfully, optimized for low power mobile applications since that's what Samsung's largest foundry customer (Apple) was gunning for. What makes you think that a high IPC/high frequency design will be particularly feasible on this process, especially designed by whatever team is left at AMD which to be frank is not exactly the industry's A-team?



OK, so why will it be any more appealing than the quad core Skylake chips you can buy today or the 6-8 core Broadwell-E chips that you'll be able to buy in a few months?

Oh I don't know, maybe the whole unified thing??? You have to pay a whole lot for an X99 mobo and a Haswell E chip. If Zen has just one socket and mobo(which is true, looking at AM4) then you might be able to throw in a Zen based Athlon and then go for a higher performance Zen CPU when you have the cash as opposed to having to buy a new mobo AND a new CPU just to get the enthusiast CPU.

As for the process node, I'm pretty sure they're using a revised version but I think Looncraz is certainly more knowledgeable of the specific tweaks that were made to Samsung's manufacturing process.
 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,433
4,195
136
On what do you base this notion? Samsung's 14nm LPP is, very rightfully, optimized for low power mobile applications since that's what Samsung's largest foundry customer (Apple) was gunning for. What makes you think that a high IPC/high frequency design will be particularly feasible on this process, especially designed by whatever team is left at AMD which to be frank is not exactly the industry's A-team?

To make things clear Apple use the 14nm LPE.

14nm LPP exist in different threshold voltage, HVT , RVT, LVT, sLVT.

This list start from the less leaky/slower iteration to the more leaky/faster, the LVT at 2.4GHz has 2.3x better efficency than GF s 28nm HPP used for Kaveri.

Edit : The A9 likely use the 14nm LPE RVT wich is optimised up to 1.8GHz with leakage 4x lower than the LVT.
 
Last edited:

AtenRa

Lifer
Feb 2, 2009
14,003
3,361
136
Apple is using Samsungs 14nm LPE, not LPP.

LPP is suited for more than Mobile and low clocks.

 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |