Bulldozer ES benchmark is out!

Page 3 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Skurge

Diamond Member
Aug 17, 2009
5,195
1
71
Here's the thing. How much do you realistically think AMD can change in the next 2 months? Yes, it really looks like the chip is crippled and broken, but I don't think the crippling is intentional. It seems like the chip really has a lot that needs to be fixed/improved, but AMD has a very short timeframe in which to do it. The benchmark results right now are horrendous for a chip that's supposed to compete with 2500K, 2600K, and Ivy Bridge when it comes out early next year.

In contrast, Intel's Sandy Bridge ES chips benchmarked pretty close to the final versions, and Anandtech ran those benchmarks 3-4 months before the official release of SB chips (August 2010). Here we are 3 months away from Bulldozer's "official" release and AMD still can't produce engineering samples that can perform half-decently?

You make it seem like this particular ES is the best the have atm, I'm pretty sure its a few months old so they had more than just 2month to fix it. Now, I don't pretend to know much about how CPUs work, but it looks to me like this sample has about a 1ghz clock deficit to the top bulldozer part amongst other things mentioned. Like cache writing being disabled. How much performance will fixing that give? I don't know, but I'm pretty sure its more than just 10%.

Also, do you think if bulldozer performed this badly, much worse than an i7 860, they would price it the same as a 2600K?

It might not be a magical performer, but it looks like it might be pretty competitive with SB (Intel's latest architecture) and that is more than you can say for AMD for the last 6yrs.
 

hamunaptra

Senior member
May 24, 2005
929
0
71
Sandy Bridge is a completely new microarchitecture, not a die-shrink. Nehalem was the architecture for the first gen Core i3/i5/i5 chips, and Wolfdale was the 32nm die-shrink. Next year, Ivy Bridge will be the die-shrink of Sandy Bridge.

This is why it's depressing news that AMD cannot get decently performing ES chips out just a few months before release, when Intel was able to do the same 3-4 months before the final SB release.

Um, SB is not a completely new uarch. It is an enhancement of nehalem.
Hell, if you wanna get technical, SB, nehalem, C2D all pretty much get their roots from the p6 uarch.
 

formulav8

Diamond Member
Sep 18, 2000
7,004
522
126
Sandy Bridge ES chips 3-4 months before release were performing exactly on par with their final retail counterparts.

See the difference?

Just because Intel had a ES working 3 months before launch means nothing. Intel could have had ES samples for a year for all you know. You can NOT gauge Intel's timeframe to be AMD's timeframe.
 

lol123

Member
May 18, 2011
162
0
0
Um, SB is not a completely new uarch. It is an enhancement of nehalem.
Hell, if you wanna get technical, SB, nehalem, C2D all pretty much get their roots from the p6 uarch.
Sandy Bridge might not have been a completely new architecture (arguably no microarchitecture is completely new), but it was a complete redesign. It was not an incremental update to Nehalem.
 

dx11101

Member
Jun 6, 2011
45
2
71
at 2.8 GHz it looks like AMD's new chip is slower than i7 clock for clock in these benches. That is not a good sign assuming the numbers are legit of course. Their is still hope though because from this you cant give the official "suck" verdict just yet.
 

notty22

Diamond Member
Jan 1, 2010
3,375
0
0
Until recently, AMD's time frame was now, granted, imo, they have known for months, that they are xx months away. As of now, if they are supposed to be selling cpu's in September, they should have something 'ship' ready. Not sure who it benefits to not leak, or demonstrate its performance. We have tech sites, putting on the front pages, reviews for the new 990fx m/b's and the disappointment to have no cpu's to go with them ? To me, thats only going to keep mounting pressure, tension about the 'unknown' performance. Which shakes confidence.
 

hamunaptra

Senior member
May 24, 2005
929
0
71
Sandy Bridge might not have been a completely new architecture (arguably no microarchitecture is completely new), but it was a complete redesign. It was not an incremental update to Nehalem.

Um dude, it was a very incremental design. They added AVX and took out the ROB and put a PRF in its place also added trace cache, tacked on a crappy GPU. Made a few other tweaks here and there, other than that, its pretty much unchanged from nehalem.
 

garagisti

Senior member
Aug 7, 2007
592
7
81
...We are beyond original launch date...

I read somewhere that original NDA date has still not expired... so you're wrong on that account. Someone already mentioned somewhere that Llano was to be launched on 14th of June. BD's launch date was scheduled after Llano, as confirmed by many.

EDIT:
Also some people are murmuring about a certain donanimhaber slide being approximation of performance... that is 10 pts in cinebench 11. Hell i don't know... but sure as hell it will be faster than Thuban.
 
Last edited:

out.of.order

Junior Member
Jan 22, 2011
18
0
0
Sandy Bridge is a completely new microarchitecture, not a die-shrink. Nehalem was the architecture for the first gen Core i3/i5/i5 chips, and Wolfdale was the 32nm die-shrink. Next year, Ivy Bridge will be the die-shrink of Sandy Bridge.

This is why it's depressing news that AMD cannot get decently performing ES chips out just a few months before release, when Intel was able to do the same 3-4 months before the final SB release.

i think you have confused westmere with wolfdale

btw also considering the ES, the beta bios, some lack of optimizations etc. etc. etc.... the IPC seems to be lower than a phenom... :whiste: at least for the single core, for the whole CPU don't know
 
Mar 10, 2006
11,715
2,012
126
Um dude, it was a very incremental design. They added AVX and took out the ROB and put a PRF in its place also added trace cache, tacked on a crappy GPU. Made a few other tweaks here and there, other than that, its pretty much unchanged from nehalem.

No. You obviously have no idea what you're talking about.

List of changes:
- Reintroduction of trace cache (this is actually big!)
- L3 cache latency reduced; now not part of the uncore, so it runs at full speed (this is also huge!)
- Branch predictor is new
- AVX
- PRF added
- Load/Store buffers increased in size
- Ring bus (really nice if you want to scale # of cores)
- on-die graphics...
- ROB buffers increased from 128 to 168
 

996GT2

Diamond Member
Jun 23, 2005
5,212
0
76
This is hilarious because people actually believe that BD will perform like this POS ES.

I think it's reasonable to believe that the final retail BD will be in the same ballpark. AMD's obviously working to fix and improve it, but it's not like the final retail product will perform an order of magnitude better...

A big part of the performance improvements will come from the much higher clock speed on the retail Bulldozer chips (>3.5 GHz vs. 2.8 GHz), but clock for clock I'm not expecting more than a ~10% improvement from this ES.
 
Last edited:

hamunaptra

Senior member
May 24, 2005
929
0
71
No. You obviously have no idea what you're talking about.

List of changes:
- Reintroduction of trace cache (this is actually big!)
- L3 cache latency reduced; now not part of the uncore, so it runs at full speed (this is also huge!)
- Branch predictor is new
- AVX
- PRF added
- Load/Store buffers increased in size
- Ring bus (really nice if you want to scale # of cores)
- on-die graphics...
- ROB buffers increased from 128 to 168


That doesnt exactly spell "redesign" ... pretty much just enhancements. Like I stated.
You stated "List of changes" - exactly my point - just changes.
Whereas BD in nearly all ways departs from stars....
 

Arkadrel

Diamond Member
Oct 19, 2010
3,681
2
0
Just wondering, why doesn't AMD switch to using an LGA for their sockets? Is it just to keep backwards compatibility with AM2/AM3? Or is it cheaper/easier to manufacture, or something?


Haveing the "pin's" on the CPU adds extra cost, makeing a product with less is bound to make said product cheaper.

Intel pushes that onto the motherboard manufactors.

Why doesnt AMD use LGA? who knows <.< maybe they prefer the pins on their chips instead of motherboard? ei. bent pin = new cpu buy instead of new motherboard buy, or maybe its a cost thingy? cpu cheaper than motherboard, so the risk should be on the cpu?

Anyways... as far as I know, LGA or not, makes zero differnce to anything, it comes down to a question of where you want to put risk of bent pins, and greed/cost of production of the cpu.



about the performance...

ES of a bulldozer, with beta bioses and whatnot... rumors of horrid B0 steppings.

Lets wait for the final product to come out before we look at some video that shows bad performance on a ES and claim bulldozer will perform like that (anything else stinks of fanboyisme).
 
Last edited:

intangir

Member
Jun 13, 2005
113
0
76
No. You obviously have no idea what you're talking about.

List of changes:
- Reintroduction of trace cache (this is actually big!)
- L3 cache latency reduced; now not part of the uncore, so it runs at full speed (this is also huge!)
- Branch predictor is new
- AVX
- PRF added
- Load/Store buffers increased in size
- Ring bus (really nice if you want to scale # of cores)
- on-die graphics...
- ROB buffers increased from 128 to 168

Agreed, the expanded uop cache is really huge. In addition, the two address generation units can now each do either a load or store, instead of being a dedicated load AGU and a dedicated store AGU like on Nehalem. So Sandy Bridge can do two loads or stores in a single cycle, whereas Nehalem could only do one.

David Kanter at realworldtech called Sandy Bridge an entirely different microarchitecture from the P6.
http://www.realworldtech.com/page.cfm?ArticleID=RWT091810191937&p=10
David Kanter said:
Sandy Bridge is a fundamentally new microarchitecture for Intel. While it outwardly resembles Nehalem and the P6, it is internally far different. The essence of an out-of-order microarchitecture is tracking, re-ordering, renaming and dynamically scheduling operations to achieve the limit of data flow. Nehalem and Westmere rely on the same mechanisms that date back to the original P6. Sandy Bridge changes the underlying out-of-order engine and uses the more efficient approach taken by the EV6 and P4. That one change alone qualifies Sandy Bridge as a different breed entirely from the P6. But, there are changes in almost every other aspect of the design. The uop cache is a huge improvement for the front-end, largely by eliminating many of the vagaries of x86 fetch and decode. The implementation is quite clever and achieves many of the aims of the P4’s trace cache, but in a far more efficient and reliable manner. AVX improves execution throughput and most importantly, the more flexible memory pipelines benefit almost all workloads.
 

hamunaptra

Senior member
May 24, 2005
929
0
71
Well, my bad =P I didnt think it was that much different. Just pick and choose parts from P4 and P6 and integrate them into one thing. Hence why I dont see it as a big change. OTOH, stars -> BD is a much more drastic change. Hardly anything in BD represents any uarch of AMD's past, much less a normal x86 design. To me, thats a true redesign.
 

dmens

Platinum Member
Mar 18, 2005
2,271
917
136
Um dude, it was a very incremental design. They added AVX and took out the ROB and put a PRF in its place also added trace cache, tacked on a crappy GPU. Made a few other tweaks here and there, other than that, its pretty much unchanged from nehalem.

Um, no. Single thread benchmark speedup shows it's a lot more than an incremental design.
 

intangir

Member
Jun 13, 2005
113
0
76
Well, my bad =P I didnt think it was that much different. Just pick and choose parts from P4 and P6 and integrate them into one thing. Hence why I dont see it as a big change. OTOH, stars -> BD is a much more drastic change. Hardly anything in BD represents any uarch of AMD's past, much less a normal x86 design. To me, thats a true redesign.

Well, if you break down the Bulldozer changes into a list, would it really look much different?

- Widened decode width to 4
- Added scheduling support for two threads
- Added PRF
- Decreased ALUs and AGUs to two dedicated ones per thread
- Added AGU capability for simple ALU instructions
- Split FPU into two halves which can be assigned to different threads
- Added AVX support
- Added FMA4

What about these things makes it a unique new microarchitecture?
 

richierich1212

Platinum Member
Jul 5, 2002
2,741
360
126
I think it's reasonable to believe that the final retail BD will be in the same ballpark. AMD's obviously working to fix and improve it, but it's not like the final retail product will perform an order of magnitude better...

A big part of the performance improvements will come from the much higher clock speed on the retail Bulldozer chips (>3.5 GHz vs. 2.8 GHz), but clock for clock I'm not expecting more than a ~10&#37; improvement from this ES.

Ok so SuperPi isn't AMD's strongest benchmark, but you're saying you believe that BD will be even slower than old Athlon X2 processors? LOL.

It's crippled for a reason, because people just like the guy from ChipHell who somehow gets their hands on an engineering sample and doesn't follow NDA and wants to benchmark it. Do you think AMD wants that to happen?
 

app

Junior Member
Jun 9, 2011
1
0
0
It's said that the es sample is from foxconn, and was sold online, so I believe the owner is free to post the info on forums, there really is no NDA for him to follow.
 

996GT2

Diamond Member
Jun 23, 2005
5,212
0
76
Ok so SuperPi isn't AMD's strongest benchmark, but you're saying you believe that BD will be even slower than old Athlon X2 processors? LOL.

It's crippled for a reason, because people just like the guy from ChipHell who somehow gets their hands on an engineering sample and doesn't follow NDA and wants to benchmark it. Do you think AMD wants that to happen?

Slower than an Athlon X2? Where do you get that from? Not sure if srs...

Looking at the Cinebench results (a multithreaded benchmark), the ES is about 10&#37; slower than a Core i7 860. With some tweaks and fixes I believe they can probably get it to match the i7-860 clock for clock in multithreaded applications, which wouldn't be too bad. Then add in the speed bump from 2.8 to >3.5 GHz and the retail Bulldozer chips should hopefully be competitive with Sandy Bridge, at least in multithreaded apps...

Single-threaded performance is a totally different story though, and not in favor of AMD unfortunately.
 

nonameo

Diamond Member
Mar 13, 2006
5,949
3
76
It's said that the es sample is from foxconn, and was sold online, so I believe the owner is free to post the info on forums, there really is no NDA for him to follow.

Except that ES processors are AMD property, and if AMD didn't give it to him then it could be considered stolen?
 

hamunaptra

Senior member
May 24, 2005
929
0
71
Well, if you break down the Bulldozer changes into a list, would it really look much different?

- Widened decode width to 4
- Added scheduling support for two threads
- Added PRF
- Decreased ALUs and AGUs to two dedicated ones per thread
- Added AGU capability for simple ALU instructions
- Split FPU into two halves which can be assigned to different threads
- Added AVX support
- Added FMA4

What about these things makes it a unique new microarchitecture?

Add to that -

Improved prefetch
Drastically improved branch predictors
Drastic revamp of entire cache hierarchy
Many sections of the pipelines decoupled from the others
The fact there is sharing of very specific parts of uarch and other parts not. Stars shares...what the L3?

On paper the core and fundamentals BD uarch looks NOTHING like stars.
This cant be said of SB vs Nehalem.
Reading real world tech articles, when looking at BD vs stars - um...the graphs are very different. When reading about SB vs nehalem they look very similar...
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |