AMD Bulldozer Engineering Sample CPU Overclocked to 4.63GHz

Page 6 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Phynaz

Lifer
Mar 13, 2006
10,140
819
126
Even with 4 way SLI ?...

An then, if only a few cores are at work, it s just more amazing
since BD make use of a single module for 2T and 2 modules for 3 or 4T
while a 990X make use of a full core for each thread in such case...

No. Windows will not schedule two threads on the same module if it doesn't have to.
 

Abwx

Lifer
Apr 2, 2011
11,204
3,910
136
It makes me wonder... if the L2 cache really is disabled, yet it performs on par with an i7-990, that is very very good news for AMD. It makes me wonder if they did not disable it purposely, just to hide its awesomeness from intel.

Disabling the L2 cache would ruin any CPU s perfs..
That said, even erroneous, that s still a too smart guess compared to the answers it got from some clueless poster..
 

Abwx

Lifer
Apr 2, 2011
11,204
3,910
136
No. Windows will not schedule two threads on the same module if it doesn't have to.

If it hasnt to, it wont of course..!!

But if two threads are to be scheduled, they will be sent in a single
module , not to two modules..

Do you only know how threads are handled by BD, or is it just
a wild assumption ?...

She also hinted that scheduling two threads on the same module and leaving the other three modules idle, so they cold drop into a low-power state, might be the best path to power-efficient performance.

http://techreport.com/articles.x/19514
 
Last edited:

Phynaz

Lifer
Mar 13, 2006
10,140
819
126
But if two threads are to be scheduled, they will be sent in a single
module , not to two modules..

What part of 'no' didn't you understand? Windows will schedule threads across modules before it puts two threads on the same module.

On an eight core BD, each module will get one running thread until the thread count gets to five, then modules will get multiple threads.

Putting more than one thread on a module when it's not necessary will result in a performance decrease, so why would it get scheduled that way?

It's exactly the same as Hyperhtreading, Windows will schedule threads so only one thread lands on a core, until you are out of physical cores.
 
Last edited:

Arzachel

Senior member
Apr 7, 2011
903
76
91
No. Windows will not schedule two threads on the same module if it doesn't have to.

Didn't JFAMD state the exact opposite? That module usage will be will be kept as low as possible so they can be gated off, to get the most out of turbo?
 

Abwx

Lifer
Apr 2, 2011
11,204
3,910
136
What part of 'no' didn't you understand? Windows will schedule threads across modules before it puts two threads on the same module.

On an eight core BD, each module will get one running thread until the thread count gets to five, then modules will get multiple threads.

Putting more than one thread on a module when it's not necessary will result in a performance decrease, so why would it get scheduled that way?

It's exactly the same as Hyperhtreading, Windows will schedule threads so only one thread lands on a core, until you are out of physical cores.

It s not the same as hyperthreading , and that s why you misled yourself.

HT mandate that each core is first used for a single thread since
this provide the best efficency in IPC.
When the thread count is more than the physical core count,
then virtual cores are used as a last ressort, with little gain in perfs,
that s why it s scheduled the way you re talking about.

With BD, if two thread are to be scheduled, they are sent to a single
module , allowing to gate off the three inactive modules and using
the saved TDP to boost even more the active module frequency.

If 4 threads are to be executed, same thing, two modules are used
in lieu of four , allowing to shut off two complete module and increase
the two active one s frequencies.
 

OCGuy

Lifer
Jul 12, 2000
27,227
36
91
Since AMD Phenom II X6 1100T can beat Intel Core I5 2500K in almost all multithreaded applications at the same price point, i would say BD will be much faster at the same apps even than SB 2600K.

I'm sure you all would like a link to support what i say,

http://www.anandtech.com/show/4310/amd-phenom-ii-x4-980-black-edition-review/1











No mater how fast your IPC will be, more cores are better in multithreaded applications and since BD will have both higher IPC and more Cores (than Phenom II) we really have to wait and see the real silicon and what is capable off.


Those dont bode well for AMD actually. Getting smoked by a previous generations 4 and 6 cores with HT...with 6 core sandy and IB right around the corner...
 

notty22

Diamond Member
Jan 1, 2010
3,375
0
0
Why did AMD quit BAPCo board: Poor Bulldozer performance on SYSmark 2012 or ...




So is AMD being a “big baby” for quitting BAPCo, or has Intel’s purported influence over SYSmark just reached the breaking point for other semiconductor companies? Either way, there will now be even more scrutiny over which benchmarks reviewers use to measure new processors, like AMD’s Bulldozers, and what those results are. Obviously, AMD needs its Llano and Bulldozer chips to test as competitively as possible to blunt Intel’s sales advantage with its Sandy Bridge processors. But what “competitively” means and how it’s measured will now draw more attention than ever.
 

Arzachel

Senior member
Apr 7, 2011
903
76
91
Thanks, I looked it up, per my previous post.

Referring to:http://forums.anandtech.com/showthread.php?p=31151015#post31151015

JFAMD said:
Yes, people are starting to come around. Everyone was getting all wrapped around "how do I spread my threads out across modules so that I have one thread per module. Yes, you get a performance increase with that, but it is marginal. However, running threads on the same module would allow for a.) sharing of L2 cache for apps that are utilizing the same data set and b.) the other modules to be shut down, reducing power and increasing the ability to boost.

Lots of people don't get it. You have a maximum amount of power that the processor can consume. You may be better off concentrating the power on fewer modules to achieve higher clocks than try to spread threads out to get 100% of the L2 resources.
 

sm625

Diamond Member
May 6, 2011
8,172
137
106
What I want to know is will we be able to configure windows so that it schedules one thread per module, or two threads per module. I would like to be able to try both and see which works best. It is an interesting choice.

a) Two modules for two threads = more floating point resources per thread.
b) One module for two threads = more potential clock speed.

I still think option "a" would be faster.
 

podspi

Golden Member
Jan 11, 2011
1,982
102
106
What I want to know is will we be able to configure windows so that it schedules one thread per module, or two threads per module. I would like to be able to try both and see which works best. It is an interesting choice.

a) Two modules for two threads = more floating point resources per thread.
b) One module for two threads = more potential clock speed.

I still think option "a" would be faster.

Sounds like this is the case, but 'b' will have a higher perf/watt. Very useful for server workloads where that sort of thing is important. Most home users will prefer to spread threads out. Probably Windows will have some power options. Perhaps in SP2?
 

drizek

Golden Member
Jul 7, 2005
1,410
0
71
It all comes down to how much power your motherboard can deliver and how much cooling you have.
 

Topweasel

Diamond Member
Oct 19, 2000
5,436
1,655
136
What I want to know is will we be able to configure windows so that it schedules one thread per module, or two threads per module. I would like to be able to try both and see which works best. It is an interesting choice.

a) Two modules for two threads = more floating point resources per thread.
b) One module for two threads = more potential clock speed.

I still think option "a" would be faster.

Its doubtful you would see a difference at all. The shared FPU stuff doesn't happen unless a very specific type of code is processed. Chances are you wouldn't find a benchmarking system outside some newer server-load stuff that would use it.
 

AtenRa

Lifer
Feb 2, 2009
14,003
3,361
136
What I want to know is will we be able to configure windows so that it schedules one thread per module, or two threads per module. I would like to be able to try both and see which works best. It is an interesting choice.

a) Two modules for two threads = more floating point resources per thread.
b) One module for two threads = more potential clock speed.

I still think option "a" would be faster.

I had almost the same conversation with Dresdenboy, ill quote

Originally Posted by AtenRa

I don't remember where i read this but like Intel's HyperThreding, the second thread to be executed is going to the next free core and not to the HT core (when there is one available) does the same happens with BD modules ???

When we have one thread to be executed, then core 1 of Module 1 will be used, but when we have a second thread then Core 1 of the Module 2 will be used and so on.

So this 80% performance is only applicable when all cores are being used, for example if we have a dual Module BD and we have to execute 4 threads. ???



Originally Posted by Dresdenboy


This is a power efficiency problem and might be handled by a new scheduler in Windows 8 (I heard something like that). So far I'd say since AMD's cores are seen as physical cores, the OS currently wouldn't consider the module architecture. Power wise a further activated module (to run one additional thread) would instantly cause sth. like ~35% of max. power due to leakage and activity power levels (according ISSCC paper). This might be lower as long as CPU load is low and frequency+voltage could be lowered. But while this 35% (or 10-20% w/ lower voltage/freq) would mean a lot, the same thread running on the first module's second core might cause a lower power consumption increase. So Turbo might help a bit. But overall this is not easy to answer. It might depend on energy saving policies etc.
 

Ares1214

Senior member
Sep 12, 2010
268
0
0
I think that thats still ES, even if its B2 i dont think its production quite yet. A good water system could see 5.3 GHz maybe, maybe more.
 

skipsneeky2

Diamond Member
May 21, 2011
5,035
1
71
took me 8.97s to do 1 million on 2500k clocked to 4.3ghzs.

Don't think super pi is that efficient could use a update?
 

bridito

Senior member
Jun 2, 2011
350
0
0
I dunno. That last figure before PI value output -> pi data.txt still starts with a 1 and it's not in the position of a single but a double digit. If it's that OC'd and still scoring over 10s it seems to my relatively uneducated in the ways of superpi mind that this is still a CPU that's going to have to work its silicon off to match even a 2500K let alone an SB-E/IB. But then again, that's just my humble opinion.
 

Minerva

Platinum Member
Nov 18, 1999
2,115
11
81
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |