First Steamroller processor core exposure

Page 17 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Vesku

Diamond Member
Aug 25, 2005
3,743
28
86
I think AM3+ is pretty much dead. Steamroller looks to be FM2+ only. With the launch of those 220W TDP Vishera's I even have doubts we will see improved Pile Driver (2014 Server roadmap) on AM3+.
 

AtenRa

Lifer
Feb 2, 2009
14,003
3,361
136
I think AM3+ is pretty much dead. Steamroller looks to be FM2+ only. With the launch of those 220W TDP Vishera's I even have doubts we will see improved Pile Driver (2014 Server roadmap) on AM3+.

I do believe we will see PD2 on AM3+ (if no SR for Servers for 2014), maybe in Q4 2013.
 

ShadowVVL

Senior member
May 1, 2010
758
0
71
I was hoping for a FX-6400 SR as my next upgrade.It will be a shame if amd doesnt atleast give us a hexacore.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
Steamroller Mainstream/Consumer -> FM2+/FM3
Steamroller Server/Enthusiast -> G34+/GC34/GC36
Excavator Mainstream/Consumer -> H2015/(Socket HM1)
Excavator Server/Enthusiast -> GC34/GC36
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
Do you have a source for that?
The usual rumor mills WCCFTech, DonanımHaber, Blogspot, etc. :sneaky:

Bulldozer/Piledriver Front-ends -> Vertical Multithreading(Fine-grain)
Steamroller/Excavator Front-ends -> Simultaneous multithreading(CMP-like)
Bulldozer/Piledriver FPU Front-end -> Vertical Multithreading(Coarse-grain)
Steamroller/Excavator FPU Front-ends -> Simultaneous multithreading(CMP-like)

Cores in all four architectures are all single-threaded and only produce one logical processor in task managers.

Bulldozer/Piledriver 32-nm PDSOI (VLIW)
Steamroller 28-nm Bulk and FDSOI(3xx+ mm² dies) (CI GCN)
Excavator 14-nm FinFET or FDSOI (PI GCN)
 

LegSWAT

Member
Jul 8, 2013
75
0
0
The usual rumor mills WCCFTech, DonanımHaber, Blogspot, etc. :sneaky:

Bulldozer/Piledriver Front-ends -> Vertical Multithreading(Fine-grain)
Steamroller/Excavator Front-ends -> Simultaneous multithreading(CMP-like)
Bulldozer/Piledriver FPU Front-end -> Vertical Multithreading(Coarse-grain)
Steamroller/Excavator FPU Front-ends -> Simultaneous multithreading(CMP-like)
You, or those, must be equipped with inconceivably powerful precognitive capabilities.:hmm:
Just a few more questions before sb. from WTFtech, Dona, etc. tells my fortune:
1) Are you/they sure about those threads per module for SR & ExC?
2) Which architecture does that mysterious die-shot belong to?
3) Who is to say the socket/platform situation is not going to change dramatically with DDR4? Wouldn't it be rational for the company to scrap some of those server platforms altogether and unify them for DDR4 and APU use?
4) Will AMD still be among us by the time Excavator is scheduled to come out?
5) Will the secretly planned alien invasion have begun by that point of time, and if so, will they buy excavator APUs? And what's their marketshare?

Lots of questions, but I' just feel so lucky to finally meet a level 87 psychic medium! Otherwise sorry for OT blather, but this level of supreme uber-clairvoyance is just a tad too much for me!
 

podspi

Golden Member
Jan 11, 2011
1,982
102
106
5) Will the secretly planned alien invasion have begun by that point of time, and if so, will they buy excavator APUs? And what's their marketshare?

The date already passed. :biggrin:

You, or those, must be equipped with inconceivably powerful precognitive capabilities.:hmm:
Just a few more questions before sb. from WTFtech, Dona, etc. tells my fortune:
1) Are you/they sure about those threads per module for SR & ExC?
2) Which architecture does that mysterious die-shot belong to?
3) Who is to say the socket/platform situation is not going to change dramatically with DDR4? Wouldn't it be rational for the company to scrap some of those server platforms altogether and unify them for DDR4 and APU use?
4) Will AMD still be among us by the time Excavator is scheduled to come out?
...

I don't know how legit NostaSeronx's predictions/sources are, but I do want to put in my 2c.

1) I would be very surprised if AMD adopted SMT for integer workloads. They seemed pretty certain that SMT was not a great idea before.

2) This one I have no idea.

3) I think it makes a lot of sense for server/enthusiast platforms to combine, given AMD's huge marketshare in both of these markets. Kind of a bummer, though I am happy I never ended up wasting money on an AM3+ setup (though I did buy AM3 in preparation for BD when it was supposed to be compatible, so hmmm ).

4) Yes. Though lots of people will say no, AMD is not going anywhere. That being said, I wouldn't guarantee the owners will be the same people . RR is doing better than I expected, here is hoping they can right the ship in time to save us current shareholders. But the engineering teams and IP still have a lot of value, I can't see it being wound down.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
@LegSWAT
1. Steamroller still retains two threads per module. It is that both cores get instructions per cycle rather one core gets instructions per cycle. The cores are still single threaded and do not use SMT.
2. Steamroller, the die shot was released slightly before Steamroller's design methodology was explained.
3. The socket/platform will change for DDR4.
4. XV is planned for 2016, so I don't know.
5. That is dependent if you include Jaguar in the x86, Intel vs AMD marketshare.

Bulldozer/Piledriver:
Front-end(1 thread)
2 cores(2 threads)
FPU F-E(1 thread)
2 execution units(2 threads)

Steamroller/Excavator:
Front-end(2 threads)
2 cores(2 threads)
FPU F-E(2 threads)
2 execution units(2 threads)
 
Last edited:

Exophase

Diamond Member
Apr 19, 2012
4,439
9
81
Steamroller/Excavator:
Front-end(2 threads)
2 cores(2 threads)
FPU F-E(2 threads)
2 execution units(2 threads)

So every part of the bulldozer module executes two threads but doesn't employ SMT. In other words it's two completely independent cores, except for L2 cache.

Wait, that doesn't sound right.

More likely to believe what AMD has said all along for Steamroller - separate decoders now but the rest of the front end (branch prediction, TLBs, L1 cache, ifetch) is still shared and the SIMD core is still shared.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
yadda yadda yadda
Obviously, you didn't read what I posted.
~~~~~~~~~~~~~
--Some Extra Stuff--

Ancestry Table:
http://i.imgur.com/3cdqWYa.png
Left Piledriver vs Right Steamroller

Branch Predictor:
http://i.imgur.com/KIqPzXD.png
Bottom Piledriver vs Top Steamroller

Frontend:
http://farm4.staticflickr.com/3704/9243291686_9f14d92737_o.png
Bottom Piledriver vs Top Steamroller

FPU Frontend:
http://farm8.staticflickr.com/7441/9243291388_a5d77bf365_o.png
Bottom Piledriver vs Top Steamroller.

L1 DTLB?:
http://i.imgur.com/tV6IRq7.png
Left Piledriver vs Right Steamroller

Immediate Value Storage:
http://i.imgur.com/Wetk6Fm.png
Left Piledriver vs Right Steamroller

Instruction Payload Storage:
http://i.imgur.com/8q6kHMs.png
Left Piledriver vs Right Steamroller

L1 Data:
http://i.imgur.com/SuZVyZt.png
Left Piledriver vs Right Steamroller

MMX Unit:
http://i.imgur.com/GXE5wP8.png
Bottom Piledriver vs Top Steamroller

Physical Register Files for the cores:
http://i.imgur.com/zqES8NA.png
Left Piledriver vs Right Steamroller

Register Rename:
http://i.imgur.com/04GbH8y.png
Left Piledriver vs Right Steamroller

Retire Queue:
http://i.imgur.com/R8KJVLO.png
Left Piledriver vs Right Steamroller

Wake Pick:
http://i.imgur.com/05QwQZE.png
Top Piledriver vs Bottom Steamroller
 
Last edited:

LegSWAT

Member
Jul 8, 2013
75
0
0
So every part of the bulldozer module executes two threads but doesn't employ SMT. In other words it's two completely independent cores, except for L2 cache.

Wait, that doesn't sound right.
The schematics posted on the previous page would about answer that.
http://cdn3.wccftech.com/wp-content/uploads/2013/07/AMD-Steamroller-vs-Bulldozer.jpg

Which convention?

I assume it's not the convention of WCCFTech, DonanımHaber, Blogspot, and etc.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
Which convention?
Tuesday, June 4: 10:30am - 12:00pm

2.1 Physical Design Methodologies on AMD’s CPU Cores


This presentation will cover the physical design methodologies and CAD flows used on AMD’s low-power (Jaguar) and high-performance (Steamroller) CPU cores. Both CPU cores used a wide variety of construction techniques including full-custom macros, custom-placed blocks and synthesis/P&R. Each of these methodologies along with corresponding tool flows will be discussed. We will also give an overview of the tools and methodologies used for static timing, IR, electro-migration, power consumption analysis, leakage recovery and clock/voltage domain crossing. The presentation will highlight some of the unique tool flows developed around in-house and industry-standard CAD tools.
^-- assuming it is this one. There are a couple more like this discussing both Steamroller and Jaguar.
Branch Prediction Thread domain = SMT
Instruction Fetch Thread domain = SMT
Decode Thread domain = SMT
Dispatch Thread domain = SMT
Floating Point Frontend = SMT
Integer Core 1 = Single Thread
Integer Core 2 = Single Thread
Floating Point Execution Unit 1 = Single Thread
Floating Point Execution Unit 2 = Single Thread
Floating Point Backends = Single Thread
Shared L2 = SMT
 
Last edited:

wlee15

Senior member
Jan 7, 2009
313
31
91
The schematics posted on the previous page would about answer that.
http://cdn3.wccftech.com/wp-content/uploads/2013/07/AMD-Steamroller-vs-Bulldozer.jpg



I assume it's not the convention of WCCFTech, DonanımHaber, Blogspot, and etc.

That's only an interpretation based on publicly known information based on mainly last years Hot Chips Steamroller presentation. I would note that the diagram looks suspiciously like the diagrams that Hiroshige Goto makes on pc.watch.co.jp.
 

NostaSeronx

Diamond Member
Sep 18, 2011
3,689
1,223
136
care to share then?
Not really. I would prefer you wait and see rather than completely agree with me.

May 2011: http://rc.partners.org/sites/default/files/Partners_AMD_HP Refresher May 2011.pdf
October 2011: http://www.eteknix.com/amd-to-turn-to-tsmc-to-save-bulldozer/
August 2012: http://www.hotchips.org/wp-content/...10-Surround-Computing-Era-Papermaster-AMD.pdf
October 2012: http://www.phoronix.com/scan.php?page=news_item&px=MTIwNDY

May 2011
•PCIe integrated into CPU for direct link to GPU
•GPU access to x86 memory
•Enhanced server-class RAS
•OpenCL driver enhancements
•Greater unification:

  • •DMA memory and cache access w/ accelerators
•Server Fusion Architecture: Integration of CPU, GPU and communications with shared memory model
•Dynamic load balancing
Berlin APU for now.

October 2011:
Transition to TSMC
The issue with this rumor was that it for for 28-nm GloFo not 32-nm GloFo.

August 2012:
Design to tune up integer execution
bandwidth:
In concert with feeding the core faster
More register resources, same latency
More intelligent scheduling

Design to decrease average load latency:
Minimum latency is only part of story
Faster handling of data cache misses
Accelerate store-to-load forwarding

No compromises two thread performance
Increase instruction cache size(I-Cache Misses Reduced by 30%)
Enhance instruction pre-fetch
Mispredicted Branches Reduced by 20%
including those testing transaction processing => TSX

October 2012:
Fetching and decoding logic is different from previous fam15 processors. Fetching is done every two cycles rather than every cycle and two decode units are available. The decode units therefore decode four instructions in two cycles.

There is more but this should get you to find what you need to look for.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,863
3,417
136
Not really. I would prefer you wait and see rather than completely agree with me.

i always thought it looks too real to be fake so to speak. enough the same, enough different. But i still dont know what the die shot is. I was hoping if you had a PDF from a presentation or something with the die in it i could see for myself.

without that i am unwilling to assume it is SR at this stage, its to different from hotcips last year. that said given the general size of the core to the size of the L2 im guessing its a 28nm design not 20nm.
 

CHADBOGA

Platinum Member
Mar 31, 2009
2,135
832
136
i always thought it looks too real to be fake so to speak. enough the same, enough different. But i still dont know what the die shot is. I was hoping if you had a PDF from a presentation or something with the die in it i could see for myself.

without that i am unwilling to assume it is SR at this stage, its to different from hotcips last year. that said given the general size of the core to the size of the L2 im guessing its a 28nm design not 20nm.

Hasn't AMD in the last few years admitted to putting out a photoshopped die shot to throw off Intel?

This could be the same thing again.
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,863
3,417
136
Hasn't AMD in the last few years admitted to putting out a photoshopped die shot to throw off Intel?

This could be the same thing again.

not quite the same thing, my understanding is in all die shot AMD have released they have photoshop "sensitive" structures. That nothing like photoshopping an entire die.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |