Intel Broadwell Thread

Sweepr

Diamond Member
May 12, 2006
5,148
1,142
131
Deserves its own thread.








CPU

All told, Intel is shooting for a better than 5% IPC improvement over Haswell. This is similar to Ivy Bridge (4%-6%), though at this stage in the game Intel is not talking about expected clockspeeds or the resulting overall performance improvement. Intel has made it clear that they don’t regress on clockspeeds, but beyond that we’ll have to wait for further product details later this year to see how clockspeeds will compare.

To accomplish this IPC increase Intel will be relying on a number of architectural tweaks in Broadwell. Chief among these are bigger schedulers and buffers in order to better feed the CPU cores themselves. Broadwell’s out-of-order scheduling window for example is being increased to allow for more instructions to be reordered, thereby improving IPC. Meanwhile the L2 translation lookaside buffer (TLB) is being increased from 1K to 1.5K entries to reduce address translation misses.



GPU

In Haswell-Y Intel used a GT2 configuration, which was composed of a single slice that in turn contained 2 sub-slices. In Intel’s GPU architecture the sub-slice is the smallest functional building block of the GPU, containing the EUs (shaders) along with caches and texture/data/media samplers. Each EU in turn was composed of 2 4-wide vector SIMDs, with 10 EUs per sub-slice.

For Broadwell Intel is not changing the fundamental GPU architecture, but they are rebalancing the number of EUs per sub-slice and increasing the number of sub-slices overall. As compared to Haswell, Broadwell’s sub-slices will contain 8 EUs per sub-slice, with a complete slice now containing 3 sub-slices. Taken altogether this means that whereas Haswell-Y was a 2x10EU GPU, Broadwell-Y will be a 3x8EU GPU.

The ramifications of this is that not only is the total number of EUs increased by 20% from 20 to 24, but Intel has greatly increased the ratio of L1 cache and samplers relative to EUs. There is now 25% more sampling throughput per EU, with a total increase in sampler throughput (at identical clockspeeds) of 50%. By PC GPU standards increases in the ratio of samplers to EUs is very rare, with most designs decreasing that ratio over the years. The fact that Intel is increasing this ratio is a strong sign that Haswell’s balance may have been suboptimal for modern workloads, lacking enough sampler throughput to keep up with its shaders.

What Intel is doing however is reiterating the benefits of their 14nm process in this case, noting that because 14nm significantly reduces GPU power consumption it will allow for more thermal headroom, which should further improve both burst and sustained GPU performance in TDP-limited scenarios relative to Haswell.



Core M (Broadwell-Y)

But even that’s still not enough, and for Core M Intel went so far as to give Broadwell-Y its own die and design a low-power optimized version of their 14nm process just for it. This variant is designed to further reduce power consumption by optimizing the resulting transistors for lower power, lower voltage, lower clockspeed operation. By doing this Intel was able to further reduce power consumption in all of the major areas over what would be a traditional 14nm Intel process.

In the resulting 14nm Broadwell-Y process, voltage, capacitance, and leakage were all improved over the traditional 14nm process. Capacitance was reduced by 15%, minimum operating voltages by 10%, and leakage was improved to the point that it reduced SoC power consumption by 10%. The combination of all of these factors serves to significantly lower power consumption over what Intel’s traditional 14nm process would provide.

Working in concert with Broadwell-Y’s 14nm process, Intel has made a number of other changes to further reduce power consumption. Some of these are reflected in Broadwell’s overall design and others are unique to Broadwell-Y, such as using a lower TjMax for Broadwell-Y. By reducing the maximum operating temperature of the chip Intel has been able to realize further power savings by further reducing voltage and decreasing the maximum amount of temperature-dependent leakage the SoC will experience.

www.anandtech.com/show/8355/intel-broadwell-architecture-preview
 
Last edited:

Phynaz

Lifer
Mar 13, 2006
10,140
819
126
But even that’s still not enough, and for Core M Intel went so far as to give Broadwell-Y its own die and design a low-power optimized version of their 14nm process just for it.

Wow.
 

NTMBK

Lifer
Nov 14, 2011
10,320
5,347
136
Looking forward to seeing this in devices. My Haswell-Y 2-in-1 is a pretty sweet device, but eliminating the fans and dropping the thickness would be the real cherry on top.
 

Hulk

Diamond Member
Oct 9, 1999
4,464
2,387
136
Don't get me wrong, I'm happy for any Broadwell information, but we knew about 30% power reduction many months ago and we also know from experience 5% IPC improvement is about what to expect. There are some other good tidbits but I'm personally looking for some sku specs.
 
Mar 10, 2006
11,715
2,012
126
Looking forward to seeing this in devices. My Haswell-Y 2-in-1 is a pretty sweet device, but eliminating the fans and dropping the thickness would be the real cherry on top.

Agreed. I just hope to goodness that the 2-in-1 convertible designs come with passable keyboards...
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
This 'Broadwell + 14nm = Core M' IDM cooperation is really working out great. Unfortunately Core M is only aiming at $600+.

The information about Gen8 confuses me. It was said to be a giant improvement, but Anand says there are no architectural improvement (in contradiction with the slide). Is it me or does it seem like they could have already done this (sub-)slice balancing in Gen7(.5)? So what performance and power improvements will the raw Gen8 architecture bring? I remember from the Iris Pro review that the amount of FPS/GFLOPS was weak compared to the competition, so if this balancing brings them on par with the competition, I guess we'll see an additional 20% or so improvement with the 20% EU increase.
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
Don't get me wrong, I'm happy for any Broadwell information, but we knew about 30% power reduction many months ago and we also know from experience 5% IPC improvement is about what to expect. There are some other good tidbits but I'm personally looking for some sku specs.

Now we know more specific details and we know how they managed to do all those things, which I think is more interesting than plain benchmarks.

BTW, (some of) the SKUs' specs already leaked. 4.5W TDP, cTDP of 4W, 0.8-1.1GHz base speed to over 2GHz turbo.
 
Mar 10, 2006
11,715
2,012
126
This 'Broadwell + 14nm = Core M' IDM cooperation is really working out great. Unfortunately Core M is only aiming at $600+.

The information about Gen8 confuses me. It was said to be a giant improvement, but Anand says there are no architectural improvement (in contradiction with the slide). Is it me or does it seem like they could have already done this (sub-)slice balancing in Gen7(.5)? So what performance and power improvements will the raw Gen8 architecture bring? I remember from the Iris Pro review that the amount of FPS/GFLOPS was weak compared to the competition, so if this balancing brings them on par with the competition, I guess we'll see an additional 20% or so improvement with the 20% EU increase.

It seems that Intel has fixed some of the bottlenecks that plagued the Gen 7/7.5 architecture in graphics/gaming workloads.
 

Hulk

Diamond Member
Oct 9, 1999
4,464
2,387
136
Now we know more specific details and we know how they managed to do all those things, which I think is more interesting than plain benchmarks.

BTW, (some of) the SKUs' specs already leaked. 4.5W TDP, cTDP of 4W, 0.8-1.1GHz base speed to over 2GHz turbo.


Not trying to be "cheeky" but they did it the same way they always do it. Process shrink and some minor architectural improvements.
 

AtenRa

Lifer
Feb 2, 2009
14,003
3,361
136
Two things,

The 14nm they currently using is the LP, HP will be ready only in 2015 products.

And secondly, It has just began. Broadwell-Y iGPU is bigger than its CPU cores. Intel has become a GPU company and that will continue to the rest of the mainstream and mobile chips from here on.

Lets see how that will affect pricing.
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
Not trying to be "cheeky" but they did it the same way they always do it. Process shrink and some minor architectural improvements.
Have you actually listened to the webcast? Here's the summary of what they did in the previous generations and the webcast was about how they'll improve it with Broadwell:

 

Roland00Address

Platinum Member
Dec 17, 2008
2,196
260
126
So in sum the i5 haswell y processor that turbos to 2 GHz should had similar CPU performance of the i5 3317u at 1.7 GHz to 2.6 GHz, assuming it can sustain it turbo for an extended period of time. The i5 3317u was the CPU in the 1st surface and we can now get that performance in iPad form factors.
 
Last edited:

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
I just started reading AnandTech's coverage and I see that, surprisingly, it isn't Anand, CPU editor, who wrote the 2 articles but Ryan smith, GPU editor. It seems Anand is busy reviewing smartwatches.
 

Sweepr

Diamond Member
May 12, 2006
5,148
1,142
131
For Broadwell Intel is not changing the fundamental GPU architecture, but they are rebalancing the number of EUs per sub-slice and increasing the number of sub-slices overall. As compared to Haswell, Broadwell’s sub-slices will contain 8 EUs per sub-slice, with a complete slice now containing 3 sub-slices. Taken altogether this means that whereas Haswell-Y was a 2x10EU GPU, Broadwell-Y will be a 3x8EU GPU

So it's 3x8 (24 EUs) for Broadwell-Y - presumably GT2 and 2x8 (16 EUs) for Cherry Trail / Braswell, what about Broadwell GT1? Not sure if such SKU exists, but still...
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
So it's 3x8 (24 EUs) for Broadwell-Y - presumably GT2 and 2x8 (16 EUs) for Cherry Trail / Braswell, what about Broadwell GT1? Not sure if such SKU exists, but still...

1x8 for low-end CT.

GT3 will be 2x24
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
I just started reading AnandTech's coverage and I see that, surprisingly, it isn't Anand, CPU editor, who wrote the 2 articles but Ryan smith, GPU editor. It seems Anand is busy reviewing smartwatches.

Okay, he will probably do the extended preview based on the new information at IDF.
 

know of fence

Senior member
May 28, 2009
555
2
71
Couldn't be any less excited about 3-5 W TDP fanless. Full stagnation of performance to fit "core" into ever smaller form factors. Intel is going to far with this Moore's Law thing, they shrink the devices and battery now as well, not just the silicon.
Also Broadwell volume ramp in 2015, and Bi-gate(?) is the new Tri-gate.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |