AMD Carrizo Pre-release thread

Page 103 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

paffinity

Member
Jan 25, 2007
89
1
71
Apparently, the AVX2 support of Excavator isn't necessarily a good thing. At least on y-cruncher:

http://www.overclock.net/t/1560230/jagatreview-hands-on-amd-fx-8800p-carrizo/400_100#post_24310470

we can see AVX2 mode being only a hair faster than SSE3. Maybe if AVX2 apps were coded with Excavator in mind, it wouldn't be such a problem, but at that point, one may as well use xOP.

Excavator supports 256-bit AVX2 instructions, but it needs to split them up into 2 micro instructions. So any 256-bit AVX2 instruction will look worse or be a wash compared to 128-bit AVX/SSE.
 

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de
Apparently, the AVX2 support of Excavator isn't necessarily a good thing. At least on y-cruncher:

http://www.overclock.net/t/1560230/jagatreview-hands-on-amd-fx-8800p-carrizo/400_100#post_24310470

we can see AVX2 mode being only a hair faster than SSE3. Maybe if AVX2 apps were coded with Excavator in mind, it wouldn't be such a problem, but at that point, one may as well use xOP.
AVX2 as AVX is less flexible to decode and handle than XOP. The latter has the advantage of 3 operand encoding over SSE.
 

Soulkeeper

Diamond Member
Nov 23, 2001
6,714
143
106
Apparently, the AVX2 support of Excavator isn't necessarily a good thing. At least on y-cruncher:

http://www.overclock.net/t/1560230/jagatreview-hands-on-amd-fx-8800p-carrizo/400_100#post_24310470

we can see AVX2 mode being only a hair faster than SSE3. Maybe if AVX2 apps were coded with Excavator in mind, it wouldn't be such a problem, but at that point, one may as well use xOP.

Nice catch. I've come across posts by the stilt a lot he does good work.
Shame to see the avx/avx2 score poorly, but the xop speed makes up for it I guess.
On another note the linux software raid is said to utilize avx2 for large speedups.

If you're into coding and instruction latencies on different CPUs interest you http://www.agner.org/optimize/ agner has some nice code, his data has been used for the instruction choosing in gcc.
 
Last edited:

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de
Nice catch. I've come across posts by the stilt a lot he does good work.
Shame to see the avx/avx2 score poorly, but the xop speed makes up for it I guess.
On another note the linux software raid is said to utilize avx2 for large speedups.
A SW RAID with AVX2? Ok, maybe for faster mem copying and some data processing. But anything 128b wide should be enough. as this would already max out the mem subsystem at some xy GB/s, enough for a bunch of parallel hard drives or SSDs.
 

Soulkeeper

Diamond Member
Nov 23, 2001
6,714
143
106
A SW RAID with AVX2? Ok, maybe for faster mem copying and some data processing. But anything 128b wide should be enough. as this would already max out the mem subsystem at some xy GB/s, enough for a bunch of parallel hard drives or SSDs.

On boot it benchmarks the various supported types, sse, sse2, avx, avx2, etc. and chooses the fastest.
wiki says "required AVX2, AVX is not sufficient" just why I mentioned this.
https://git.kernel.org/cgit/linux/k.../?id=2c935842bdb46f5f557426feb4d2bdfdad1aa5f9
that's the avx2 commit dunno how it'd fare on carrizo if we had a desktop version to test.

Output from a recent intel:
[ 0.165172] raid6: sse2x1 5136 MB/s
[ 0.182199] raid6: sse2x2 6378 MB/s
[ 0.199230] raid6: sse2x4 7460 MB/s
[ 0.216267] raid6: avx2x1 9906 MB/s
[ 0.233299] raid6: avx2x2 11382 MB/s
[ 0.250332] raid6: avx2x4 13363 MB/s
[ 0.250333] raid6: using algorithm avx2x4 (13363 MB/s)
[ 0.250335] raid6: using avx2x2 recovery algorithm

Sorry for the distraction, Just a random detour of the thread I guess.
 
Last edited:

DrMrLordX

Lifer
Apr 27, 2000
22,021
11,595
136
Excavator supports 256-bit AVX2 instructions, but it needs to split them up into 2 micro instructions. So any 256-bit AVX2 instruction will look worse or be a wash compared to 128-bit AVX/SSE.

Right. I've heard of that before. Theoretically, I think someone coding with an AVX/AVX2 target could compensate for the Construction Core requirement that 256-bit SIMD instructions be split without specifically resorting to xOP, but I'm not experienced enough to know more about that. Something coded specifically with AVX/AVX2-capable Intel processors in mind will not offer such compensation, forcing XV to split the instructions in hardware.

Nice catch. I've come across posts by the stilt a lot he does good work.

Yes, he does. He was rather gracious in providing data on those few FP benchmarks that I requested of him (at static clockspeeds no less). XV actually shows quite a bit of improvement over SR in anything that (apparently) relies upon some of the more-recent x86 ISA extensions.

If you're into coding and instruction latencies on different CPUs interest you http://www.agner.org/optimize/ agner has some nice code, his data has been used for the instruction choosing in gcc.

I have heard of Agner's. I'm too much of a neophyte to make effective use of that right now.
 
Last edited:

cbn

Lifer
Mar 27, 2009
12,968
221
106
Some clockspeed numbers (from Stilt) in Prime95 Large FFT using the FX-8800P:

http://www.overclock.net/t/1560230/jagatreview-hands-on-amd-fx-8800p-carrizo/350#post_24278342

20W cTDP= 1990 Mhz
42W cTDP= 3050 MHz

Extrapoltaing from the chart below, two modules @ 10W cTDP would have come out to be ~1540 Mhz.



So XV has ~78% of the clockspeed at 5W module compared to 10W module.

Furthermore, XV is greater than 20% faster than Kaveri @ 5W module and slightly less than 10% faster than Kaveri @ 10W module. At 4W module, it looks like the XV clockspeed (probably around 1375 Mhz) increases to ~30% faster than Kaveri.

So the strength of the XV design (as we have read many times before) is at low power consumption. However, If that marketing graph holds up in real life the gains are most pronounced at 5W module and particularly 4W module.

So at 4W per module, I wonder how these XV micro cores compare to cat cores at the same power consumption?
 

cbn

Lifer
Mar 27, 2009
12,968
221
106
As an example of different cooling gear for different TDPs I thought these teardowns of the 13.3" and 15" Apple Mac Pro Retina (late 2013) models were interesting:

https://www.ifixit.com/Teardown/MacBook+Pro+13-Inch+Retina+Display+Late+2013+Teardown/18695



https://www.ifixit.com/Teardown/MacBook+Pro+15-Inch+Retina+Display+Late+2013+Teardown/18696



The first laptop is the 13.3" model with a 28W Haswell dual core.

The second laptop is the 15" model with a 47W Haswell quad core with Iris Pro graphics.

Copper heatsink size is about the same for both, but the 47W model uses two fans (each with blowing air into a very small aluminum finned area). This compared to the 28W model which only uses one fan, but notice the aluminum finned area is much larger.

So overall, I would say both 28W and 47W set-ups have the following amount of material:

Copper heatsink material: About the same for both the 47W and 28W
heatpipe: twice as much for the 47W
fans: twice as much for the 47W
aluminum finned material (this is the part next to the fan): About the ~same total weight for both, the 47W has less of it for each fan.

So it appears the amount of metal used for cooling isn't really that different between the two TDPS, but difference in ventilation is much in favor of the 47W.

In the contrast to the 28W Haswell 13" Mac Book Pro and 47W 15" Haswell 15" Mac Book Pro (both shown above) we have the 4.5W Broadwell 12" Mac Book Retina (shown below):

https://www.ifixit.com/Teardown/Retina+Macbook+2015+Teardown/39841



Since the chip is so low power it doesn't need a fan and notice the small size of the system board PCB (Core M located on the far left outlined in red, below):



And the small heatsink for the Core M processor:



 
Last edited:

Blitzvogel

Platinum Member
Oct 17, 2010
2,012
23
81
So what is a good balance of TDP between the graphics and CPU cores for general gaming? Seems like 800 MHz on the graphics is pointless if the CPU cores can't reach 1 GHz because of such a low cTDP in all of the available Carrizo notebooks I've seen available to the US.
 

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de
On boot it benchmarks the various supported types, sse, sse2, avx, avx2, etc. and chooses the fastest.
wiki says "required AVX2, AVX is not sufficient" just why I mentioned this.
https://git.kernel.org/cgit/linux/k.../?id=2c935842bdb46f5f557426feb4d2bdfdad1aa5f9
that's the avx2 commit dunno how it'd fare on carrizo if we had a desktop version to test.

Output from a recent intel:
[ 0.165172] raid6: sse2x1 5136 MB/s
[ 0.182199] raid6: sse2x2 6378 MB/s
[ 0.199230] raid6: sse2x4 7460 MB/s
[ 0.216267] raid6: avx2x1 9906 MB/s
[ 0.233299] raid6: avx2x2 11382 MB/s
[ 0.250332] raid6: avx2x4 13363 MB/s
[ 0.250333] raid6: using algorithm avx2x4 (13363 MB/s)
[ 0.250335] raid6: using avx2x2 recovery algorithm

Sorry for the distraction, Just a random detour of the thread I guess.
Just had a look at the code. They use a lot of AVX2 logical and add operations. With these instructions Skylake has 2x to 4x the throughput of Carrizo.
 

Blitzvogel

Platinum Member
Oct 17, 2010
2,012
23
81
it seems oems, consumers and even AMD don't care about carrizo. The crazies like me can't sustain it.

Carrizo would be a pretty good budget media chip if it was allowed to use higher TDP and paired with better screens and chasis. So far, I've only seen it in the Toshiba Radius and that one HP Envy model with freakin' 1366 x 768 monitors, HDDs, and locked to 15W TDP. It's being treated like a budget chip even though it seems to be giving i3s and i5s a run for their money.

Funny enough, I went to HPs website to build an AMD Envy 15z laptop, and the A10-8800p could not be paired with a full HD screen, only the 8700p. Also, the 8 GB option is only 1 DIMM, (6 GB is 2 + 4 GB Dimms) and there is no SSD or even hybrid drive option. I'm starting to think there is deliberate sabotage going on via contracts with Intel.
 
Last edited:

DrMrLordX

Lifer
Apr 27, 2000
22,021
11,595
136
I'm starting to think there is deliberate sabotage going on via contracts with Intel.

Occam's Razor. The interesting bit is that there are some more configuration options available to European buyers for these chips than there are for North American folks. It was the same way with mobile Kaveri, more-or-less.
 

ET

Senior member
Oct 12, 1999
521
33
91
Occam's Razor. The interesting bit is that there are some more configuration options available to European buyers for these chips than there are for North American folks. It was the same way with mobile Kaveri, more-or-less.

Occam's Razor is that people are stupid. That's usually the correct reason.
 

richaron

Golden Member
Mar 27, 2012
1,357
329
136
No doubt AMD need to pull a Chromebook Pixel if they're ever going to get their best reviewed and loved. Design a premium product in house, not for the profit, but to redesign public perception and challenge OEMs.

*waits for purile arguments from anti-AMD-superfriends*
 
Last edited:

DrMrLordX

Lifer
Apr 27, 2000
22,021
11,595
136
Occam's Razor is that people are stupid. That's usually the correct reason.

No, Occam's Razor states that:

"Among competing hypotheses, the one with the fewest assumptions should be selected."

We have no direct evidence that anyone involved in selecting Carrizo laptop configurations is fundamentally stupid (and plenty of evidence points to the contrary), so that conclusion requires (potentially erroneous) assumptions on our part. If you will observe the above complaints about Carrizo configurations, the logical conclusions are that OEMs are being cheap/lazy or that they're being "encouraged" to deliberately misconfigure Carrizo. The former argument holds less water since OEMs are raising BoM needlessly with dGPUs, and since the difference in 15w and 35w cooling configurations often amounts to little more than a few fans (read: not a significant increase in BoM).

In other words, it would have been easy for OEMs to permit 35w configurations on Carrizo chips by simply cutting the dGPUs and adding the required fans, which presumably would have reduced BoM overall and would have significantly improved end-user experience. The remaining margin could have gone to changing 1x8GB RAM configurations to 2x4GB configurations, again improving user experience.

Enough already !! us AMDhopeians should apply for a job at AMD for sure now

I don't think they're hiring, unless you're willing to work at maybe half the industry standard and happen to be among the best in your field.
 

ET

Senior member
Oct 12, 1999
521
33
91
No, Occam's Razor states that:

"Among competing hypotheses, the one with the fewest assumptions should be selected."

You look at Intel configurations, and they come by default with 1 8MB DIMM, with a 12GB dual channel option, and then you see a Carrizo configuration, and it's not that different. So instead of thinking "hey, they're just doing something similar to what they do with Intel" you think "Intel must be doing something behind the scenes to sabotage AMD" and then claim that you need fewer assumptions for that.

Perhaps "people are stupid" wasn't the right way to say it (it's just the underlying truth, and perhaps 'ignorant' is a better word). HP (as an example) is selling the configurations it thinks will sell best and earn it the most. Most people have no clue about dual channel, and have no clue what AMD is. From HP's point of view AMD is simply a way to provide reasonable power CPU's for a low price to non-discerning customers. Battery life is a bigger selling point than "CPU/GPU power", and 1 DIMM is cheaper than 2 DIMMs. The assumption that Intel gets in the way requires the assumption that it makes financial sense for HP to sell more powerful AMD laptops in the first place.
 
Last edited:

DrMrLordX

Lifer
Apr 27, 2000
22,021
11,595
136
You look at Intel configurations, and they come by default with 1 8MB DIMM, with a 12GB dual channel option, and then you see a Carrizo configuration, and it's not that different.

You speak as though DIMM configurations are the only problems inherent to these Carrizo laptops.
 

ET

Senior member
Oct 12, 1999
521
33
91
You speak as though DIMM configurations are the only problems inherent to these Carrizo laptops.

Well, mostly. The main other complaint I see is that they're only 15W and not 35W, but AMD has been pushing the chips as low power chips, that's what they mainly talk about in their presentations. So why would you need Intel to explain that OEM's are using this configuration? And again, battery life is a bigger selling point than computing power and the Intel configurations of these laptops also have 15W versions.

Really, I haven't seen anything that can't be explained by actual logic (i.e., OEM's making choices based on their understanding of the market), and though I did call this 'stupidity', that simply because what sells well (or OEM's think sells well) isn't necessarily what takes full advantage of the technology. (Although if AMD is to be believed, running the chips at low power is precisely what takes full advantage of the work it's done.)
 

erunion

Senior member
Jan 20, 2013
765
0
0
No, Occam's Razor states that:

"Among competing hypotheses, the one with the fewest assumptions should be selected."

We have no direct evidence that anyone involved in selecting Carrizo laptop configurations is fundamentally stupid (and plenty of evidence points to the contrary), so that conclusion requires (potentially erroneous) assumptions on our part. If you will observe the above complaints about Carrizo configurations, the logical conclusions are that OEMs are being cheap/lazy or that they're being "encouraged" to deliberately misconfigure Carrizo. The former argument holds less water since OEMs are raising BoM needlessly with dGPUs, and since the difference in 15w and 35w cooling configurations often amounts to little more than a few fans (read: not a significant increase in BoM).


I'm sorry but if you think a secret conspiracy between Intel and OEM is in line with Occam's razer, you're completely misapplying it.

If we accept Occam's Razer, that the simplier answer is the better answer, we can conclude that OEMs don't prioritize AMD laptops because consumers don't prioritize AMD laptops.

Your explanation introduced a third party, Intel, needlessly.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |