Anandtech:Intel's Skylake-SP Xeon VS AMD's EPYC 7000

Page 5 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Mopetar

Diamond Member
Jan 31, 2011
8,011
6,459
136
So that AVX2 FP score means that intel's approach of building monster avx units doesn't work and its better to have more smaller units?

It works fine (remember than Intel has 4 fewer cores as well) from a design approach, but for what Intel is charging it doesn't make a lot of financial sense. I suppose if you're building a 4P or 8P box, then Intel wins by default, but for a 2P box, AMD offers similar performance for ~$12,000 less.
 

moinmoin

Diamond Member
Jun 1, 2017
4,994
7,765
136
So that AVX2 FP score means that intel's approach of building monster avx units doesn't work and its better to have more smaller units?
For sub AVX512 definitely. And considering the other issues Intel's approach introduced (downclocking cores whenever AVX is used, still heating up chips way beyond their TDP as the i9 chips have shown) the worth versus more power efficient dedicated GPUs is imo not there.
 

Nothingness

Platinum Member
Jul 3, 2013
2,765
1,421
136
EPYC 7601 ,TDP 180W..$4200
Xeon 8180 ,TDP 205W..$10009


SPEC CPU2006



CINT2006 Rates

AMD EPYC 7601(AVX2) 2360 HTML CSV PDF PS Text Config
intel Xeon 8180(AVX512) 2930 HTML CSV PDF PS Text Config


------------------------------
CFP2006 Rates

AMD EPYC 7601(AVX2) 1840 HTML CSV PDF PS Text Config
intel Xeon 8180(AVX2) 1890 HTML CSV PDF PS Text Config
I guess forcing AVX2 for CFP2006 means that using AVX-512 resulted in lower scores.

This can mean several things:
  1. poor compiler; I kind of doubt it as icc has supported AVX-512 for years
  2. SPEC benchmarks don't have enough loops that benefit from using 512-bit vectors, but enough that the reduced clock has to be enabled, resulting in lower scores
  3. AVX-512 simply stinks on Skylake-SP.
I bet on 2.
 
Reactions: Arachnotronic

Nothingness

Platinum Member
Jul 3, 2013
2,765
1,421
136
This article. They've run SPEC2006 suite on the R7 1800X, i7 6900K, i7 7700K and A12-9800 using MSVC and ICC2017 compilers with SSE, AVX and AVX2 flags. AVX2 results were only marginally better.
I don't feel like paying to read it. Does AVX get a good speedup? Most 256-bit FP instructions came with AVX. AVX2 would only gain something if FMA can be used.
 

Malogeek

Golden Member
Mar 5, 2017
1,390
778
136
yaktribe.org
I still think that anything less than 15 gig database does not show much. How big is the Anandtech forum database ? Its still rather small compared to what I am used to I bet.
AT is a fairly large forum and as a XenForo forum owner I wouldn't be surprised if it's over 10Gb. Xenforo has a lot of caching entries and indexing.

Throw it in a VM with a lot of memory allocated, do some appropriate InnoDB tweaking and see how some intensive queries perform.
 

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
I don't feel like paying to read it. Does AVX get a good speedup? Most 256-bit FP instructions came with AVX. AVX2 would only gain something if FMA can be used.
I've read that. Extremely minor difference between SSE3 and AVX, somewhat better with AVX2(5-8%). I guess people need to start testing with SPEC2017. Ryzen seems to have strong non-AVX FP performance, the 1800X is 12% faster than the 6900K in single threaded SPECfp2006.
 

StefanR5R

Elite Member
Dec 10, 2016
5,690
8,263
136
So that AVX2 FP score means that intel's approach of building monster avx units doesn't work and its better to have more smaller units?
From what I understood:
  • The two AVX512 FMACs per core can only be used with AVX512 aware code. Which in turn gives a performance uplift over AVX256 or AVX128 only if the data are organized suitably, and due to scaling laws the uplift is less than 100 % in theory, and far less than 100 % in practice, even with perfectly suitable data.
  • Without special AVX512 code, one of the two AVX512 FMACs of Skylake-SP will be unused, and the other will be repurposed as two AVX256 FMACs.
  • In comparison, Zen/ Zeppelin/ EPYC has got two AVX128 FMACs per core, and it splits AVX256 operations such that these two FMACs perform on half of the vector data each, in parallel.
  • This sounds at first as if Zen had only 1/2 of the theoretical AVX256 throughput as Skylake, but AFAIU it actually has more than that due to differences of the capabilities of the AVX pipelines in Zen compared to Skylake.
  • However, there are many floating point workloads out there which cannot be vectorized easily, and lots of legacy code which technically could be optimized but won't be anytime soon, or ever. And Zen's FP units look to perform very well with such code, very different from AMD's previous core designs.
For sub AVX512 definitely. And considering the other issues Intel's approach introduced (downclocking cores whenever AVX is used, still heating up chips way beyond their TDP as the i9 chips have shown) the worth versus more power efficient dedicated GPUs is imo not there.
Re heating up: Power draw of i9 CPUs (on overclocker boards with overclocker BIOS and possibly user-configured power limits) is of no indication for power draw of Xeon CPUs. I expect that Skylake-SP behaves like previous Xeon generations and won't have a time-averaged power draw greater than TDP.

Re downclocking and energy use: If you have workloads suitable for AVX and code optimized for AVX, the throughput and the energy efficiency will exceed that of classic code despite lower clocks and despite higher energy use. Increased vector sizes will have diminishing returns on real workloads though, to varying degree.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |