Intel Skylake / Kaby Lake

Page 436 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

RichUK

Lifer
Feb 14, 2005
10,341
678
126
Lifted from overclockers.co.uk forum.

Thai Prices: https://gameolo.com/product-category/คอมประกอบ/cpu/cpu-intel?orderby=price-desc

INTEL i9-7920X - 12c/24t @ 2.90GHz (No turbo listed) - 160W.
L2 CACHE -12 x 256KB
L3 CACHE - 16.5MB
£1300 + VAT / $1671

INTEL i9-7900X - 10c/20t @ 3.30/4.50GHz - 160W
L2 CACHE - 10 x 256KB
L3 CACHE - 13.75MB
£855 + VAT / $1099

INTEL i9-7820X - 8c/16t @ 3.60/4.20GHz - 140w
L2 CACHE - 8 x 256KB
L3 CACHE - 11MB
£515 + VAT / $659

INTEL i9-7800X - 6c/12t @ 3.50/4.00GHz - 140W
L2 CACHE - 6 x 256KB
L3 CACHE - 8.25MB
£345 + VAT / $439
 
Reactions: Drazick

inf64

Diamond Member
Mar 11, 2011
3,864
4,546
136
So at what base (MT) clock will 18C/36T part run? Can we assume 2.5Ghz? If AMD targets 1800X clocks for their top of the line Threadripper equivalent, the 18C/36T part running at 2.5Ghz base will have a tough time competing in non-AVX256 bit optimized MTed workloads, which is basically a 90+% of what we have in use today. The saving grace could be the advanced Turbo intel plans to use on it though, we will have to wait and see how that one works in practice.
 
Reactions: Drazick

blue11

Member
May 11, 2017
151
77
51
So sorry but Xeon Platinum tops out at 28 cores.
28 Skylake cores are worth far more than 32 Zen cores, especially given how the latter is actually a 4-way NUMA system, with independent memory controllers on each node.

So at what base (MT) clock will 18C/36T part run? Can we assume 2.5Ghz? If AMD targets 1800X clocks for their top of the line Threadripper equivalent, the 18C/36T part running at 2.5Ghz base will have a tough time competing in non-AVX256 bit optimized MTed workloads, which is basically a 90+% of what we have in use today. The saving grace could be the advanced Turbo intel plans to use on it though, we will have to wait and see how that one works in practice.
AMD TDPs are imaginary anyway. Once you OC to 3.5 GHz, the power draw in the same workloads will probably be the same. In AVX workloads, obviously Intel will draw more power, but it will also do twice (or maybe 4x) the work.

And more importantly above 12.000 usd. We are way past what is relevant. But who knows. Perhaps a new supply of high price hedt cpu will move demand for 2k cpu.
That is not only a rumor, but also incomplete information as the Platinum 8180 replaces the E7 product line. The E5-2699Av4 has a MSRP of only $4500, so there will likely be a Xeon SKU in the $4000-5000 range with 24+ cores, perhaps up to 26 cores.

Lifted from overclockers.co.uk forum.

Thai Prices: https://gameolo.com/product-category/คอมประกอบ/cpu/cpu-intel?orderby=price-desc

INTEL i9-7920X - 12c/24t @ 2.90GHz (No turbo listed) - 160W.
L2 CACHE -12 x 256KB
L3 CACHE - 16.5MB
£1300 + VAT / $1671

INTEL i9-7900X - 10c/20t @ 3.30/4.50GHz - 160W
L2 CACHE - 10 x 256KB
L3 CACHE - 13.75MB
£855 + VAT / $1099

INTEL i9-7820X - 8c/16t @ 3.60/4.20GHz - 140w
L2 CACHE - 8 x 256KB
L3 CACHE - 11MB
£515 + VAT / $659

INTEL i9-7800X - 6c/12t @ 3.50/4.00GHz - 140W
L2 CACHE - 6 x 256KB
L3 CACHE - 8.25MB
£345 + VAT / $439
These prices are disappointing but not unexpected. I said as much in other threads, but those expecting Zen to cause an Intel HEDT price crash are going to be disappointed. Since Intel does not actually sell any HEDT CPUs, they have no incentive to price competitively. It appears their response to AMD "ThreadRipper" is just to put out another SKU at the top end (at a price point nobody wants), so they can claim to have more cores for marketing/PR reasons.

I would rather move to LGA3647 for 6-channel memory controllers instead of buying 16+ cores on quad-channel. Even on Haswell, the quad-channel memory interface was limiting on the larger dies. Since the stock memory frequency is bumped to DDR4-2667, not even DDR4 OC at 3200 MHz can make up the bandwidth gap. I also find it doubtful that 18 cores could be run at 4 GHz within thermal constraints.

Clock speed will probably be in the upper 2 GHz range, given the increased TDP on Skylake-X (165 W) compared to the E5-2695v4 (120 W). An interesting reference is the Puget Systems all-core turbo chart, which shows the E5-2695v4 with an actual operating frequency of 2.6 GHz. I expect the i9-7980XE (purported) to reach the lower 3 GHz range in turbo frequency.

On the other hand, the much-rumored "competition" has apparently failed to affect pricing. I expect the 18C SKU to cost over $3000, given the $1700 price point of the 12C.
 
Last edited:
Reactions: Drazick

inf64

Diamond Member
Mar 11, 2011
3,864
4,546
136
On the other hand, the much-rumored "competition" has apparently failed to affect pricing. I expect the 18C SKU to cost over $3000, given the $1700 price point of the 12C.
If that is the price then nobody sane will buy it. They will buy either 10/12C parts from intel or Threadripper.
 
Reactions: Drazick

VirtualLarry

No Lifer
Aug 25, 2001
56,554
10,171
126
On the other hand, the much-rumored "competition" has apparently failed to affect pricing. I expect the 18C SKU to cost over $3000, given the $1700 price point of the 12C.
Maybe Intel will make bank with their larger-core-count HEDT CPUs. They should thank AMD for the introduction of Ryzen. I think that Ryzen has single-handedly caused enthusiasts (and even mainstreamers) to move to that platform, which is itself rather HEDT-like, being a direct competitor to BDW-E, Intel's HEDT platform. Meaning, I think AMD is growing the HEDT market with Ryzen, or at least getting open-minded Intel fans to think in terms of "more cores", more being 6-8, rather than the core-anemic i7-7700K.
 
Reactions: Drazick

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
28 Skylake cores is worth far more than 32 Zen cores, especially given how the latter is actually a 4-way NUMA system, with independent memory controllers on each node.
'Worth' is determined in part by cost.

Ryzen engineering motherboards had 2:1 DF:Memclock in debug mode, and it is not known how Snowy Owl and Naples would have it configured. Broadwell-E already showed latency issues with 22+ cores, and my hunch is that the revised cache configuration in Skylake-X is aimed primarily at alleviating that problem. What it'll do in case of lower core counts is anybody's guess.

I don't want to go nitpicking about the details regarding to what extent Naples is NUMA-like(the technical thread has tons of info about that), but historically AMD has been more comfortable with NUMA than Intel since the Opteron heydays.
 
Reactions: Drazick

Sweepr

Diamond Member
May 12, 2006
5,148
1,142
131
So at what base (MT) clock will 18C/36T part run? Can we assume 2.5Ghz?

I see you missed Xeon Gold specs. They have a 2.7 GHz base 18C part at 165W with full speed AVX-512 enabled. If my source is correct this monolithic 7980XE beast might be clocked even higher for desktops (with more limited AVX-512 support). And then we have the >4 GHz Turbo and possible gains from the new cache structure improving performance per clock.

If AMD targets 1800X clocks for their top of the line Threadripper equivalent, the 18C/36T part running at 2.5Ghz base will have a tough time competing in non-AVX256 bit optimized MTed workloads, which is basically a 90+% of what we have in use today. The saving grace could be the advanced Turbo intel plans to use on it though, we will have to wait and see how that one works in practice.

AMD just lost any hope of having the performance lead on desktops. But look on the bright side, Threadripper could do well against the cheaper 10C-12C Skylake-X variants.
 
Last edited:

scannall

Golden Member
Jan 1, 2012
1,960
1,678
136
Maybe Intel will make bank with their larger-core-count HEDT CPUs. They should thank AMD for the introduction of Ryzen. I think that Ryzen has single-handedly caused enthusiasts (and even mainstreamers) to move to that platform, which is itself rather HEDT-like, being a direct competitor to BDW-E, Intel's HEDT platform. Meaning, I think AMD is growing the HEDT market with Ryzen, or at least getting open-minded Intel fans to think in terms of "more cores", more being 6-8, rather than the core-anemic i7-7700K.
I don't think Ryzen is 'stealing' a lot of market share. There has been a lot of pent up demand for more cores for a long time now. By people unwilling to pay the Intel 'premium' for anything more than 4 cores. Those are sales Intel wouldn't have gotten anyway. And at this point, they don't have to bring down prices. Just fiddle with the SKU's will be enough to keep most Intel fans happy.
 
Reactions: tential

blue11

Member
May 11, 2017
151
77
51
'Worth' is determined in part by cost.

Ryzen engineering motherboards had 2:1 DF:Memclock in debug mode, and it is not known how Snowy Owl and Naples would have it configured. Broadwell-E already showed latency issues with 22+ cores, and my hunch is that the revised cache configuration in Skylake-X is aimed primarily at alleviating that problem. What it'll do in case of lower core counts is anybody's guess.
Naples is certainly not going to improve memory latency (which is already poor on 1-node Zen) by introducing NUMA and a 75% probability of cross-node memory traffic in "1S" installations. Perhaps it will also have a 4-way NUMA programming model like Intel Cluster-On-Die, with all the performance scaling problems you would imagine from a 4S cluster trying to compete with a 1S CPU.

I don't want to go nitpicking about the details regarding to what extent Naples is NUMA-like(the technical thread has tons of info about that), but historically AMD has been more comfortable with NUMA than Intel since the Opteron heydays.
This statement is one I have seen countless times on these forums now. It is also one that makes zero sense, as NUMA scaling problems are physical problems and have nothing to do with emotions or "comfort." We already have seen how well Zen scales on even a single node, with all the threads about CCX performance catastrophes, and Naples will have four times as many of those node boundaries.
 
Last edited:
Reactions: wildhorse2k

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
I see you missed Xeon Gold specs. They have a 2.7 GHZ base 18C part with full speed AVX-512 enabled. If my source is correct it might be even higher for desktops with more limited AVX-512 support.
Base clock speeds which are listed in the specifications have got nothing to do with AVX512. Full/half speed AVX512 doesn't affect the listed base and boost clocks because there are separate multipliers for AVX workloads since the v3 Xeons.
 
Reactions: Drazick

blue11

Member
May 11, 2017
151
77
51
I see you missed Xeon Gold specs. They have a 2.7 GHz base 18C part at 165W with full speed AVX-512 enabled. If my source is correct this monolithic 7980XE beast might be clocked even higher for desktops with more limited AVX-512 support. And then we have the >4 GHz Turbo and possible gains from the new cache structure improving performance per clock
We will have to wait and see if AVX-512 is a feature of the XCC die or merely an e-fuse configuration. The lineup of Skylake-SP shows full AVX-512 only being enabled on XCC SKUs (which starts at 16C), but that could easily be a segmentation strategy instead of a technical detail. Chances are still very high that AVX-512 will be half speed on HEDT, and even if it were full speed, we would just get endless threads of "enthusiasts" bitching about it messing with their overclocking.
 
Last edited:

Karnak

Senior member
Jan 5, 2017
399
767
136
AMD just lost any hope of having the performance lead on desktops. But look on the bright side, Threadripper could will do well against the cheaper 10C-12C all Skylake-X variants.
FTFY

Zen's multithreading is extremely strong and there is no doubt that the 16C/32T Threadripper will do very well against the >12C SKL-X SKUs.

And if we are talking about pricing... I know who will be the winner there. By a lot.

And not soldering SKL-X... Epic fail from Intel if true.
 
Last edited:

mikk

Diamond Member
May 15, 2012
4,244
2,299
136
Really amazing how Intel was able to keep this secret until (almost) Computex. Everyone I talked to just a few days ago reiterated the previous leak, even Intel in the presentation below.

8th Gen Intel Core 'Coffee Lake-S' + Z370 Chipset to Launch in August/September (4C and 6C 'K' Options)



65W and 95W as expected. Some people should wake up finally.


14C/16C/18C for SKL-X is a big surprise.

These prices are disappointing but not unexpected.


You should better wait for the official price list and for the sales start. Such pre-release listings are often much higher, especially when we talk about an unknown asian shop.
 
Reactions: Sweepr

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
Naples certainly isn't going to improve memory latency (which is already poor on 1-node Zen) by introducing NUMA and a 75% probability of cross-node memory traffic in "1S" installations. Perhaps it will also have a 4-way NUMA programming model like Intel Cluster-On-Die, with all the performance scaling problems you would imagine from a 4S cluster trying to compete with a 1S CPU.
Naples is hardly any more NUMA than Ryzen. There is a possibility that DF clocks may be enabled to run at twice the memory speed.
This statement is one I have seen countless times on these forums now. It is also one that makes zero sense, as NUMA scaling problems are physical problems and have nothing to do with emotions or "comfort."
Well for starters, Infinity Fabric means that in theory Naples can approach node-interleaved configuration more than NUMA, and node-interleaving had significant performance benefits on AMD with no effect on Intel in purely I/O bound scenarios.
http://www.anandtech.com/show/4486/server-rendering-hpc-benchmark-session/6
 
Reactions: Drazick

blue11

Member
May 11, 2017
151
77
51
Naples is hardly any more NUMA than Ryzen. There is a possibility that DF clocks may be enabled to run at twice the memory speed.
Are you asking how four nodes are "hardly any more" NUMA than one node? Even if Naples has this amazing INFINITY interconnect with infinite bandwidth, it does nothing for latency, which will be made even worse by the fact that each node's L3 and memory controllers will be working independently. In fact, even within one node, the L3 isn't uniform! If 4-way NUMA scales so well, why are there hardly any 4S installations in the field?

Irrelevant stuff from last decade.
 
Last edited:
Reactions: wildhorse2k

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
Are you asking how four nodes are "hardly any more" NUMA than one node? Even if Naples has this amazing INFINITY tearsofjoy interconnect with infinite bandwidth, it does nothing for latency, which will be made even worse by the fact that each node's L3 and memory controllers will be working independently.
Such authority, much wow.

Do you work for AMD that you can say this with such assertiveness?
In fact, even within one node, the L3 isn't even uniform! If 4-way NUMA scales so well, why are there hardly any 4S installations in the field?
I don't wish to do the work for you, but when software correctly identifies 16MB L3 as 2x8MB, performance improves a lot. So it is a matter of optimization.

Irrelevant stuff from last decade.
So CFD which is purely I/O bound, the real test for NUMA latency issues, is irrelevant? Whatever floats your boat. Are you telling me that people no longer do fluid dynamics on CPUs?
 
Reactions: Drazick

JoeRambo

Golden Member
Jun 13, 2013
1,814
2,105
136
So CFD which is purely I/O bound, the real test for NUMA latency issues, is irrelevant? Whatever floats your boat. Are you telling me that people no longer do fluid dynamics on CPUs?

Actually it is you who has no clue. Even in that article it's written that it is FP load that has completely RANDOM memory access in certain sized chunks.

So with NUMA the following happens - app has no clue where to allocate memory and it gets allocated on lets say NODE1 (socket1). Now only Socket1 threads have decent latency and other 3 run disastrous latency and choke HT links ( and also bad things happen due to that memory controller being overloaded).

With Interleave, all 4 controllers get to work, even if suboptimal latency for all 4 CPUs it increases scaling from disastrous to okayish.

So please take your BS claims about "AMD benefiting from Interleaving" to some other forum, it is workload that is specific, not AMD.

Reality is the less NUMA nodes You have and the more uniform pools of resources You have - the better the perf. Anything above 2 NUMA nodes requires specific tuning to really shine.
 

tamz_msc

Diamond Member
Jan 5, 2017
3,865
3,729
136
Actually it is you who has no clue. Even in that article it's written that it is FP load that has completely RANDOM memory access in certain sized chunks.
This is precisely what an I/O bound workload looks like. CFD, or for that matter solving any differential equation on a grid is that - I/O bound.
With Interleave, all 4 controllers get to work, even if suboptimal latency for all 4 CPUs it increases scaling from disastrous to okayish.

So please take your BS claims about "AMD benefiting from Interleaving" to some other forum, it is workload that is specific, not AMD.
Even with suboptimal latency it is far better than having to do remote access. The Intel Xeon CPU had already got a more advanced protocol than the Opteron, hence it does not benefit from interleaving substantially.
 
Reactions: tjf81 and Drazick

Sweepr

Diamond Member
May 12, 2006
5,148
1,142
131
65W and 95W as expected. Some people should wake up finally.

14C/16C/18C for SKL-X is a big surprise.

Indeed it is. And some people thought Intel would stand still and not react properly. We get 14nm++ 4C/6C Coffee Lake-S half an year earlier than expected and possibly one of their biggest gen-to-gen performance increases on HEDT. Core i9-7980XE will absolutely demolish Core i7-6950X - 80% more cores/threads, better performance per clock and likely higher Turbo clocks.

You should better wait for the official price list and for the sales start. Such pre-release listings are often much higher, especially when we talk about an unknown asian shop.

True, especially for Core i9-7920X. But from this early listing we can already see:

- Core i9-7900X 10C/20T 3.3-4.5 GHz cheaper than Core i7-6900K
- Core i9-7820X 8C/16T 3.6-4.5 GHz cheaper than Core i7-6850K
- Core i9-7800X 6C/12T 3.5-4.0 GHz cheaper than Core i7-6800K and barely more expensive than Core i7-7700K

Exciting times ahead.
 
Last edited:
Reactions: TheF34RChannel

JoeRambo

Golden Member
Jun 13, 2013
1,814
2,105
136
This is precisely what an I/O bound workload looks like. CFD, or for that matter solving any differential equation on a grid is that - I/O bound.

Even with suboptimal latency it is far better than having to do remote access. The Intel Xeon CPU had already got a more advanced protocol than the Opteron, hence it does not benefit from interleaving substantially.


And? It's just one specific workload. Can't make up BS claim about AMD CPU's somehow avoiding NUMA specific issues just by enabling Interleaving.
 

beginner99

Diamond Member
Jun 2, 2009
5,231
1,605
136
Lifted from overclockers.co.uk forum.

Thai Prices: https://gameolo.com/product-category/คอมประกอบ/cpu/cpu-intel?orderby=price-desc

INTEL i9-7920X - 12c/24t @ 2.90GHz (No turbo listed) - 160W.
L2 CACHE -12 x 256KB
L3 CACHE - 16.5MB
£1300 + VAT / $1671

INTEL i9-7900X - 10c/20t @ 3.30/4.50GHz - 160W
L2 CACHE - 10 x 256KB
L3 CACHE - 13.75MB
£855 + VAT / $1099

INTEL i9-7820X - 8c/16t @ 3.60/4.20GHz - 140w
L2 CACHE - 8 x 256KB
L3 CACHE - 11MB
£515 + VAT / $659

INTEL i9-7800X - 6c/12t @ 3.50/4.00GHz - 140W
L2 CACHE - 6 x 256KB
L3 CACHE - 8.25MB
£345 + VAT / $439

That pricing wouldnot make much sense. I mean the difference between 8 and 10 core is way, way too large.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |