Question Intel Mont thread

Page 14 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Jul 27, 2020
23,462
16,510
146
They have a long way to go before they can challenge Zen 6 by converting this E-core into a P-core. Just like Lion Cove, they may end up hobbling their savior unified P-core through various deficiencies and oversights. And they ABSOLUTELY need HT.
 

Kosusko

Senior member
Nov 10, 2019
200
186
116
It's a shame that they only focused on IPC at 1c/1T (or one cluster 4c/4T) and not on the overall performance (four cluster 16c/16T) of the Skymont E-core.
 

Hulk

Diamond Member
Oct 9, 1999
5,058
3,537
136
They have a long way to go before they can challenge Zen 6 by converting this E-core into a P-core. Just like Lion Cove, they may end up hobbling their savior unified P-core through various deficiencies and oversights. And they ABSOLUTELY need HT.
If we keeping bashing and putting down Intel perhaps we can help them go out of business?

Then Zen 6 can come out in 2036 instead of 2026. No reason to get it out next year if there is no competition right?
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
26,800
15,810
136
If we keeping bashing and putting down Intel perhaps we can help them go out of business?

Then Zen 6 can come out in 2036 instead of 2026. No reason to get it out next year if there is no competition right?
I don't want Intel to go out of business. I want them to wake up and start doing the RIGHT things, like AMD did in 2017 with Zen, after 10 years of crap !

Raptor lake is like Bulldozer, except worse. Bulldozer did not degrade and lock up, just was a bad product.
 

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
Which vector extension are they missing compared to Zen 2?
No I mean versus Zen4/5 vector performance isn't comparable because it lacks vector extensions and other vector design optimizations. Comparing Skymont to Zen 2 is like comparing a 6950xt to 3060.

Even compared to Zen2, skymont has a 128-bit SIMD unit width versus Zen2's 256-bit width, which also puts skymont at a disadvantage in vector performance as well.
 

Hitman928

Diamond Member
Apr 15, 2012
6,604
12,103
136
No I mean versus Zen4/5 vector performance isn't comparable because it lacks vector extensions and other vector design optimizations. Comparing Skymont to Zen 2 is like comparing a 6950xt to 3060.

Even compared to Zen2, skymont has a 128-bit SIMD unit width versus Zen2's 256-bit width, which also puts skymont at a disadvantage in vector performance as well.

But isn’t that relevant when talking about E-cores being able to compete with P-cores at lower area and power?
 
Reactions: lightmanek

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
But isn’t that relevant when talking about E-cores being able to compete with P-cores at lower area and power?
Relevant sure, but how much vector performance contributes to the performance of the core in your average consumer workloads is debatable.

Ray tracing is also a factor when considering the performance of two different graphics cards, but many people would say that rasterization is much more important than ray-tracing thus it's unfair to compare a 6950xt to a 3060 based solely on ray-tracing.

I mean if vector performance is so important why did Intel fuse off avx-512 silicon in multiple generations of golden-cove and it's derivatives. Workloads such as Lib264 encoding demonstrated here would have benefited a ton from AVX-512. If vector performance was so important for the general use-cases of consumers then intel left double digit performance improvements on the table.

There also is the question of just how easy it is to solve this specific issue. I'm not CPU architect but it seems to me that having relatively weak vector performance is much less of a problem to fix than having weak int/fp performance to begin with. With skymont already being such a good base, why would adding features and improvements to beat a p-core so unfeasible?? It's already 1/3 of the area of golden cove while offering similar IPC in many scenarios while being able to clock 5Ghz without being designed for high clocks. I'd say there's much room left for improvement.
 

Hitman928

Diamond Member
Apr 15, 2012
6,604
12,103
136
Relevant sure, but how much vector performance contributes to the performance of the core in your average consumer workloads is debatable. Ray tracing is also a factor when considering the performance of two different graphics cards, but many people would say that rasterization is much more important than ray-tracing thus it's unfair to compare a 6950xt to a 3060 based solely on ray-tracing.


These are general purpose cores meant to be sold in pretty much every segment, if you're going to completely replace the P-core, it needs to be able to compete in pretty much every category with at least decent competitiveness.

I mean if vector performance is so important why did Intel fuse off avx-512 silicon in multiple generations of golden-cove and it's derivatives.

Because the E-cores didn't support it and they couldn't figure out how to make a mixed instruction set big-little CPU work in software. AVX-512 is all kinds of messy as well for Intel but that's a whole different discussion.

Workloads such as Lib264 encoding demonstrated here would have benefited a ton from AVX-512. If vector performance was so important for the general use-cases of consumers then intel left double digit performance improvements on the table.

Yep, and it just led to further confusion and difficulties around adopting AVX-512. They got criticized for it quite a bit.

There also is the question of just how easy it is to solve this specific issue. I'm not CPU architect but it seems to me that having relatively weak vector performance is much less of a problem to fix than having weak int/fp performance to begin with. With skymont already being such a good base, why would adding features and improvements to beat a p-core so unfeasible?? It's already 1/3 of the area of golden cove while offering similar IPC in many scenarios while being able to clock 5Ghz without being designed for high clocks. I'd say there's much room left for improvement.

I'm not saying it's impossible, or even ill advised, for Intel to move to their E-core architecture entirely, just that the comments of adding a few tweaks and having a world class, small, efficient core top the charts is not realistic. It may compete extremely well in many thread scenarios and in many integer heavy workloads, but will still get smoked in several work loads. To get to a more fully competitive design will significantly increase power and area. Adding full width support for wider vector operations not only increases power and area on its own, but you have to be able to feed the beast as well which means expanding other parts of the core. Even in lightly threaded integer workloads, you either need to increase IPC or frequency enough to compete with designs clocking 20+% higher, neither of which is easy and will add additional area and power. The point is, by the time you do all that, the E-core is not going to look so much like an E-core anymore. If done right, it will probably be much better than what the P-core team can put out, but it's not going to be a few tweaks here and there and you get an E-core competing in the competitive P-core landscape. That's just not realistic.
 
Last edited:

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
These are general purpose cores meant to be sold in pretty much every segment, if you're going to completely replace the P-core, it needs to be able to do compete in pretty much every category with at least decent competitiveness.
No one is debating that, but nonetheless skymont cores are excellent for what they are designed to do and a good showing of what might come in the future.
Because the E-cores didn't support it and they couldn't figure out how to make a mixed instruction set big-little CPU work in software. AVX-512 is all kinds of messy as well for Intel but that's a whole different discussion.
And also maybe because it's not entirely useful for consumers??? I get your argument for all these AVX support for datacenter but I really don't see how it's such a big deal for 99% of consumers buying a cpu for their PC.
Yep, and it just led to further confusion and difficulties around adopting AVX-512. They got criticized for it quite a bit.
Criticized for wasting die space and money on AVX-512 sure. But I don't think anyone was claiming that somehow giving AVX-512 support for CPUs was going to revolutionize their product. Zen5 is the first full fat AVX-512 implementation and yet it barely improves upon zen4 in any meaningful way for consumers.
I'm not saying it's impossible, or even ill advised, for Intel to move to their E-core architecture entirely, just that the comments of adding a few tweaks and having a world class, small, efficient core top the charts is not realistic. It may compete extremely well in many thread scenarios and in many integer heavy workloads, but will still get smoked in several work loads.
For consumers, a few tweaks would indeed lead to a world class, small, efficient core while also being much cheaper to produce. I literally linked a benchmark video where they turned off all the P-cores except one and left all 16 e-cores on which led to faster gaming performance than the default configuration. Ofc some leeway should be given here because the default configuration has scheduling issues, but if you compare the results to the 8P test and the new FIXED cyberpunk benchmarks then you see it is pretty close if not on par.

I never claimed it could outperform P-cores in every edge-case scenario and in every single productivity test, but getting 90-95% of the performance of current P-cores in consumer applications? ABSOLUTELY that is possible, it's been proven in gaming workloads and unless the rest of consumer-related workloads are all extremely vector-heavy, I don't see how that would be any different. The ONLY workloads that it seems to be significantly behind in are vector workloads and in how many of those workloads is it vector heavy enough to be a significant bottleneck? Below is a good post on exactly how useful AVX-512 is for the average consumer.

To get to a more fully competitive design will significantly increase power and area. Adding full width support for wider vector operations not only increases power and area on its own, but you have to be able to feed the beast as well which means expanding other parts of the core. Even in lightly threaded integer workloads, you either need to increase IPC or frequency enough to compete with designs clocking 20+% higher, neither of which is easy and will add additional area and power. The point is, by the time you do all that, the E-core is not going to look so much like an E-core anymore. If done right, it will probably be much better than what the P-core team can put out, but it's not going to be a few tweaks here and there and you get an E-core competing in the competitive P-core landscape. That's just not realistic.
No one claimed that designing an E-core to replace all the functions of the P-core would result in a core 1/4 the size of the current P-core. I agree, the core would look much bigger than an actual e-core maybe more like 60-70% the size of a P-core rather than 25-33% which we see now. To fully replace the P-core would take significant work but at the same time what CPU architecture doesn't take significant work?

I obviously don't think they can replace the current P-cores outright without another product cycle worth of design and work, but they can absolutely sell an E-core only product with a few tweaks and have it be a cheap high nT, zen4 1T performance chip in most CONSUMER applications. As I said before the results are literally already there with 1P 16E core gaming, for people not doing heavy productivity work that specifically relies on vector performance, the current E-cores are perfectly sufficient, especially with minor tweaks and increased clock speeds.

I don't believe that an E-core product with only a few tweaks would outright beat the current lineup of Zen5/Arrowlake either. But if you can get 90% of the performance, essentially N-1 performance of Zen4/12-13th gen, and then sell that 16-core product for 200$ it'd already be an insane product for pretty much everyone with a cpu older than 3-4 years or a cpu with 8 or less cores. For example currently I am on a 9600k with 6-cores and around half of the single thread performance of the newest CPUs, not to mention the nT performance. If i could pay 200$ for a 16-core CPU (even without hyperthreading) that has Zen4 performance for most consumer applications and gaming, and much better power efficiency compared to even leading edge I would immediately upgrade no issue.

OFC this product is all theoretical because even though it may be a good product on paper, it may cost lots of engineering time and money for a low-margin budget product that might not sell well due to not being the newest and greatest thing. Also it might mess up Intel's own segmenting of their products and overall would not be good for their product stack. Regardless it is absolutely the case that an E-core only cpu using skymont with a few tweaks/higher clocks would be great for MOST consumers especially since the die cost of such a cpu would be a fraction of other 16-core or 24-core CPUs.
 

CouncilorIrissa

Senior member
Jul 28, 2023
619
2,401
96
I mean if vector performance is so important why did Intel fuse off avx-512 silicon in multiple generations of golden-cove and it's derivatives. Workloads such as Lib264 encoding demonstrated here would have benefited a ton from AVX-512. If vector performance was so important for the general use-cases of consumers then intel left double digit performance improvements on the table.
It was disabled because of scheduling problems on hetero-ISA (and also because it's bad optics to have a CPU that performs better in a small subset of tasks with E-cores disabled, because some mobos actually allowed enabling AVX-512) CPUs IIRC.

Sapphire Rapids had AVX-512 support.
 

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
Yeah and SKT is present in exactly 0 value parts. Next.
U behave like an imbecile in every thread I see you in lol, at least stay in AMD threads if you're gonna behave like this. Arrow lake is shit, BMG is largely unprofitable as it stands, lion cove is dog and P-cores have no future. Foundry likely has made huge advances but is unlikely to be profitable for the next half-decade if not longer.

Regardless you can realize all those things and still acknowledge that Skymont is indeed impressive. No need to be a one dimensional simpleton!!


No personal attacks allowed.


esquared
Anandtech Forum Director
 
Last edited by a moderator:

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
It was disabled because of scheduling problems on hetero-ISA (and also because it's bad optics to have a CPU that performs better in a small subset of tasks with E-cores disabled, because some mobos actually allowed enabling AVX-512) CPUs IIRC.

Sapphire Rapids had AVX-512 support.
Yea but how much performance did that really leave on the table for the average consumer?? It'd be nice if those products did include AVX-512 support but once again unless you're doing heavy productivity work that also happens to rely heavily on vector performance, you'd be perfectly fine. It's like how some people are very adamant on Intel releasing a 12-core HEDT product with only P-cores and AVX-512 support but to be perfectly honest most people don't care and it's a tiny segment of the consumer market.

Also side note but AVX-512 is not that much of a die bloater as you may think. Look at Skylake(no AVX) vs Skylake-X(AVX-512 for HEDT) core size and you see they are very similar. The original skylake-cores do have certain features that are AVX-512 oriented such as an unused/extra register file meant for AVX-512 workloads but even that barely takes up a couple percentage points of the overall core size.
 

adroc_thurston

Diamond Member
Jul 2, 2023
5,360
7,534
96
U behave like an imbecile in every thread I see you in lol, at least stay in AMD threads if you're gonna behave like this
Meds.
Skymont is indeed impressive
It's fine. Z4c-ish with castrated SIMD is pretty good but not exactly jack off materiel either.
If you want a real impressive LITTLE, look at AAPL stuff.
 
Reactions: lightmanek

ajsdkflsdjfio

Member
Nov 20, 2024
185
132
76
It's fine. Z4c-ish with castrated SIMD is pretty good but not exactly jack off materiel either.
If you want a real impressive LITTLE, look at AAPL stuff.
Clock it at 4.9 instead of 4.6 and it's way close to Z4 rather than Z4c. Regardless for x86 it certainly is exciting especially given it's rumored to be the future of Intel products, and compared to the P-cores and previous E-cores it is jack-off material. Clearwater forest may be the first time Intel has had a competitive data center product in 5-10 years.

IDK much about apple but unfortunately they're ARM/MacOS only on top of being exorbitantly priced when it's anything above entry-level so really not that exciting for me personally.
 

MS_AT

Senior member
Jul 15, 2024
555
1,167
96
Even compared to Zen2, skymont has a 128-bit SIMD unit width versus Zen2's 256-bit width, which also puts skymont at a disadvantage in vector performance as well.
It is not so simple. Skymont has 4x128b symetrical SIMD units, Zen2 has asymetrical units so for inner loops of things like matrix multiply they are equally matched.

I mean if vector performance is so important why did Intel fuse off avx-512 silicon in multiple generations of golden-cove and it's derivatives
Past stupid decision when Intel thought it owned the marked, so they thought market segmentation will be good for them as they would be able to milk avx512 as a bonus feature. So it got completly ignored in consumer market until Rocket/Tiger lake and then Intel cut the support after one gen as they needed E-cores to save face in Cinebench.

Now if you think Intel thinks that AVX512 is not useful in consumer market then why they are rolling out AVX10? it only purpose is to bring AVX512 features to E cores as AVX10 doesnt require 512b execution units. With AVX10/AVX512 the compiler will spill less often, due to bigger architectural register pool, even for scalar floating point operations. Handling corner cases is easier thanks to masking etc.
 

CouncilorIrissa

Senior member
Jul 28, 2023
619
2,401
96
Yea but how much performance did that really leave on the table for the average consumer?? It'd be nice if those products did include AVX-512 support but once again unless you're doing heavy productivity work that also happens to rely heavily on vector performance, you'd be perfectly fine. It's like how some people are very adamant on Intel releasing a 12-core HEDT product with only P-cores and AVX-512 support but to be perfectly honest most people don't care and it's a tiny segment of the consumer market.
Not a lot, that's correct. I was mostly arguing that Intel didn't leave performance on the table because it was irrelevant, they did because they had no other option.
 
Jul 27, 2020
23,462
16,510
146
I don't believe that an E-core product with only a few tweaks would outright beat the current lineup of Zen5/Arrowlake either. But if you can get 90% of the performance, essentially N-1 performance of Zen4/12-13th gen, and then sell that 16-core product for 200$ it'd already be an insane product for pretty much everyone with a cpu older than 3-4 years or a cpu with 8 or less cores. For example currently I am on a 9600k with 6-cores and around half of the single thread performance of the newest CPUs, not to mention the nT performance. If i could pay 200$ for a 16-core CPU (even without hyperthreading) that has Zen4 performance for most consumer applications and gaming, and much better power efficiency compared to even leading edge I would immediately upgrade no issue.

OFC this product is all theoretical because even though it may be a good product on paper, it may cost lots of engineering time and money for a low-margin budget product that might not sell well due to not being the newest and greatest thing. Also it might mess up Intel's own segmenting of their products and overall would not be good for their product stack. Regardless it is absolutely the case that an E-core only cpu using skymont with a few tweaks/higher clocks would be great for MOST consumers especially since the die cost of such a cpu would be a fraction of other 16-core or 24-core CPUs.
Agree. But given Intel's history, they may release at most an 8-core Skymont die if they actually decide to release an ADL-N successor. They seem to be allergic to making enthusiasts happy.
 

LightningZ71

Platinum Member
Mar 10, 2017
2,077
2,525
136
Intel knows full well what their cores are capable of. They may make poor market decisions, but they aren't completely inept. If Intel really wanted to own the MT market, they are perfectly capable of producing a processor with 2 Lion Cove cores and 8 Skymont quads. It would STILL do just as well in any ST benchmark due to the two LC cores and do notably better in almost every non-main memory bound MT benchmark with 34 total threads and only 10 ring bus processor core/cluster stops. It would probably improve L3/ring latency enough to give each station larger L3 slices.

They didn't do that. Either they know something we don't, or they just made another poor market decision.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |