- Jul 27, 2020
- 23,462
- 16,510
- 146
First Gracemont laptop available for sale.
Anybody got disposable $500 to buy and test this laptop?
That wasn't the case previously though. Now it's close enough.By the way, game fps improving with one Lion Cove and all Skymonts enabled? I think it's coz of reduced number of ring stops.
Or, they screwed up and/or did not meet to their expectations. Which I think is extremely reasonable when you are dealing with a device with tens of billions of transistors. Literally there are tens of thousands of paths that could have gone wrong. You can't distill the problems down to a simple manner on a project as complex as a CPU. Why was Pentium 4 bad? Why was Bulldozer bad?They didn't do that. Either they know something we don't, or they just made another poor market decision.
Zen 2 has 2x FPAdd and 2x FPMul, while Skymont has 4x 128-bit FMA. Zen 2 also has 2x 256-bit Load and 1x 256-bit Store while Skymont has 3x128-bit Loads and 4x128-bit Stores.It is not so simple. Skymont has 4x128b symetrical SIMD units, Zen2 has asymetrical units so for inner loops of things like matrix multiply they are equally matched.
One skm 4c cluster gets less L3 bandwidth than a single Zen 2 core. This might be a disadvantage for Skymont. Plus an entire zen 2 ccx is larger than one skm cluster. Any attempt at a unified atom core likely won't be arranged in 4c clusters in "E core" fashion since area efficiency will be less of a concernI think the worst result was getting beat by Zen 2 with HT, of all things.
ZEN TWO!
SMH.
View attachment 113582
Fair enough.Any attempt at a unified atom core likely won't use arranged in cluster in this fashion
I don't know where you heard Arrow Lake's e-cores use 17.8 billion transistors. Intel via TechPowerUp makes it sounds like all the dies together add up to 17.8 billion. Comparing that number and the die size to Apple's M3, I believe that the compute die is 17.8 billion transistors.The only reliable-ish info we have is that on Arrow lake, the Skymont cores take up about 17.8 billion transistors, giving about 1 billion transistors per core and it's share of the L2. No idea if this includes L3 slices, but from the wording, probably not.
The entire Zen2 CCD, 16MB L3, 2 CCXs of 4 cores each, is 3.8 Billion transistors. Zen 2 is at least half the size of Skymont on a per-core basis.
It is conceivable that AMD could produce a processor with 32 Zen 2 cores and 8 Zen 5 cores that are instruction restricted to Zen 2's capabilities in the same space as Arrow Lake and run rings around it in MT tasks with 48 active threads. This assumes that there are no memory bottlenecks.
Intel says that the die-area of Arrow Lake-S (8P+16E) is 243 mm², and its total transistor count is 17.8 billion.
Why do you think AVX10 is meant for consumers? I doubt Intel is so worried about bringing AVX features to consumers, more likely they are bringing AVX-10 so that their E-core datacenter products have even more of an edge, since they are going to be their only competitive datacenter products going forward. Intel being able to enable AVX-512 on P-cores and E-cores for consumers sounds more like a trickle off effect rather than their main intention.Now if you think Intel thinks that AVX512 is not useful in consumer market then why they are rolling out AVX10? it only purpose is to bring AVX512 features to E cores as AVX10 doesnt require 512b execution units. With AVX10/AVX512 the compiler will spill less often, due to bigger architectural register pool, even for scalar floating point operations. Handling corner cases is easier thanks to masking etc.
I wouldn't think so. Most modern games utilize 8-cores ATLEAST, and it's extremely beneficial to have more than that so that you can have multiple other programs running in the background without significantly effecting your game. Even when just doing general computer work without advanced productivity applications, many people have multiple programs open and multiple chrome windows with dozens of tabs per window open at the same time. I've seen it where when they press alt tab there are like 10-20 windows open to switch between . Although technically you can just be more organized, I think having 12-16 cores to not slowdown when heavily multi-tasking is very nice to have.AVX512 is niche for consumers, but so are 16-core CPUs, it's just a Cinebench benchmarking contest about "which CPU has more raw power".
is this why CWF-AP and future Atom xeons are dead?they are going to be their only competitive datacenter products going forward.
The only scenario in which future Atom xeons are dead is one where Intel themselves are dead.is this why CWF-AP and future Atom xeons are dead?
But they are dead (no, seriously.). At least until Unified Core, lol.The only scenario in which future Atom xeons are dead is one where Intel themselves are dead.
If clearwater forest wasn't Intel's best bet why are they heralding it as such and devoting so much resources to launching it in 2025 with diamond rapids nowhere to be seen? Also why do you think Unified Core is their current plan? If atom is so shit why create an entire architecture surrounding it?But they are dead (no, seriously.). At least until Unified Core, lol.
It's extra funny given how DCAI realigned itself politically.
2025?If clearwater forest wasn't Intel's best bet why are they heralding it as such and devoting so much resources to launching it in 2025
That's a Diamond Rapids issue.with diamond rapids nowhere to be seen
Because it's a good idea. It worked for AMD. Why won't it work for them?Also why do you think Unified Core is their current plan?
It's not shit, just that Atom server products are not competitive enough to get hyperscaler traction. Ergo they're dead.If atom is so shit why create an entire architecture surrounding it?
No I'm not asking why Intel is shifting to a singular core architecture, I'm asking why they are shifting to a singular core architecture based on Atom and not P-core or even Royal Core?Because it's a good idea. It worked for AMD. Why won't it work for them?
SFR uses Crestmont which is not at all comparable to Skymont/Darkmont. Also in general it was a super late product. Crestmont is only an incremental improvement over gracemont which was released in what... 2021? 2021 e-cores in 2024 is likely not going to be a massive success, especially when you consider that the performance of crestmont/gracemont was a lot weaker compared to their P-core counterparts. Skymont/darkmont is a different story. You have a design based on 2024 e-cores which are much more competitive both compared to their contemporary P-cores and compared to competing Zen cores of the same release cadence.It's not shit, just that Atom server products are not competitive enough to get hyperscaler traction. Ergo they're dead.
Like SRF-SP was made for Meta and Meta just bought Bergamo instead. gg no re
I won't argue that Diamond Rapids doesn't have issues, but for the past couple years Intel has been talking about clearwater forest and not diamond rapids, before any major engineering issues might've taken place. I think they simply realized their e-core architecture was going to be their main attraction going forward.That's a Diamond Rapids issue.
Because Atom guys aren't washed and output competitive PPA, and Royal Core was an absolute mess and that team is poof anyway.I'm asking why they are shifting to a singular core architecture based on Atom and not P-core or even Royal Core?
ughhh. Well it's a throughput part first and foremost.SFR uses Crestmont which is not at all comparable to Skymont/Darkmont.
Not really, went from H2'23 to H1'24. Fine.Also in general it was a super late product.
It. Is. a. throughput product. Made for favelas.2021 e-cores in 2024 is likely not going to be a massive success, especially when you consider that the performance of crestmont/gracemont was a lot weaker compared to their P-core counterparts
If you say so.You have a design based on 2024 e-cores which are much more competitive both compared to their contemporary P-cores and compared to competing Zen cores of the same release cadence.
Which is why Atom xeons are dead?I think they simply realized their e-core architecture was going to be their main attraction going forward.
Which is exactly what I'm trying to say?Because Atom guys aren't washed and output competitive PPA, and Royal Core was an absolute mess and that team is poof anyway.
So why are you making the comparison between SRF and CWF? Skymont is more than "good enough things" and in my opinion useful for more than just throughput with good IPC in many workloads, the aforementioned AVX-10 being pushed out for improved SIMD performance, and plus the fact that it is power efficient enough so that its performance isn't as kneecapped in server scenarios making it even most competitive versus other cores.ughhh. Well it's a throughput part first and foremost.
It. Is. a. throughput product. Made for favelas.
144 good enough things on a single die were supposed to sell something but they did, in fact, not.
Gracemont was Skylake (2015) IPC in 2021, Skymont is Zen4 (2022) IPC in 2024.If you say so
My statement that "their e-core architecture was going to be their main attraction going forward" includes e-cores in Xeon, not just their unified architecture.Which is why Atom xeons are dead?
same swimlane. same target customer.So why are you making the comparison between SRF and CWF?
years and years and years away.the aforementioned AVX-10 being pushed out for improved SIMD performance
You forgot the part where CWF is much, much more expensive aka loses half the reason of Atom Xeon even exist (they're cheapo).Even in a situation where CWF is only useful for throughput, it would be a vastly superior product to SRF and therefore much more attractive even if just in smaller market.
These parts are not defined by 1t PPC, they're defined by socket-level throughput ISO power.Gracemont was Skylake (2015) IPC in 2021, Skymont is Zen4 (2022) IPC in 2024.
Well the point is that there are no e-cores in Xeon anymore. Dead.My statement that "their e-core architecture was going to be their main attraction going forward" includes e-cores in Xeon, not just their unified architecture.
Wrong for the many reasons I pointed out.same swimlane. same target customer.
You are right about this, but also darkmont and consequently its server versions most definitely improve on vector performance making it more viable than skymont.years and years and years away.
Maybe because the "Atom" cores are going to perform more like P-cores instead? Why devote all this packaging cost for a core that's "just an e-core" like all previous e-cores.You forgot the part where CWF is much, much more expensive aka loses half the reason of Atom Xeon even exist (they're cheapo).
Not solely defined by 1t PPC, but if all other things are equal, a 1t PPC improvement would translate 1:1 with socket-level throughput ISO-power.These parts are not defined by 1t PPC, they're defined by socket-level throughput ISO power.
If you define e-cores as cores similar in function/perf to atom-cores before skymont sure. I define e-cores as cores designed significantly more area/power efficient than P-cores by the atom team. With your definition sure E-cores are dead in Xeons, since these new e-cores are so much better that they play a role larger than traditional e-cores.Well the point is that there are no e-cores in Xeon anymore. Dead.
It's literally the same product lane. tf are you on?Wrong for the many reasons I pointed out.
It's still a joke.You are right about this, but also darkmont and consequently its server versions most definitely improve on vector performance making it more viable than skymont.
No?Maybe because the "Atom" cores are going to perform more like P-cores instead?
18A is kinda expensive and SRAM scaling there is a joke. It's not "all that packaging cost", it's an attempt to ship an immature node early at a reasonable (reasonable enough) price.Why devote all this packaging cost for a core that's "just an e-core" like all previous e-cores.
I have some major news for you, but IPC isn't free.a 1t PPC improvement would translate 1:1 with socket-level throughput ISO-power.
you haven't seen a single SKT implementation on the same node as GRT.Skymont is ALSO much more performant for its area/power.
They're dead because the products are dead.With your definition sure E-cores are dead in Xeons, since these new e-cores are so much better that they play a role larger than traditional e-cores.
Same product family =/= same customers. Same product lane =/= same competitiveness either.It's literally the same product lane. tf are you on?
Okay then it's a joke and CWF fails, Intel DC/AI takes further losses putting them that much closer to Intel failing.It's still a joke.
Yes? I just compared gracemont and skymont to equivalent level P-cores ipc. Skymont is clearly closer to contemporary P-cores than gracemont meaning it performs more of the functions of a P-core.
Right, the ONLY purpose of CWF is to ship 18a out. Not like it's going to be their most competitive offering in the past decade.18A is kinda expensive and SRAM scaling there is a joke. It's not "all that packaging cost", it's an attempt to ship an immature node early at a reasonable (reasonable enough) price.
I addressed the issue of IPC not being free by saying that even considering area/power increases, Skymont IPC is still massively improved. IPC is indeed not free, but it's 30% ipc gain is not reflected in an equivalent power/area increase.I have some major news for you, but IPC isn't free.
We've seen GLC vs LNC, and GRT vs SKT. So we can extrapolate enough data there so show that SKT is indeed much more performant all things considered.you haven't seen a single SKT implementation on the same node as GRT.
Right... because you said so.They're dead because the products are dead.
CWF-AF is dodo and RRF also seems not there.
Yeah it is. A favela part is a favela part and boy does it have favela customers.Same product family =/= same customers.
Not getting it cap'n.Okay then it's a joke and CWF fails, Intel DC/AI takes further losses putting them that much closer to Intel failing.
I'm sorry to disappoint you but there's more to a big core than SIR2017 1t rate.Skymont is clearly closer to contemporary P-cores than gracemont meaning it performs more of the functions of a P-core.
YES. Pat spent eons talking about i18a and how CWF is their first product on it!Right, the ONLY purpose of CWF is to ship 18a out.
IT'S A SHRINK. THERE IS NO ISO NODE COMPARISON BETWEEN GRACEMONT AND SKYMONT.I addressed the issue of IPC not being free by saying that even considering area/power increases, Skymont IPC is still massively improved. IPC is indeed not free, but it's 30% ipc gain is not reflected in an equivalent power/area increase.
what does that meanWe've seen GLC vs LNC, and SKT vs GRT. So we can extrapolate enough data there so show that SKT is indeed much more performant all things considered.
yeah.Right... because you said so.
We can extrapolate node differences by showing GLC->LNC improvements vs GRT->SKT improvements. Both comparisons are intel 7 vs N3B and have similar size increases(ISO node), SKT gains much more performance than GLC to the effect of triple the amount.what does that mean
you can, in fact, NOT.We can extrapolate node differences by showing GLC->LNC improvements vs GRT->SKT improvements.
That's not what I'm talking about. You say that there is no ISO node comparison between gracemont and skymont which is true.you can, in fact, NOT.
They're not even the same foundry ffs.
you in fact, can not. Not one knows any actual perf/power/area/Cac/whatever difference between i7 and N3b. Just a futile effort.I counter by saying we can estimate ISO node comparisons by comparing Golden cove/Lion cove to gracemont/skymont