Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

Page 665 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
702
632
106






As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



Comparison of upcoming Intel's U-series CPU: Core Ultra 100U, Lunar Lake and Panther Lake

ModelCode-NameDateTDPNodeTilesMain TileCPULP E-CoreLLCGPUXe-cores
Core Ultra 100UMeteor LakeQ4 202315 - 57 WIntel 4 + N5 + N64tCPU2P + 8E212 MBIntel Graphics4
?Lunar LakeQ4 202417 - 30 WN3B + N62CPU + GPU & IMC4P + 4E012 MBArc8
?Panther LakeQ1 2026 ??Intel 18A + N3E3CPU + MC4P + 8E4?Arc12



Comparison of die size of Each Tile of Meteor Lake, Arrow Lake, Lunar Lake and Panther Lake

Meteor LakeArrow Lake (N3B)Lunar LakePanther Lake
PlatformMobile H/U OnlyDesktop & Mobile H&HXMobile U OnlyMobile H
Process NodeIntel 4TSMC N3BTSMC N3BIntel 18A
DateQ4 2023Desktop-Q4-2024
H&HX-Q1-2025
Q4 2024Q1 2026 ?
Full Die6P + 8P8P + 16E4P + 4E4P + 8E
LLC24 MB36 MB ?12 MB?
tCPU66.48
tGPU44.45
SoC96.77
IOE44.45
Total252.15



Intel Core Ultra 100 - Meteor Lake



As mentioned by Tomshardware, TSMC will manufacture the I/O, SoC, and GPU tiles. That means Intel will manufacture only the CPU and Foveros tiles. (Notably, Intel calls the I/O tile an 'I/O Expander,' hence the IOE moniker.)



 

Attachments

  • PantherLake.png
    283.5 KB · Views: 24,014
  • LNL.png
    881.8 KB · Views: 25,501
Last edited:

511

Golden Member
Jul 12, 2024
1,034
894
106
I think ocean was used by the p core team to fool pat and fight against the E core team. Pat was right to reject it. General technology was used on E core, and PPA&PPW took off.
Yes Pat is a former CPU Architect and one of the better one of his time not easy to be fooled
 

dttprofessor

Member
Jun 16, 2022
105
30
71
The original GNR was redwood on Intel4, which was rejected by pat and changed to redwood+ on intel3, using some front-end technologies of lion cove. This adjustment is very meaningful, otherwise GNR will be a disaster.
 
Reactions: yuri69

dttprofessor

Member
Jun 16, 2022
105
30
71
The biggest reason why GNR did not adopt lion cove was that the p core team could not deliver the hyper-threading function on time, and pat could not wait any longer. This may also be an important reason why the p core team was broken up and placed under administration.
 

OneEng2

Senior member
Sep 19, 2022
259
356
106
Anyways RIP Intel in Gaming it has been never been more over 🤣
... but certainly gaming isn't "game-set-and-match" type of a thing.

DC and Laptop are, and Intel are doing
Intel Panther Cove will bring APX, which basically should give x86 similar performance possibilities than todays ARM. x86 is currently lacking about 50% IPC. If Intel doesn't target that with Panther Cove they are just losing their mind. And when they target that kind of high IPC they pretty much have no other option than drop SMT.

If they change their mind and re-introduce SMT they basically confirm that they have failed all their design targets and have gone to damage limitation mode.
In order to maximize ILP in a processor, you always need more execution units than a "normal" load will give in order to keep IPC high. This leaves execution units just sitting idle in many parallel load apps. This is where SMT fills in the gaps. SMT provides higher performance by running another thread through those unused execution units. It has been found that SMT provides up to 40% higher performance with only 5-10% die size penalty and very little increase in power.

I can't think of any reason why SMT would not be a good idea, especially in highly threaded workloads.
I have no idea, but I believe it is impossible that AMD ran exactly the same platform for three consecutive years without updating it.

Now is a bad time for jumping ships, the next year you will be able to choose between a fixed CPU for LGA 1851 or even better CPUs for AM5 than are available today.
AFAIK, AM5 has remained the same and any processor that fits into AM5 since the first, will work in the current AM5 and vise versa. It's very un-Intel like.

I think for AMD, AM5 is getting long in the tooth though and expect to see another socket for Zen 7. I think Zen 6 has already been confirmed for AM5.

Intel .... I wouldn't bet much on their socket stability. It has almost seemed like their strategy is to intentionally produce incompatibility between each new CPU release. It seems rare when they stick with a socket for more than 3 releases of a CPU
 

cannedlake240

Senior member
Jul 4, 2024
207
111
76
The original GNR was redwood on Intel4, which was rejected by pat and changed to redwood+ on intel3, using some front-end technologies of lion cove. This adjustment is very meaningful, otherwise GNR will be a disaster.
RWC+ is a myth, Intel fellow and Xeon architect confirmed this in an interview with Cheese, that the only biggest differences are AMX and AVX512. Unless of course one deems those changes substantial enough to call it a RWC+
 

naukkis

Senior member
Jun 5, 2002
962
829
136
... but certainly gaming isn't "game-set-and-match" type of a thing.

DC and Laptop are, and Intel are doing

In order to maximize ILP in a processor, you always need more execution units than a "normal" load will give in order to keep IPC high. This leaves execution units just sitting idle in many parallel load apps. This is where SMT fills in the gaps. SMT provides higher performance by running another thread through those unused execution units. It has been found that SMT provides up to 40% higher performance with only 5-10% die size penalty and very little increase in power.

I can't think of any reason why SMT would not be a good idea, especially in highly threaded workloads.

Biggest reason is that SMT doesn't give you 40% more performance. It gives you 140% performance of two combined threads vs 100% single thread. So to use SMT you are giving up, in this example, 30% thread performance. And only reason for big cores to exists is to aim high thread performance. Without that need they can have much more performance from same silicon area by using more smaller cores. SMT is useless.
 

511

Golden Member
Jul 12, 2024
1,034
894
106
RWC+ is a myth, Intel fellow and Xeon architect confirmed this in an interview with Cheese, that the only biggest differences are AMX and AVX512. Unless of course one deems those changes substantial enough to call it a RWC+
Well a move from Intel 4 -> 3 is enough to call it a plus
 
Jul 27, 2020
20,898
14,487
146
Biggest reason is that SMT doesn't give you 40% more performance. It gives you 140% performance of two combined threads vs 100% single thread. So to use SMT you are giving up, in this example, 30% thread performance. And only reason for big cores to exists is to aim high thread performance. Without that need they can have much more performance from same silicon area by using more smaller cores. SMT is useless.
So IBM with their Dynamic SMT implementation and AMD killing Intel in y-cruncher nT workloads with just a 9800X3D's SMT vs. Intel's SMT-less cores is all happening somewhere in a galaxy far, far away


 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
26,389
15,513
136

DavidC1

Golden Member
Dec 29, 2023
1,211
1,932
96
So IBM with their Dynamic SMT implementation and AMD killing Intel in y-cruncher nT workloads with just a 9800X3D's SMT vs. Intel's SMT-less cores is all happening somewhere in a galaxy far, far away
IBM doesn't view SMT as "icing on top of a cake" as AMD/Intel does so they aren't directly comparable. Their SMT adds 20-30% transistors to the core to get lot more out of it. But their POWER lineup is used for Big Iron/Transactional computers so it benefits a lot.
The original GNR was redwood on Intel4, which was rejected by pat and changed to redwood+ on intel3, using some front-end technologies of lion cove. This adjustment is very meaningful, otherwise GNR will be a disaster.
It doesn't look like that's the case, and it's just Redwood Cove with AMX and AVX-512. Some speculated 8-wide is just the output of the uop cache and there hasn't been anything contrary.

Pat's claims of ten plus percent might just refer to clocks.
 

OneEng2

Senior member
Sep 19, 2022
259
356
106
Biggest reason is that SMT doesn't give you 40% more performance. It gives you 140% performance of two combined threads vs 100% single thread. So to use SMT you are giving up, in this example, 30% thread performance. And only reason for big cores to exists is to aim high thread performance. Without that need they can have much more performance from same silicon area by using more smaller cores. SMT is useless.
I call this "verbal diarrhea" or "Talking in circles".

Lets simplify the discussion a bit eh?

Neither SMT or more cores does ANYTHING for single threaded performance. If your use case is single threaded application performance ONLY, you don't need SMT and you only need a single core.

Now, lets leave fantasy land for a minute....

If you need MT performance, you can:
1) Add more cores
2) Make one core handle more than one thread

Now, lets talk about this a minute.

Adding SMT provides up to 40% (in AMD's implementation) and likely much more in IBM's implementation. It does this at the cost of only a tiny amount of power, and 15% more die size per core.

Doubling cores provides almost 100% MT over half the number of cores BUT it cost 100% more die size and 100% more power.

Another thing to consider in your above totally incorrect conclusion on SMT is that many (most) DC software licensing is done on a PER CORE basis, so doubling the amount of cores ALSO doubles your software licensing. FYI, for those of you that have never purchased this level of software, the software is WAY WAY more expensive than the computer you run it on.

If it is Intel's intent to create next gen DC processors by the "more cores" approach ..... wow..... good luck to them and God have mercy on their souls.

In DC, there are two very real limitations.
1) Socket power
2) Licensing cost per core (customer price pain limit)

SMT is VERY important in MT in DC products. AMD has decided that it is important to have a single CPU core design that works for an entire generation of products across all markets. Intel has decided to have 2 different core designs.

While you may be able to get away with lots of little cores in the desktop where per core licensing rarely exists, the same approach will find you totally bankrupt in the DC.

AMD provides Turin and Turin Dense exactly for this reason. While Turin Dense greatly outperforms Turin in overall performance in many applications (more cores with SMT, but each core being less performant than normal Turin), it doesn't work for enterprise applications that charge on a per core basis. In these applications, having maximum performance per core (including MT) is the most important thing since (as I already stated) the licensing cost is MUCH higher (and usually happens on a recurring basis) than the hardware costs.

So to steal a phrase from Star Wars "You are greatly mistaken about a great many things".
 

Abwx

Lifer
Apr 2, 2011
11,612
4,469
136
Biggest reason is that SMT doesn't give you 40% more performance. It gives you 140% performance of two combined threads vs 100% single thread. So to use SMT you are giving up, in this example, 30% thread performance. And only reason for big cores to exists is to aim high thread performance. Without that need they can have much more performance from same silicon area by using more smaller cores. SMT is useless.

Contrary to your sayings SMT is very usefull.

Say a core that has 40% SMT gain and a firm that sell cores usage for VMs.

If there s two users with each a virtual core then it would require that both have their app loading fully a single core to have 70% of the ST perf for each user, but it s unlikely that both will have their app peaking at the same exact time and for same durations, so in practice both will have the equivalent of 100% of the max ST perf while there s actualy only 140% available from the core.
 

alcoholbob

Diamond Member
May 24, 2005
6,338
404
126
Intel ? Stability ? you are in the wrong universe, see the Raptor lake instability threads

I think he said chipset stability. AMD still hasn’t fixed their USB instability problems in AM5 (probably will have to wait for AM6 when they update the USB chipset), and people are still complaining about TPM bugs.
 

cannedlake240

Senior member
Jul 4, 2024
207
111
76
Okay, kinda makes you wonder why, though? Certainly validation costs/delays aren't the reason?
This wouldn't be surprising, wasn't there a rumor about Pat pressuring product teams for shorter time to market? When he took over all Intel products were constantly being pushed back with Sapphire Rapids being the most infamous example. Who knows, maybe if it wasn't for this approach, Granite rapids, SRF, ARL, Lunar all wouldve been Q1-Q2 2025
 

naukkis

Senior member
Jun 5, 2002
962
829
136
Contrary to your sayings SMT is very usefull.

Say a core that has 40% SMT gain and a firm that sell cores usage for VMs.

If there s two users with each a virtual core then it would require that both have their app loading fully a single core to have 70% of the ST perf for each user, but it s unlikely that both will have their app peaking at the same exact time and for same durations, so in practice both will have the equivalent of 100% of the max ST perf while there s actualy only 140% available from the core.

For security reasons single core threads should not be split to different users. As cloud providers usually just disables SMT even AMD does also offer high-core count Epycs without SMT to them.
 

511

Golden Member
Jul 12, 2024
1,034
894
106
This wouldn't be surprising, wasn't there a rumor about Pat pressuring product teams for shorter time to market? When he took over all Intel products were constantly being pushed back with Sapphire Rapids being the most infamous example. Who knows, maybe if it wasn't for this approach, Granite rapids, SRF, ARL, Lunar all wouldve been Q1-Q2 2025
Sapphire was something else it was during Kranzichs time to be delivered and it took till pat to deliver.
Also other things he did like
the design was not getting free tape out
Accurate simulations for metrics
Pressuring to shorter time to market
Design not getting hot lots and many more lol
 

poke01

Platinum Member
Mar 8, 2022
2,581
3,409
106
so what is Intels answer to this?
Nova lake?

Intel dropped SMT at the worst possible time too. M4 Max CPU power consumption is around 60 watts in R23.

And you know what not a SINGLE M4 CPU test regressed. Intel start making your ship straight because if others start achieving this kind of performance in the Windows laptop ecosystem then it will be too late.
 

cannedlake240

Senior member
Jul 4, 2024
207
111
76
so what is Intels answer to this?
Nova lake?

Intel dropped SMT at the worst possible time too. M4 Max CPU power consumption is around 60 watts in R23.

And you know what not a SINGLE M4 CPU test regressed. Intel start making your ship straight because if others start achieving this kind of performance in the Windows laptop ecosystem then it will be too late.
View attachment 111251
This comes up from time lol, the fact is that they have no answer... Royal core was their premiere uarch program aiming high ST perf and that got canned 4-5 years into development just this summer. Their next chance from rumors is the Atom based uarch that's targeting beyond 2028 timeline
 

511

Golden Member
Jul 12, 2024
1,034
894
106
so what is Intels answer to this?
Nova lake?

Intel dropped SMT at the worst possible time too. M4 Max CPU power consumption is around 60 watts in R23.

And you know what not a SINGLE M4 CPU test regressed. Intel start making your ship straight because if others start achieving this kind of performance in the Windows laptop ecosystem then it will be too late.
View attachment 111251
It will be around 20% if you factor the OS but Nova Lake P core should be at least 20-25% ST Improvement considering they will fix their IMC/Mem controller along with APX and additional changes.Also they are doing a Tick and Tock so it is to be expected otherwise I don't know what would happen to x86
I also expect AMD to be around 15-20 better
 

OneEng2

Senior member
Sep 19, 2022
259
356
106
For security reasons single core threads should not be split to different users. As cloud providers usually just disables SMT even AMD does also offer high-core count Epycs without SMT to them.
Do you have a link on this? I have never heard of anything like this before.
Thanks.

Still, no matter how you slice it, or find a corner case where SMT can't be used, the fact still remains that SMT is a very good architectural element for achieving high performance in MT rich applications. Intel made a mistake by removing it as it cripples their ability to compete in DC where the most growth and highest margins exist.
 

511

Golden Member
Jul 12, 2024
1,034
894
106
Do you have a link on this? I have never heard of anything like this before.
Thanks.

Still, no matter how you slice it, or find a corner case where SMT can't be used, the fact still remains that SMT is a very good architectural element for achieving high performance in MT rich applications. Intel made a mistake by removing it as it cripples their ability to compete in DC where the most growth and highest margins exist.
there may be some edge cases otherwise we would not have options to disable SMT on SKUs
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |