Discussion Future ARM Cortex + Neoverse µArchs Discussion

Page 28 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

SarahKerrigan

Senior member
Oct 12, 2014
585
1,403
136
So I'm thinking that the Client CSS stuff we're seeing now is basically what Qualcomm was talking about in their lawsuit a couple of years ago with the whole "you'll have to bundle Cortex with Mali" complaint. Going to be interesting to see what that ends up looking like in practice - I have a hard time believing that Nvidia and Samsung (and, for that matter, Renesas) will just be forced onto Mali for their lineups.
 
Reactions: xpea and SpudLobby

gdansk

Platinum Member
Feb 8, 2011
2,478
3,373
136
About 10% of it seems to be clock rate from N3 but overall very promising.
And I like the "X925 x12" configuration for a laptop. But Mali drivers? I guess there are some of us hoping for MediaTek to combine that with an Nvidia GPU.
 

uzzi38

Platinum Member
Oct 16, 2019
2,698
6,393
146
Damn those slides are hilariously bad. So incredibly misleading. You really have to take a good look at all of the slides to notice as well.

Most of the gains they're touting for CPU outside of the AI stuff is just clock gains - most of which won't be used in smartphones anyway (just like how nobody clocked X4 at 3.6GHz, clocking X925 at 3.8GHz is even less likely).




They're getting those clock gains by comparing X925 at it's best against a relatively gimped X4:



(X4 is capable of hitting 3.6GHz, but nobody shipped this because power is too high for smartphones. X925 peak power consumption looks to be worse, so 3.8GHz comparisons are also equally useless).

IPC uplift looks a touch under 15% at best if you go off this slide:



But this slide also directly contradicts the prior slide showing much smaller gains in GB6.

Oh and the comparison platform also just so happened to have twice the L3 and SLC available to it:




No iso-node comparison (the comparisons provided are using an N4-based test platform vs an N3-based test platform) means power efficiency gains look to be all node as well. And again, this peak power will only be useful when we're talking about laptops in the future - this is too high power for a mobile phone. If the chart is remotely accurate we're looking at a ~20% increase to peak power.



Honestly I wasn't expecting much for X5/X925, but holy damn I was very unimpressed by the slideware.
 

Attachments

  • 1716997066661.png
    224.1 KB · Views: 16

SarahKerrigan

Senior member
Oct 12, 2014
585
1,403
136
Damn those slides are hilariously bad. So incredibly misleading. You really have to take a good look at all of the slides to notice as well.

Most of the gains they're touting for CPU outside of the AI stuff is just clock gains - most of which won't be used in smartphones anyway (just like how nobody clocked X4 at 3.6GHz, clocking X925 at 3.8GHz is even less likely).

View attachment 99914


They're getting those clock gains by comparing X925 at it's best against a relatively gimped X4:

View attachment 99915

(X4 is capable of hitting 3.6GHz, but nobody shipped this because power is too high for smartphones. X925 peak power consumption looks to be worse, so 3.8GHz comparisons are also equally useless).

IPC uplift looks a touch under 15% at best if you go off this slide:

View attachment 99916

But this slide also directly contradicts the prior slide showing much smaller gains in GB6.

Oh and the comparison platform also just so happened to have twice the L3 and SLC available to it:

It really doesn't contradict. Showing ~30% over "Android flagship 2023" lines up reasonably well with a 15% iso-clock win. (I'm pretty sure "3.8GHz Best-in-class" actually refers to A17.)
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
The Cortex-X925 is also designed for and optimized for AI-based workloads, with dedicated AI accelerators and software optimizations that enhance AI processing efficiency. With up to 80 TOPS (trillion operations per second), the core can handle complex AI tasks, from natural language processing to computer vision
What. 80 TOPS in one core??
 

soresu

Platinum Member
Dec 19, 2014
2,934
2,157
136
ARM's Cambridge team continues to drink tea.
A5x and A5xx have typically seen very few updates.

v8-A CPUs saw only 2 - A53 and A55, announced 4 years apart.

So the cadence so far on v9-A while unimpressive is still more than we have seen previously.
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
Cortex X925 seems to be generally underwhelming. It's seemingly not a clean-sheet design, but an evolution of X4. Also

Based on their numbers, X925's GB6 SC should be about ~3000.

Still incapable of of competing with the forerunners- Apple, Intel and AMD.
 
Reactions: Apokalupt0

soresu

Platinum Member
Dec 19, 2014
2,934
2,157
136
What. 80 TOPS in one core??
That's a tricky figure to reason out when 1 TOPS can mean INT4 or FP64 with wildly differing implications.

At 3.8 Ghz w/ 6x 128b SIMD ALUs it doesn't sound impossible for lower precision math.
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
X925 must be having the widest SIMD in an ARM client core.

Even Apple M4 P has 4x128b
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
Cortex X925 seems to be generally underwhelming. It's seemingly not a clean-sheet design, but an evolution of X4. Also

Based on their numbers, X925's GB6 SC should be about ~3000.

Still incapable of of competing with the forerunners- Apple, Intel and AMD.
This is concerning for the future of Windows-on-ARM (except for Qualcomm maybe, because they use their custom Oryon CPU).
 

soresu

Platinum Member
Dec 19, 2014
2,934
2,157
136
Even Apple M4 P has 4x128b
A lot of SIMD grunt is pretty much overkill in a purely mobile core.

Now that they are looking at real desktop/laptop marketshare it doesn't surprise me that they are going for it.

Unless I missed my guess A725 is still just 2x.
 

Nothingness

Platinum Member
Jul 3, 2013
2,719
1,348
136
That's a tricky figure to reason out when 1 TOPS can mean INT4 or FP64 with wildly differing implications.

At 3.8 Ghz w/ 6x 128b SIMD ALUs it doesn't sound impossible for lower precision math.
That doesn't seem possible: even for 4-bit FMA that'd mean 3.8G*2*6*32, which is less than 1.5 TOPS.
 
Reactions: Hitman928

soresu

Platinum Member
Dec 19, 2014
2,934
2,157
136
This is concerning for the future of Windows-on-ARM (except for Qualcomm maybe, because they use their custom Oryon CPU).
ARM typically incrementally upgrade their CPU every year, I wouldn't expect QC to be so prompt.

By the time QC field Pegasus or whatever Oryon2 is the ARM contender will not be in the same place.
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
ARM typically incrementally upgrade their CPU every year, I wouldn't expect QC to be so prompt.

By the time QC field Pegasus or whatever Oryon2 is the ARM contender will not be in the same place.
Yeah but this Cortex A925 will have to compete with Zen5, Lion Cove and M4-P.
 

ikjadoon

Member
Sep 4, 2006
139
222
126
They're getting those clock gains by comparing X925 at it's best against a relatively gimped X4:

View attachment 99915

(X4 is capable of hitting 3.6GHz, but nobody shipped this because power is too high for smartphones. X925 peak power consumption looks to be worse, so 3.8GHz comparisons are also equally useless).

According to that slide, Arm didn't target X4 to hit 3.6 GHz, though: Arm targeted X4 for 3.3 GHz.

Arm's frequency targets—see below the timeline—are usually spot on. Whether ">3.6 GHz" designs actually hit 3.8 GHz or just 3.61 GHz is a fair question when Arm uses 3.8 GHz in their comparisons, but Arm's target frequency is pretty common.

TCS21 = Cortex-X2 = Arm targeted 3.0 GHz, OEMs shipped 3.2 GHz (+6.6% Arm's target)
TCS22 = Cortex-X3 = Arm targeted 3.2 GHz, OEMs shipped 3.4 GHz (+5% Arm's target)
TCS23 = Cortex-X4 = Arm targeted 3.3 GHz, OEMs shipped 3.4 GHz (+3% Arm's target)
CSS24 = Cortex-X925 = Arm targets 3.6 GHz+, OEMs will ship...?

It should be expected for a full node shrink to give headroom for higher clocks.

//

Not to say these slides are anything great. Arm doesn't commit themselves to a precise IPC calculation over X4, from my review, and that's not helpful to not even have a first-party public target.

Then, this Speedometer 2.1 data is just nonsense. Why on Earth would Arm not also provide data from the same Chromium build, to control any "SW uplift"? That is not really a fair comparison at all.



Sadly, this is likely one of the few disclosures Arm Ltd. will ever do on X925, so our window of opportunity to really learn more has unexpectedly closed.
 

ikjadoon

Member
Sep 4, 2006
139
222
126
So I'm thinking that the Client CSS stuff we're seeing now is basically what Qualcomm was talking about in their lawsuit a couple of years ago with the whole "you'll have to bundle Cortex with Mali" complaint. Going to be interesting to see what that ends up looking like in practice - I have a hard time believing that Nvidia and Samsung (and, for that matter, Renesas) will just be forced onto Mali for their lineups.

I believe you're right. XDA goes into only slightly more detail:

Arm is also, for the first time, packaging these cores as a GDSII to be provided to OEMs. This is essentially a file that can be provided to a fabricator like TSMC or Samsung for immediate production, and those GDSII files take into account any specific quirks or features of the fabricator. Arm says that a big perk of this is that it can improve time to market for companies that are using these cores, but that those companies can simply license the designs and do all of the leg work themselves if they wish, just like with previous Arm cores.

On that topic of litigation, from my recent check at Court Listener, Qualcomm added counter-claims against Arm that Arm did not actually destroy / quarantine all of the NUVIA IP that Arm had access to. It seems to be related to fabrics, as it mentions Arm's NCI and CMN, but also the MMU; Qualcomm says they found out about the claimed breaches during the discovery & deposition phase.

From this PDF and this PDF.

Hard to know the real details as so much is sealed & redacted, but Qualcomm's amendments were partly allowed (Qualcomm are the Defendants below) and QC seems likely to pursue this reverse-infringement-by-Arm claim:

[SEALED] Memorandum Order granting-in-part and denying-in-part Defendants' motion to amend answer and counterclaims (DI 260, 272). Signed by Judge Laura D. Hatcher on 3/6/24.This order has been emailed to local counsel. (kjk) (Entered: 03/06/2024)

Trial is still set for December 2024, so a long ways to go (for us, but I imagine Arm & QC are under pressure to either win handily or settle early).
 

FlameTail

Diamond Member
Dec 15, 2021
3,122
1,786
106
On that topic of litigation, from my recent check at Court Listener, Qualcomm added counter-claims against Arm that Arm did not actually destroy / quarantine all of the NUVIA IP that Arm had access to. It seems to be related to fabrics, as it mentions Arm's NCI and CMN, but also the MMU; Qualcomm says they found out about the claimed breaches during the discovery & deposition phase.

From this PDF and this PDF.

Hard to know the real details as so much is sealed & redacted, but Qualcomm's amendments were partly allowed (Qualcomm are the Defendants below) and QC seems likely to pursue this reverse-infringement-by-Arm claim:

Trial is still set for December 2024, so a long ways to go (for us, but I imagine Arm & QC are under pressure to either win handily or settle early).
You should post updates about that court case more often, if possible.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |