Samsung outs Exynos 9 Series 9810

Page 3 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Lodix

Senior member
Jun 24, 2016
340
116
116
It seems their performance targets were not exaggerate as some people implied.
 

french toast

Senior member
Feb 22, 2017
988
825
136
Uhhh what exactly made you come to the conclusion they're in order? They are OOO..
Oh, this quote from your article "At heart the Exynos M1 and M2 microarchitectures are based on a 4-wide in-order stage for decode and dispatch"

I took that as at least partially in order?
 

Andrei.

Senior member
Jan 26, 2015
316
386
136
Oh, this quote from your article "At heart the Exynos M1 and M2 microarchitectures are based on a 4-wide in-order stage for decode and dispatch"

I took that as at least partially in order?
That's how all OOO processors work, the front-end is always in order whilst execution is out of order.
 

krumme

Diamond Member
Oct 9, 2009
5,956
1,595
136
I would have liked a new a73 implementation on 10nm second gen. Seems more sensible to me outside of bragging rights.
As if people need to have an AI server farm in their pocket.
 

Andrei.

Senior member
Jan 26, 2015
316
386
136
What is all the FP performance needed for?
I mean is this more a server cpu tacked to a a55 mobile cpu?
90% of what you do with your phone, web browsing. JavaScript is completely 64bit floating point. Apple's cores are wider than this, nobody seems to bat an eye at them..
 
Reactions: krumme

krumme

Diamond Member
Oct 9, 2009
5,956
1,595
136
90% of what you do with your phone, web browsing. JavaScript is completely 64bit floating point. Apple's cores are wider than this, nobody seems to bat an eye at them..
Damn. I didnt knew that about java script. Okey...lol
 

Hans de Vries

Senior member
May 2, 2008
321
1,018
136
www.chip-architect.com
Performance is dead clear. Now we have to see the efficiency difference.

https://www.anandtech.com/show/12361/samsung-exynos-m3-architecture
Nice article Andrei

Good find, the link really contains a lot of info about the M3 :^)
https://reviews.llvm.org/D42387
Two small remarks:
- The 2 complex ALU also handle simple integer stuff, so the simple integer handling is now 4 wide.
- The Branch units calculate branch addresses, the branch prediction unit is in front, it steers the fetch unit.
 

Thala

Golden Member
Nov 12, 2014
1,355
653
136
I would have liked a new a73 implementation on 10nm second gen. Seems more sensible to me outside of bragging rights.
As if people need to have an AI server farm in their pocket.

The A75 is as efficient as the A73 - e.g. at iso-performance same power. In addition the DSU only supports A75 and A55 but not A73. No need to extend the lifetime of A73.
 
Reactions: pcp7

eek2121

Diamond Member
Aug 2, 2005
3,053
4,281
136
I wouldn't be surprised if they boosted single core performance close to 2x in geekbench. I also wouldn't be surprised to find out that the various ARM SOC vendors are now optimising their designs around getting high geek bench scores at the expense of actual performance. They'd almost be insane not to when this single metric has somehow become the defacto benchmark of processor performance. Imagine what would inevitably happen if the GPU-buying market assessed the speed of GPUs almost entirely based on their 3DMark scores. GPUs would get very good at 3DMark in very short order.

We desperately need a wider variety of benchmarks for mobile that reliably isolate processor performance.

As much crap as people give Geekbench, Geekbench uses open source libraries in order to test performance across a variety of workloads. I would LOVE to see a CPU/SoC that performs great at Geekbench and has shit performance at other benchmarks. Zlib, Jpeg compression, etc...that's all stuff based on standard libraries.
 
Last edited:

french toast

Senior member
Feb 22, 2017
988
825
136
That's nigh-impossible to pull off for them.
Really? Apple apparently did so without buying any major ip or graphics company.. whether you believe they genuinely designed their own gpu from scratch that would appear to run all software and games that was designed for imagination cores is up to you. (Without massive driver overhaul/issues)
If imagination don't bring litigation then I suppose this must be the case, no matter how suspicious it seems to myself.
Nevertheless, they still have a GPU product they call their own.

Intel did something similar, to less success.

Samsung has been rumoured to be working on GPU designs for many years, it is quite possible that after internal projects and millions of dollars that Samsung could have their own design out next year..whether it's competitive or not should it arrive is guess work, but it is possible.

They have motive...they design their own SOCs, custom processor's and fabric, their own CPUs, modems.
Then they fab them on their own process from their own foundry.
It stands to reason they would pine for their own gpu uarch...mated to either heavily skinned/adjusted android to their specifications or even Tizen?...that would give them the holy grail of top to bottom/ vertical integration of their own hardware/software...ie Apple.

Samsung has the motive,engineering talent and the finances to do it...I for sure wouldn't bet against it happening.
 
Last edited:

Qwertilot

Golden Member
Nov 28, 2013
1,604
257
126
Not quite guesswork - if it arrives then it'll be competitive They seem to be far too pragmatic to forcibly use a substandard in house design.
 
Reactions: french toast

Nothingness

Platinum Member
Jul 3, 2013
2,769
1,429
136
@Andrei. As written on RWT:
I might have been missing something but what makes Andrei and others think M3 has
6 decoders?

The IssueWidth from the machine description is not the number of decoders. For instance
the Skylake machine description also sets IssueWidth to 6 though Skylake doesn't have
6 decoders. See this.

So it's possible M3 has less than 6 decoders but has a uop cache that can issue up to
6 uops each cycle.
Did you pick that 6-decode information from IssueWidth? If so then I guess this might be wrong.
 

Andrei.

Senior member
Jan 26, 2015
316
386
136
@Andrei. As written on RWT:

Did you pick that 6-decode information from IssueWidth? If so then I guess this might be wrong.
It's literally written there in cleartext.

// The Exynos-M1 is a traditional superscalar microprocessor with a
// 4-wide in-order stage for decode and dispatch and a wider issue stage.
// The execution units and loads and stores are out-of-order.

// The Exynos-M3 is an advanced superscalar microprocessor with a 6-wide
// in-order stage for decode and dispatch and a wider issue stage.
// The execution units and loads and stores are out-of-order.
 

Nothingness

Platinum Member
Jul 3, 2013
2,769
1,429
136
It's literally written there in cleartext.

// The Exynos-M1 is a traditional superscalar microprocessor with a
// 4-wide in-order stage for decode and dispatch and a wider issue stage.
// The execution units and loads and stores are out-of-order.

// The Exynos-M3 is an advanced superscalar microprocessor with a 6-wide
// in-order stage for decode and dispatch and a wider issue stage.
// The execution units and loads and stores are out-of-order.
Ha thanks for clarifying. So I guess this means either no uop cache or a uop cache with the same width as the decode stage.

I wonder if these decoders are completely identical or if some of them only support AArch64 for instance.
 

Lodix

Senior member
Jun 24, 2016
340
116
116
I tthough maybe their claim about "around two-fold" single core performance would translate to ×1'8-1'95 in really but in this new marketing image on twitter they state a solid x2 performance.

 

Andrei.

Senior member
Jan 26, 2015
316
386
136
You need to understand that SLSI's marketing department is a department that is really small and essentially has no reason to exit as it doesn't *really* benefit the business in any way. When they do talk about any claims or marketing figures they are generally sourced from the technical teams.
 
Reactions: Lodix

french toast

Senior member
Feb 22, 2017
988
825
136
You need to understand that SLSI's marketing department is a department that is really small and essentially has no reason to exit as it doesn't *really* benefit the business in any way. When they do talk about any claims or marketing figures they are generally sourced from the technical teams.
Andrei, Is there a reason you can think of why apple was able to get to such a massive multithreaded performance with A11..with just 2 ultra wide cores and 4 small/medium cores...Vs Samsung with improved A55s and similar width M3 cores..but more of them? (Approx 25% lower MT score projected geekbench?)

I would have though having more cores would have allowed exynos 8910 to spread the load at lower clock speeds... thereby getting higher performance per watt..
Is this a superior memory controller or fabric from apple at play..
 

krumme

Diamond Member
Oct 9, 2009
5,956
1,595
136
Andrei, Is there a reason you can think of why apple was able to get to such a massive multithreaded performance with A11..with just 2 ultra wide cores and 4 small/medium cores...Vs Samsung with improved A55s and similar width M3 cores..but more of them? (Approx 25% lower MT score projected geekbench?)

I would have though having more cores would have allowed exynos 8910 to spread the load at lower clock speeds... thereby getting higher performance per watt..
Is this a superior memory controller or fabric from apple at play..
Didnt apple get two more small cores in a11?
Besides those small cores is probably fairly beefy vs a55 but i cant find specs we need Hans/andrei for this so more like half big and in a powerconstrained situation like mt perf. they are effective vs a very wide design?
 

jt7

Junior Member
Jan 4, 2018
4
1
81
Andrei, Is there a reason you can think of why apple was able to get to such a massive multithreaded performance with A11..with just 2 ultra wide cores and 4 small/medium cores...Vs Samsung with improved A55s and similar width M3 cores..but more of them? (Approx 25% lower MT score projected geekbench?)

I would have though having more cores would have allowed exynos 8910 to spread the load at lower clock speeds... thereby getting higher performance per watt..
Is this a superior memory controller or fabric from apple at play..

I'm guessing the performance per watt on the big cores is not as good as the Monsoon cores. Given that the SOCs are power limited in mutl core mode, the big cores have to be clocked way down compared to the 2.9ghz max. On the A11 the big cores can run closer to their peak performance.

If this is the case, it makes you wonder why they even bothered with 4 big cores. Perhaps it can stretch its legs more in a laptop form factor.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |