Question Post your Geekbench AI scores!

igor_kavinski · Aug 15, 2024

Det0x did the honors here: http://www.portvapes.co.uk/?id=Latest-exam-1Z0-876-Dumps&exid=thread...ranite-ridge-ryzen-9000.2607350/post-41277152

I'll try to post my score as soon as I get home.

igor_kavinski · Aug 15, 2024

Here ya go, folks!

ASRock Z790 PG SONIC vs ASRock Z790 PG SONIC - Geekbench

So what are we looking at here?

Same system. The faster one is just overclocked using Intel XTU (5.1 GHz P-cores 4.2ghz E-cores 275W 320A).

The red scores? That's when the system thermally throttled pretty hard.

Hail's 7950X beating the crap out of my system: https://browser.geekbench.com/ai/v1/compare/4729?baseline=6860

Det0x's 9950X ES taking the wind out of my PC: https://browser.geekbench.com/ai/v1/compare/4408?baseline=6860

gdansk · Aug 15, 2024

I tried running on my M1 Pro NPU and it's been stuck at Pose Estimation (Q) for 5 minutes.
The CPU didn't take long though:

MacBook Pro (16-inch, 2021) - Geekbench

Benchmark results for a MacBook Pro (16-inch, 2021) with an Apple M1 Pro processor.

browser.geekbench.com

poke01 · Aug 15, 2024

Can someone post the Ryzen HX 370 result?
On paper it has the highest, 50 TOPS.

However Apple claimed to the fastest NPU with the M4 with 38 TOPS but that was before HX 370 came out.
True to Apple's claims they do have the fastest NPU from what I seen so far at least in (INT8).

M4 NPU:

iPad16,3 - Geekbench

Benchmark results for an iPad16,3 with an ARM processor.

browser.geekbench.com

8 Gen 3, Qualcomm's best NPU:

Samsung Galaxy S24 - Geekbench

Benchmark results for a Samsung Galaxy S24 with an ARM ARMv8 processor.

browser.geekbench.com

Snapdragon X Elite - X1E78100 NPU:

LENOVO 83ED - Geekbench

Benchmark results for a LENOVO 83ED with a Snapdragon X Elite - X1E78100 - Qualcomm Oryon processor.

browser.geekbench.com

igor_kavinski · Aug 15, 2024

gdansk said:
I tried running on my M1 Pro NPU and it's been stuck at Pose Estimation (Q) for 5 minutes.

Maybe V1.0 bug.

igor_kavinski · Aug 15, 2024

poke01 said:
M4 NPU:

iPad16,3 - Geekbench

Benchmark results for an iPad16,3 with an ARM processor.

browser.geekbench.com

8 Gen 3, Qualcomm's best NPU:

Samsung Galaxy S24 - Geekbench

Benchmark results for a Samsung Galaxy S24 with an ARM ARMv8 processor.

browser.geekbench.com

Snapdragon X Elite - X1E78100 NPU:

LENOVO 83ED - Geekbench

Benchmark results for a LENOVO 83ED with a Snapdragon X Elite - X1E78100 - Qualcomm Oryon processor.

browser.geekbench.com

There is something else to note there.

Correction: The Apple NPU is more accurate than the CPUs!

poke01 · Aug 15, 2024

igor_kavinski said:
There is something else to note there.

The NPUs are more accurate than CPUs!

True, thats why I'm curious to see how Strix point NPU performs.

igor_kavinski · Aug 15, 2024

poke01 said:
True, thats why I'm curious to see how Strix point NPU performs.

I don't think we are gonna see that soon. Either AMD hasn't been able to get the necessary software support ready or no one who has a HX 370 laptop knows enough to test the NPU. I'm kinda leaning towards the former.

igor_kavinski · Aug 15, 2024

AVX-512 on vs. off by MarkPost: http://www.portvapes.co.uk/?id=Latest-exam-1Z0-876-Dumps&exid=thread...ranite-ridge-ryzen-9000.2607350/post-41277318

Tech Junky · Aug 15, 2024

igor_kavinski said:
HX 370 laptop

It's only been a couple of weeks since they were released and only a couple of models from Asus. As for the desktop side those were really released this week for the 9 level and I don't recall if they had AI or not.

Hitman928 · Aug 15, 2024

poke01 said:
Can someone post the Ryzen HX 370 result?
On paper it has the highest, 50 TOPS.

However Apple claimed to the fastest NPU with the M4 with 38 TOPS but that was before HX 370 came out.
True to Apple's claims they do have the fastest NPU from what I seen so far at least in (INT8).

M4 NPU:

iPad16,3 - Geekbench

Benchmark results for an iPad16,3 with an ARM processor.

browser.geekbench.com

8 Gen 3, Qualcomm's best NPU:

Samsung Galaxy S24 - Geekbench

Benchmark results for a Samsung Galaxy S24 with an ARM ARMv8 processor.

browser.geekbench.com

Snapdragon X Elite - X1E78100 NPU:

LENOVO 83ED - Geekbench

Benchmark results for a LENOVO 83ED with a Snapdragon X Elite - X1E78100 - Qualcomm Oryon processor.

browser.geekbench.com

This benchmark doesn’t support the AMD NPU yet.

FlameTail · Aug 15, 2024

Hitman928 said:
This benchmark doesn’t support the AMD NPU yet.

Yes, the signal65 article noted that;

Despite having one of the fastest NPUs on paper, Geekbench AI 1.0 still doesn’t have the ability to measure the AMD Ryzen AI NPU performance. When I asked Primate Labs about this, I was told the reasoning was that it was a representation of where AMD stood today in terms of its consumer AI framework implementations, and that trying to integrate support through Vitis software (carryover from Xilinx) just wasn’t working out. Disappointing for sure, but also is mirrored by the fact that you cannot run the Procyon AI benchmark on AMD NPUs. Hopefully we’ll have a solution from AMD on this soon.

New Geekbench AI 1.0 Benchmark Analysis and Early Results - Signal65

While the ability to run performance analysis of on-device AI computing continues to expand, Geekbench AI 1.0 is a new benchmark that attempts to make sense of it all. Signal65 dives into the benchmark itself, pros and cons, and shares early performance data.

signal65.com

Makaveli · Aug 15, 2024

mikegg · Aug 16, 2024

delete

mikegg · Aug 16, 2024

Device	Single Precision	Half Precision	Quantized
Intel Core Ultra 9 185H (NPU)	7172	7176	11000
Qualcomm Snapdragon X Elite 80-100 (NPU)	2177	11069	21549
M3 (NPU)	2499	13971	14877
M4 (NPU)	4702	32052	40743
NVIDIA GeForce RTX 4090	36800	50531	27568

Source: https://signal65.com/research/ai/new-geekbench-ai-1-0-benchmark-analysis-and-early-results/

Using best framework for each NPU. Added RTX GPU (ONNX DirectML) for reference.

Based on this benchmark, we can clearly see that a GPU is geared towards training (FP32 & FP16) and is not very efficient for inference (INT8/INT4).

Hitman928 · Aug 16, 2024

mikegg said:
Device Single Precision Half Precision Quantized
Intel Core Ultra 9 185H 7172 7176 11000
Qualcomm Snapdragon X Elite 80-100 2177 11069 21549
M3 2499 13971 14877
M4 4702 32052 40743

Can you add the framework used?

itsmydamnation · Aug 17, 2024

mikegg said:
Device Single Precision Half Precision Quantized
Intel Core Ultra 9 185H (NPU) 7172 7176 11000
Qualcomm Snapdragon X Elite 80-100 (NPU) 2177 11069 21549
M3 (NPU) 2499 13971 14877
M4 (NPU) 4702 32052 40743
NVIDIA GeForce RTX 4090 36800 50531 27568

Source: https://signal65.com/research/ai/new-geekbench-ai-1-0-benchmark-analysis-and-early-results/

Using best framework for each NPU. Added RTX GPU (ONNX DirectML) for reference.

Based on this benchmark, we can clearly see that a GPU is geared towards training (FP32 & FP16) and is not very efficient for inference (INT8/INT4).

is it driver restriction ? Nvidia do that alot on allowed throughput rates of "datacentre" formats , Nv should be eating most of the various precision alive.

mikegg · Aug 17, 2024

itsmydamnation said:
is it driver restriction ? Nvidia do that alot on allowed throughput rates of "datacentre" formats , Nv should be eating most of the various precision alive.

No. Highly doubt it is.

Gaming emphasizes FP32.

itsmydamnation · Aug 17, 2024

mikegg said:
No. Highly doubt it is.

Gaming emphasizes FP32.

Not really been using packed math for years

https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.nvidia.com/content/PDF/nvidia-ampere-ga-102-gpu-architecture-whitepaper-v2.pdf&ved=2ahUKEwipxO68wfuHAxWV4jQHHYnlAb8QFnoECBMQAQ&usg=AOvVaw0GxIvUUS-GXgMQrYWoGc_1

The GA10x SM continues to support double-speed FP16 (HFMA) operations which are
supported in Turing. And similar to TU102, TU104, and TU106 Turing GPUs, standard FP16
operations are handled by the Tensor Cores in GA10x GPU

igor_kavinski · Aug 19, 2024

Different AI processing options comparison in the same laptop: http://www.portvapes.co.uk/?id=Latest-exam-1Z0-876-Dumps&exid=threads/qualcomm-snapdragon-thread.2616013/post-41279314

Clean right now but it could become messy when more laptops of the same model run GBAI in future: https://browser.geekbench.com/search?k=ai&q=Inspiron+13+5330+&utf8=✓

igor_kavinski · Aug 19, 2024

Anyone know how one may ascertain the NPU TOPS from these GB scores? Or should one just double the Quantized score to arrive at the TOPS? That would mean the M4 NPU has 80 TOPS!

Doug S · Aug 19, 2024

igor_kavinski said:
Anyone know how one may ascertain the NPU TOPS from these GB scores? Or should one just double the Quantized score to arrive at the TOPS? That would mean the M4 NPU has 80 TOPS!

Since different vendors are reporting different things with "TOPS" (e.g. some may be INT8, some INT4, some FP8) there's no formula for conversion. But we'll be able to see what "TOPS" figures their marketers claim, and compare to GB AI scores, and figure out a "fudge factor" to compare e.g. the TOPS figure for Qualcomm to Intel, or whatever. Obviously that's pointless once GB AI scores are available, but when something new is announced but not yet released, vendor claimed TOPS are all you have to go by.

Det0x · Sep 11, 2024

Did some testruns in preperation for hwbot, found out that this benchmark dont care about threads at all.. Getting pretty much same score with SMT enabled/disabled on my 9950X

16/32 SMT enabled

16/16 SMT disabled

MS_AT · Sep 11, 2024

Det0x said:
Did some testruns in preperation for hwbot, found out that this benchmark dont care about threads at all.. Getting pretty much same score with SMT enabled/disabled on my 9950X

16/32 SMT enabled
View attachment 107381

16/16 SMT disabled
View attachment 107382

Have you observed thread utilization? OpenVino might limit itself to physical cores since HT won't give you lots of benefits in backend bound code. What you might see is noticeable performance scaling with DDR MT/s if the benchmark is using LLMs underneath.

Det0x · Sep 11, 2024

MS_AT said:
Have you observed thread utilization? OpenVino might limit itself to physical cores since HT won't give you lots of benefits in backend bound code.

Yeah thats exacly what it looked like when i had hwinfo open while running

Device	Single Precision	Half Precision	Quantized
Intel Core Ultra 9 185H	7172	7176	11000
Qualcomm Snapdragon X Elite 80-100	2177	11069	21549
M3	2499	13971	14877
M4	4702	32052	40743

Question Post your Geekbench AI scores!

Lifer

Lifer

Diamond Member

Platinum Member

Lifer

Lifer

Platinum Member

Lifer

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Golden Member

Golden Member

Diamond Member

Platinum Member

Golden Member

Platinum Member

Lifer

Lifer

Platinum Member

Golden Member

Senior member

Golden Member