Question Geekbench 6 released and calibrated against Core i7-12700

igor_kavinski · Feb 14, 2023

Geekbench Blog

www.geekbench.com

Weird choice of baseline CPU and even weird is that the baseline score is 2500.

i7-12700 does hardly 2000 in GB5 with the fastest DDR5.

okoroezenwa · Thursday at 9:11 AM

igor_kavinski said:
I don't know If AMD has AMX support in their CPUs.

Dunno either, though they got similar uplifts in the object identification subtest as shown in this previous thread by @poke01. Maybe due to AMX? 👀

igor_kavinski said:
He could've held off the SME changes until the end of the year when M4, Zen 5 and Lunar Lake will all be available.

Why? Only one of those is Arm-based so it won't matter. Also I have a very hard time believing people like you would deal with that in any sensible way.

igor_kavinski said:
If there were multiple CPUs with SME being released, I would get that and it wouldn't seem "suspicious".

lol, you'd just claim it was some Arm bias anyway.

Nothingness · Thursday at 9:58 AM

FTR some weeks ago I posted a link comparing w/wo same gen InteL CPU w/wo AMX

Here it is again but reversed to better apprehend the effect of AMX:

ASUS System Product Name vs HP HP Pro Mini 400 G9 Desktop PC - Geekbench

Hitman928 · Thursday at 9:59 AM

okoroezenwa said:
Dunno either, though they got similar uplifts in the object identification subtest as shown in this previous thread by @poke01. Maybe due to AMX? 👀

Why? Only one of those is Arm-based so it won't matter. Also I have a very hard time believing people like you would deal with that in any sensible way.

lol, you'd just claim it was some Arm bias anyway.

AMD CPUs don't support AMX. They get an uplift as object identification has AVX-VNNI support (AVX512-VNNI is also supported on applicable CPUs). AMX isn't supported on client Intel CPUs either, it's a special accelerator that Intel only puts in Xeons.

igor_kavinski · Thursday at 10:01 AM

Nothingness said:
ASUS System Product Name vs HP HP Pro Mini 400 G9 Desktop PC - Geekbench

Great comparison but I don't think AMX is coming to consumer CPUs.

Bencher · Thursday at 10:10 AM

igor_kavinski said:
Because it's being treated as THE benchmark by the fruit users, not A benchmark.

I always felt like Geekbench was trash. Especially gb5, that was insanely, absurdly memory bound. I remember just tuning my memory increased my gb5 score by 40%, lol.

poke01 · Thursday at 10:17 AM

igor_kavinski said:
It has a USEFUL output. It's not just a useless benchmark. The developer is passionate about optimization. He cares about Pi digits and that's what his optimizations make possible in lesser and lesser time.

This is such a double take because it ruins your speculation on why SME was added to GB before retail release. So it’s okay for one developer to add optimisations but not the other before release?

Let’s call it Zen-cruncher from now on then.

poke01 · Thursday at 10:31 AM

some here need to understand that a pure cpu benchmark is not a good a way of measuring performance.

AVX, AVX-512, NEON and SVE/SVE2 etc are all useful. That’s why looking at subtexts is helpful.

igor_kavinski said:
Before Lunar Lake launch?

Well, GB did add VINNI support which Lunar Lake will have.

igor_kavinski · Thursday at 11:21 AM

poke01 said:
This is such a double take because it ruins your speculation on why SME was added to GB before retail release. So it’s okay for one developer to add optimisations but not the other before release?

One is a benchmark developer. Other is an application developer. First one can be bought and it misleads their users. Second one even if bought still benefits their users because their work finishes in LESS time. That's the difference.

A simple "I was sponsored by Apple to include SME" in this benchmark on his blogpost would've been enough.

Or if that were not the case, he could put a disclaimer saying that he was NOT sponsored and did it because he always wanted to add SME.

Right now, we have no idea why SME was added so close to M4.

TwistedAndy · Thursday at 2:11 PM

Bencher said:
I always felt like Geekbench was trash. Especially gb5, that was insanely, absurdly memory bound. I remember just tuning my memory increased my gb5 score by 40%, lol.

Geekbench 6 is not that different here as well. Let's take a closer look on the technical details of the tests included in Geekbench 6:

1. File Compression
Almost useless. Instead of LZ4 and ZSTD, it makes sense to use deflate (gzip/zip), which is used everywhere on the web and system-wide.

2. Navigation
Useless. Both Google and Apple Maps usually do not perform those calculations on the device. Even in the case of offline navigation, that test is not representative because different apps are used.

3. HTML5 Browser
It's not representative. Instead of using the actual headless browser or Node.js, Geekbench decided to use some libraries to parse and render HTML/CSS. Surprisingly, the most compute-heavy part (JS) is not included.

4. PDF Render
The first useful test!

5. Photo Library
It's a very weird test. Instead of measuring some useful things like JPG/PNG/WebP scaling/compression/decompression, they added a lot of other useless steps like running an image classification model and storing tags in the SQL database.

6. Developer workloads

- Clang. It's not representative because Clang is used mostly on Apple OS and some Unix systems. Technically, it's possible to use Clang on Windows or Linux, but it's not a common scenario. Windows uses MSVC by default. Linux - GCC.

In general, this benchmark represents the compile performance in Xcode for Apple devices and is not relevant for other platforms.

- Text Processing. It looks to be a valid benchmark, but it makes sense to process all those files using Node.js, JVM, or PHP. It's a more common scenario.

- Asset Compression. I do not work with 3D assets, but it looks to be valid.

7. Machine Learning Workloads

Despite having ML in the name, it's a useless benchmark. The actual apps that detect objects blur backgrounds, remove objects, etc., usually use GPU or NPU for that. Also, these kinds of workloads are very sensitive to code optimization and used libraries and frameworks.

8. Horizon Detection, Photo Filter, and HDR

These tasks also heavily depend on code optimization and the libraries used. The performance in different apps may be very different from the results in this section.

9. Ray Tracer

Useless. In most cases, the GPU makes the ray tracing calculations and scene rendering. There are some cases when it makes to do that on CPU, but it's an exception.

10. Structure from Motion

It looks to be a valid benchmark, but I'm not sure how frequently it is used.

Summary

Geekbench 6 is not the best benchmark available. Many tests heavily depend on the actual implementation and platform optimization, not to mention SME, AVX-512, etc.

Instead of using open and widely used tools, libraries, and apps like Node.js, Electron, Blender, JVM, etc., it uses some custom implementations for very questionable tasks.

Cinebench R23 and 2024, on the other hand, measure one specific task, but it's based on real commercial software.

igor_kavinski · Thursday at 4:05 PM

I think Andy did a much better job than me.

poke01 · Thursday at 4:40 PM

igor_kavinski said:
A simple "I was sponsored by Apple to include SME" in this benchmark on his blogpost would've been enough.

Thats the thing they weren't?? Geekbench also supports SVE which Apple doesn't use. SME is not made by Apple but by ARM.
Qualcomm/ARM also use geekbench and actively promotes it, it could have been them and they gave feedback to GB.

poke01 · Thursday at 4:41 PM

igor_kavinski said:
Second one even if bought still benefits their users because their work finishes in LESS time. That's the difference.

ehh, your really grasping for the straws

poke01 · Thursday at 4:42 PM

TwistedAndy said:
Cinebench R23 and 2024, on the other hand, measure one specific task, but it's based on real commercial software.

Cinebench R23 and to a certain extent 2024 are very optimized for the Intel CPUs. Not a fair benchmark for ARM and even AMD.

poke01 · Thursday at 4:46 PM

TwistedAndy said:
Instead of using open and widely used tools, libraries, and apps like Node.js, Electron, Blender, JVM, etc., it uses some custom implementations for very questionable tasks.

Thing is Blender also uses AVX-512 in the cycles test. So yeah, you cannot have a popular benchmark that doesn't use CPU extensions.

What I don't get @TwistedAndy you parrot GB for using CPU extensions but the benchmarks you prefer extensivity use AVX2, AVX512, SSE etc?

Doug S · Thursday at 5:15 PM

TwistedAndy said:
Cinebench R23 and 2024, on the other hand, measure one specific task, but it's based on real commercial software.

So what if it is based on commercial software? The one thing it does is something most PC users (let alone smartphone users) never do.

If Microsoft released a benchmark that was based on what Excel does when you recalculate a big spreadsheet would you think that's also great "because it is based on real commercial software"? I bet you'd find some objections to it if it didn't show the results you want to see.

Doug S · Thursday at 5:17 PM

igor_kavinski said:
It has a USEFUL output. It's not just a useless benchmark. The developer is passionate about optimization. He cares about Pi digits and that's what his optimizations make possible in lesser and lesser time.

In what world is the digits of pi "useful" output? You can download the already calculated value of pi to far more digits than your PC could calculate in a year, or your lifetime for that matter.

igor_kavinski · Thursday at 5:21 PM

poke01 said:
Qualcomm/ARM also use geekbench and actively promotes it, it could have been them and they gave feedback to GB.

Could be. At best, this could be the reason and the dev was too naive to think that people might blame him for being in cahoots with Apple if he released it early. Or he didn't know when Apple was gonna release M4. But for me (coz I'm the conspiracy theorist type), the simplest explanation is what seems to be the obvious one to me: Apple's dirty money made someone happy and eager to please.

igor_kavinski · Thursday at 5:22 PM

Doug S said:
In what world is the digits of pi "useful" output? You can download the already calculated value of pi to far more digits than your PC could calculate in a year, or your lifetime for that matter.

And how would the user know for sure that the downloaded output is correct? He would generate it himself to be sure.

igor_kavinski · Thursday at 5:25 PM

Doug S said:
If Microsoft released a benchmark that was based on what Excel does when you recalculate a big spreadsheet would you think that's also great "because it is based on real commercial software"? I bet you'd find some objections to it if it didn't show the results you want to see.

Not Microsoft but I use a benchmark sheet from overclock.net forums. It's good because it engages the calculation engine of a real spreadsheet that I installed on my PC.

Doug S · Thursday at 5:30 PM

igor_kavinski said:
Could be. At best, this could be the reason and the dev was too naive to think that people might blame him for being in cahoots with Apple if he released it early. Or he didn't know when Apple was gonna release M4. But for me (coz I'm the conspiracy theorist type), the simplest explanation is what seems to be the obvious one to me: Apple's dirty money made someone happy and eager to please.

Your fixation with and hatred of Apple has made you lose touch with reality. You think Apple is paying off the developer of a benchmark to make M4 look better? If Apple was paying for it, why wouldn't they get some mileage out of that by highlighting M4's GB6 results when they announced iPad Pro?

I suppose you have an excuse for that too, but you're like a flat earther who will concoct ever more wild fantasy explanations - which is what happens when you start with a conclusion and choose your "facts" to get you there.

igor_kavinski · Thursday at 5:37 PM

Doug S said:
why wouldn't they get some mileage out of that by highlighting M4's GB6 results when they announced iPad Pro?

Because then there would be too much finger pointing: "Oh look, Apple touting the GB6 score of a version that just released a month ago! I bet they made it happen with their wads of cash". By remaining silent, they avoid implicating themselves in a potential scandal.

Doug S said:
I suppose you have an excuse for that too, but you're like a flat earther who will concoct ever more wild fantasy explanations - which is what happens when you start with a conclusion and choose your "facts" to get you there.

It's not my final conclusion. It's a possible conclusion. Pointing out a possibility shouldn't be met with cries of "oh but you are wrong!".

poke01 · Thursday at 6:40 PM

igor_kavinski said:
Could be. At best, this could be the reason and the dev was too naive to think that people might blame him for being in cahoots with Apple if he released it early. Or he didn't know when Apple was gonna release M4. But for me (coz I'm the conspiracy theorist type), the simplest explanation is what seems to be the obvious one to me: Apple's dirty money made someone happy and eager to please.

I can prove this theory wrong with this. GB also added SVE support, something only CPUs from ARM support so far.

ARM also recently published a SME article in their blog site. Apple never uses Geekbench to present performance improvements so until proven otherwise I don’t think Apple payed Primate Labs.

Geekbench Blog

www.geekbench.com

Looking at the 6.1 release did ARM pay primate labs to add SVE support and did AMD pay them too for AVX512-FP16?
It’s funny how you think something that benefits Apple you immediately assume that Apple payed but when something benefits AMD or ARM, @TwistedAndy and you stay quiet and don’t bring it up.

Upgrade to Clang 16 Geekbench 6.1 is built with Clang 16 on all platforms. Geekbench 6.1 also improves the optimization switches used when building Geekbench.
Increase workload gap Geekbench 6.1 increases the workload gap (the pause between workloads) from two seconds to five seconds. The increased workload gap minimizes thermal throttling and reduces run-to-run variability on newer smartphones such as the Samsung Galaxy S23.
Introduce support for SVE instructionsGeekbench 6.1 includes SVE implementations of several image processing and machine learning functions.
Introduce support for AVX512-FP16 instructions Geekbench 6.1 includes AVX512-FP16 implementations of several image processing functions.
Introduce support for fixed-point mathGeekbench 6.1 introduces fixed-point implementations of several image processing functions. Geekbench uses fixed-point math to implement some image processing functions on systems without FP16 instructions.
Improve Multi-Core PerformanceGeekbench 6.1 improves the multi-core implementations of the Background Blur and Horizon Detection workloads, especially on high-end desktop processors such as 12- and 16-core AMD Ryzens, AMD Threadrippers, and Intel Xeons.

“Thanks to these changes, Geekbench 6.1 single-core scores are up to 5% higher, and multi-core scores are up to 10% higher than Geekbench 6.0 scores. As a result of these methodological differences, which have a non-trivial impact on scores, we recommend users not compare Geekbench 6.1 scores against Geekbench 6.0 scores.”

Since these improvements also improved AMD CPUs and ARM I guess it’s fine right but as soon Primate Labs increase Apple’s score with SME extension, it’s bad and Geekbench is corrupt.

igor_kavinski · Thursday at 10:04 PM

poke01 said:
Since these improvements also improved AMD CPUs and ARM I guess it’s fine right but as soon Primate Labs increase Apple’s score with SME extension, it’s bad and Geekbench is corrupt.

OK, you do make good points. So ARM is corrupting GB with their desire to make their phones look better.

okoroezenwa · Thursday at 11:37 PM

igor_kavinski said:
OK, you do make good points. So ARM is corrupting GB with their desire to make their phones look better.

Not AMD though?

poke01 · Friday at 12:19 AM

okoroezenwa said:
Not AMD though?

AMD is also favoured in Geekbench due to GB supporting a number of AVX512 extensions.

Question Geekbench 6 released and calibrated against Core i7-12700

Lifer

Member

Platinum Member

Diamond Member

Lifer

Member

Golden Member

Golden Member

Lifer

Member

Lifer

Golden Member

Golden Member

Golden Member

Golden Member

Platinum Member

Platinum Member

Lifer

Lifer

Lifer

Platinum Member

Lifer

Golden Member

Lifer

Member

Golden Member