Surreptitious Samsung Benchmark Shenanigans

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

jfpoole

Member
Jul 11, 2013
43
0
66
Actually, if the benchmark is at all properly written and the DVFS as regulated by the operating system (and user settings) is at all reasonable then this is exactly what should happen when you launch a benchmark. If you're running something that pegs the CPU it should be given as much CPU time as possible within the operating limits of the device.

I've heard (but have been unable to verify) that these shenanigans also raise the thermal limits of the processor. So not only will all CPU cores start out at peak frequency when running benchmarks, they'll run at peak frequency for longer (before throttling down due to thermal limits) when running benchmarks.
 

SlimFan

Member
Jul 5, 2013
91
11
71
http://anandtech.com/comments/7378/samsung-galaxy-note-101-2014-edition-review/331118

Anand Lal Shimpi - Tuesday, October 01, 2013 - link
Yep, I believe so (need to confirm with Brian since I don't have the device in front of me) - all modern non GPe Samsung devices (as well as those from other OEMs) do the same manual DVFS setting upon benchmark detect unfortunately.

(emphasis mine).
From this, it's not clear that this is normal Samsung business practices, but is rather more widespread. Maybe it's specific SOC vendors? I think most of the products that have been found to do this so far are Qualcomm. Maybe it's a Qualcomm problem more than a Samsung problem?
 

sontin

Diamond Member
Sep 12, 2011
3,273
149
106
It's wrong. No other OEM is ignoring the TDP cap in certain applications.
nVidia caps Tegra 4 in Shield to 1,48GHz with 4 threads in every Apps and you can't force an ignore to the TDP limit.

/edit: And Samsung is doing the same with their own SoCs. So it's more a normall Samsung behaviour.
 
Last edited:

Chiropteran

Diamond Member
Nov 14, 2003
9,811
110
106
Sounds a lot like turbo boost, which everyone seemed to be perfectly okay with.

Seems to me that Samsung noticed that several benchmarks don't properly load the CPU, so it underclocks occasionally during execution. Maybe through lack of time, or maybe to simply save money, they made a hard-coded list of apps to keep the CPU at full speed during, instead of creating a complex algorithm that works better than the current.

I'm not familiar with such things, but the default behavior of the CPU, when it's not being forced to run at full speed like this, is that something controlled by android software, or the processor itself, or something else (bios)?


Personally, I think this is a positive change and should be the default behavior for all devices. In the past, I recall a time when I ran a benchmark on my Note 2 and got vastly different results each run. Like a 500 score, a 1730 score, an 800 score, etc. Not consistent at all. I'm guessing that was because of the cores under-clocking themselves because they were not properly detecting the load the benchmark was trying to provide. It sounds like this is a direct response to that sort of issue. If it results in more consistent benchmark scores, it sounds great.
 
Last edited:

SlimFan

Member
Jul 5, 2013
91
11
71
This isn't "turbo boost." This isn't "throttling." This is cheating.

The only applications that are allowed to use the higher frequencies are the benchmarks. You browser, your game, app launch, etc... anything you would do on your phone to actually use it rather than test "how fast it is" does not get the extra frequency.

This is the equivalent of giving the review sites devices that run at a 20% higher frequency, seeing glorious reviews about how fast the device is, and then selling that device to the masses at the lower frequency.

The frequency boosting that is being done on PCs is generally applicable for anything the CPU is doing, as long as there are no thermal issues. This means your bowser and your game can use it. You can argue about "how long" or "how much" it can use it, but it's still available. Some people tell you that you need to turn it off in order to see how fast your PC "really" is, since it's hard to make it stop kicking in. That's a great thing for the user, because it's showing that it's generally useful.

An optimization (including one that raises the frequency) that is only available for specific use cases that only the OEM can specify (in this case, Samsung and things they have identified as benchmarks) is clearly a cheat and has absolutely zero value to the person buying the device, other than to mislead them about how "much faster" it will be than their previous device.

There is no similarity to ICC here. In the case of ICC, at least that's a compiler that will attempt to apply the optimizations generically. It's possible that the vast majority of the code doesn't get the optimization applied since the compiler can't figure out how to apply it. I don't think anyone has ever accused ICC of having a "white list" of source files that it handles differently, maps to hand coded assembly, etc. ICC is potentially shady, but not cheating.

There are certainly real-world examples of ICC making code faster that solves real world problems, unlike this Samsumg optimization.
 

SlimFan

Member
Jul 5, 2013
91
11
71
It's wrong. No other OEM is ignoring the TDP cap in certain applications.
nVidia caps Tegra 4 in Shield to 1,48GHz with 4 threads in every Apps and you can't force an ignore to the TDP limit.

/edit: And Samsung is doing the same with their own SoCs. So it's more a normall Samsung behaviour.

I ask about Qualcomm, and you respond with Nvidia information.
 

monstercameron

Diamond Member
Feb 12, 2013
3,818
1
0
So you think it's okay to get the best performance only in benchmarks but not in every normal workload?! :\
i cant see the cheating bit, all they did was the equivalent of putting your windows laptop in high performance mode so your laptop wont speedstep or c&q down the clocks when idle. the processor is capable, it isnt operating beyond spec nor were there any software hacks to the benchmark itself.
also where is the evidence that the processor cant operate at these performance levels when under stress after x amount of time.
 

SlimFan

Member
Jul 5, 2013
91
11
71
SPersonally, I think this is a positive change and should be the default behavior for all devices. In the past, I recall a time when I ran a benchmark on my Note 2 and got vastly different results each run. Like a 500 score, a 1730 score, an 800 score, etc. Not consistent at all. I'm guessing that was because of the cores under-clocking themselves because they were not properly detecting the load the benchmark was trying to provide. It sounds like this is a direct response to that sort of issue. If it results in more consistent benchmark scores, it sounds great.

If the benchmarks aren't consistent, why do you think your browsing performance is consistent? Your app launch performance? If the frequency control of the device can't keep a repetitive benchmark running at the high frequency, then it has no prayer of doing so in any real world use cases.

Maybe the 500/800 scores were because the device got too hot, and the 1730 score was when it was cold? Doesn't that represent the "real" performance of your device and what you'll see day in and day out?
 

SlimFan

Member
Jul 5, 2013
91
11
71
i cant see the cheating bit, all they did was the equivalent of putting your windows laptop in high performance mode so your laptop wont speedstep or c&q down the clocks when idle. the processor is capable, it isnt operating beyond spec nor were there any software hacks to the benchmark itself.
also where is the evidence that the processor cant operate at these performance levels when under stress after x amount of time.

Except you can put your windows laptop in "high performance mode" whenever you want. You can't edit their white-list, and the white-list doesn't have anything in it that would make using you device actually faster.

It's like windows only allowing you to turn on "high performance mode" when you're on the screen asking it to calculate the windows experience score. Completely misleading, and completely useless.
 

monstercameron

Diamond Member
Feb 12, 2013
3,818
1
0
This isn't "turbo boost." This isn't "throttling." This is cheating.

The only applications that are allowed to use the higher frequencies are the benchmarks. You browser, your game, app launch, etc... anything you would do on your phone to actually use it rather than test "how fast it is" does not get the extra frequency.

This is the equivalent of giving the review sites devices that run at a 20% higher frequency, seeing glorious reviews about how fast the device is, and then selling that device to the masses at the lower frequency.

The frequency boosting that is being done on PCs is generally applicable for anything the CPU is doing, as long as there are no thermal issues. This means your bowser and your game can use it. You can argue about "how long" or "how much" it can use it, but it's still available. Some people tell you that you need to turn it off in order to see how fast your PC "really" is, since it's hard to make it stop kicking in. That's a great thing for the user, because it's showing that it's generally useful.

An optimization (including one that raises the frequency) that is only available for specific use cases that only the OEM can specify (in this case, Samsung and things they have identified as benchmarks) is clearly a cheat and has absolutely zero value to the person buying the device, other than to mislead them about how "much faster" it will be than their previous device.

There is no similarity to ICC here. In the case of ICC, at least that's a compiler that will attempt to apply the optimizations generically. It's possible that the vast majority of the code doesn't get the optimization applied since the compiler can't figure out how to apply it. I don't think anyone has ever accused ICC of having a "white list" of source files that it handles differently, maps to hand coded assembly, etc. ICC is potentially shady, but not cheating.

There are certainly real-world examples of ICC making code faster that solves real world problems, unlike this Samsumg optimization.

I am not sure where you read about higher freqs but the article clearly shows that during benchmark/high performance mode it doesnt overclock the soc, it just doesnt downclock the soc. It booster doesnt allow any of the cores to go to lower power states.
you also seem to be claiming that samsung doesnt allow other software to have access to the fastest clockspd, which isnt true.
 

monstercameron

Diamond Member
Feb 12, 2013
3,818
1
0
Except you can put your windows laptop in "high performance mode" whenever you want. You can't edit their white-list, and the white-list doesn't have anything in it that would make using you device actually faster.

It's like windows only allowing you to turn on "high performance mode" when you're on the screen asking it to calculate the windows experience score. Completely misleading, and completely useless.

WEI itself is kinda useless but the idea is, in an ideal environment, to get the best score as a comparison point between different machines. so getting your WEI score in a less than ideal condition such as having the cpu throttle during testing would throw off such scores, which would throw off the comparison.
 

SlimFan

Member
Jul 5, 2013
91
11
71
I am not sure where you read about higher freqs but the article clearly shows that during benchmark/high performance mode it doesnt overclock the soc, it just doesnt downclock the soc. It booster doesnt allow any of the cores to go to lower power states.
you also seem to be claiming that samsung doesnt allow other software to have access to the fastest clockspd, which isnt true.

They've definitely refused to expose the GPU frequencies in the past:

http://anandtech.com/show/7187/looking-at-cpugpu-benchmark-optimizations-galaxy-s-4

Note that the CPU behavior is different from what we saw on the GPU side however. These CPU frequencies are available for all apps to use, they are simply forced to maximum (and in the case of Snapdragon, all cores are plugged in) in the case of these benchmarks. The 532MHz max GPU frequency on the other hand is only available to these specific benchmarks.

But you're right, this says that the higher frequencies are potentially available for other applications. But is that any better? When you run something like geekbench, it does the same repetetive behavior (all high CPU demanding, high IPC functions) very consistently. If the built-in frequency governor can't keep it running at high frequency, then real apps have no shot.

This is essentially Samsung deciding internally that their frequency governor is so broken, they have to ignore it when people are benchmarking their products.
 

SlimFan

Member
Jul 5, 2013
91
11
71
WEI itself is kinda useless but the idea is, in an ideal environment, to get the best score as a comparison point between different machines. so getting your WEI score in a less than ideal condition such as having the cpu throttle during testing would throw off such scores, which would throw off the comparison.

Sadly WEI is just as useful as all of these Android benchmarks.

If I have two different OEMs, one that actually puts a fan/heat sink on their CPU, and one who doesn't, but the one who doesn't locks the CPU frequency high during benchmarks, is it okay for them to get the same score?

We've gone past the days where the OEM's attention to thermal details can be ignored. You can take the exact same CPU and put it into two different boxes with different thermal solutions, and end up with different performance.

This is much, much worse in the phone space. There's a definite cost to 8mm thick devices vs 10mm thick devices. You should expect a lower thermal limit in the thinner device, and therefore lower overall performance.

Do you really think benchmark overrides to create consistent scores that override these effects is where we want to be? I mean, what's the point of the benchmark then? It's apparently not to say how "fast" the device will feel to the user. The more this goes on, the further away the benchmarks get from being useful or real world.

I want to know how fast my phone will fetch a web page (anandtech, of course! :biggrin page across the LTE or WIFI connection and present the page to me for each device.

My performance is going to be a function of the network, the temperature in the room, the amount of work my device has been doing (its temperature), the quality of the network drivers, the quality of the DVFS governor, the resolution of the display, etc. Having benchmarks that override ALL of this makes the benchmarks useless.
 

Chiropteran

Diamond Member
Nov 14, 2003
9,811
110
106
Doesn't that represent the "real" performance of your device and what you'll see day in and day out?

No, not really, because day in and day out I am not running benchmarks.

This is standard practice with PC benchmarking- you turn off all extraneous software, you make sure CPU is fully clocked, turn off power saving functions, etc.

Why is it suddenly cheating when the same practices are used on a phone?
 

SlimFan

Member
Jul 5, 2013
91
11
71
No, not really, because day in and day out I am not running benchmarks.

This is standard practice with PC benchmarking- you turn off all extraneous software, you make sure CPU is fully clocked, turn off power saving functions, etc.

Why is it suddenly cheating when the same practices are used on a phone?

Day in, day out, you aren't running benchmarks. But hopefully you're using your device?

The reason phones are different than PCs is because they are universally thermally limited, and each form factor is sufficiently different to have different behaviors.

You wouldn't expect a PC to throttle while surfing the web. Phones, on the other hand, are quite likely to. Once you turn on the 3G/LTE chip, plus GPS for the adds to track where you are, the screen so you can read, there's not always enough budget left over to run the CPU at max frequency.

And when you run benchmarks on PCs, you don't disable turbo, and you don't disable thermal throttling. If you go back to jfpoole's post:

I've heard (but have been unable to verify) that these shenanigans also raise the thermal limits of the processor. So not only will all CPU cores start out at peak frequency when running benchmarks, they'll run at peak frequency for longer (before throttling down due to thermal limits) when running benchmarks.

If Intel said "hey, I know I'm running Cinebench" and then disabled thermal throttling and allowed 4 cores to turbo up to the single turbo point, would that be considered cheating within PC benchmarks? Wasn't "quak3" considered cheating? Of course. This is the same, is it not?
 

Nothingness

Platinum Member
Jul 3, 2013
2,757
1,405
136
My performance is going to be a function of the network, the temperature in the room, the amount of work my device has been doing (its temperature), the quality of the network drivers, the quality of the DVFS governor, the resolution of the display, etc. Having benchmarks that override ALL of this makes the benchmarks useless.
Apparently browser speed isn't increased by Samsung trick. So browser benchmark scores as found by AT can be trusted and they are good: http://www.anandtech.com/show/7376/samsung-galaxy-note-3-review/4
 

Chiropteran

Diamond Member
Nov 14, 2003
9,811
110
106
The reason phones are different than PCs is because they are universally thermally limited, and each form factor is sufficiently different to have different behaviors.

You wouldn't expect a PC to throttle while surfing the web.

A PC in sleep mode or with several cores parked will indeed be less responsive at visiting a website than an otherwise identical PC running at full speed.

Phones, on the other hand, are quite likely to. Once you turn on the 3G/LTE chip, plus GPS for the adds to track where you are, the screen so you can read, there's not always enough budget left over to run the CPU at max frequency.

Are you trying to argue that a PC isn't slowed down by running multiple pieces of software? I beg to differ. I work on computers every day, half of them have multiple browser bars, 5 different programs that auto-start on boot, antivirus, chat programs, drop box and/or the multitude of clones, etc. They run like crap.

On the other hand, if you follow normal bench-marking practices, install a fresh drive with clean OS and none of that crap, those exact same PCs will run significantly better. This has been considered okay for years, why is it suddenly cheating when a phone is the hardware?

And when you run benchmarks on PCs, you don't disable turbo, and you don't disable thermal throttling. If you go back to jfpoole's post:

If you go to his post, you see some speculation. We don't know for a fact that thermal throttling is disabled.

However, even if it is, this is nothing new!

Here is Intel's take:

Note: Intel Turbo Boost Technology 2.0 allows the processor to operate at a power level that is higher than its rated upper power limit (TDP)

If Intel said "hey, I know I'm running Cinebench" and then disabled thermal throttling and allowed 4 cores to turbo up to the single turbo point, would that be considered cheating within PC benchmarks?

I hope not, because that is exactly what turbo boost does.

Wasn't "quak3" considered cheating? Of course. This is the same, is it not?

In that instances, image quality was sacrificed for a performance gain. In this case, a performance gain is obtained at no cost. Vastly different.
 

Exophase

Diamond Member
Apr 19, 2012
4,439
9
81
I've heard (but have been unable to verify) that these shenanigans also raise the thermal limits of the processor. So not only will all CPU cores start out at peak frequency when running benchmarks, they'll run at peak frequency for longer (before throttling down due to thermal limits) when running benchmarks.

That figures.

I still don't really understand why this is whitelisted for benchmarks and not part of the standard frequency/power management model, except that their programming capabilities are just lacking. Speaking of which, I heard that the Exynos based Note 3 still uses cluster migration despite (allegedly) fixing the hardware issue preventing better techniques. So being incapable of doing decent power management isn't that far off..

This is also inline with other manufacturers setting heavily conservative thermal limits that don't really make sense. I don't think they've really got this figured out, and it could really be an underlying Android problem.

I've seen the argument that it's only allowed for benchmarks because they know they'll be rarely ran, while if they allowed it for any little burst task it'd eventually wear out the processor from thermal stress. But I don't buy that, in a device like this discomfort to the user will be an issue long before part stress. If running the benchmarks doesn't make your hands too warm then letting it run within these thermal limits for other stuff should be okay.

At any rate, a whitelist that changes anything just for benchmarks is cheating, period. This should not be on phones.
 

SlimFan

Member
Jul 5, 2013
91
11
71
A PC in sleep mode or with several cores parked will indeed be less responsive at visiting a website than an otherwise identical PC running at full speed.

Not in any meaningful way, with modern power management features.

Are you trying to argue that a PC isn't slowed down by running multiple pieces of software? I beg to differ. I work on computers every day, half of them have multiple browser bars, 5 different programs that auto-start on boot, antivirus, chat programs, drop box and/or the multitude of clones, etc. They run like crap.

No, I'm not saying anything about running other software. I was talking about running a single software stack to accomplish a single task, and the impact of that on the thermals of the device.

On the other hand, if you follow normal bench-marking practices, install a fresh drive with clean OS and none of that crap, those exact same PCs will run significantly better. This has been considered okay for years, why is it suddenly cheating when a phone is the hardware?

This isn't a fresh install getting rid of spyware.

If you go to his post, you see some speculation. We don't know for a fact that thermal throttling is disabled.

However, even if it is, this is nothing new!

Here is Intel's take:

Note: Intel Turbo Boost Technology 2.0 allows the processor to operate at a power level that is higher than its rated upper power limit (TDP)



I hope not, because that is exactly what turbo boost does.

No, you're confusing your terms here. I talked about thermal throttling, and you brought up TDP.

1. Thermal design point : This is supposed to be how much thermal energy needs to be removed from the device over a period of time. "My CPU will produce X watts of heat output per second, so you need to remove X watts of heat per second." That sort of thing.

2. Thermal throttling: This is what happens when the skin of the device gets too hot (Tskin), or, if the transistors (junction temperature, Tj) get too hot.

Intel is saying "we may violate the TDP, but if we violate Tskin or Tj, we will stop what we're doing."

So they will opportunistically raise the frequency for all software, until the thermal throttling comes in.

The only people who should ever care about TDP are the OEMs who are building products. This gives them an idea of how much they need to spend on cooling. The result of that work is the only thing that customers should care about. Given a particular cooling solution, you can afford a certain amount of energy production over time. With a poorer cooling solution, you throttle more. With a fancier one, you throttle less. The TDP is a design spec for OEMs, not really a product spec.

It gets even trickier with integrated graphics. What do I specify as my TDP? Is it running a GFX virus + CPU virus? What about the poor guy who adds a discreet graphics card? For those people who don't use integrated graphics, it's likely that with a thermal solution that matches the TDP (say a heatsink and a fan), they will be able to turbo 100% of the time.

That's very, very different from what's happening here. At least from my perspective.

In that instances, image quality was sacrificed for a performance gain. In this case, a performance gain is obtained at no cost. Vastly different.

The performance gained here is only gained in the benchmarks. At least quake ran faster.
 

sontin

Diamond Member
Sep 12, 2011
3,273
149
106
So, nVidia is the only honest OEM/device maker who is not cheating. :hmm:

/Oh and Google...
 

bullzz

Senior member
Jul 12, 2013
405
23
81
if nvidia was cheating on benchmarks even after putting a fan in the shield, that wud be hilarious
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |