Benchmark your computer with Handbrake

Page 5 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

monstercameron

Diamond Member
Feb 12, 2013
3,818
1
0
Genuine Intel® CPU T1300 @ 1.66GHz ubuntu 12.10

x264 [info]: using cpu capabilities: MMX2 SSE Cache64
x264 [info]: profile Main, level 3.0
[01:53:24] reader: done. 1 scr changes
[01:53:38] work: average encoding speed for job is 5.182079 fps
 

mrle

Member
Mar 27, 2009
33
0
0
Intel Atom N270 (single core with HT) @ 1.60 GHz
Arch Linux i686
HandBrake rev5474 (2013062499)

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 Cache64 SlowCTZ SlowAtom
x264 [info]: profile Main, level 3.0
[09:39:56] reader: done. 1 scr changes
[09:40:07] work: average encoding speed for job is 7.469536 fps
 

mrle

Member
Mar 27, 2009
33
0
0

JoeRambo

Golden Member
Jun 13, 2013
1,814
2,105
136
"Custom" compilation won't help much, at the heart of handbreak there is x264 that is famous for incredible hand tuned assembly code. Since x264 is where most time is being spent, I don't think even ICC compiled handbreak can achieve substantially faster results.
 

SPBHM

Diamond Member
Sep 12, 2012
5,058
410
126
"Custom" compilation won't help much, at the heart of handbreak there is x264 that is famous for incredible hand tuned assembly code. Since x264 is where most time is being spent, I don't think even ICC compiled handbreak can achieve substantially faster results.

well, it looks like just going for the last version or the "nightly build" gives a 5% gain with all the rest unchanged...



my result should be included on the list
http://forums.anandtech.com/showpost.php?p=35201481&postcount=28

I tested with the version from the OP and Nightly build, I don't know which version should be used to be included, but both are there.
 

blotto

Senior member
Feb 11, 2006
219
4
81
2500k @ 4.6ghz
8GB 1600mhz

0.9.9.5530
199.58 Avg FPS

Nightly
211.73 Avg FPS

Nightly with OpenCL (GTX670)
217.19
 
Last edited:

Edgemeal

Senior member
Dec 8, 2007
211
57
101
UPDATE:
I removed the results because some used a nightly build and it really didn't make much sense for comparison anymore.

See the OP here.
 
Last edited:

Edgemeal

Senior member
Dec 8, 2007
211
57
101
AMD Phenom II X2 550 @ stock (3.1GHz) DDR3-1333 9-9-9-24-33-1T
HandBrake 0.9.9.5530 - 64bit Version
OS: Microsoft Windows NT 6.2.9200.0 (Windows 8.1 Pro Preview)

[14:36:40] starting job
x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSEMisalign LZCNT
x264 [info]: profile Main, level 3.0
[14:41:18] work: average encoding speed for job is 51.558769 fps
 
Last edited:

JoeRambo

Golden Member
Jun 13, 2013
1,814
2,105
136
well, it looks like just going for the last version or the "nightly build" gives a 5% gain with all the rest unchanged...
.


It does, actually it gives even more epic increase for Haswell, but that is due to the fact that nighly build of Handbrake is running newest version of x264 that supports AVX2.

if you look @ http://x264.nl/x264/changelog.txt, you will see that all AVX2 and plenty of other optimizations were done after 0.9.9 handbreak (with x264 r2273) was released.

Ofc "native" compilation can still help, but low hanging fruit is already eaten long ago.
 

gbeirn

Senior member
Sep 27, 2005
450
13
81
HandBrake svn5618 (Nightly Build) - 64bit Version
OS: Microsoft Windows NT 6.1.7601 Service Pack 1
CPU: Intel(R) Core(TM) i7 CPU 960 @ 3.20GHz
Ram: 12278 MB, Screen: 1920x1080

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2
x264 [info]: profile Main, level 3.0
[18:23:14] reader: done. 1 scr changes
[18:23:14] work: average encoding speed for job is 170.469330 fps
 

MrPickins

Diamond Member
May 24, 2003
9,022
599
126
Reran using the latest stable version:

HandBrake 0.9.9.5530 - 64bit Version
OS: Microsoft Windows NT 6.1.7601 Service Pack 1
CPU: Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz (really 4.2GHz)
Ram: 8079 MB DDR31600

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX AVX2 FMA3 LZCNT BMI1 BMI2
work: average encoding speed for job is 232.463257 fps

Previously (using Nightly build 5616):


using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX AVX2 FMA3 LZCNT BMI2
[14:01:24] work: average encoding speed for job is 253.857376 fps

Nightly build is definitely faster.
 
Last edited:

Edgemeal

Senior member
Dec 8, 2007
211
57
101
Reran using the latest stable version:

Nightly build is definitely faster.

I didn't pay any attention to what versions people were using, some posted using released version and others used night builds!

Hmmm looks like you gained about 9% on a night build! :sneaky:
 

avatar82

Junior Member
Sep 13, 2010
22
0
0
HandBrake svn5618 (Nightly Build) - 64bit Version
OS: Microsoft Windows NT 6.2.9200.0

vintage 2009 D0 i7-920 @ 4.2Ghz
ancient OCZ 3x2=6GB 192bit channel @ 1600Mhz CAS 888 1T
Windows 8-64bit

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2
x264 [info]: profile Main, level 3.0
[23:02:38] reader: done. 1 scr changes
[23:02:38] work: average encoding speed for job is 230.800461 fps
 
Last edited:

ZOXXO

Golden Member
Feb 1, 2003
1,281
0
76
AMD Opteron 175
Windows XP

x264 [info]: using cpu capabilities: MMX2 SSE2Slow SlowCTZ
x264 [info]: profile Main, level 3.0
[00:48:42] reader: done. 1 scr changes
[00:48:46] work: average encoding speed for job is 22.679539 fps
 

bbhaag

Diamond Member
Jul 2, 2011
6,754
2,128
146
Bummer my version of Handbrake doesn't have the Android preset. I was looking forward to posting up my results.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
Update, downloaded latest Nightly build:

750K @ 4.3Ghz (2 PD modules/4 threads),8GB DDR3 @ 1600Mhz
windows 7 x64

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX XOP FMA4 FMA3 SSEMisalign LZCNT BMI1
[12:21:37] work: average encoding speed for job is 126.983131 fps
 

monstercameron

Diamond Member
Feb 12, 2013
3,818
1
0
Update, downloaded latest Nightly build:

750K @ 4.3Ghz (2 PD modules/4 threads),8GB DDR3 @ 1600Mhz
windows 7 x64

x264 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX XOP FMA4 FMA3 SSEMisalign LZCNT BMI1
[12:21:37] work: average encoding speed for job is 126.983131 fps

interesting, my 965BE is only clocked at 4GHz, doesnt use [SSSE3 SSE4.2 AVX XOP FMA4 FMA3 BMI1] has slower ram yet, I beat your 750k [although I was using ubuntu].

anyone know why this is? is it just operating environment? or cpu itself?
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
interesting, my 965BE is only clocked at 4GHz, doesnt use [SSSE3 SSE4.2 AVX XOP FMA4 FMA3 BMI1] has slower ram yet, I beat your 750k [although I was using ubuntu].

anyone know why this is? is it just operating environment? or cpu itself?

Your 965BE has 4 FP units, my PD has 2 that are dual threaded (still doesn't make up for the fact there is 2x the disparity in unit count) . I think its impressive that 2 dual threaded FP units can come so close to 4 dedicated ones. I'd say if workload could potentially benefit from FMA3/4 that my lowly PD would just crush that 965 with 50+% delta (as it would be able to *potentially* execute 2x ops per cycle with FMA). x264 is AVX tuned so in this case peak throughput of PD core remains unchanged (well maybe +5% due to SSE->AVX).
 

SPBHM

Diamond Member
Sep 12, 2012
5,058
410
126
Your 965BE has 4 FP units, my PD has 2 that are dual threaded (still doesn't make up for the fact there is 2x the disparity in unit count) . I think its impressive that 2 dual threaded FP units can come so close to 4 dedicated ones. I'd say if workload could potentially benefit from FMA3/4 that my lowly PD would just crush that 965 with 50+% delta (as it would be able to *potentially* execute 2x ops per cycle with FMA). x264 is AVX tuned so in this case peak throughput of PD core remains unchanged (well maybe +5% due to SSE->AVX).

AMD calls a dual module CPU "Quad Core", and dual module CPUs replaced "native quad cores CPUs" in price (AII X4, PII X4 955)

still, is no surprise that PII and C2Q can do well in this.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
I know what they call it, I'm just stating the facts of what's inside of the design . The fact is it's 4 dedicated FP units (128bit add+128bit mul each) Vs 2 shared units that can do 2 threads each ( each having 2 multipurpose pipes that either do muls,adds or FMAs). So effectively in legacy code , with X4 Deneb vs 2M PD you have : 4 mul+4 add 128bit pipes Vs 4 128bit mull/add pipes. The performance in the end is somewhat similar (still Deneb is ~15-20% faster) which tells you that shared SMT unit in PD is much more efficient than the dedicated one in Deneb (~50% more ops/cycle roughly in legacy code and much more in FMA). This is due to the fact that fully loading the dedicated FP unit in "normal" code (non Linpack) is very hard thing and PD takes advantage of this.
 
Last edited:

lamedude

Golden Member
Jan 14, 2011
1,206
10
81
Video encoding is mostly (entirely?) integer though. My guess its the front end that's the bottleneck as a quad core Ph2 can decode 12 instructions compared a 2 module FX's 8.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
Video encoding is mostly (entirely?) integer though. My guess its the front end that's the bottleneck as a quad core Ph2 can decode 12 instructions compared a 2 module FX's 8.
There is no bottleneck as FX8xxx just crushes every other AMD chip ( OCed X6 included). You can see in the table that 1st gen BD based FX6100 @ 4.1Ghz scores ~170pts vs ~184pts (average ) of X6 @ 4Ghz. That's basically ~10% difference while having 2x less dedicated FP hardware-an amazing showing by BD/PD .

PS Anything SSE (packed integer) is still done on FlexFP unit.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |