Post your Ryzen Blender Demo Scores! (AMD clarifies Blender Benchmark Confusion, Run @ 150 Samples)

Page 9 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Abwx

Lifer
Apr 2, 2011
11,167
3,862
136
Basically AMD must have been lying their rear ends off as far as throughput (40% my rear). Each Zen core is indeed pushing between 2.5 and 2.7 K10 cores worth on blender... and one Piledriver core at same clock is only worth around .85 of a K10 core (at least on Cinebench).

Apparently a Zen core throughput is 2.94x the one of a Piledriver core at equal Hz, and this with FP32 wich is the precision for Blender and Cinebench 11.5.

It should be quite interesting to see how it will fare with FP64 computation since the ratio with Piledriver is at 3.44x, those ratios are the ones that were available at Sisoftware, and so far for FP32 it has been accurate for Blender.
 
Reactions: rvborgh

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
A clean run at 150 samples on my C2Q @3.52GHz resulted in 2:45.75. A theoretical 16-thread Core 2 chip at the same clocks would then be 41.44 seconds. At 3.4GHz would then be 42.9 seconds. My CPU under full load uses 88W according to HWmonitor - so a 16 core version would be 352W. Zen with 16 threads is 36s, all in 95W (although momentary platform power during the test was ... 160W? My PC under CPU stress (OCCT) hovers around 190W. That adds up to roughly 450W platform power for my theoretical 16-thread Core 2, which still underperforms Zen by 20%. And half of those threads are virtual, of course.

No matter the talk of stagnation in the CPU space, I'd say this is pretty good progress in 8 years.

A theoretical performance increase of just 20% doesn't sound too great. Then again, my chip is clocked way faster than any that existed back then. Also, 4 cores were as high as you got back then. So what we're seeing is a 20% increase in raw IPC including simultaneous threads, while consuming ~1/4 the power, while packing 2x the cores and 4x the threads into the chip. Not to mention new instruction sets and various capabilities that have been added.

I'll be very happy to upgrade to Zen when the time comes.



edit: I ran the test on my A8-7600-based HTPC as well, which ended up at 4:36.15 for 150 samples. The chip is 3.1-3.7GHz (and power limited), but clocks fluctuated between 3.3 and 3.6 during the test. I'd guesstimate an average of ~3.45 or so. That's a very, very solid IPC increase on the part of Zen.
 
Last edited:

bystander36

Diamond Member
Apr 1, 2013
5,154
132
106
A clean run at 150 samples on my C2Q @3.52GHz resulted in 2:45.75. A theoretical 16-thread Core 2 chip at the same clocks would then be 41.44 seconds. At 3.4GHz would then be 42.9 seconds. My CPU under full load uses 88W according to HWmonitor - so a 16 core version would be 352W. Zen with 16 threads is 36s, all in 95W (although momentary platform power during the test was ... 160W? My PC under CPU stress (OCCT) hovers around 190W. That adds up to roughly 450W platform power for my theoretical 16-thread Core 2, which still underperforms Zen by 20%. And half of those threads are virtual, of course.

No matter the talk of stagnation in the CPU space, I'd say this is pretty good progress in 8 years.

A theoretical performance increase of just 20% doesn't sound too great. Then again, my chip is clocked way faster than any that existed back then. Also, 4 cores were as high as you got back then. So what we're seeing is a 20% increase in raw IPC including simultaneous threads, while consuming ~1/4 the power, while packing 2x the cores and 4x the threads into the chip. Not to mention new instruction sets and various capabilities that have been added.

I'll be very happy to upgrade to Zen when the time comes.



edit: I ran the test on my A8-7600-based HTPC as well, which ended up at 4:36.15 for 150 samples. The chip is 3.1-3.7GHz (and power limited), but clocks fluctuated between 3.3 and 3.6 during the test. I'd guesstimate an average of ~3.45 or so. That's a very, very solid IPC increase on the part of Zen.
You are way overestimating what 8 cores and 16 threads would do for the Core 2 chip and what hyperthreading (SMT) would do for it. 8 cores and 16 threads is not 4 times faster than 4 cores.

Edit: Maybe someone could help him here with real world benefits of SMT on Intel platforms. In gaming, HT has at most 10-20% improvements. I think some synthetics may show as much as 30% performance gains from HT, but I have not been following this in quite a while.

Edit2: Using this post as a reference: Post your Ryzen Blender Demo Scores! (AMD clarifies Blender Benchmark Confusion, Run @ 150 Samples)
I found that SMT gave a 26.6% improvement in the scores. Perhaps you could use that as a reference to redo your calculation.

Edit3: Since I already went that far, I figured I'd finish the calculation for you. Using a theoretical improvement of 26.6% with SMT, if your C2Q was converted into a 8c/16t CPU, with similar gains as the post above gained, you'd get a score of 1 minute, 5.77 seconds.
 
Last edited:

TheELF

Diamond Member
Dec 22, 2012
3,993
744
126
Edit: Maybe someone could help him here with real world benefits of SMT on Intel platforms. In gaming, HT has at most 10-20% improvements. I think some synthetics may show as much as 30% performance gains from HT, but I have not been following this in quite a while.
In gaming you get up to 100% boost if there are enough threads.
In game benchmarks the boost is not that big because it's only rendering the graphics and rendering only benefits a little bit by HTT.
Also most tests of this sort are being done on i7 CPUs that are already fast enough even without HTT so the difference is very small.
 

witeken

Diamond Member
Dec 25, 2013
3,899
193
106
I have heard some people say that Zen has higher IPC than Broadwell-E because that chip has a 3.7GHz boost speed.

Someone has done the Blender benchmark at 150 samples with his Xeon E5-2695 V3, which is a 14-core Haswell 120W TDP with a base of 2.2GHz and a boost of 3.3GHz. During the benchmark, his CPU clocks down to 2.6GHz, so we don't know how much BDW-E throttled.

He got 29s, which is consistent with an 8-core BDW-E (5% IPC gain) doing about 37s at 3.4GHz.
 

SPBHM

Diamond Member
Sep 12, 2012
5,058
410
126
sandy bridge i5 3GHz = 2:33
CPU package power during the test is 44-45W

I think there are things that makes this CPU use a lot more power.
 
Last edited:

bystander36

Diamond Member
Apr 1, 2013
5,154
132
106
In gaming you get up to 100% boost if there are enough threads.
In game benchmarks the boost is not that big because it's only rendering the graphics and rendering only benefits a little bit by HTT.
Also most tests of this sort are being done on i7 CPUs that are already fast enough even without HTT so the difference is very small.
Of course that game requires 4 cores to meet the minimums. The i3 mimics it enough. That said, as shown in this thread, 26% is all HT gives a boost to performance.

HT does not normally give remotely 100% boosts, even if this one test did (it's not even clear if they disabled the logic cores in that test).
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
Yeah, but he was comparing it to an 8c/16t CPU's score.
Yes, as that is what Zen is. Core 2 never had any form of SMT, and as such making that up would be utterly useless. My point was to show how even when compared to an extrapolated theoretical best-of-he-best chip from way back when there have been huge gains both in IPC and power. Of course a 16c16t Zen chip would grow this lead even more - but there is no such chip. And SMT is one of the innovations that has given us these huge efficiency gains. That Zen can, even with half the "real" cores of my theoretical Core2Sedecim, outperform it by 20% while consuming 1/4 of the power, that's what makes this impressive in my eyes. Of course it would utterly crush a theoretical Core2Octo with 25% or so added for SMT. But beating a "true" 16-core is far more impressive IMHO.
 

bystander36

Diamond Member
Apr 1, 2013
5,154
132
106
Yes, as that is what Zen is. Core 2 never had any form of SMT, and as such making that up would be utterly useless. My point was to show how even when compared to an extrapolated theoretical best-of-he-best chip from way back when there have been huge gains both in IPC and power. Of course a 16c16t Zen chip would grow this lead even more - but there is no such chip. And SMT is one of the innovations that has given us these huge efficiency gains. That Zen can, even with half the "real" cores of my theoretical Core2Sedecim, outperform it by 20% while consuming 1/4 of the power, that's what makes this impressive in my eyes. Of course it would utterly crush a theoretical Core2Octo with 25% or so added for SMT. But beating a "true" 16-core is far more impressive IMHO.
I find it extremely unfair to compare an even theoretical 16 core chip to an 8 core chip with SMT. That just doesn't compute at all as a reasonable comparison, if you are going to say that the 8 core chip is unimpressive for not being a ton faster.
 

Dygaza

Member
Oct 16, 2015
176
34
101
I find it extremely unfair to compare an even theoretical 16 core chip to an 8 core chip with SMT. That just doesn't compute at all as a reasonable comparison, if you are going to say that the 8 core chip is unimpressive for not being a ton faster.

Didn't he just said that he's very impressed that current 8C/16T cpu is so much faster than fictional 16C/16T cpu from older days? That's what I get from his text.
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
I find it extremely unfair to compare an even theoretical 16 core chip to an 8 core chip with SMT. That just doesn't compute at all as a reasonable comparison, if you are going to say that the 8 core chip is unimpressive for not being a ton faster.
Sorry, when did I say that? I'm quite sure what I said was the exact opposite, as in that I'm very [bleep]ing impressed by Zen. So please, READ before you answer. Jeez.


And yes, I did say this:
A theoretical performance increase of just 20% doesn't sound too great.
Which was a rhetorical point to underscore that this is 8 cores (not 16!) with SMT (which, as everyone, their dog and their grandmothers have been pointing out adds ~25% perf) beating 16 "real" (theoretical) cores by 20%, all the while consuming 1/4 of the power. How is that not impressive? I think that's pretty damn impressive. That means, in essence, that an 8c8t Zen chip would tie 16 Core 2 cores. 16. That's twice as many! At speeds they never came close to in production. How is that not impressive? I'm sorry if my apparently excessively complex way of writing confuses you. Or, well, I'm really not.

To quote myself:
I'll be very happy to upgrade to Zen when the time comes.
Now can you please stop your straw man argumentation please?
 

daniel1926

Junior Member
Feb 18, 2015
23
1
11
2x E5-2670
192gb mem

Background processes using about 10 to 15% of CPUs and 25gb of mem

100: 18.72
150: 28.10
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,223
136
Since now we have a file to DL and instructions how to run Handbrake test, how about we post our results from Handbrake as well?
We can make a new topic if need be.

Anyway, my Handbrake results on 4690K i5 @ 4.3Ghz, DDR3 @ 2400Mhz 16GB, win10 pro x64 and following instructions summed up here by Dresdenboy and Car Merc:
https://twitter.com/Cat_Merc/status/810515806316560384

Encode time: 1 minute 40 seconds.
I am just about x2.2 times slower than Ryzen at the same clock and about ~2x slower than 6900K at the same clock.
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
Encode time: 1 minute 40 seconds.
I am just about x2.2 times slower than Ryzen at the same clock and about ~2x slower than 6900K at the same clock.
At the same clock? You said your i5 runs at 4,3GHz. The Ryzen demo is at 3,4 (6900k is 3,2-3,7 IIRC). That's quite a difference.
 

Stipa

Junior Member
Dec 15, 2016
2
0
1
Since now we have a file to DL and instructions how to run Handbrake test, how about we post our results from Handbrake as well?
We can make a new topic if need be.

Anyway, my Handbrake results on 4690K i5 @ 4.3Ghz, DDR3 @ 2400Mhz 16GB, win10 pro x64 and following instructions summed up here by Dresdenboy and Car Merc:
https://twitter.com/Cat_Merc/status/810515806316560384

Encode time: 1 minute 40 seconds.
I am just about x2.2 times slower than Ryzen at the same clock and about ~2x slower than 6900K at the same clock.
i7-5820k encode 00:01:20
Stock CPU settings but it doesn't even turbo boost, stays at 3.3GHz. Same situation with blender.
 

Lebdnil

Junior Member
Jul 13, 2016
11
8
81
Handbrake 10.5 x64 downloaded from website, using "bunnyshit2.mp4" from Dresdenboy (I don't put that name on file).
Core i7-2630QM - 21.466452fps (169.6s). Again, start at 2.6GHz, go down to 2.4GHz and ends at 2.3GHz. Average 2.35GHz, 42W. Some creative math, divide by 2 (8 cores, 16 threads, lineal scale with cores) and scale by frequency (2.35/3.4) and you got... 58.6s or just about the same time than Broadwell-E in the live demo. Cough.

Core i5-6600 - 36.217285fps (100.6s). 3.63GHz, 40W max package power (39W average). Same creative math as before and... 54s, scaled to double cores (this time not taking into account HT scaling) and 3.4GHz lineal scale in frequency.
Results uploaded here: https://1drv.ms/i/s!AjqAFTn4xM0ZgbABxLObDPuwgWTTdw

And just for fun, Quicksync results. Sandy couldn't run (Sandy video block don't support 4k), but Skylake run very well. To use Quicksync, you go to video tab and change "h.264" to "h.264 Intel QSV". I tested with "best quality" preset (default) and "balanced".
Core i5-6600 driver 4552Beta, best quality - 88.712051fps (41.1s). Max package power 25W. Yeah, Quicksync is fast and power efficient.
Core i5-6600 driver 4552Beta, balanced quality - 131.819839fps (27.6s). Max package power 18W.

Btw, system specs:
Core i7-2630QM - Acer Aspire 5750G notebook, Win10 x64 up to date. 8Gb DDR3-1333 dualchannel, Intel driver 4229 (GT540M via Optimus, driver 364.57).
Core i5-6600 - Asrock Z170Extreme4, Win10 x64 up to date. 8Gb DDR4-2133, Intel driver 4552Beta (screen attached to it).
 

The Stilt

Golden Member
Dec 5, 2015
1,709
3,057
106
Didn't Lisa Su state that they used the "Ryzen intro video" ("the video you just saw") as the test media in those Handbrake tests?
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,223
136
At the same clock? You said your i5 runs at 4,3GHz. The Ryzen demo is at 3,4 (6900k is 3,2-3,7 IIRC). That's quite a difference.
I have already calculated the clock difference in there . So all things accounted for, my i5 (Haswell 4C/4T) is about ~2.2x slower than Ryzen 8C/16T if both were running at the same clock.


Didn't Lisa Su state that they used the "Ryzen intro video" ("the video you just saw") as the test media in those Handbrake tests?

Yeah she did say that but I think I'd go with James guy from AMD team who knew the technical details. Lisa was probably carried away in the speech. The results I got in Handbrake roughly match what 6900K @ stock would get.
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
I have already calculated the clock difference in there . So all things accounted for, my i5 (Haswell 4C/4T) is about ~2.2x slower than Ryzen 8C/16T if both were running at the same clock.
Ah. That wasn't clear from the way you wrote it
 
Reactions: inf64

inf64

Diamond Member
Mar 11, 2011
3,764
4,223
136
Ah. That wasn't clear from the way you wrote it

Are you sure?

"I am just about x2.2 times slower than Ryzen at the same clock and about ~2x slower than 6900K at the same clock."
I bolded out the important parts in my original post.
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
Are you sure?

"I am just about x2.2 times slower than Ryzen at the same clock and about ~2x slower than 6900K at the same clock."
I bolded out the important parts in my original post.
Yep, there's nothing there indicating that you've done calculations to equalize things. Those statements can just as easily mean that you've simply misread the spec
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |