Post your Ryzen Blender Demo Scores! (AMD clarifies Blender Benchmark Confusion, Run @ 150 Samples)

Page 5 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

BeepBeep2

Member
Dec 14, 2016
86
44
61
I ran my 5960x at 3.375 core/ 3.375 cache and 3.5 core/ 3.5 cache at 150 samples for three runs each.
This is a brand new Windows 10 install with zero background programs. I even waited for all background processes to stop so CPU usage was at 0%.
3.375: 38.81 / 38.15 / 38.15
3.500: 37.41 / 36.68 / 36.61
This still doesn't quite seem right compared to AMD's results, especially given my cache is technically overclocked at both speeds and I have fast memory (3000 MHz).
If the 6900k was running at 3.5 during the test, it sorta seems reasonable it might score 0.8 seconds faster. But this would make it seem AMD is claiming to have higher IPC than even Haswell, at least in this test.
Of course they are, Broadwell-E has 5-10% higher IPC than Haswell-E.
I'm pretty sure we've come to the conclusion at this point that the 25 second press demo was at 100 samples, and the livestreamed 36 second New Horizons demo was at 150.
 

R0H1T

Platinum Member
Jan 12, 2013
2,582
162
106
I don't see any other 10 core 6950X CPUs in that list. I ran the benchmark a few more times. The lowest score I get is 24.48 seconds at 150 samples.
I meant the scores don't correspond to what AMD_james said, then again he contradicted himself with the spreadsheet (unconfirmed) scores.
 

Valantar

Golden Member
Aug 26, 2014
1,792
508
136
Of course they are, Broadwell-E has 5-10% higher IPC than Haswell-E.
I'm pretty sure we've come to the conclusion at this point that the 25 second press demo was at 100 samples, and the livestreamed 36 second New Horizons demo was at 150.
It seems pretty cut and dry from the demos that AMD is aiming for/claiming (in a vague sense, as no claims are very specific) IPC parity with BDW-E. Matching/beating a 3,2-3,7GHz chip with a 3,4GHz chip pretty much makes that a given. Unless the i7 was significantly thermally limited (which seems very unlikely, given that it would be an extremely unpredictable way of messing up a demo), it would run at maximum all-core turbo (although I don't know the exact frequency for that, I'd wager 3,4-3,5GHz).

It'll be very, very interesting to see how this matches up in ST performance, core/thread scaling (as in how the "simultaneous" threads affect performance), and so on. AMD, can you please give AnandTech an exclusive to post an architecture deep-dive review in, say, mid January? That would be nice. (of course, with the recent review cadence of AT, that would mean that it would actually be posted around June, but nvm.)
 

moonbogg

Lifer
Jan 8, 2011
10,637
3,095
136
So, I did some math (check it) after changing my clockspeeds to 3.4 across all cores and reran the test at 150 samples.

Zen's 8 cores@3.4 = 36 seconds
36/8 = 4.5 seconds of work per core
Remove the work of two cores and you get 45 seconds for a hypothetical 6 core zen

Sandy's 6 cores@3.4 = 64 seconds

45s vs 64s = Zen is 30% faster per core than Sandy in this test at the same clocks. If Zen is consistently 30% faster than Sandy, then Zen will be a knockout and I might actually buy one!
 
Last edited:
Reactions: Hoye0017

kraatus77

Senior member
Aug 26, 2015
266
59
101
It's not how it works. because there's SMT involved.

if you get same improvements 6c vs 6c when smt is disabled on both. then maybe yes 57% improvement for zen on single core.

unfortunately we don't have results without smt. so we can't really compare it like that.
 

lixlax

Member
Nov 6, 2014
184
158
116
So, I did some math (check it) after changing my clockspeeds to 3.4 across all cores and reran the test at 150 samples.

Zen's 8 cores@3.4 = 36 seconds
36/8 = 4.5 seconds of work per core
Remove the work of two cores and you get 45 seconds for a hypothetical 6 core zen

Sandy's 6 cores@3.4 = 104 seconds

45s vs 104s = Zen is 57% faster per core than Sandy? I know I MUST have screwed something way the hell up. So what was it?
Maybe it was 1m and 04s = 64s? Happens to me sometimes.
 

moonbogg

Lifer
Jan 8, 2011
10,637
3,095
136
It's not how it works. because there's SMT involved.

if you get same improvements 6c vs 6c when smt is disabled on both. then maybe yes 57% improvement for zen on single core.

unfortunately we don't have results without smt. so we can't really compare it like that.

It works the same if you just factor out a 2 before dividing. Same result.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
Of course they are, Broadwell-E has 5-10% higher IPC than Haswell-E.
I'm pretty sure we've come to the conclusion at this point that the 25 second press demo was at 100 samples, and the livestreamed 36 second New Horizons demo was at 150.

Bold above is the key.
 

lolfail9001

Golden Member
Sep 9, 2016
1,056
353
96
So, I did some math (check it) after changing my clockspeeds to 3.4 across all cores and reran the test at 150 samples.

Zen's 8 cores@3.4 = 36 seconds
36/8 = 4.5 seconds of work per core
Remove the work of two cores and you get 45 seconds for a hypothetical 6 core zen

Sandy's 6 cores@3.4 = 64 seconds

45s vs 64s = Zen is 30% faster per core than Sandy in this test at the same clocks. If Zen is consistently 30% faster than Sandy, then Zen will be a knockout and I might actually buy one!
Blender, just like about every real life load, does not scale linearly with core/thread count. So you need a result for 2670 to make any meaningful comparisons. And even then, that's a stretch.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
Yes that's what I meant, except the 36s score is supposed to be with 200 samples (according to the spreadsheet linked by AMD_james) & then there is a new file with samples prefixed at 150?

www.reddit.com/r/Amd/comments/5idz87/ryzen_blender_benchmark_comparison_spreadsheet/

It is not that hard to figure out.
We can solve the mystery easily by using AdamK47 result above with his 6950x.
He scores 16.95s at 100 samples and 24.92 at 150 samples using a 10C/20T Broadwell-E at 4.12Ghz.
Since blender scales great with cores and clocks, 8C/16T broadwell-e running at multicore turbo boost clock of 3.5Ghz(I assume that chip AMD used managed to boost to this clock during tests) should score:
100 sample run -> 16.95s x 10/8 x 4.12/3.5=24.9s
150 sample run -> 24.92s x 10/8 x 4.12/3.5=36.66s

Scores above are perfectly in line with what AMD showed as both runs were completed within a fraction of a second from the projected run times I got in the formulas above.
 

R0H1T

Platinum Member
Jan 12, 2013
2,582
162
106
It is not that hard to figure out.
We can solve the mystery easily by using AdamK47 result above with his 6950x.
He scores 16.95s at 100 samples and 24.92 at 150 samples using a 10C/20T Broadwell-E at 4.12Ghz.
Since blender scales great with cores and clocks, 8C/16T broadwell-e running at multicore turbo boost clock of 3.5Ghz(I assume that chip AMD used managed to boost to this clock during tests) should score:
100 sample run -> 16.95s x 10/8 x 4.12/3.5=24.9s
150 sample run -> 24.92s x 10/8 x 4.12/3.5=36.66s

Scores above are perfectly in line with what AMD showed as both runs were completed within a fraction of a second from the projected run times I got in the formulas above.
I've seen a number of blender results, IIRC in this very thread, scale linearly with core count & clock speeds so extrapolating the score from 100 -> 150 samples is fairly easy. The spreadsheet, in the reddit thread, however shows the SR7 score as 36s (unconfirmed according to AMD_james) with 200 samples & then he goes on to say that they'll upload a new file with samples set to 150.

At this point in time I do believe AMD's scores but there shouldn't have been this level of confusion, especially coming from an AMD employee.
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
I've seen a number of blender results, IIRC in this very thread, scale linearly with core count & clock speeds so extrapolating the score from 100 -> 150 samples is fairly easy. The spreadsheet, in the reddit thread, however shows the SR7 score as 36s (unconfirmed according to AMD_james) with 200 samples & then he goes on to say that they'll upload a new file with samples set to 150.

At this point in time I do believe AMD's scores but there shouldn't have been this level of confusion, especially coming from an AMD employee.
I agree that there was unnecessary confusion but it is obvious now that the scores they got are in line with reality and nothing was "rigged".
 

rvborgh

Member
Apr 16, 2014
195
94
101
Throughput wise when i compare the results with my 48 core Opteron setup... it looks like each Zen core (with SMT) is pushing the same as 2.72 K10 cores worth. Basically making this perform like almost 22 K10 cores at the same Ghz. If that proportionate relationship holds we are talking about a chip that does about 19.5 on Cinebench 11.5 and around 1716 in Cinebench R15. Doggone impressive if you ask me.

i'm not a blender guy, but wouldn't this benchmark be stressing the floating point units more than integer? What says folks here?
 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,167
3,862
136
If that proportionate relationship holds we are talking about a chip that does about 19.5 on Cinebench 11.5. Doggone impressive if you ask me.

At 3.4GHz it should score 17.3 according to the Sisoftware submission...
 

Nothingness

Platinum Member
Jul 3, 2013
2,751
1,397
136
Blender, just like about every real life load, does not scale linearly with core/thread count. So you need a result for 2670 to make any meaningful comparisons. And even then, that's a stretch.
If there's a thing that scales very well with cores, it's rendering, and blender likely is no exception.

EDIT: OTOH for SMT the speedup most likely depends a lot on the way SMT is implemented.
 
Last edited:
Reactions: Phynaz

majord

Senior member
Jul 26, 2015
444
533
136
I think the Sample sizes were chosen to keep the benchmark to a reasonable length for the given presentation.. To match the dialog and not keep people waiting too long.

I still believe 2.77 was used for the press demo and maybe 2.78a for the other as claimed , not sure why the difference.

I Queried James / Robert in that Reddit thread , but they insist it was 2.78a for both.. I'm not convinced. Even though it's hard to read whether it definatly says 2.77 in the video, there's definitely no 'a' on the end of the version No.
 

rvborgh

Member
Apr 16, 2014
195
94
101
17.3 would still be doggone impressive! My conservative best guess would be that single thread CB 11.5 would be around 1.7 to 1.8 then. Seems like they've caught up to Intel in one swoop. Running at 4 Ghz (if they can get the process to cooperate)... CB 11.5 single thread of over 2!
 

inf64

Diamond Member
Mar 11, 2011
3,764
4,222
136
I think the Sample sizes were chosen to keep the benchmark to a reasonable length for the given presentation.. To match the dialog and not keep people waiting too long.

I still believe 2.77 was used for the press demo and maybe 2.78a for the other as claimed , not sure why the difference.

I Queried James / Robert in that Reddit thread , but they insist it was 2.78a for both.. I'm not convinced. Even though it's hard to read whether it definatly says 2.77 in the video, there's definitely no 'a' on the end of the version No.
I think there is basically no difference in results between those two versions, somebody has already tested it in this thread (fraction of a point worth of difference). The sample size is the key difference between the live stream run and the press demo run.
 

R0H1T

Platinum Member
Jan 12, 2013
2,582
162
106
I think the Sample sizes were chosen to keep the benchmark to a reasonable length for the given presentation.. To match the dialog and not keep people waiting too long.

I still believe 2.77 was used for the press demo and maybe 2.78a for the other as claimed , not sure why the difference.

I Queried James / Robert in that Reddit thread , but they insist it was 2.78a for both.. I'm not convinced. Even though it's hard to read whether it definatly says 2.77 in the video, there's definitely no 'a' on the end of the version No.
I saw the pics on sweclockers as well & it's definitely version 2.77 rendering the logo at least once, I guess we'll just have to wait for AMD's official word on this.

 

R0H1T

Platinum Member
Jan 12, 2013
2,582
162
106
I think there is basically no difference in results between those two versions, somebody has already tested it in this thread (fraction of a point worth of difference). The sample size is the key difference between the live stream run and the press demo run.
Yes not much between them, but again unnecessary confusion it seems.
 

majord

Senior member
Jul 26, 2015
444
533
136
Sorry these results are incomplete (see blanks) I ran out of time before having to leave for couple of days.. so I've just compiled what I have.

Excavator vs Skylake



Lots to think about there.. Skylake murders Excavator in throughput on the older, official builds as used in demo (2module, 4 thread, - vs - 2 core, 4 thread) meaning Zen would also , on flipside Exv does OK on AVX2 build - in throughput at least, since Skylake is not getting anywhere near the SMT yield.

So yes, these builds certainly are throughput orientated - but what could be causing such dismal performance for Construction cores on the builds? They still get huge Yield from CMT.
 
Last edited:

itsmydamnation

Platinum Member
Feb 6, 2011
2,863
3,417
136
its probably memory latency bound per thread ( thus good scaling) if you lower your memory latency ( up clock, lower cas etc) do you good perf improvements.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |