AMD Ryzen (Summit Ridge) Benchmarks Thread (use new thread)

Page 253 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Status
Not open for further replies.

looncraz

Senior member
Sep 12, 2011
722
1,651
136
The Buddhists are probably pleased more than anything else.

I'd say they're more likely to be content with it than pleased. I mean... they're Buddhists.

Hmm . . . that could work. L3 is per CCX, right? What if they're using something like a modernized version of HT Assist to mitigate inter-CCX communication problems? We might not detect it unless using something with a sufficiently-large data set.

L3 is per CCX, yes.

I know some of how the inter-CCX communications might work, based on the die shots, the CCX design, and some comments by AMD personnel.

I see two possibilities - the first is that there is NO inter-CCX communication and it is all done through the memory controller... this would be the slowest method and would be a very bad idea.. it would result in horrible multi-threaded scaling issues... which we simply aren't seeing at all with the leaks...

The second is an external clone of the internal L3 tags. Each CCX would have an external L3 tag set to which it can write, but not read. When a CCX misses on its own cache it sends a request out on its high-speed, low latency, external command bus to a specific memory controller. The memory controller would scan the L3 tag cache which clones the other CCX's tags, letting it know if the data is on-die, but just in the other L3...

If the tag is in the other L3, it will make a request for that data from the other CCX and that data will be tagged for the other CCX & core and sent on the wide, high speed, inter-CCX data bus (which resides in an upper metal layer).

As the servicing L3 would not need to do an L3 lookup, you save a cycle or two, which pays for some of the round-trip costs. The extra cost would only be a couple nanoseconds of latency - compared to 20+ for going to main memory - with double the bandwidth (100GB/s vs 50GB/s).

In addition to all of that, there needs to be some method of synchronizing shared data reliably. I'm very surprised by the layout of the CCXes - I would have thought the L3 caches butting against each other would have made for easier and faster communications, but it seems each CCX has its own preferred memory channel which may have an attached global L3 tag clone set.

It would be efficient enough, but would cost quite a bit of area outside of the CCX...

It will be REALLY interesting to see what route AMD took (I expect we will get some solid details on this with the launch-day reviews).
 
Reactions: Dresdenboy

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Can you get a dual socket board for $300?

Oh, and the 2620v4 has a base frequency of 2.1GHz. Kinda changes the perspective your putting on things.

Yeah, paying slightly more for a little more than half the performance
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136

Ugh, almost answered in French.

XFR is just a switch that allows Precision Boost's top turbo bin to be ignored. There's nothing really technical about it to say.

Code:
while ( (boostFlags & BOOST_ENABLED) == BOOST_ENABLED ) {
	float turboMaxMult = gTurboMaxMult;
	float maxCorePower = gMaxCorePower;
	float maxTDP = gMaxTDP;
	float maxVolts = gMaxVolts;

	// Extended Frequency Range (XFR)
	if ( (boostFlags & XFR_ENABLED) == XFR_ENABLED ) {
		XFRConfigRef xfr;

		turboMaxMult = xfr.maxMultiplier;
		maxCorePower = xfr.maxCorePower;
		maxTDP = xfr.maxTDP;
		maxVolts = xfr.maxVolts;		
	}

	// ... Precision Boost logic follows ...
}
 

imported_jjj

Senior member
Feb 14, 2009
660
430
136
The problem is as long as engine and games have a DX11 code path the game and engine inherently are limited by DX11 paradigms which is 1 thread for draw calls. Tacking on dx12 on an engine based probably on dx9 (or even older) won't give great benefits especially not scaling well to an 8-core. So the whole tool-chain from engine to game has to be made from scratch. That will take years and then you can upgrade again.

Besides that, some games even in 2017 are 100% ST dependent, prime exampel being starcraft 2. So If you play that alot, 7700k is the way to go.

.

Starcraft 2 is not something that requires higher FPS, you get to over 200FPS with a GTX 1050.
The new CPUs do favor Win 10 so the transition to DX12 will likely be faster than some expect and it's not about when all games transition, it's about when it starts.Once the transitions to games utilizing more cores starts, it begins to matter and some would argue that it has started.
Old game are a fav line for some and that's fine if you consider Crysis 3 as old but lets not be concerned about old games where you get 100-200FPS and even large differences in percentage would have no relevant impact on experience.
In high end it will matter first as ,in some games folks could not get best perf anymore out of their 2000$ GPU with quads.
Going form 2 cores to 4 wasn't that different, it's just that today folks have been conditioned by marketing and poor reviews to want quad cores high clocks.
 

realibrad

Lifer
Oct 18, 2013
12,337
898
126
Media "guru" bullshit.


By-and-by large, the clueless will buy:
- a Ryzen because their mate who knows something about computers said its the one to get* // there is someone in the shop who actually knows a bit and cares about getting their customers' the best bang for buck*.
- an Intel because that is what they've always done // that is what the even more gormless idiot in the shop said to get // 'cos they've heard of Intel.

The educated will buy:
- the CPU that best meets their needs and can properly evaluate their needs.

None will buy based on the colour of the box.


*Assuming Zen delivers the performance we are kinda expecting at this point at the prices we hope for.

Media guru bullshit eh. Edward Bernays
Media "guru" bullshit.


By-and-by large, the clueless will buy:
- a Ryzen because their mate who knows something about computers said its the one to get* // there is someone in the shop who actually knows a bit and cares about getting their customers' the best bang for buck*.
- an Intel because that is what they've always done // that is what the even more gormless idiot in the shop said to get // 'cos they've heard of Intel.

The educated will buy:
- the CPU that best meets their needs and can properly evaluate their needs.

None will buy based on the colour of the box.


*Assuming Zen delivers the performance we are kinda expecting at this point at the prices we hope for.

You are free to believe what you want, but marketing is not BS. There is a reason companies spend money on marketing, and why some marketing is more productive than another. In this case, color might represent maybe .1% in terms of sales, but it does build a brand image that people internalize. When people think of your brand, you want them to think positive things, and not mediocre or bad things.
 
Reactions: Ryzenenthusiast

Magic Hate Ball

Senior member
Feb 2, 2017
290
250
96
I'd say they're more likely to be content with it than pleased. I mean... they're Buddhists.



L3 is per CCX, yes.

I know some of how the inter-CCX communications might work, based on the die shots, the CCX design, and some comments by AMD personnel.

I see two possibilities - the first is that there is NO inter-CCX communication and it is all done through the memory controller... this would be the slowest method and would be a very bad idea.. it would result in horrible multi-threaded scaling issues... which we simply aren't seeing at all with the leaks...

The second is an external clone of the internal L3 tags. Each CCX would have an external L3 tag set to which it can write, but not read. When a CCX misses on its own cache it sends a request out on its high-speed, low latency, external command bus to a specific memory controller. The memory controller would scan the L3 tag cache which clones the other CCX's tags, letting it know if the data is on-die, but just in the other L3...

If the tag is in the other L3, it will make a request for that data from the other CCX and that data will be tagged for the other CCX & core and sent on the wide, high speed, inter-CCX data bus (which resides in an upper metal layer).

As the servicing L3 would not need to do an L3 lookup, you save a cycle or two, which pays for some of the round-trip costs. The extra cost would only be a couple nanoseconds of latency - compared to 20+ for going to main memory - with double the bandwidth (100GB/s vs 50GB/s).

In addition to all of that, there needs to be some method of synchronizing shared data reliably. I'm very surprised by the layout of the CCXes - I would have thought the L3 caches butting against each other would have made for easier and faster communications, but it seems each CCX has its own preferred memory channel which may have an attached global L3 tag clone set.

It would be efficient enough, but would cost quite a bit of area outside of the CCX...

It will be REALLY interesting to see what route AMD took (I expect we will get some solid details on this with the launch-day reviews).

I wonder how much the Infinity Fabric interconnect can affect inter-CCX communication. It would make sense for them to develop it to handle such communications to further scale into the 32-core server beasts they're talking about later on, why invent the wheel twice?
 

looncraz

Senior member
Sep 12, 2011
722
1,651
136
I wonder how much the Infinity Fabric interconnect can affect inter-CCX communication. It would make sense for them to develop it to handle such communications to further scale into the 32-core server beasts they're talking about later on, why invent the wheel twice?

Infinity fabric is the command bus and data bus in my post

It just extends off the chip. In theory, it should allow only minimal extra penalty for going to the other die over going to the adjacent CCX on the same die.

Jim Keller was basically the king of high-performance NUMA back in the day... only makes sense he's advanced since then and included that in Zen.

What's more interesting is that I think this same tech is also now in Vega ;-)
 

Magic Hate Ball

Senior member
Feb 2, 2017
290
250
96
Infinity fabric is the command bus and data bus in my post

It just extends off the chip. In theory, it should allow only minimal extra penalty for going to the other die over going to the adjacent CCX on the same die.

Jim Keller was basically the king of high-performance NUMA back in the day... only makes sense he's advanced since then and included that in Zen.

What's more interesting is that I think this same tech is also now in Vega ;-)
Yeah, it being in Vega is probably half the reason why we're not seeing Raven Ridge until after Vega launch.

If they're using a Vega based design for the APU, then the IF being present will potentially be a huge performance boon.
 

Atari2600

Golden Member
Nov 22, 2016
1,409
1,655
136
You are free to believe what you want, but marketing is not BS.

I never said marketing was BS. Its extremely important.

But the colour of the box of a retail CPU is sooo irrelevant its unreal.


Q: Who buys CPUs in retail boxes?
A: Someone building a system.

Q: Where do the majority get their CPUs?
A: Online. Where you don't even see the box.

Q: Does someone who is building their own system (or a system for others) tend to know what they are doing or get advice from those that do?
A: Yes.

Q: Will that person then be walking into the store knowing what is compatible with their current system, or walking in with a few options in mind which will have been considered beforehand?
A: Very likely. Too many possible incompatibilities and permutations to risk a buy without some forethought.

Q: Will the clueless who may be enticed by a "oohhh shiny" buy a CPU in a fancy retail box?
A: No, they are infinitely more likely to be buying a finished box instead.
 
Reactions: Space Tyrant

IllogicalGlory

Senior member
Mar 8, 2013
934
346
136
AFAIK we've got a 250-page thread of spin with nothing more than a set of benchmarks from AMD that can't yet be verified by the review sites and anyone who chooses to. If I'm correct, no amount of guesswork, wishful thinking, logic and rationalisation changes that. "Leaks" aren't verifiable either so they have even less credibility than AMD's own benchmarks.

I sincerely hope that AMD has produced a viable set of competitors to Intel's current range (as it would shake up the market), even if it wasn't at all price points, however until we have some publicly verifiable benchmarks, speculation is simply a waste of time, and criticising speculation that doesn't agree with your own opinions an even bigger waste of time.
The poster I quoted stated that even if Ryzen has Haswell IPC, it's not impressive because it's on the level of a 4-year old chip, rather than dealing with performance metrics, the only thing that matters, now it's age. It's textbook spin.

There are numerous leaks, multiple impressive demos from AMD and a review from an accountable tech magazine, with the prices and SKUs all but confirmed. Every one of them looks like good news. I'm confident that Ryzen is going to be great at this point, and I'll accept however much crow is necessary if it's not.

I didn't feel the same for say, the RX 480, either because the leaks and demos were unimpressive (or at the very least, not in line with the hype we saw at the time). What we got was pretty much in line with what was demoed and leaked, too. Everything except the lack of power efficiency, which took me by surprise.

I don't think (w.r.t. Ryzen) it's rationalization or wishful thinking at this point. I think most of that is going on with the side that predicted Bulldozer 2.0.
 
Last edited:

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Yeah, it being in Vega is probably half the reason why we're not seeing Raven Ridge until after Vega launch.

If they're using a Vega based design for the APU, then the IF being present will potentially be a huge performance boon.

I think the formula for Raven Ridge is as follows:

SR - CCX + GPU = RR

Just drop the GPU right in place of a CCX. The same external synchronization can be used and everything - the CCXes, for example, don't need to 'know' about each other.

But a 44mm^2 GPU would be epic... maybe a nice high bandwidth cache with 6 CUs could fit in that area at 14nm... and could operate at 1Ghz+... would certainly perform better than 8 CUs and no HBC.
 
Reactions: Space Tyrant

looncraz

Senior member
Sep 12, 2011
722
1,651
136
I didn't feel the same for say, the RX 480, either because the leaks and demos were unimpressive (or at the very least, not in line with the hype we saw at the time). What we got was pretty much in line with what was demoed and leaked, too. Everything except the lack of power efficiency, which took me by surprise.

It's actually very efficient. AMD overvolted the efficiency right out of it.

At stock voltage my GPU pulls 110W and throttles. Stock is at 1.15V... I can use 1.05V completely safely - zero issues to date (it can actually go to 1035V, but I like headroom). It now uses only 80W and stays solid at 1288MHz (XFX RX 480 GTR 8GB stock turbo).

In fact, at 1.05V, I can hit a little over 1.3Ghz. At 1.15V I can get nearly to 1.4GHz, where I hit a wall even 1.3V can't get me past.

I would be surprised if Zen behaved any differently - and is any more relatively efficient... should be a tuner's dream
 

Dresdenboy

Golden Member
Jul 28, 2003
1,730
554
136
citavia.blog.de
The second is an external clone of the internal L3 tags. Each CCX would have an external L3 tag set to which it can write, but not read. When a CCX misses on its own cache it sends a request out on its high-speed, low latency, external command bus to a specific memory controller. The memory controller would scan the L3 tag cache which clones the other CCX's tags, letting it know if the data is on-die, but just in the other L3...
Did you catch the term "directory based" (like good old HT Assist) somewhere? It's been mentioned on several LinkedIn profiles.

And CCXs + DMCs + FCH might all simply be connected via the NOC mesh. How could it look like with 4 remaining links going off the die?

Vega certainly has it, at least as Greenland on Snowy Owl:
 

Magic Hate Ball

Senior member
Feb 2, 2017
290
250
96
It's actually very efficient. AMD overvolted the efficiency right out of it.

At stock voltage my GPU pulls 110W and throttles. Stock is at 1.15V... I can use 1.05V completely safely - zero issues to date (it can actually go to 1035V, but I like headroom). It now uses only 80W and stays solid at 1288MHz (XFX RX 480 GTR 8GB stock turbo).

In fact, at 1.05V, I can hit a little over 1.3Ghz. At 1.15V I can get nearly to 1.4GHz, where I hit a wall even 1.3V can't get me past.

I would be surprised if Zen behaved any differently - and is any more relatively efficient... should be a tuner's dream

This exactly, I have an RX470 I run at stock speeds I can actually run at 983mv or so (weird rounding by Wattman) solidly, power usage goes down dramatically.

I'm actually fairly hopeful that the initial learning process of 14nm for GloFo were figured out with the Polaris chips and that they've taken what they've learned while using the process to improve work on the Ryzen production.
 
Reactions: looncraz

realibrad

Lifer
Oct 18, 2013
12,337
898
126
I never said marketing was BS. Its extremely important.

But the colour of the box of a retail CPU is sooo irrelevant its unreal.


Q: Who buys CPUs in retail boxes?
A: Someone building a system.

Q: Where do the majority get their CPUs?
A: Online. Where you don't even see the box.

Q: Does someone who is building their own system (or a system for others) tend to know what they are doing or get advice from those that do?
A: Yes.

Q: Will that person then be walking into the store knowing what is compatible with their current system, or walking in with a few options in mind which will have been considered beforehand?
A: Very likely. Too many possible incompatibilities and permutations to risk a buy without some forethought.

Q: Will the clueless who may be enticed by a "oohhh shiny" buy a CPU in a fancy retail box?
A: No, they are infinitely more likely to be buying a finished box instead.

Again, you might feel that way, but its brand management, which happens to be something AMD needs a lot.

There is a reason why Coca Cola has a bunch of different versions of its logo on its collateral. To you, the logo is a logo. To marketing people, the logo is an emotional trigger.

I would assume you have no experience in CPG marketing. You personally may not be influenced by it, but many are. To build a brand, you need marketing that gives people more than cost/benefit of the individual product they are looking for. There is a reason Apple is Apple fyi. You might not like it, and you might think its dumb, but it works.
 
Reactions: Ryzenenthusiast

Glo.

Diamond Member
Apr 25, 2015
5,763
4,667
136
Did you catch the term "directory based" (like good old HT Assist) somewhere? It's been mentioned on several LinkedIn profiles.

And CCXs + DMCs + FCH might all simply be connected via the NOC mesh. How could it look like with 4 remaining links going off the die?

Vega certainly has it, at least as Greenland on Snowy Owl:
Every GPU that is slated for 2017 release, not re-release is going to have features from Greenland GPU.
 
Reactions: Drazick

lolfail9001

Golden Member
Sep 9, 2016
1,056
353
96
It would be efficient enough, but would cost quite a bit of area outside of the CCX...
Well, don't we sort of know that Zen's die is quite a bit larger than just 2 CCXs?
It's actually very efficient. AMD overvolted the efficiency right out of it.
Making a statements based on the binned top tier rx480 does not help your cause, though. What we really know is that at least first batches of rx480 had ridiculous variance, ranging from bearable to madly throttling.
XFR is just a switch that allows Precision Boost's top turbo bin to be ignored. There's nothing really technical about it to say.
Okay, that is the viable explanation, i am dumping my pessimist version.
But a 44mm^2 GPU would be epic...
Should fit about 10CUs and 8 RoPs... In line with last RR rumors, too.
 
Reactions: CatMerc

looncraz

Senior member
Sep 12, 2011
722
1,651
136
Again, you might feel that way, but its brand management, which happens to be something AMD needs a lot.

There is a reason why Coca Cola has a bunch of different versions of its logo on its collateral. To you, the logo is a logo. To marketing people, the logo is an emotional trigger.

I would assume you have no experience in CPG marketing. You personally may not be influenced by it, but many are. To build a brand, you need marketing that gives people more than cost/benefit of the individual product they are looking for. There is a reason Apple is Apple fyi. You might not like it, and you might think its dumb, but it works.

AMD is underplaying their relationship to Ryzen so that Ryzen can stand on its own, marketing-wise.

I would also like to say that the box, sitting on a shelf next Intel CPUs, will draw your attention through the use of negative space. And a little gloss can go a long way on top of that
 
Reactions: CatMerc

realibrad

Lifer
Oct 18, 2013
12,337
898
126
AMD is underplaying their relationship to Ryzen so that Ryzen can stand on its own, marketing-wise.

I would also like to say that the box, sitting on a shelf next Intel CPUs, will draw your attention through the use of negative space. And a little gloss can go a long way on top of that

Its letting Ryzen become known, and then will associate AMD to it. Its not unusual for products to drive the image of a company. Java is a great example of this. Java was a big deal for Sun Microsystems to the point where they decided to change their name to their product because it was so well known. I would be a very smart move for AMD to let Ryzen pull up AMD.

AMD wont change their name like Sun did, but same principle.
 
Status
Not open for further replies.
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |