Anandtech 6850 comparison is up

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

nyker96

Diamond Member
Apr 19, 2005
5,630
2
81
OP you might add in title, OC comparison. since it's really about overclocking performances.

I must say most impressive OC result from 6850, once OCed can get to 6870 results. excellent.
 

WelshBloke

Lifer
Jan 12, 2005
32,428
10,553
136
Good article, but no OC NVIDIA cards there?

In 1 article where AT review stock cards and don't even approach OC of that product at all you compare them to OCed competition cards.

In 1 article where AT review OC cards and goes an OC cards some more, AT sticks with NVIDIA stock cards.

Go figure.


Yeah this.

I could see (kinda) why people were upset at the OC cards in the first article, but to leave them out of this one just seems daft.
 

Petey!

Senior member
May 28, 2010
250
0
0
Yeah seems like the 6850 will be a good OC'er the way the 5850 was.

But like everyone else is saying, i wanna see a review with both the 6850 and 460 1GB Oc'd, and I wanna see that the proper IQ setting is used for the AMD drivers.
 

Kuzi

Senior member
Sep 16, 2007
572
0
0
So you can buy a 6850 for less than $190 and once OCed you get ~GTX470 performance. Sweet deal
 

blastingcap

Diamond Member
Sep 16, 2010
6,654
5
76
?
The 768MB owns its price segment, and there's a 1GB on Newegg at $197 with $30 MIR and free HAWX 2.
6850 looks to be nicely priced and a nice little overclocker, but it's Nvidia that's been putting on the pressure since release with the GTX 460.

I'm sure NV dropped the prices for GTX460s and 470s overnight out of the kindness of their hearts in late October.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,791
1,512
136
Asking what settings the card was running at is a fair question, not sure why anyone would claim Keyes is trolling....

Performance seems very similar (but perhaps slightly slower) than a heavily overclocked GTX 460. Of course HD 6850 is a newer product than GTX 460, so it might be able to clock better with time.

I'm very surprised that no one has commented on Anand's memory clock findings. I don't think anyone was expecting an 850/1150 6850 to beat a stock 950/1050 6870. That's *insane*. Barts is between three to four times more sensitive to memory bandwidth than it is to compute power. The implications are mind boggling -- for example, it's now obvious that there is a lot more to Barts than simply being a better balanced Cypress (Barts actually appears to be the least balanced ATI GPU since R600), and it's also quite easy to imagine a hypothetical 512-bit Barts having GTX480/580 level performance. In terms of what this means for Cayman, if Barts is this memory starved on 256-bit, then Cayman will be completely hamstrung on 256-bit if the same principles apply to it's tweaked architecture. On the other hand, if Cayman is 512-bit performance could be in a whole other world.

I can still hardly believe Anand's results. But wow, this could change everything.
 
Last edited:

Termie

Diamond Member
Aug 17, 2005
7,949
48
91
www.techbuyersguru.com
I'm very surprised that no one has commented on Anand's memory clock findings. I don't think anyone was expecting an 850/1150 6850 to beat a stock 950/1050 6870. That's *insane*. Barts is between three to four times more sensitive to memory bandwidth than it is to compute power. The implications are mind boggling -- for example, it's now obvious that there is a lot more to Barts than simply being a better balanced Cypress (Barts actually appears to be the least balanced ATI GPU since R600), and it's also quite easy to imagine a hypothetical 512-bit Barts having GTX480/580 level performance. In terms of what this means for Cayman, if Barts is this memory starved on 256-bit, then Cayman will be completely hamstrung on 256-bit if the same principles apply to it's tweaked architecture. On the other hand, if Cayman is 512-bit performance could be in a whole other world.

I can still hardly believe Anand's results. But wow, this could change everything.

Interesting observation, but the 6850 at 850/1150 only beats the stock 6870 in ME2 and Civ5, so it's not universal. Also, note that the 6870 stock is 900, not 950. So, yes, in a sense, the 1150 memory clock on the 6850 comes close to making up for a 50Mhz clock deficit and fewer SMUs.

I agree that AMD went the wrong way cutting back on the memory controller of Barts. It's a shame that the 6850/6870 can't hit the 1200 memory clock that 5870s had as standard and most 5850s could attain through overclocking. I think it's the reason that a heavily overclocked 6870 almost always loses to a stock 5870 and only matches an overclocked 5850.

And I don't quite agree that the 256-bit Cayman "will be completely hamstrung" on a 256-bit bus. The problem is not the width, it's the quality. If Cayman's memory bus is at least as good as the 5870s (allowing ~1250-1300Mhz clocks on the memory), it probably won't be a bottleneck).
 
Last edited:

Arkadrel

Diamond Member
Oct 19, 2010
3,681
2
0
@HurleyBird,

Yeah the barts are appearntly memory bandwidth limited... at only 134.4 Gb/s.

The 6970 is rumored to be 256bit bus width and have up to 6Ghz memory speeds.
(6ghz / 4 (GDDR5) = 1500mhz).

256bit bus width x 1.500 / 8 bits pr byte x 4 = 192.0 Gb/s memory bandwidth.
192 Gb/s is 43% more memory bandwidth than a 6870 has.

The 580 is also rumored to have 192.0 Gb/s memory bandwidth (it uses a bigger bus, but slower memory speeds).
 

GaiaHunter

Diamond Member
Jul 13, 2008
3,682
329
126
I'm very surprised that no one has commented on Anand's memory clock findings. I don't think anyone was expecting an 850/1150 6850 to beat a stock 950/1050 6870. That's *insane*. Barts is between three to four times more sensitive to memory bandwidth than it is to compute power. The implications are mind boggling -- for example, it's now obvious that there is a lot more to Barts than simply being a better balanced Cypress (Barts actually appears to be the least balanced ATI GPU since R600), and it's also quite easy to imagine a hypothetical 512-bit Barts having GTX480/580 level performance. In terms of what this means for Cayman, if Barts is this memory starved on 256-bit, then Cayman will be completely hamstrung on 256-bit if the same principles apply to it's tweaked architecture. On the other hand, if Cayman is 512-bit performance could be in a whole other world.

I can still hardly believe Anand's results. But wow, this could change everything.

I think you've reached the wrong conclusion.

Cayman, as other people said will be use a more complex 256-bit memory controller (not every memory controller with the same width are created equal) with faster memory on top.

I think the more interesting conclusion is that the 6870 should still have some OC performance room despite the core only being able to OC 50-100MHz more.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,791
1,512
136
Interesting observation, but the 6850 at 850/1150 only beats the stock 6870 in ME2 and Civ5, so it's not universal. Also, note that the 6870 stock is 900, not 950. So, yes, in a sense, the 1150 memory clock on the 6850 comes close to making up for a 50Mhz clock deficit and fewer SMUs.

Yeah, my bad on the clocks thing. Not sure what I was thinking on that. And you're right, there is some error on the Anandtech commentary where they said the 850/1150 6850 beats the 6870 in 3/5. In reality, it wins 2, ties 1, and loses 2. Seeing as the margins are all fairly close, it's reasonable to call them the same speed.

Also keep in mind that besides the 50Mhz deficit and fewer SPs 6870 also has proportionally fewer TMUs.

So roughly, in terms of compute (and texturing I suppose), 6870 has a:
(900/850) * (1120/960) = 1.235x advantage over our 850/1150 6850.

In terms of memory bandwidth our 6850 has a:
1150/1050 = 1.095x advantage over a stock 6870.

Putting that all together, and assuming both configurations perform equally as per the Anand results, Barts is more sensitive to memory clock than it is to core by:

0.235/0.095 = 2.47 = ~2.5x

So, yeah, I was a bit off due to idiotically saying 6870 has a 950MHz core clock, but 2.5x more sensitive to memory clock is still very significant. Especially when you take into account that previous ATI designs have traditionally been much more sensitive to core clock, and that Barts already has a much lower compute/bandwidth ratio than Cypress. You're still looking at a hypothetical 512-bit Barts competing at a GTX 480/580 level assuming that kind of scaling doesn't quickly fall off.

And I don't quite agree that the 256-bit Cayman "will be completely hamstrung" on a 256-bit bus. The problem is not the width, it's the quality. If Cayman's memory bus is at least as good as the 5870s (allowing ~1250-1300Mhz clocks on the memory), it probably won't be a bottleneck).

That Cayman will be hamstrung by a 256-bit bus is the most reasonable assumption. Obviously, with Cayman looking to be a different or at least highly modified architecture anything can happen, but:

a. The most obvious reason for Barts being so bandwidth limited is that the shaders are much more efficient at game code than Cypress (which itself is horribly inefficient at game code, losing to GTX 480 despite having more texturing power and theoretically over twice as much compute) and thus have higher bandwidth requirements.

b. Even at 1400MHz, a 256-bit Cayman has only a third more bandwidth than Barts.

c. Cayman is going to have quite a few more shaders than Barts. The exact number still being up in the air. More shaders to feed = more bandwidth is needed to feed them.

d. If Cayman in indeed VLIW4, which is looking to be the case right now, then it's going to be even more efficient at game code than Barts is, somewhere much closer to Fermi in efficiency as opposed to Cypress. This is going to increase bandwidth needs per shader to keep the chip fed.

e. No matter what you clock the memory at, it's probable that a 256-bit Cayman is going to see way more than the 2.5x more memory sensitivity we currently have with Barts. Anything greater than 4x can easily be definable as "hamstrung". Heck, depending on how you would define hamstrung one might even go as far as to use that word to describe Barts.

Cayman, as other people said will be use a more complex 256-bit memory controller (not every memory controller with the same width are created equal) with faster memory on top.)

It will probably use a more complex controller, and yes, admittedly there is a good chance that it will be 256-bit. However, the reason AMD used a redwood-derived controller in Barts in the first place was because the memory controller in Cypress was already highly expensive, and that one only got them to 1.2GHz. Given the signals we've been getting from AMD, and the path the Nvidia has taken, wider, less complex memory controllers may be the way to go if you have the die size to support it.

@HurleyBird,
The 6970 is rumored to be 256bit bus width and have up to 6Ghz memory speeds.
(6ghz / 4 (GDDR5) = 1500mhz).

256bit bus width x 1.500 / 8 bits pr byte x 4 = 192.0 Gb/s memory bandwidth.
192 Gb/s is 43% more memory bandwidth than a 6870 has.

Firstly, it's unlikely in the extreme that even if 6GHz rated chips are used that the memory controller will be up to the task of running them at those speeds. And this is going by, well, pretty much every GDDR5 using graphics card that has ever been released. But even then, 43% more memory bandwidth might not even be enough to balance *Barts*, and does anyone here really expect Cayman to be less than 43% more powerful than Barts when it comes to compute/core? anyone?
 
Last edited:

Arkadrel

Diamond Member
Oct 19, 2010
3,681
2
0
Cayman from rumors is supposed to have:

1536 shaders (4D) = 384 shader units (20% more than 5870 @ 320 5d units)

Shader wise it goes down from 1600 --> 1536, but its supposed to be 20% more units overall.


If the 6970 just has 20% more memory bandwidth (than a 5870) wont that mean its not limited by it any more than the 5870 is?



Firstly, it's unlikely in the extreme that even if 6GHz rated chips are used that the memory controller will be up to the task of running them at those speeds. And this is going by, well, pretty much every GDDR5 using graphics card that has ever been released. But even then, 43% more memory bandwidth might not even be enough to balance *Barts*, and does anyone here really expect Cayman to be less than 43% more powerful than Barts when it comes to compute/core? anyone?


I just convinced me... AMD is loseing to much performance by makeing small dies, they should make bigger chips if that truely means their performance will just shoot straight up into the air.

Does the off chip buffering for Tessellation help? does tessellation use any memory bandwidth?
 
Last edited:

Skurge

Diamond Member
Aug 17, 2009
5,195
1
71
Cayman from rumors is supposed to have:

1536 shaders (4D) = 384 shader units (20% more than 5870 @ 320 5d units)

Shader wise it goes down from 1600 --> 1536, but its supposed to be 20% more units overall.


If the 6970 just has 20% more memory bandwidth (than a 5870) wont that mean its not limited by it any more than the 5870 is?

The 5870 had MUCH more shader power than it could really put to good use. Shown by the fact that a 5850 is just 2-3% slower at the same clocks.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,791
1,512
136
The 5870 had MUCH more shader power than it could really put to good use. Shown by the fact that a 5850 is just 2-3% slower at the same clocks.

yet responded around equally to increased memory and core speeds. While Barts has way fewer shaders and responds to memory clock 2.5x more than it responds to core. Now isn't that interesting...
 

Kuzi

Senior member
Sep 16, 2007
572
0
0
Firstly, it's unlikely in the extreme that even if 6GHz rated chips are used that the memory controller will be up to the task of running them at those speeds. And this is going by, well, pretty much every GDDR5 using graphics card that has ever been released. But even then, 43% more memory bandwidth might not even be enough to balance *Barts*, and does anyone here really expect Cayman to be less than 43% more powerful than Barts when it comes to compute/core? anyone?

6GHz memory won't be needed even for Cayman and actually the 43% more bandwidth would be overkill. I don't see Cayman being more than 30% faster than Barts in normal gaming situations, so in that case even memory running at 5.6GHz (179.2GB/s) would be fine, because that's already 33% more bandwidth than Barts.

I believe Cayman Pro will release with 1.3GHz (5.2GHz effective) memory, and Cayman XT with 1.4GHz (5.6GHz effective) but we'll have to wait and see.
 
Last edited:

GaiaHunter

Diamond Member
Jul 13, 2008
3,682
329
126
It will probably use a more complex controller, and yes, admittedly there is a good chance that it will be 256-bit. However, the reason AMD used a redwood-derived controller in Barts in the first place was because the memory controller in Cypress was already highly expensive, and that one only got them to 1.2GHz. Given the signals we've been getting from AMD, and the path the Nvidia has taken, wider, less complex memory controllers may be the way to go if you have the die size to support it.

The controller in Cypress was too expensive to be a part sitting at $200 range. If it is a part sitting at $350+ range things can be different.

Additionally, while NVIDIA uses wider less complex memory controllers, NVIDIA uses the same 5GHz rated GDDR5 as Cypress and Barts. Cypress comes at stock 4.8 GHz while the GTX 480 comes at 3.7 GHz!

For example, the MSI GTX480 Lightning comes at 4GHz and in this review, http://www.guru3d.com/article/msi-geforce-gtx-480-lightning-review/18 , the max it reached was 4.4GHz. This MSI 5870 Lightning on the other hand, http://www.guru3d.com/article/msi-r5870-lightning-review/24, broke the specs (!!!!) and reached 5488MHz!!!

But look at even the 5870 at launch day, http://www.guru3d.com/article/radeon-hd-5870-review-test/26 , 5.4GHz.

And while the Cypress controller might be highly expensive it is highly efficient.

And what is more expensive, a highly efficient controller or a die size in the GF100 die size range?

Firstly, it's unlikely in the extreme that even if 6GHz rated chips are used that the memory controller will be up to the task of running them at those speeds. And this is going by, well, pretty much every GDDR5 using graphics card that has ever been released.

Considering that Cypress can break the specs of 5GHz GDDR5 I'll have to disagree with you.
 
Last edited:

Throckmorton

Lifer
Aug 23, 2007
16,829
3
0
I like the XFX's shorter length and vapor chamber cooling. If I upgrade my 8800GT to this card, I'll gain not only performance, but a couple SATA ports that my current card blocks!
 

Ryan Smith

The New Boss
Staff member
Oct 22, 2005
537
117
116
www.anandtech.com
Apparently, the high noise level of the XFX may have been a glitch. XFX issued a fix. See this article: http://www.hardwarecanucks.com/news/video/xfx-issues-fix-hd-6850-high-idle-fan-speed/

I wonder if this had been applied in the Anandtech article, and if not, would it have changed the rankings?

More and more, I think it's hard to compare various noise measurements, because any power user will go in and tailor a fan profile that optimizes noise and temperature...I think some sites have started to measure CPU heatsinks at identical decibels to test the effectiveness of the designs. I wonder if the same should be done for GPUs. It's obviously a lot more work.
Yes, it has been applied. XFX provided that fix to us before the 6850 even launched. There's nothing "wrong" with the XFX 6850 - its fan scales up and down correctly. It's just tuned for cooling over noise.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,791
1,512
136
Additionally, while NVIDIA uses wider less complex memory controllers, NVIDIA uses the same 5GHz rated GDDR5 as Cypress and Barts. Cypress comes at stock 4.8 GHz while the GTX 480 comes at 3.7 GHz!

Expensive in terms of die size investment, not in terms of paying for the memory itself. AMD used a Redwood-based MC in Barts instead of a Cypress-based one to cut down on manufacturing costs. Given Barts' affinity for bandwidth that was either a tragic mistake or building a memory controller capable of running GDDR5 at lightning speeds takes up a disproportionately large amount of the die vs. a less powerful one.

Another thing is that even if AMD is running higher rated memory chips in Cayman that doesn't mean that they have beefed up the memory controller to compensate. For example, even though AMD was only able to get 1200MHz out of chips rated for 1250MHz in Cypress, if they were to put chips rated for, say, 1600MHz on the same card they definitely wouldn't be running at 1600MHz, but they might be able to get 1250-1450MHz out of them.

Considering that Cypress can break the specs of 5GHz GDDR5 I'll have to disagree with you.

Did you seriously just make the argument that despite Cypress (and every other high end GDDR5 based card on the market) being shipped with memory below the rated speed, that Cayman will be different simply because those previous chips can overclock? Really? Seriously? For real?

...

Again, did you really just say that?

Just step back and think for a second. And just so you know you're talking to a guy with a 5870 in his system that can't run 1250MHz memory stable
 
Last edited:

GaiaHunter

Diamond Member
Jul 13, 2008
3,682
329
126
Expensive in terms of die size investment, not in terms of paying for the memory itself. AMD used a Redwood-based MC in Barts instead of a Cypress-based one to cut down on manufacturing costs. Given Barts' affinity for bandwidth that was either a tragic mistake or building a memory controller capable of running GDDR5 at lightning speeds takes up a disproportionately large amount of the die vs. a less powerful one.

Another thing is that even if AMD is running higher rated memory chips in Cayman that doesn't mean that they have beefed up the memory controller to compensate. For example, even though AMD was only able to get 1200MHz out of chips rated for 1250MHz in Cypress, if they were to put chips rated for, say, 1600MHz on the same card they definitely wouldn't be running at 1600MHz, but they might be able to get 1250-1450MHz out of them.



Did you seriously just make the argument that despite Cypress (and every other high end GDDR5 based card on the market) not being shipped with memory below the rated speed, that Cayman will be different simply because those previous chips can overclock? Really? Seriously? For real?

...

Again, did you really just say that?

Just step back and think for a second. And just so you know you're talking to a guy with a 5870 in his system that can't run 1250MHz memory stable

I'm saying that some don't need wider buses to get higher bandwidth. If you believe that 96% at stock is not efficient, well...

And of course the Gigabyte 5870 Super Overclock comes with 5GHz from the factory, but I guess they don't run stable either.

And you can't disregard other ways to reduce bandwidth requirements, like cache.

Look at the GTX480 - it only has 11% more bandwidth than the GTX285.
 
Last edited:
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |