Vega/Navi Rumors (Updated)

Page 237 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Status
Not open for further replies.

Elixer

Lifer
May 7, 2002
10,376
762
126
Looks like HBM2 won't budge at all, it is maxed out already.
*edit, these are reference cards, no tweaks have been done to them at all.

https://sapphirenation.net/sapphire-radeon-vega/?utm_source=newsletter
SAPPHIRE Technology has announced the launch of the much-anticipated SAPPHIRE Radeon Vega 64 enthusiast graphics cards. The Vega architecture boasts significant improvements focused on maximizing the performance. Vega cards are designed for enthusiasts seeking top-of-the-chart framerates in games of today and tomorrow – in Ultra details and VR.
 
Last edited:

ozzy702

Golden Member
Nov 1, 2011
1,151
530
136
Looks like HBM2 won't budge at all, it is maxed out already.

https://sapphirenation.net/sapphire-radeon-vega/?utm_source=newsletter

Not surprising given the voltage that has been reported. Vega is the 290 all over again. Pushed to it's limit, hot, loud, guzzling power. Over a year late to market and still rushed out the door... I've said it before and I'll say it again, AMD better have used some secret sauce on Vega to make it mine like a champ because mining is it's only hope to sell anywhere near MSRP.

Anyone know/educated guess if the memory on Vega 56 can be overclocked to 1890mhz?
 

fleshconsumed

Diamond Member
Feb 21, 2002
6,485
2,362
136
Not surprising given the voltage that has been reported. Vega is the 290 all over again. Pushed to it's limit, hot, loud, guzzling power. Over a year late to market and still rushed out the door... I've said it before and I'll say it again, AMD better have used some secret sauce on Vega to make it mine like a champ because mining is it's only hope to sell anywhere near MSRP.

Anyone know/educated guess if the memory on Vega 56 can be overclocked to 1890mhz?
Well, here's to hoping that just like 290 we'll be able to pick them up for $200 6 months to a year after release
 

Elixer

Lifer
May 7, 2002
10,376
762
126
Likely it is maxed out but using those Sapphire details to prove it is irrelevant as those are the reference cards re-branded, not custom AIB versions.
True, but I was hoping they would tweak it a bit, but, these are just vanilla reference cards.
 

Tup3x

Golden Member
Dec 31, 2016
1,011
1,001
136
I have asked simple question, how much do you guys think GPU will be faster in situation when software implements Vega Features. I did not implied it will be 2 times faster.

Yes, AMD have said that in CURRENT slack of software 150W Vega has similar performance per watt that GTX 1080.

Does any of current slack of software use any of Vega features that have most meaningful impact on performance?


Doubling the Geometry registered each clock by GCN is solving biggest problem it had in geometry throughput. And then on top of it, you have a culling technique that supposedly can cull at least 50% of geometry not used in scenes.

About the last paragraph. What if Vega already beats GTX 1080 by some margin? What if you will add on top of it hardware features, and Primitive Shaders to software? Is it impossible for Vega to gain 40% of performance with it?
It's impossible to say how much because that will vary case by case (or should I say frame by frame). Proper lods and middlewares like Umbra already help with reducing useless geometry in games so I don't think the gains are going to be huge in actual games.

Also, who knows how much perf per clock they lost in order to gain higher clocks. Maybe the new stuff just help to gain the lost speed. To me it looks like they tried to achieve higher clocks but hit the wall (combination of GCN and manufacturing process problems).

I'm pretty certain that AMD has done the predictions already. They would have been way more bullish if they except Vega to beat big Pascal. If that ever happens, Volta will be out and it all is irrelevant. Also how come that there has never been talks about "wait for GPU feature implementations" if the GPU performance is lacking at launch? Massive majority of devs are not going bother. Hardware that requires extra work is just a bad idea.

Vega has a lot of compute power but that doesn't translate to massive gaming performance. It feels like they are making computing cards for miners and not graphics cards for gamers. NVIDIA made correct corrections after Fermi. AMD should have done the same after Hawaii but it looks like they have problems doing that.
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
It's impossible to say how much because that will vary case by case (or should I say frame by frame). Also, who knows how much perf per clock they lost in order to gain higher clocks. Maybe the new stuff just help to gain the lost speed. To me it looks like they tried to achieve higher clocks but hit the wall (combination of GCN and manufacturing process problems).

I'm pretty certain that AMD has done the predictions already. They would have been way more bullish if they except Vega to beat big Pascal. If that ever happens, Volta will be out and it all is irrelevant. Also how come that there has never been talks about "wait for GPU feature implementations" if the GPU performance is lacking at launch? Massive majority of devs are not going bother. Hardware that requires extra work is just a bad idea.

Vega has a lot of compute power but that doesn't translate to massive gaming performance. It feels like they are making computing cards for miners and not graphics cards for gamers. NVIDIA made correct corrections after Fermi. AMD should have done the same after Hawaii but it looks like they have problems doing that.
What do you think defines performance of GPUs, per clock? Its a serious question. Why do you think they had to lose performance per clock to gain higher clock speeds?
 

JDG1980

Golden Member
Jul 18, 2013
1,663
570
136
Looks like HBM2 won't budge at all, it is maxed out already.

HBM2 speeds are one of Vega's biggest problems. The design goal was 512 GB/sec, but they only got the bandwidth up to 480 GB/sec, and only did that by overvolting and overclocking the chips (thus driving up power consumption and heat). In other words, it has only ~94% of the memory bandwidth of Fury X despite having >1.5x the raw shader power (due to higher core clocks). Combine that with the fact that it can still only do 4 triangles/clock (compared to 6 for GP102) and it's not hard to see why it has such trouble competing with Nvidia's best cards in gaming.

They might be able to get better results next year if they have access to HBM2 that's actually running at the speeds it was originally supposed to. That, combined with a respin (process tweaks, like on 2nd gen Polaris), might provide marginally better performance and perf/watt, but it still will not be particularly competitive with Pascal for gaming.

It will be interesting to see how Raven Ridge fares - unlike Vega, it should have as much or more memory bandwidth than its predecessor. It will still be bottlenecked - even with DDR4 3200, it will have only 51.2 GB/sec of bandwidth, and that shared - but should still be able to beat any other iGPU out there. I wonder if the HBCC can borrow part of the CPU's L3 cache? If so, that could help. Roughly speaking, I'm expecting Raven Ridge GPU performance to be about on par with Cape Verde (HD 7770).
 
Reactions: ozzy702

Elixer

Lifer
May 7, 2002
10,376
762
126
They might be able to get better results next year if they have access to HBM2 that's actually running at the speeds it was originally supposed to. That, combined with a respin (process tweaks, like on 2nd gen Polaris), might provide marginally better performance and perf/watt, but it still will not be particularly competitive with Pascal for gaming.
Well, it was already mentioned by this guy...
Industry sources...


HBM2 is at version 3 currently, "Perfected Yields", version 4 allows for half-channel => 512-bit, and 2.4 GHz op which is 307.2 gigabyte per sec or 153.6 gigabyte per sec,"Low Cost".

Vega 12 w/ 512-bit HBM2... yum.

So, it seems we could yet see faster HBM2, next year (or sooner?).

Nobody is really saying what the exact problem is with HBM2, and why the fabs can't do 2 Gbps before. It was in the JEDEC specs, so, where was the big issue that caused more volts and yet, less speed?
Was it just yield issues, or does it go deeper than that?
 

utahraptor

Golden Member
Apr 26, 2004
1,053
199
106
When I first saw the specs of the vega the first thing I wanted to see was how fast the new HBM2 was. When I saw it was twice as fast, but they cut the bus in half I was kind of shocked.
 
Reactions: tonyfreak215

Tup3x

Golden Member
Dec 31, 2016
1,011
1,001
136
What do you think defines performance of GPUs, per clock? Its a serious question. Why do you think they had to lose performance per clock to gain higher clock speeds?
If losing tiny bit performance pre clock means much higher clocks then it's a worthwhile trade-off. Especially if you can gain back that with new features (that would mean no PPC decrease in practice). Why I think that might be the case is at least so far Vega has performed pretty much like higher clocked Fiji. Though, performance per clock doesn't matter but the actual outcome. There are numerous ways to do things.

In general it tends to be low PPC -> high frequency. High PPC -> low frequency. It's usually pretty darn hard to get both up. With Pascal NVIDIA managed to get the clock speed up a lot. AMD didn't succeed as well as NVIDIA. Also since the core config is same between Vega and Fiji (4096:256:64) it means that the main performance increase should come from higher clock speed and the modifications they made since Fiji. They hit the power and frequency wall, thus the results are what they are.

It's also possible that the new features they implemented just do not work as well as they thought. If that happens to top teams in F1, I'm sure it can happen to GPU developers too.
 
Last edited:

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
If losing tiny bit performance pre clock means much higher clocks then it's a worthwhile trade-off. Especially if you can gain back that with new features (that would mean no PPC decrease in practice). Why I think that might be the case is at least so far Vega has performed pretty much like higher clocked Fiji. Though, performance per clock doesn't matter but the actual outcome. There are numerous ways to do things.

In general it tends to be low PPC -> high frequency. High PPC -> low frequency. It's usually pretty darn hard to get both up. With Pascal NVIDIA managed to get the clock speed up a lot. AMD didn't succeed as well as NVIDIA. Also since the core config is same between Vega and Fiji (4096:256:64) it means that the main performance increase should come from higher clock speed and the modifications they made since Fiji. They hit the power and frequency wall, thus the results are what they are.
You have not answered my question.

What do you think affects performance per clock in GPUs? Why do you think they had to lose performance per clock in order to gain clock speeds?
 

Tup3x

Golden Member
Dec 31, 2016
1,011
1,001
136
You have not answered my question.

What do you think affects performance per clock in GPUs? Why do you think they had to lose performance per clock in order to gain clock speeds?
I did answer why I though they could have done it. In order to get the clocks higher. If losing a bit of performance per clock means much higher clock speeds and higher performance then that's one design reason.

Besides, what kind of questions is that? If we take GTX 1070 and GTX 1080 then the answer would be: GTX 1080 has more hardware so that it can do more stuff at the same clock speed (on high level).

Shorter, wider... Longer, narrower... Pretty much the same reason why Athlon 64 smoked Pentium 4.
 

raghu78

Diamond Member
Aug 23, 2012
4,093
1,475
136
Vega is a unmitigated disaster in terms of perf/watt and perf/sq mm. AMD marketing tried their best to hype this dud. I doubt AMD can do anything about this massive perf and efficiency gap against Pascal which will get even worse against Volta without a clean sheet redesign. I think its time AMD moved to software scheduler like Nvidia introduced with Kepler. The die and power cost of a hardware scheduler is too much of a handicap to compete with Nvidia's lean and efficient designs. With the advent of Ryzen and the pipeline of improvements coming over the next few years - higher clocks , higher IPC and more cores this is the right time for AMD to make that transition. Their CPUs are going to powerful enough to handle that transition especially by the time 7nm Zen 2 arrives. One thing is sure AMD needs a rethink of their GPU architecture going forward.
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
I did answer why I though they could have done it. In order to get the clocks higher. If losing a bit of performance per clock means much higher clock speeds and higher performance then that's one design reason.

Besides, what kind of questions is that? If we take GTX 1070 and GTX 1080 then the answer would be: GTX 1080 has more hardware so that it can do more stuff at the same clock speed (on high level).

Shorter, wider... Longer, narrower... Pretty much the same reason why Athlon 64 smoked Pentium 4.
You have answered why, but you have not answered what is defining performance per clock for GPUs.

And it is core throughput, and graphics capabilities, of each architecture.


How do you explain that in fact, Nvidia GPUs have longer pipeline that can clock higher but can do less work each cycle than AMD GPUs can?

Nvidia has 32 KB Warp, that is executed by 128 cores/256 KB Register File size, in Maxwell and Pascal architectures.
AMD has 64 KB wavefront that is executed by 64 cores/256 KB Register file size in every single GCN generation since GCN1. All what has changed in GCN since that time, is graphics capabilities. Nvidia changed core throughput, by reducing the number of cores per 256 KB Register File size, which made them less starved for resources. This is why we have seen zero improvement in core for core performance in compute applicatins, but we have seen improvement core for core in both gaming and compute with shift from Kepler to Maxwell.

Where bottleneck is for GCN in graphics throughput - Registering triangles. Each cycle you can register 4 triangles with 4 geometry engines. GP102 can register 6 triangles each clock with 6 GPC's(Each GPC has 1 Geometry Engine). Vega is lifting this bottleneck with Programmable Geometry Pipeline, which can register up to 11 triangles each clock with 4 geometry engines.

AMD lifted a lot of bottlenecks in Vega. But it will take time, before the software will mature.
 

Crumpet

Senior member
Jan 15, 2017
745
539
96
http://www.3dmark.com/compare/spy/2205628/spy/2177830/spy/1984947

Buildzoid comment on /AMD/

  • "It's fake. Vega has a bug where you can set very very high core clocks but the performance ends up being worse than stock. I first ran into when testing my FE on LN2 but I also managed to make my card run at "2000MHz" core in later testing on air."
Which makes sense considering the performance shown here is roughly equal to 2x Rx470 crossfire. (thanks to Jaeger for pointing that out)

So what we can take from this is reference Vega performance @stock clocks, or worse.
 
Status
Not open for further replies.
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |