[Videocardz rumour] Vega pushed forward to October

Page 9 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Yakk

Golden Member
May 28, 2016
1,574
275
81
What could AMD possibly do to increase throughput of their GPUs? Next gen. rasterization? Next gen. schedulers?

Is there anything that would make 4096 GCN core, and 1TB/s of bandwidth fully utilized?

I know I may sound a little excited on this topic, but I want to know what might appear in next gen. Mac Pro .

I would think AMD already have their next gen schedulers with HWS units being tested on the 480. Then they will probably be replacing standard ACE units completely. AMD will have to have calculated the right balance of HWS units needed to saturate the entire GPU while at the same time making it easier to program for game engine developers to use fully with next gen multi-core CPUs.
 

linkgoron

Platinum Member
Mar 9, 2005
2,335
857
136
If Polaris was a major disappointment to anyone performance wise, that is only because of the above mentioned hype machine getting out of control and setting expectations way too high. AMD has been absolutely consistent about the 480 offering 390 level performance at reduced power consumption and that's exactly what we got. I honestly thought they would get more praise for being upfront about what Polaris is since people are always keen to jump on a company for over-promising and under-delivering.

Sorry, my wording was a bit off. I meant that perf/watt was a major disappointment. Maybe the whole throttling+spec problems (6-pin decision) made it more severe than it actually was, but I still think that 120 watts is what people expected. Let's face it, the fact that the 28nm 980 has better perf/watt (according to TPU) is disappointing.

Performance, I expected a touch higher (i.e. consistently beat the 390, as the 390 does beat it in some games), but my original post made it look a bit differently.
 

krumme

Diamond Member
Oct 9, 2009
5,956
1,595
136
Rx480 is a mainstream part targeting 200 and 240 usd. A mainstream part is meant to be sold in droves yet this card is hardly available in any serious numbers.

Its a desktop card so its of minor importance if its using 30w to much and lacking 100mhz. Its performs super fine in dx 12. The basic problem the 480 8g is nowhere in sight.

Its so huge an issue Lisa even mentioned gf yields. And it means its bad. Like complaining about your wife at the next family dinner when she is sitting next to you.

For once amd actually launches a product where there is nost need and then they can not make it.

I was surpriced gf could launch at same time as tsmc but damn they were not ready. Tons of process variantion and leaking stuff and bad yield as by far the worst part. Sad for compettition and prices.
 

RussianSensation

Elite Member
Sep 5, 2003
19,458
765
126
I am 'hopeful' for Vega, but almost think we will just get 2x Polaris for 2x the price. AMD hopefully has something up their sleeve....and they will need to be providing some solid performance leaks and (accurate) prerelease hype to keep people from still buying 1080s/1080Tis. I really wanted to go AMD this year, but nothing on the mid high-end to choose from. Hoping Vega is a home-run.

2X the performance of an RX 480 would put it faster than an AIB 1080 and all that for $499 vs. $650-700 that NV charges? That's how hype gets started without any basis for it. You made your prediction sound disappointing but to me that would be a huge win if true. It would essentially mean a card 25% faster than the reference GTX1070 for only $50 more or similar performance to the GTX1080 for $150-200 less. I doubt Vega 10 will be that competitive. AMD is going to need to fill in $329-499 price levels with cards between RX 480 and the GTX1080. This is why AMD already confirmed they will have Vega 10 and Vega 11. The confusing part about these 2 chips is that AMD has Vega with HBM2 on the roadmap, which doesn't align well with some speculation that Vega 10 may be GDDR5X and Vega 11 (larger chip) would be HBM2.

What could AMD possibly do to increase throughput of their GPUs? Next gen. rasterization? Next gen. schedulers?

Is there anything that would make 4096 GCN core, and 1TB/s of bandwidth fully utilized?

I know I may sound a little excited on this topic, but I want to know what might appear in next gen. Mac Pro .

At the same GPU clocks and on paper memory bandwidth, Polaris 10 (4th gen) is just 18% faster on average against Tahiti (7970) but is only 7% faster against Tonga (R9 380X). That means in nearly 5 years AMD has improved IPC by just about half of what NV did in 2 years moving from Kepler to Maxwell (35-40%). Given AMD's current track record in improving IPC for GCN over 5 years, I would not expect any major improvements in architecture with Vega (0-10%). Chances are AMD will use the wide core approach (close to double everything) because it's unlikely they will use the brute force (GPU clock speeds) approach of their competitor. Most 'reasonable' estimates for Vega 10 are 4096 SPs, 256 TMUs, 64 ROPs, HBM2.

Computerbase shows only a 4.3% advantage for 2304 SP RX 480 against 2048 SP RX 480 at the same GPU/memory clocks. This suggests a severe memory bandwidth or some other bottleneck (ROP?). Despite 12.5% higher shaders and textures, the RX 480 is unable to translate this on paper advantage to real world games. This means AMD is going to need the fastest GDDR5X they can get their hands on or have no choice but to wait for HBM2. In order to improve perf/watt, as their road-map suggests, they are going for HBM2 for maximum efficiency. This is where things get tricky for AMD. SK-Hynix shows only 2 options on their road-map for Q3' 2016:

H5VR32ESM4H-12C 4GB 4Hi = 204GB/sec 1.6Gbps
H5VR32ESM4H-20C 4GB 4Hi = 256GB/sec 2.0Gbps

If they choose the former, they will likely need 4x4Hi = 16GB 800GB/sec. 16GB seems massive overkill for this generation of gaming. You end up with wasted VRAM, higher cost, higher power usage.

If they choose the latter, I am not sure you can do 2x4Hi = 8GB 512GB/sec. 8GB is a good spot to be in but 512GB/sec memory bandwidth seems borderline low for a 4096 SP, 64 ROP GCN 4.0 because RX 480 is already memory bandwidth bottlenecked with 256GB/sec.

Here is another issue: if Vega 10 is only 4096 SP, 64 ROP, 256 TMU, 512GB/sec 250W TPD design, if linear scaling was 2X over RX 470, the card would only end up slightly beating the GTX1070. Since Polaris 10 is only a 5.7B chip, the Vega 10 with these specs would fit at ~ 11B, but the rumours have been whispering that flagship Vega would be ~ 15-18B transistors. Not adding up.

On the surface it seems we could see a 3500-4000 Vega 10 with 10-11B transistors but they could also be a much larger 5000-6000 shader Vega 11 with 15-18B (or the rumours are just that - unsubstantiated dreams). What makes it so difficult to estimate is AMD's vagueness and the gargantuan gap that exists in performance between the RX 480 and GTX1080/Titan XP. It doesn't seem realistic for a 1.2-1.3Ghz Vega 10 to be both a competitor to GP104 and GP102.

IMHO, AMD's best bet to improve performance will be going wider (more functional units), and maximizing perf/watt at the transistor level just like NV mentioned they spent months trying to maximizing clock speeds on 16nm node. AMD's Polaris 10 clocks were a disappointment for a 14nm node. Whatever I typed is pretty much nothing new though.
 
Last edited:
Reactions: crisium

thilanliyan

Lifer
Jun 21, 2005
11,912
2,130
126
If they get 2x Polaris performance, at maybe 1.5x the power consumption, and even if it's 2x the price I'd buy!! Need a beast mining card lol. If in mining it's relatively close to 2x290 at much better power consumption, I'd definitely buy.
 

Glo.

Diamond Member
Apr 25, 2015
5,765
4,670
136
At the same GPU clocks and on paper memory bandwidth, Polaris 10 (4th gen) is just 18% faster on average against Tahiti (7970) but is only 7% faster against Tonga (R9 380X). That means in nearly 5 years AMD has improved IPC by just about half of what NV did in 2 years moving from Kepler to Maxwell (35-40%). Given AMD's current track record in improving IPC for GCN over 5 years, I would not expect any major improvements in architecture with Vega (0-10%). Chances are AMD will use the wide core approach (close to double everything) because it's unlikely they will use the brute force (GPU clock speeds) approach of their competitor. Most 'reasonable' estimates for Vega 10 are 4096 SPs, 256 TMUs, 64 ROPs, HBM2.
No. I will show you something.

Tahiti was Graphics IPv7.
Tonga was Graphics IPv8.
Fiji was Graphics IPv8.1.
Polaris was Graphics IPv8.2.
https://translate.google.com/translate?sl=auto&tl=en&js=y&prev=_t&hl=cs&ie=UTF-8&u=https://www.computerbase.de/2016-08/amd-radeon-polaris-architektur-performance/&edit-text=
Here you have direct comparison of Improvements of performance of different versions of the same architecture, clock for clock, core count for core count.
Vega is Graphics IPv9.

Secondly, we do not know what will Vega have. Graphics IPv9 may mean next generation rasterizer, for example. What slides from AMD are indicating? 3 times higher performance per watt comparing to 28 nm GPUs from 2015. What technology would bring increase in performance per watt on the same node as Polaris? Tile-Based Rasterization. Next generation Rasterization in desktop graphics. The very reason why there is absolutely no difference for Nvidia when they think about desktop and mobile.

Thirdly. We don't know the core counts of Vega 10. Your estimation may be "logical". What, however, if bigger Vega is that 4096 core GPU? What if Bigger Vega will have not 64 but 96 ROPs? What if Smaller Vega will have 3072 GCN cores, 64 ROPs and 8 GB of HBM and will be sold this year? You may say that 4096 GCN core GPU as RX Fury would be completely stupid. Not when you consider Rasterizing technology in this big picture. How much more performance you would be able to squeeze from this technology and 3072 GCN core GPU?

Those are only my thoughts. Are exactly as valid as yours, RS about upcoming lineup.
 
Last edited:

Krteq

Senior member
May 22, 2015
993
672
136
No. I will show you something.

Tahiti was Graphics IPv7.
Tonga was Graphics IPv8.
Fiji was Graphics IPv8.1.
Polaris was Graphics IPv8.
Fixed.

According to AMDGPU sources, Polaris is still GFX IPv8.

Code:
       kaveri bonaire kabini hawaii topaz  tonga carrizo fiji Polaris
gmc      7.0    7.0    7.0    7.0    8.0    8.0    8.0    8.5    8.1
ih       2.0    2.0    2.0    2.0    2.4    3.0    3.0    3.0    3.1
smc      7.0    7.0    7.0    7.0    7.1    7.1    8.0    7.1    7.2
dce      8.1    8.2    8.3    8.5    N/A   10.0   11.0   10.1   11.2
gfx      7.1    7.2    7.2    7.3    8.0    8.0    8.0    8.0    8.0
sdma     2.0    2.0    2.0    2.0    2.4    3.0    3.0    3.0    3.1
uvd      4.2    4.2    4.2    4.2    N/A    5.0    6.0    6.0    6.3
vce      2.0    2.0    2.0    2.0    N/A    3.0    3.0    3.0    3.4
Courtesy of Beyond3D forums
 
Last edited:
Reactions: prtskg

Headfoot

Diamond Member
Feb 28, 2008
4,444
641
126
I just cant imagine that the architecture will be radically different than Polaris. It doesn't make sense. Why would you invent a bunch of new uArch level improvements and then intentionally withhold them from your bread and butter midrange GPU? I think its far more likely Vega has a couple more enhancements, the things necessary to get it to scale to such large shader counts, and of course HBM2 for at least one if not both the Vega dies. I very much doubt there are major architecture enhancements that we haven't already seen on Polaris.
 

Krteq

Senior member
May 22, 2015
993
672
136
Hmm, they should enhance ROPs/RBEs for example. They are practicaly untouched since first GCN incarnation and rasterizer changes are one of the key elements for Maxwell/Pascall Perf/watt success.
 

swilli89

Golden Member
Mar 23, 2010
1,558
1,181
136
I just cant imagine that the architecture will be radically different than Polaris. It doesn't make sense. Why would you invent a bunch of new uArch level improvements and then intentionally withhold them from your bread and butter midrange GPU? I think its far more likely Vega has a couple more enhancements, the things necessary to get it to scale to such large shader counts, and of course HBM2 for at least one if not both the Vega dies. I very much doubt there are major architecture enhancements that we haven't already seen on Polaris.

Vega is the LONG awaited "Greenland" GPU architecture. I really have a strong a feeling that this will be a total revamp to GCN. Now in hindsight, it looks like AMD chose to direct its limited engineering bandwidth to getting the first chip out on 14nm LPP that satisfied criteria that Apple, Sony, M$ all had for their OEM designs. They ended up getting forced to sell Polaris 10 at <$250 due to the market. Vega could may very well be in development since before Polaris even began, indicating a truly new architecture.
 
Mar 10, 2006
11,715
2,012
126
Hmm, they should enhance ROPs/RBEs for example. They are practicaly untouched since first GCN incarnation and rasterizer changes are one of the key elements for Maxwell/Pascall Perf/watt success.

Actually, the changes to the SMs were by far the biggest drivers of the nice perf/watt in Maxwell/Pascal.
 

Glo.

Diamond Member
Apr 25, 2015
5,765
4,670
136
Vega is the LONG awaited "Greenland" GPU architecture. I really have a strong a feeling that this will be a total revamp to GCN. Now in hindsight, it looks like AMD chose to direct its limited engineering bandwidth to getting the first chip out on 14nm LPP that satisfied criteria that Apple, Sony, M$ all had for their OEM designs. They ended up getting forced to sell Polaris 10 at <$250 due to the market. Vega could may very well be in development since before Polaris even began, indicating a truly new architecture.
Greenland is the name of the GPU. Vega is the name of architecture. Lets stick to this and do not mix them up. Greenland can be completely different chip than Vega "whatever it will be called" turn out to be.
 

Krteq

Senior member
May 22, 2015
993
672
136
Actually, the changes to the SMs were by far the biggest drivers of the nice perf/watt in Maxwell/Pascal.
Well... nope.

It's a discussion for another section/thread anyway. This one is about Vega, not about a nV uarchs.
 

swilli89

Golden Member
Mar 23, 2010
1,558
1,181
136
Greenland is the name of the GPU. Vega is the name of architecture. Lets stick to this and do not mix them up. Greenland can be completely different chip than Vega "whatever it will be called" turn out to be.
Oh I thought Greenland was simply renamed to Vega to follow their new naming scheme? Regardless I know it doesn't really matter; the point I was making is that its been in development for a very long time, which gives me a little hope.
 

Glo.

Diamond Member
Apr 25, 2015
5,765
4,670
136
Oh I thought Greenland was simply renamed to Vega to follow their new naming scheme? Regardless I know it doesn't really matter; the point I was making is that its been in development for a very long time, which gives me a little hope.
Be cautious here. Overhyping the architecture can be painful in the end, as we have seen with Polaris.
 
Reactions: Arachnotronic
Mar 10, 2006
11,715
2,012
126
Well... nope.

It's a discussion for another section/thread anyway. This one is about Vega, not about a nV uarchs.

Just going to finish off with this, then I'm done:

The primary contributor to Maxwell’s improved efficiency is the new Maxwell SM architecture, SMM. This new SM architecture achieves much higher power efficiency and delivers 35% more performance per CUDA Core on shader-limited workloads. Achieving these results required a number of major changes to the architecture. The SM scheduler architecture and algorithms have been rewritten to be more intelligent and avoid unnecessary stalls, while further reducing the energy per instruction required for scheduling.

http://international.download.nvidi...tional/pdfs/GeForce-GTX-750-Ti-Whitepaper.pdf
 
Reactions: happy medium

lyssword

Diamond Member
Dec 15, 2005
5,761
25
91
Pretty much guaranteed out of stock for 1/2- 1 year after release because of miners....
 

Pork Sausage

Junior Member
Aug 14, 2016
4
1
21
Vega arriving this year would be nice. My GTX 660 Ti is obnoxiously slow in a few games and while the RX 480 would be a dramatic improvement, I don't think it will be quite enough to let my build coast through the next few years of useful life without graphics performance becoming an issue down the line again.
 

Zstream

Diamond Member
Oct 24, 2005
3,396
277
136
Fixed.

According to AMDGPU sources, Polaris is still GFX IPv8.

Code:
       kaveri bonaire kabini hawaii topaz  tonga carrizo fiji Polaris
gmc      7.0    7.0    7.0    7.0    8.0    8.0    8.0    8.5    8.1
ih       2.0    2.0    2.0    2.0    2.4    3.0    3.0    3.0    3.1
smc      7.0    7.0    7.0    7.0    7.1    7.1    8.0    7.1    7.2
dce      8.1    8.2    8.3    8.5    N/A   10.0   11.0   10.1   11.2
gfx      7.1    7.2    7.2    7.3    8.0    8.0    8.0    8.0    8.0
sdma     2.0    2.0    2.0    2.0    2.4    3.0    3.0    3.0    3.1
uvd      4.2    4.2    4.2    4.2    N/A    5.0    6.0    6.0    6.3
vce      2.0    2.0    2.0    2.0    N/A    3.0    3.0    3.0    3.4
Courtesy of Beyond3D forums
So Fiji was a test?
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |