My god, you'd think with all the official information from AMD people would not resort to conjecture and empty hype, but here we are again! AMD says it's product is 1080 level, but one dude says it's 1080Ti with time! LOL!
So you are saying that AMD architecture, having higher throughput of the cores, compared to consumer Pascal GPUs, and having higher geometry throughput than Pascal GPUs and just acknowledging this, is creating hype?
GCN architecture is made from 4, 16-wide SIMDs in each CU, accounting for 64 cores, and are tied by 256 KB Register File size per 64 cores.
Each wavefront in GCN has 64 KB, and each warp in Nvidia is 32 KB.
Kepler architecture was bound by 256 KB RFS per 192 cores.
Maxwell was bound by 256 KB RFS per 128 cores
Consumer Pascal is bound by 256 KB RFS per 128 cores.
GP100 - 64 cores/ 256 KB Register File Size.
GV100 - 64 cores/256 KB RFS.
What this means is that 128 cores in Maxwell architecure had the same throughput, and performance of those 192 cores, from Kepler architecture.
Now compare notes. 64 KB wavefront processed by architecture, that has 256 KB register File size per 64 cores, vs architecture that has 32 KB warp, processed by architecture that has 256 KB RFS per 128 cores. Which one will be quicker?
Previous generations of GCN were bound by Geometry performance - 1 triangle per clock, per shader engine.
Same for Nvidia 1 triangle per clock, per SM. This is where the differences start to go in Nvidia way.
If Nvidia has 6 SM's it can register 6 Triangles each clock, and is able to achieve higher clocks, so the pipeline can be filled much more often than GCN.
GCN can process more work each cycle, but cannot achieve as high clocks.
Here is where GCN5 comes. It changed everything on architecture level, to mitigate its downsides. Primitive Shader, and Programmable Geometry Pipeline allows GCN to register 10 triangles each clock with 4 shader engines, and the architecture is able to clock itself up to 1.6 GHz. There is no reason to believe that GPU architecture that has higher overall throughput, in properly optimized software to be behind latest Nvidia architectures, including Volta.
4096 GCN core chip, with 64 KB wavefront, processed by 64 cores, that are fed with 256 KB register file size, will be faster per clock, than 3840 CUDA core chip, with 32 KB warp, processed by 128 core, that are fed by the same 256 KB RFS. The only reason why its not faster: software developers did not implemented Primitive Shaders, and not used Programmable Geometry Pipeline, and drivers of said GPU are not ready.
FineWine technology has nothing to do here. Biggest features of Vega architecture, that have hghest impact on its performance are not implemented in the software.
To sum this wall of text. In DX11 games, Vega will be at best around GTX 1080 performance. With properly optimized software in DX12 and Vulkan, it will be 30% faster than GTX 1080 Ti.