Question 'Ampere'/Next-gen gaming uarch speculation thread

Page 52 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Ottonomous

Senior member
May 15, 2014
559
292
136
How much is the Samsung 7nm EUV process expected to provide in terms of gains?
How will the RTX components be scaled/developed?
Any major architectural enhancements expected?
Will VRAM be bumped to 16/12/12 for the top three?
Will there be further fragmentation in the lineup? (Keeping turing at cheaper prices, while offering 'beefed up RTX' options at the top?)
Will the top card be capable of >4K60, at least 90?
Would Nvidia ever consider an HBM implementation in the gaming lineup?
Will Nvidia introduce new proprietary technologies again?

Sorry if imprudent/uncalled for, just interested in the forum member's thoughts.
 

Mopetar

Diamond Member
Jan 31, 2011
7,992
6,407
136
Depends on the configuration. Lots of small memory chips will give significantly more bandwidth than fewer, but larger chips. If the rumors of 512bit memory interface are true, that would mean something like sixteen 1.5GB chips.

That's true, but why not use 1 GB chips that can clock a little faster as a trade off?

We've seen that in a lot of the more recent cards that bumping the memory speed is often a better use extra voltage. That may not be true in all cases or for all titles, but if it's a trade between faster memory that will be sufficient across the life of the card or having more memory than you would expect to utilize I think the choice would be obvious.

Consider going the other direction where NVidia uses 2 GB chips for 32 GB total VRAM. Obviously that uses more power that can't be used elsewhere.

My question is why is 24 GB the best choice?
 

kurosaki

Senior member
Feb 7, 2019
258
250
86
So I had searched earlier, and found something saying Micron offered 12Gb chips. But looking at both Micron's and Samsung's parts lists, they only offer 8Gb and 16Gb. So apparently I am the stupid one

But there are 12gb chips. I belive sweclockers.com talked about it in their latest video. It's in Swedish though. If you want an odd configuration there is a possibility it will be tailor made. Not that far fetched. Now where is that stupid color?!
 
Reactions: Konan and Stuka87

kurosaki

Senior member
Feb 7, 2019
258
250
86
Bogus. There are 2 PCBs, where the power MOSFETs are on the second. It is not going to be another chip on the back of the gpu. Mayyybeee, a separate acceleration card.
 

Krteq

Senior member
May 22, 2015
993
672
136
Lol, nope, that's just someone's "representation" of those leaks. Look at those heatsinks, there is no room for PCIe power connectors or any other PCB. There are fins everywhere on that end, no room for any "second PCB" and even PCIe power connectors can't fit there.
 

Konan

Senior member
Jul 28, 2017
360
291
106
a note on the leaked cooler/heat sink design -
Videocardz
The cooler design that was leaked this week is not actually the final design, which let’s be honest was to be expected. What Igor learned is that this design is just one of the two currently being evaluated by NVIDIA. The decision which design will end up in the actual product will be made in July, during the Design Validation Test.

We could have something quite different still. I think this is amplified in the Coreteks video where he zooms in and shows the fins aren’t aligned and different to what one would expect.
So basically not a confirmed cooler choice....
 

kurosaki

Senior member
Feb 7, 2019
258
250
86
Lol, nope, that's just someone's "representation" of those leaks. Look at those heatsinks, there is no room for PCIe power connectors or any other PCB. There are fins everywhere on that end, no room for any "second PCB" and even PCIe power connectors can't fit there.

Just look at it.
 
Reactions: Krteq

Krteq

Senior member
May 22, 2015
993
672
136
That's another flawed unofficial interpretation by Reddit user.

As you can see on that "heatsink" photo, there are no holes allowing airflow through those fins on sides in the middle of the heatsink.

Another thing is that fins on second part of a heatsink are in one piece, there is no space for any PCB fitted with SMDs. Anyway, IF there will be any "second PCB" in the middle of second part of that heatsink, that PCB will be blocking airflow through those fins... however, there will be no space left because there is fan already fitted
 

DooKey

Golden Member
Nov 9, 2005
1,811
458
136
It's going to be what it's going to be. No need to argue about it. As long as it keeps the beast cool enough to pump out frames without sounding like an FX or 290 we'll be fine.
 
Reactions: Krteq

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,785
136
Speculation from Coreteks - Traversal Coprocessor
Going out on a limb a bit but nevertheless interesting insights

I have to say, I have doubts on the video, but his videos are interesting nonetheless.

I particularly like 19:20 where he graphs out the performance gains and also compares to uarch gains at iso-process. According to that graph, except Maxwell generation, despite what AMD/Nvidia has claimed, all the gains came from process not uarch.
 

Glo.

Diamond Member
Apr 25, 2015
5,758
4,666
136
I have to say, I have doubts on the video, but his videos are interesting nonetheless.

I particularly like 19:20 where he graphs out the performance gains and also compares to uarch gains at iso-process. According to that graph, except Maxwell generation, despite what AMD/Nvidia has claimed, all the gains came from process not uarch.
Nvidia GPUs scale way better with clock speeds, than AMD GPUs do. AMD GPUs scale better with CU counts.

This is the reason why there is so small difference in performance between RTX 2080 Ti and RTX 2080 Super, despite RTX 2080 Ti having 40-45% more ALUs.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,725
1,342
136
Speculation from Coreteks - Traversal Coprocessor

This is extremely shoddy. Argument is:

The weird heatsink design is for cooling the back of the pcb. Therefore there's a chip there. And because transversal would make sense, that's what the chip is.

Except he doesn't show how the heatsink does this, and in fact just looking at it we can see that it doesn't help cool the back of the pcb. It's profoundly stupid conjecture.
 
Reactions: FaaR and kurosaki

kurosaki

Senior member
Feb 7, 2019
258
250
86
"
That's another flawed unofficial interpretation by Reddit user.

As you can see on that "heatsink" photo, there are no holes allowing airflow through those fins on sides in the middle of the heatsink.

Another thing is that fins on second part of a heatsink are in one piece, there is no space for any PCB fitted with SMDs. Anyway, IF there will be any "second PCB" in the middle of second part of that heatsink, that PCB will be blocking airflow through those fins... however, there will be no space left because there is fan already fitted
"


Let's just wait and see. As gentlemen i would prefer if the one wrong in this issue just posted a picture of oneself eating a piece of, or the whole: Hat.
I would like to see you have a bite of that hat, i suppose you would like too see me as well, if it came up to my faulty predictions?

Hats off bloke!

PS. Salt is allowed.

.DS
 
Last edited:

DisEnchantment

Golden Member
Mar 3, 2017
1,672
6,150
136
The interpretation of that patent makes no sense for the context of RTRT in current Windows DirectX environment.
Imagine doing inline RT and your shader launch a ray query, it goes offchip ..

No amount of power and BW will save this.

Talk about people complaining that RT being BW intensive and being very divergent with bigger scenes when tons of boxes needed to be tested for intersection on the ray path.
Don't even know what this shader which launch the ray query will do. Who will resync the result.

It is just wild.

A more realistic interpretation is that the co-processor sits aside the SM or along side the other ALUs and has an idea of the context which shader is launching the ray query so that results can be passed back. ( quite similar to something ...)

Check the DXR code sample , this context has to be correct so that the shader running this code can proceed.
Code:
RaytracingAccelerationStructure myAccelerationStructure : register(t3);

float4 MyPixelShader(float2 uv : TEXCOORD) : SV_Target0
{
    ...
    // Instantiate ray query object.
    // Template parameter allows driver to generate a specialized
    // implementation.
    RayQuery<RAY_FLAG_CULL_NON_OPAQUE |
             RAY_FLAG_SKIP_PROCEDURAL_PRIMITIVES |
             RAY_FLAG_ACCEPT_FIRST_HIT_AND_END_SEARCH> q;

    // Set up a trace.  No work is done yet.
    q.TraceRayInline(
        myAccelerationStructure,
        myRayFlags, // OR'd with flags above
        myInstanceMask,
        myRay);

    // Proceed() below is where behind-the-scenes traversal happens,
    // including the heaviest of any driver inlined code.
    // In this simplest of scenarios, Proceed() only needs
    // to be called once rather than a loop:
    // Based on the template specialization above,
    // traversal completion is guaranteed.
    q.Proceed();

    // Examine and act on the result of the traversal.
    // Was a hit committed?
    if(q.CommittedStatus()) == COMMITTED_TRIANGLE_HIT)
    {
        ShadeMyTriangleHit(
            q.CommittedInstanceIndex(),
            q.CommittedPrimitiveIndex(),
            q.CommittedGeometryIndex(),
            q.CommittedRayT(),
            q.CommittedTriangleBarycentrics(),
            q.CommittedTriangleFrontFace() );
    }
    else // COMMITTED_NOTHING
         // From template specialization,
         // COMMITTED_PROCEDURAL_PRIMITIVE can't happen.
    {
        // Do miss shading
        MyMissColorCalculation(
            q.WorldRayOrigin(),
            q.WorldRayDirection());
    }
    ...
}

For each query, it matters which is the origin, the ray direction, current frame and so on.
 
Last edited:
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |