Discussion Ada/'Lovelace'? Next gen Nvidia gaming architecture speculation

Saylick · Jan 29, 2022

Wasn't sure where to put this, but according to Greymon, Hopper is monolithic and there's currently not an MCM variant.

https://twitter.com/x/status/1487253688280240129

https://twitter.com/x/status/1487259506723594243

Saylick · Jan 29, 2022

Quick little update. GH100, the biggest die, may not be in an MCM product but the MCM product may consist of two smaller Hopper family dies? I'm not sure why Nvidia wouldn't MCM the big die if it's possible to MCM the smaller dies. Power consumption limits? If that's the case, why have the big die to begin with?

https://twitter.com/x/status/1487354924052586496

jpiniero · Jan 29, 2022

Saylick said:
Quick little update. GH100, the biggest die, may not be in an MCM product but the MCM product may consist of two smaller Hopper family dies? I'm not sure why Nvidia wouldn't MCM the big die if it's possible to MCM the smaller dies. Power consumption limits? If that's the case, why have the big die to begin with?

I suppose it could be similar dies but the MCM die has additional logic for coherency. That sounds like a total waste but perhaps they were unsure they would be able to get it working correctly?

If the single die really is ~33% more FP32 shaders, ~20 bigger die AND 25% more power draw, that doesn't sound so great if you factor in the shrink.

Ajay · Jan 29, 2022

jpiniero said:
I suppose it could be similar dies but the MCM die has additional logic for coherency. That sounds like a total waste but perhaps they were unsure they would be able to get it working correctly?

If the single die really is ~33% more FP32 shaders, ~20 bigger die AND 25% more power draw, that doesn't sound so great if you factor in the shrink.

That sounds just plain wrong, especially coming from Samsung 8LPP. Hopefully, more info comes along at GTC.

CakeMonster · Jan 29, 2022

I assume we are talking in context of the high end cards here, else it doesn't make much sense. Given that, it could be that NV have the dual chip technology nailed down, but are just weighing their options?

1) Two lower clocked and more efficient chips but connected that makes it more complex? 2) Or just one chip that pushes 5nm to its max and have no additional complexity, but needs to nail the cooling along with hand picked dies that can reach the target frequency and have very few physical errors?

jpiniero · Jan 29, 2022

Ajay said:
That sounds just plain wrong, especially coming from Samsung 8LPP. Hopefully, more info comes along at GTC.

GA100 is N7, so it's from N7 -> N5.

Ajay · Jan 29, 2022

jpiniero said:
GA100 is N7, so it's from N7 -> N5.

Oh, duh, yeah. Still, seems wrong - unless GH100 is physically smaller or has some major new functional unit included (or >> cache).

jpiniero · Jan 29, 2022

Ajay said:
Oh, duh, yeah. Still, seems wrong - unless GH100 is physically smaller or has some major new functional unit included (or >> cache).

I'm sure they are doubling down on AI/ML performance. Problem is the MI250 is likely to be far faster in FP64.

gdansk · Jan 30, 2022

How can it be slightly less than 1000mm2 and monolithic?

CP5670 · Feb 26, 2022

NVIDIA's next-gen GeForce RTX 4090 could use up to an insane 850W+

NVIDIA's next-gen Ada Lovelace GPU uses a chunk of power in new rumors: GeForce RTX 4090 could use up to 850W of power... wow.

www.tweaktown.com

The latest rumor is that the top card will draw 850W. I wonder how that would even work with current PC case designs. Maybe the stock FE cooler will be an AIO.

Ajay · Feb 26, 2022

gdansk said:
How can it be slightly less than 1000mm2 and monolithic?

Fabs work with NV to push the reticle limits to the max. Yields are probably poor, but for GPUs that expensive, it does't matter as much.

Frenetic Pony · Feb 28, 2022

CP5670 said:
NVIDIA's next-gen GeForce RTX 4090 could use up to an insane 850W+

NVIDIA's next-gen Ada Lovelace GPU uses a chunk of power in new rumors: GeForce RTX 4090 could use up to 850W of power... wow.

www.tweaktown.com

The latest rumor is that the top card will draw 850W. I wonder how that would even work with current PC case designs. Maybe the stock FE cooler will be an AIO.

That's beyond even an OAM socket rating. How big would that AIO cooler have to be? 360mm might not be enough...

Tup3x · Feb 28, 2022

250W is about the max I consider reasonable, maybe 300W. Anything higher than that is just too much. It's not practical.

Borealis7 · Feb 28, 2022

Tup3x said:
250W is about the max I consider reasonable, maybe 300W. Anything higher than that is just too much. It's not practical.

Thats is what we, as consumers, were taught for the past 16-17 years by NV & AMD (remember the GTX8800 Ultra 370W?), but the truth is technology has advanced since then and PSUs are able to pull much more power these days and output it quite reliably over the power rails to whatever hungry GPU might be connected to them. Advances in cooling and fan technology allow to dissipate more heat away from the hardware and out of the case, and silicon technology can produce densely packed chips with billions of transistors that together amount to the 300-400W operating power.
Just as we'll have to adjust to the new price norms, we will need to adjust ourselves to accept more power hungry hardware, because the number of transistors per mm2 is not going to go down any time soon.

Panino Manino · Feb 28, 2022

Nvidia was hacked and the data stolen is out.
This should give us plenty of information about future architectures.

maddogmcgee · Feb 28, 2022

Borealis7 said:
Thats is what we, as consumers, were taught for the past 16-17 years by NV & AMD (remember the GTX8800 Ultra 370W?), but the truth is technology has advanced since then and PSUs are able to pull much more power these days and output it quite reliably over the power rails to whatever hungry GPU might be connected to them. Advances in cooling and fan technology allow to dissipate more heat away from the hardware and out of the case, and silicon technology can produce densely packed chips with billions of transistors that together amount to the 300-400W operating power.
Just as we'll have to adjust to the new price norms, we will need to adjust ourselves to accept more power hungry hardware, because the number of transistors per mm2 is not going to go down any time soon.

I mean at 400 watts and two hours a day use, it would be costing me $75 AUD a year in power just for gaming. Then throw in the rest of the system, psu inefficiency, and the fact the video card would likely still use more power on the desktop (where it would easily run for another 10 hours a day) and you are starting to get into a pretty high cost per year.....especially when you could turn down the resolution and use upscaling to get similar performance from a much cheaper card.

OscaAndShintjee · Mar 1, 2022

Panino Manino said:
Nvidia was hacked and the data stolen is out.
This should give us plenty of information about future architectures.

That's assuming that the stolen data is legitimate. The only details we've seen are threats to release drivers and firmware, evidence being likely fake leaked "code files" which tell us nothing we don't already know, written in a markup language that I can't even identify. This is all by people who can hardly put two sentences together.
Take what the jittery writers at TPU or TH write with salt, the evidence isn't there.

Edit: Looks like I was wrong in relevance to the leaked files being available to torrent-download.

DooKey · Mar 1, 2022

maddogmcgee said:
I mean at 400 watts and two hours a day use, it would be costing me $75 AUD a year in power just for gaming. Then throw in the rest of the system, psu inefficiency, and the fact the video card would likely still use more power on the desktop (where it would easily run for another 10 hours a day) and you are starting to get into a pretty high cost per year.....especially when you could turn down the resolution and use upscaling to get similar performance from a much cheaper card.

I would suggest that if a person is worried about the cost of electricity to game and use their computer that they shouldn't do either and concentrate on the true necessities of life.

Mopetar · Mar 1, 2022

Ajay said:
Fabs work with NV to push the reticle limits to the max. Yields are probably poor, but for GPUs that expensive, it does't matter as much.

Yield doesn't matter as much when you have something as massively parallel as a GPU like this. Even if a massive die like this has multiple defects they're most likely in areas that are highly redundant and those parts of the hardware can be fused off.

Even if the node isn't mature or just had an abnormally high defect rate, most dies could still be sold. Even the ones that don't have any defects still might just have the weakest performing units turned off.

Saylick · Mar 1, 2022

LOL, looks like Lovelace configurations might have leaked too:

NVIDIA GeForce RTX 40 "Ada" GPU architecture specs allegedly leaked, up to 144 Streaming Multiprocessors - VideoCardz.com

NVIDIA Ada GPU specs leaked? Following a cyberattack on NVIDIA servers, more and more confidential data leaks online. NVIDIA is not having a good time right now. The hacking group that has managed to infiltrate the company’s internal servers has begun leaking the confidential data. First, it was...

videocardz.com

If I remember correctly, that SM count for the top die has been leaked out by the usual Twitter suspects already.

https://twitter.com/x/status/1457931961796743168

Looks like at best we'll see a doubling in performance from AD102 over GA102, which is in line with all of the previous leaks. ~1.7x SM counts and some clock increases as well.

jpiniero · Mar 1, 2022

Saylick said:
Looks like at best we'll see a doubling in performance from AD102 over GA102, which is in line with all of the previous leaks. ~1.7x SM counts and some clock increases as well.

92 TF FP32 would be like 2.3-2.5x compute power. What's interesting is that the lower tier parts don't get that much of an SM increase so their performance increases won't be anywhere near as dramatic.

Saylick · Mar 1, 2022

jpiniero said:
92 TF FP32 would be like 2.3-2.5x compute power. What's interesting is that the lower tier parts don't get that much of an SM increase so their performance increases won't be anywhere near as dramatic.

Right, but as you're aware, performance won't scale linearly with TFLOPS. Rumormill says 2x performance increase which I think is realistic.
The clocks might get ramped up for the lower end parts to make up for the smaller increase in SM counts. Just a guess.

jpiniero · Mar 2, 2022

GeForce RTX 40 "Ada" GPUs to feature very large L2 Caches, NVIDIA's own "Infinity Cache"? - VideoCardz.com

NVIDIA Leaks: GeForce RTX 40 to have large L2 cache While the company is still trying to figure out how the hackers gained so much information, the data that was already posted provides significant information of NVIDIA’s next architecture codenamed “Ada”. NVIDIA RTX 40 GPUs will be equipped...

videocardz.com

They are giving a big increase in L2.

JoeRambo · Mar 2, 2022

Yay to cache wars. Jump to this gen will be as large as AMD got moving to "Infinity cache". Probably even more impact on perf and power than for AMD, cause NV is so shader / RT / Tensor heavy and these are hungry for mem bw.

Saylick · Mar 2, 2022

I guess Nvidia finally realized that it too has to add a big ol' block of LLC to keep up with bandwidth demands without scaling up the memory bus to ludicrous levels. Hindsight will tell us whether Nvidia's approach of using a large, traditional memory bus supplemented with a smaller LLC is better than going all out on cache with a smaller memory bus a la AMD's Infinity Cache. If in the generation following Lovelace we see Nvidia sticking with the same bus width or even reducing it, but adding even more cache, we'd know that AMD's approach won out.

Discussion Ada/'Lovelace'? Next gen Nvidia gaming architecture speculation

Diamond Member

Diamond Member

Lifer

Lifer

Golden Member

Lifer

Lifer

Lifer

Platinum Member

Diamond Member

Lifer

Senior member

Golden Member

Platinum Member

Senior member

Senior member

Junior Member

Golden Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Lifer

Golden Member

Diamond Member