Question Speculation: RDNA3 + CDNA2 Architectures Thread

Page 25 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

uzzi38

Platinum Member
Oct 16, 2019
2,702
6,405
146

Ajay

Lifer
Jan 8, 2001
16,094
8,106
136
I doubt N6 is cheaper than N7. You might get more wafers out of it. The density bump would make it cheaper per transistor all other things considered.
Apparently there are cost savings due to the number of steps (few masks). The density difference is pretty small, but the N7->N6 is supposedly a 'cheap' transition engineering wise, so that's a plus. TSMC, reportedly, was trying to push customers to N6 - that just wouldn't work out unless there was a net cost advantage in good dies per wafer.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
I doubt N6 is cheaper than N7. You might get more wafers out of it. The density bump would make it cheaper per transistor all other things considered.

Say you took Navi 21, shrank it to N6, cut the lanes down to 8 and the bus to 128 bit... you would think that would still be bigger than Navi 22. Maybe you could get close. Throw RDNA3 as well and it'd be tough for the die to not bloat. RDNA3 would have to be pretty die efficient to still be ~6900 XT performance.
Some rumors to reconcile. Not even considering N6 is at least equivalent to N7 and most probably cheaper in cost.

N33 has 128 bus
Higher clocks and "IPC" than N2x series
Higher perf/W than N2x series
Better RT performance

This leads to, for equal performance, smaller die, cheaper board and cooling.

How do you see a 6900XT as being cheaper to make?
 
Reactions: Tlh97 and Glo.

jpiniero

Lifer
Oct 1, 2010
14,831
5,444
136
How do you see a 6900XT as being cheaper to make?

Perhaps I should rephrase - people would absolutely buy a used 6900 XT for $500 rather than this. That's the fear.

I'm unconvinced AMD would spend the effort to do the entire RDNA3 IP on N6 if Navi 33 is the only product that will launch with it. AFAIK Phoenix is N5 monolithic.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
Perhaps I should rephrase - people would absolutely buy a used 6900 XT for $500 rather than this. That's the fear.

I'm unconvinced AMD would spend the effort to do the entire RDNA3 IP on N6 if Navi 33 is the only product that will launch with it. AFAIK Phoenix is N5 monolithic.
Why? 8G vs 16G, higher power, lower vs higher RT? What would decide the issue? What if N33 is around $500, then what?
You also appear to think no RDNA3 GPUs for mid-lower end range? Where is this coming from?

Your arguments also appear to be contradictory. N3x is high cost & price, and yet, we're expected to believe that used GPUs will be cheap. In most scenarios, used GPUs and new GPUs correlate in price.
 
Reactions: Tlh97

jpiniero

Lifer
Oct 1, 2010
14,831
5,444
136
Why? 8G vs 16G, higher power, lower vs higher RT? What would decide the issue? What if N33 is around $500, then what?
You also appear to think no RDNA3 GPUs for mid-lower end range? Where is this coming from?

Yep. No mid range and below. Maybe later I guess.

Your arguments also appear to be contradictory. N3x is high cost & price, and yet, we're expected to believe that used GPUs will be cheap. In most scenarios, used GPUs and new GPUs correlate in price.

In a theoretical mining collapse, the market would be flooded with used GPUs. AMD's been burnt on this (twice?). I don't think they want to get burnt again.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
Yep. No mid range and below. Maybe later I guess.



In a theoretical mining collapse, the market would be flooded with used GPUs. AMD's been burnt on this (twice?). I don't think they want to get burnt again.
Do you plan for a mining collapse and refuse to design whole classes of GPUs. This, in effect, is what you're saying. What if it doesn't happen, do you forgo the largest part of the market? Unrealistic, in my view.
 

beginner99

Diamond Member
Jun 2, 2009
5,223
1,598
136
In a theoretical mining collapse, the market would be flooded with used GPUs
This isn't theoretical at all. Ethereum will move to proof-of-stake soon, like June/July soon. Wonder why used market suddenly has a lot more GPUs for reasonable prices? Yeah, from miners, usually smaller ones. They bet on getting out earlier to get back some of the cards costs. In 2-3 months? used prices will be down by 50%.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
You rebrand Navi 2x instead.
N6 less than N7, production cost wise. You keep ignoring this. Simpler N3x cards also have less BOM than equivalent performance N2x cards. The amortization cost of new N3x designs on N6 is the factor, and if you do one, all the blocks, then the rest is a marginal cost. Where is the crossover point in total sales? I don't know.

You either do all or none on N6.
 

jpiniero

Lifer
Oct 1, 2010
14,831
5,444
136
You either do all or none on N6.

That's what I mean. Doing RDNA3 on N6 makes no sense. A shrink of RDNA2 to N6 with some gutting would make more sense (and that doesn't appear to be what they are doing). But either of these isn't going to work if mining collapses.
 

Glo.

Diamond Member
Apr 25, 2015
5,761
4,666
136
This isn't theoretical at all. Ethereum will move to proof-of-stake soon, like June/July soon. Wonder why used market suddenly has a lot more GPUs for reasonable prices? Yeah, from miners, usually smaller ones. They bet on getting out earlier to get back some of the cards costs. In 2-3 months? used prices will be down by 50%.
I believe quite some hefty portion of miners will move to Ergo, Ethereum Classic mining. Smaller miners which will not get immediate ROI on their investments will sell their GPUs.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
That's what I mean. Doing RDNA3 on N6 makes no sense. A shrink of RDNA2 to N6 with some gutting would make more sense (and that doesn't appear to be what they are doing). But either of these isn't going to work if mining collapses.
Well then, we will have to disagree until events prove one position wrong. If AMD operated in a sole supplier position, this could work, but they don't.
 
Reactions: Tlh97 and soresu

jpiniero

Lifer
Oct 1, 2010
14,831
5,444
136
Well then, we will have to disagree until events prove one position wrong. If AMD operated in a sole supplier position, this could work, but they don't.

nVidia has the same problem. Ada costs are going way up too. They are however better positioned to deal with a mining collapse because marketing and OEM deals.

And it's only possibly a problem if this thing isn't faster than the 6900 XT.
 

Frenetic Pony

Senior member
May 1, 2012
218
179
116
None of these rumors seem very logical. Denser nodes like N5 and chiplets are both there to shrink giant monolithic chips and so save money. A 520mm chip is quite big, exactly the sort of target you'd want for chiplets and denser nodes both. And how would this possibly be a mid range chip, none of this looks like the BOM would be much lower than a 6900xt, which is probably $300 or more. Unless AMD has suddenly decided to drop almost its entire profit margin this doesn't make a lot of sense.

Nor does the tiny bus. I still don't see how that works. Some sort of magical prefetch based on static code analysis into a giant prefetch cache is the only thing I can think of, but that would require giant bubbles of available bandwidth.

I get the N6 idea at least a little, which seems accurate based on the LinkdIn profile I guess? (Unless that was a plant, which sounds ludicrous but I'll not underestimate people with too much time on the internet). Either way the only possible use case for one chip being on N6 is if it's tiny enough to justify monolithic in the first place.

It'd make way more sense if AMD had managed to design a single GPU compute chiplet with like 40CUs on N5, and then linked more and more of them together (possible with a separate IO die) like they did with their CPU chiplets. You'd only have to design one GPU chiplet, and just like with CPUs they could scale that chiplet throughout the entire lineup by just linking more together.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
None of these rumors seem very logical. Denser nodes like N5 and chiplets are both there to shrink giant monolithic chips and so save money. A 520mm chip is quite big, exactly the sort of target you'd want for chiplets and denser nodes both. And how would this possibly be a mid range chip, none of this looks like the BOM would be much lower than a 6900xt, which is probably $300 or more. Unless AMD has suddenly decided to drop almost its entire profit margin this doesn't make a lot of sense.

Nor does the tiny bus. I still don't see how that works. Some sort of magical prefetch based on static code analysis into a giant prefetch cache is the only thing I can think of, but that would require giant bubbles of available bandwidth.

I get the N6 idea at least a little, which seems accurate based on the LinkdIn profile I guess? (Unless that was a plant, which sounds ludicrous but I'll not underestimate people with too much time on the internet). Either way the only possible use case for one chip being on N6 is if it's tiny enough to justify monolithic in the first place.

It'd make way more sense if AMD had managed to design a single GPU compute chiplet with like 40CUs on N5, and then linked more and more of them together (possible with a separate IO die) like they did with their CPU chiplets. You'd only have to design one GPU chiplet, and just like with CPUs they could scale that chiplet throughout the entire lineup by just linking more together.
I'm curious.What were you thinking when the rumors of a 256bit bus for N21 first arose?

As to lower costs, some possible BOM reductions in comparison to the 6900XT

Die itself = smaller die.
N6 vs N7
Less die area due to less memory controllers
Less shaders due to higher clocks and perf/clock

Card.
1/2 memory chips needed
lower capacity cooling system
less power circuitry on card
Simpler PCB (X8 PCie)

I can only imagine that the SoIC tech is more difficult/expensive than I assumed. CPUs are a lot easier to connect from an efficiency standpoint. Data movement is an order or more of magnitude greater. Check out the bandwidth of the internal connections in a GPU.
 
Reactions: Tlh97 and Kepler_L2

Frenetic Pony

Senior member
May 1, 2012
218
179
116
I'm curious.What were you thinking when the rumors of a 256bit bus for N21 first arose?

As to lower costs, some possible BOM reductions in comparison to the 6900XT

Die itself = smaller die.
N6 vs N7
Less die area due to less memory controllers
Less shaders due to higher clocks and perf/clock

Card.
1/2 memory chips needed
lower capacity cooling system
less power circuitry on card
Simpler PCB (X8 PCie)

I can only imagine that the SoIC tech is more difficult/expensive than I assumed. CPUs are a lot easier to connect from an efficiency standpoint. Data movement is an order or more of magnitude greater. Check out the bandwidth of the internal connections in a GPU.

I was thinking "Oh that's cool, they solved something". I get why the LLC works, you store your current frame buffers in there, that's the most commonly accessed data for graphics in a GPU. You need to read and write to that a ton. But you also need to read out a lot of your main memory data every frame. Developers don't fill those multiple gigs of memory up with nothing, and this is especially true of deferred rendering (which is most engines today). They need to read then copy out uncompressed g-buffers every frame, and while developers are solving this GPU makers can't assume they have.

LLC and cache structure was, and is, a known commodity. The bottleneck between memory and processor is well understood and has been an optimization target for everything for quite a while now. But there's not much left to do there, everything is compressed, everything is cached up to the eyeballs, standard pre-fetch has been around for decades now on the GPU. Unlike LLC, which was similar to something first shown off elsewhere, or delta compression, another topic that had papers out before anyone implemented it, there's no publicly known solution to how AMD could squeeze yet more out of their bus that I'm aware of. And all I can come up with is the assumption, possibly a dangerous one, that there's bubbles of bandwidth availability in those buses on all titles. Not just some, not just most, all titles, that could be used to prefetch assets such that the bubbles are smoothed out. That is literally the only physical way this could work, and it's a very overarching assumption.

The cost doesn't go down almost at all either. Your assumptions are wrong, the rumor directly points to this card being nigh as big as a 6900xt. The only real savings are less ram, and that doesn't matter. What matters is the point that it would cost less if this card were on N5 and a chiplet based card. There is no "well it's good enough" here, if AMD can just make more money in a really obvious way like putting the card on a better node for it then they are nigh guaranteed to do that.
 

maddie

Diamond Member
Jul 18, 2010
4,787
4,771
136
I was thinking "Oh that's cool, they solved something". I get why the LLC works, you store your current frame buffers in there, that's the most commonly accessed data for graphics in a GPU. You need to read and write to that a ton. But you also need to read out a lot of your main memory data every frame. Developers don't fill those multiple gigs of memory up with nothing, and this is especially true of deferred rendering (which is most engines today). They need to read then copy out uncompressed g-buffers every frame, and while developers are solving this GPU makers can't assume they have.

LLC and cache structure was, and is, a known commodity. The bottleneck between memory and processor is well understood and has been an optimization target for everything for quite a while now. But there's not much left to do there, everything is compressed, everything is cached up to the eyeballs, standard pre-fetch has been around for decades now on the GPU. Unlike LLC, which was similar to something first shown off elsewhere, or delta compression, another topic that had papers out before anyone implemented it, there's no publicly known solution to how AMD could squeeze yet more out of their bus that I'm aware of. And all I can come up with is the assumption, possibly a dangerous one, that there's bubbles of bandwidth availability in those buses on all titles. Not just some, not just most, all titles, that could be used to prefetch assets such that the bubbles are smoothed out. That is literally the only physical way this could work, and it's a very overarching assumption.

The cost doesn't go down almost at all either. Your assumptions are wrong, the rumor directly points to this card being nigh as big as a 6900xt. The only real savings are less ram, and that doesn't matter. What matters is the point that it would cost less if this card were on N5 and a chiplet based card. There is no "well it's good enough" here, if AMD can just make more money in a really obvious way like putting the card on a better node for it then they are nigh guaranteed to do that.
I asked what did you think before we knew it's performance on a 256 bit bus. Rumors of?

News to me. What rumors indicate N33 = 6900XT card size?

16 vs 8 GB doesn't save money?
Less power doesn't save money?
8X PCie doesn't save money?
Smaller die doesn't save money?

I can't write what I think.
 
Reactions: Tlh97

Timorous

Golden Member
Oct 27, 2008
1,727
3,152
136
That's what I mean. Doing RDNA3 on N6 makes no sense. A shrink of RDNA2 to N6 with some gutting would make more sense (and that doesn't appear to be what they are doing). But either of these isn't going to work if mining collapses.

Supply.

Splitting across nodes means more supply for the chip that will likely be the most popular desktop SKU and used in mobile.

If AMD made all RDNA3 on N5 then it would constrain supply due to Genoa and Ryzen being the better margin products.

Having mainstream N33 on N6 means there is less juggling so AMD can manufacture more units.
 

xpea

Senior member
Feb 14, 2014
447
142
116
Supply.

Splitting across nodes means more supply for the chip that will likely be the most popular desktop SKU and used in mobile.

If AMD made all RDNA3 on N5 then it would constrain supply due to Genoa and Ryzen being the better margin products.

Having mainstream N33 on N6 means there is less juggling so AMD can manufacture more units.
As a reminder, TSMC N7 and N6 use same equipment and same production lines.
N5 use different line that is shared with N4
 

Mopetar

Diamond Member
Jan 31, 2011
8,004
6,446
136
As a reminder, TSMC N7 and N6 use same equipment and same production lines.
N5 use different line that is shared with N4

N6 uses some EUV equipment I believe, but it's otherwise compatible with N7 designs even though it requires new masks. It has fewer production steps so there's greater throughput.

There's no reason to make anything new on N7 since you'd save money just moving to N6 with mask cost. However for anything that's close to the end of production it's probably cheaper just to keep it on N7.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |