The power efficiency drop has a lot to do with distances - ie trips between the CCD's and the IOD.
Stacking the CCD's on the IOD as an active interposer with TSV's could lessen the efficiency impact somewhat - even more so if they could get around the thermal problems with stacking multiple CCD's on the IOD, but that would require some radical physical architectural changes to stacking (ala ICEcool) before they can eliminate the thermal problem at the transistor/wire level with more advanced process nodes causing less thermal dissipating electron leakage ala spintronics for logic and data transmission (or photonics/plasmonics for data).
Zen 4 Epyc Genoa is almost certainly going to be stacked die. TSMC has a large number of different die stacking options available now or coming out in the next year or two. I have already posted this several times, but there is a quick overview here:
I am thinking Genoa may use the SoIC die stacking to stack multiple cpu die and/or cache chips. The SoIC tech is die stacking without micro-bumps. It has better thermal conductivity than the other technologies using micro-bumps. For super high core count Genoa processors, we expect the clock speed to be lower anyway, so they can keep the power consumption down for stacking and staying within package power limits. The stacking will reduce package area significantly so core count could be huge. They really need to go to chip stacking for Zen 4. Doubling the bandwidth again using pci-e 5 style based links will consume too much power. The stacked and/or interposer links could easily be 1024-bits wide, much higher bandwidth than even pci-e 5 links, and a lot less power.
We have no idea what AMD is going to do in the desktop market for Zen 4 and/or some kind of Zen 3 based AM5 part. I am kind of thinking that we get monolithic 8-core Zen 3 APUs for AM5 in mid 2021. It is unclear how they would do an AM5 based “Warhol” with Zen 3. With Zen 4 chiplets probably being stacked, they may not be able to just use a Zen 4 IO die (possibly interposer or other stacked tech) with a Zen 3 BGA cpu die. We may actually see AMD switch to having different die for Epyc and Ryzen. They could relatively easily make a monolithic 16 core cpu at 5 nm. It would probably still be two CCX on one die if such a thing exists. TSMC has some stacking tech is much cheaper than a full interposer, so we can’t really rule out chip stacking across the full product stack.
Intel seems like they have talked a lot about upcoming technologies while AMD has been almost silent. This makes Intel appear to be in the weaker position, trying to keep people interested in their upcoming products even though they will not be out for quite some time. Intel could make a comeback with a 7 nm, stacked die cpu, but it doesn’t seem like it will be in time. TSMC seems like they are ahead in stacking technology in addition to the actual process tech, although both companies have been making some types of stacked packages for a while. I don’t know if Intel has anything to compete with the SoIC bumpless stacked die. TSMC has good reason to strongly support AMD since they can grab a chunk of the x86 enterprise market through AMD that would have been fabbed by Intel otherwise. TSMC can get a share of the server market through ARM, but who knows how that is going to go with the Nvidia purchase. A lot of companies do not like working with Nvidia. I could see Apple dumping ARM for an open architecture very quickly if they wanted to. It probably isn’t that hard to switch between more RISC-like architectures.