This is where I think you don't understand EPYC and the IO die. In Zen and Zen + the memory controllers were on the normal dies, like the ones we use in Ryzen.
That is gone. The IO die in EPYC contains all of the Memory controllers, PCI-E lanes and so on. This means AMD doesn't need all 8 dies or even the 4 (but for cooling reasons I expect only 4 and 8 Chiplet CPU's to exist) to get all 128 PCI-e Lanes and 8 memory channels.
I know how the memory architecture works.
The reason that the uncore of Naples takes up so much power is that it is 8 channels worth of DDR4 memory controller in operation.
Comparing (from
https://www.anandtech.com/show/13124/the-amd-threadripper-2990wx-and-2950x-review/4):
(At 32 threads load)
2950X ------- cores: 135W ---- uncore: 43W
2990WX----- cores: 130W ---- uncore: 62W
E7601 -------- cores: 61W------ uncore: 90W
Now, this gives us context for what is IF, and what is DDR controller driven. The extra dies on the 2990WX are hooked up via IF. So that 20W difference must be almost entirely due to IF.
Furthermore, the only difference between 2990WX and 7601 is that the 7601 has the extra DDR controllers and PCIe linked. Which is the additional 30W of power consumed.
So your looking at ~19W/DDR controller and ~4W /IF link.
2950X = 2x DDR + 1x IF = 42W
2990WX = 2x DDR + 6x IF = 62W
7601 = 4xDDR + 6xIF = 100W
Obviously correlation is not exact, the 7601 is 10% out. But it should still give you a good idea of where the power is actually being consumed, and its mostly in the DDR4 controllers, not the IF links.
So if that were to carry over to Zen2, then you'd still be spending 40W on the DDR/PCIe controllers within the I/O die, and deciding between spending 15W or 30W on the internal IF links between the 4 or 8 chiplets.
My point is - yes, 4 chiplets will use less power - but its not as much power as you might think and it definitely isn't the slam dunk solution you are assuming. Particularly when you consider the L3 cache reduction.