I actually think that the 512 bit confusion is because GB102 will be 512 bit, but that GB202 will still be 384 bit.
The rumor is that Nvidia is splitting up the gaming and pro-chips, a bit similar to how AMD has RDNA for gaming and CDNA for compute. Although AMD limits CDNA to their data center stuff, but GB10x would then be aimed more at everything from AI workstations to cheaper data center stuff.
So then GB10x would be MCM, perhaps with a separate IO-chip like AMD uses. It would also probably be fully GDDR7, going up to 64 GB for the GB102 clamshell cards. HBM would still only go into the highend B100/B200/etc MCM chips.
GB20x would then still be monolithic, but excising the professional stuff from the chips, that is on the chip, but mostly disabled. Of course, ideally Nvidia would make it so that these cards don't work that well for AI, so people would be forced to the more expensive professional cards for that.