The Ampere uarch isn't inefficient at all, the problem is desktop GPUs are pushed to the absolute limit of freq/voltage scaling and GDDR6X is a disaster.
I never considered Ampere inefficient, but RDNA2 should be more efficient thanks to IF and better process and no GDDR6x.
GDDR6x is not very good, true, but RTX 3060, Ti and 3070 don't have It and mobile variants are not so much lower depending on TGP.
RTX 3060 mobile
TGP specs:
60-65W RTX 3060 -> 1382Mhz(Boost)
80-85W RTX 3060 -> 1525Mhz(Boost)
80-95W RTX 3060 -> 1525Mhz(Boost)
90-95W RTX 3060 -> 1630Mhz(Boost)
115-130W RTX 3060 -> 1802Mhz(Boost)
170W RTX 3060 desktop -> 1780Mhz(boost)
Official data
So the 95W mobile version has only 8.5% lower boost than the desktop model, that's not a big difference.
The weird thing is that from rumors, N23 should have lower TBP(135-150W) than RTX 3060(170W) and be clocked at ~2.4-2.5GHz.
Now RX 6600M is a cut-down 28CU N23 with 2177Mhz boost, which would mean It's ~9-13% lower than the not yet released desktop part.
Why would cutting down 35-50W mean bigger relative decrease in clockspeed for RDNA2 than cutting 75W from RTX 3060? This doesn't make any sense to me.
P.S. Let's not forget TGP(GPU+MEMORY) on a desktop card is lower than TDP(NVIDIA) or TBP(AMD).