Now how is amd going to improve on that performance of the distributed l3?
Its a disadvantage for them and shows the backside of the design.
A way to look at it is simply. They cant. But so what. Where that matters its a lost market.
Imo the best strategy forward is to entrench and build on your strenght. Not try to make the ocean boil.
7nm high perf with just tuned core and uncore and they are in a very good position. It looks balanced as it is and can do. No need to invent a new bd.
They have PLENTY of options in fact:
1) Continue playing into strong L2 performance by increasing it to 1MB per core. Obvious path with time proven gains, reduced misses, less L3 traffic. ( just keep those morons who designed previous AMD caches away please )
2) Increase L3 per CCX to 12 or 16MB to help keep traffic local to CCX.
4) Keep pumping clock to interconnects and keep rising memory clocks / lowering latency.
3) Increase core count per CCX to 6 ( or one can dream - to 8 ).