What is sad is Intel has pretty much hit a IPC wall. If AMD can find a way to get 2011 Intel IPC at 4+GHz they are golden.
Again what is with this belief?
People need to reexamine conroe and the CPUs in the last 10 years.
IPC improvement is pretty much constant since core 2. There was no increase, there was no decrease in the rate. What did happen was that clockspeeds crept up only to reach a wall with SB; HT was also added.
You can find a variety of tests that show similar (take care to allow for integration of the memory controller in Nehalem - why clarkdale does poorly).
Look at the wolfdale tick for instance, its the same as IVB or BW at ~5%.
http://www.anandtech.com/show/2306/3
Intel then hit a bunch of trouble with 32 nm delays (like 14 nm). Nehalem dramatically improved in some areas but was a bit of a mixed bag.
The Core i7's general purpose performance is solid, you're looking at a 5 - 10% increase in general application performance at the same clock speeds as Penryn. Where Nehalem really succeeds however is in anything involving video encoding or 3D rendering, the performance gains there are easily in the 20 - 40% range. Part of the performance boost here is due to Hyper Threading, but the on-die memory controller and architectural tweaks are just as responsible for driving Intel's performance through the roof.
http://www.anandtech.com/show/2658/20
People think rates stalled because clockspeeds stopped going up (the addition of HT also increases the perception of slowing down). SB was solid as well but a lot of the performance increases are due to the significantly increased clockspeed.
IMO I feel that intel could be doing more for IPC (skylake really should have doubled caches across the board - the cache system is due for a rehaul) but their rate of core IPC improvements are pretty much constant without any underlying trend.
Edit:
I thought I read somewhere that Kaveri already matched Nehalem's IPC - and that is without an L3$. I could be remembering incorrectly, since I don't have any data in front of me. Someone has figured this out already, I just need to find the data again. Please feel free to correct me if I am wrong.
Not true at all. Just look at games for instance. A i7-920 generally does very well against a 8350. PD/Kaveri is around Conroe levels in legacy (disregarding newer instructions) code and conroe doesn't have an IMC. Not sure why people keep insisting on L3. Kaveri has the same amount of total cache as an i3 (without the burden of an inclusive cache system). Look at nehalem vs. Core 2. IPC was a mixed bag due to the greatly reduced L2 (if the much improved memory controller wasn't there things could be ugly in some applications).
You can also compare the A6-7400k vs. the A8-7800 (same clocks and everything). 1 MB L2 vs. 4 MB L2. Single thread performance hit is around 5% and that is a cache cut of 75%. Same with ST IPC between an i3, i7, and i7E with 4, 8, and 20 MB cache. L3 cache really doesn't help much (and AMD stated as much when they launched trinity/llano without L3).