Full SSE3 implementation
* Improved hardware data prefetch mechanism
* Increased number of writing combine buffers (D0 stepping A64's can now combine up to four non-cacheable streams compared to 2 on the C0 and CG stepping A64's)
* Improved on-die memory controller with more advanced open page policy
* On-die thermal throttling
* Black Diamond Low-K technology (slower less power hungry transistors in less used sections and faster and more power hungry transistors in frequently used sections of the cpu)