You may want to use Intel's newest compile of linpack or else you just aren't stressing the core anywhere near the maximum.
http://software.intel.com/en-us/articles/intel-math-kernel-library-linpack-download
I tested again with newest Intel Math Kernal Library Linpack (w_lpk_p_11.0.5.009) and I get:
Ambient Temp: 72F
Maximum Temps:
Core #0: 91C
Core #1: 100C
Core #2: 100C
Core #2: 92C
Package: 101C
3570k@4.6ghz@1.296v|Thermalright True Spirit 140
4x8gB Crucial Ballistix Sport LP 1.3v@2000mhz 9-9-9-24-2N@1.635v
I would guess that Haswell will get much hotter in this since it has single clock FMA + exactly the bandwidth to take advantage of it, along with the widening of the front end and various other increases in resources.
Code:
Intel(R) Optimized LINPACK Benchmark data
Current date/time: Fri Jul 05 01:53:34 2013
CPU frequency: 4.599 GHz
Number of CPUs: 1
Number of cores: 4
Number of threads: 4
Parameters are set to:
Number of tests: 12
Number of equations to solve (problem size) : 1000 2000 3000 4000 5000 10000 15000 20000 25000 30000 35000 40000
Leading dimension of array : 1000 2000 3000 4000 5000 10000 15000 20000 25000 30000 35000 40000
Number of trials to run : 4 4 4 4 4 2 2 2 2 1 1 1
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 4 4 4 4 4
Maximum memory requested that can be used=4210869504, at the size=40000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.008 80.1420 1.029343e-012 3.510325e-002 pass
1000 1000 4 0.009 78.4243 1.029343e-012 3.510325e-002 pass
1000 1000 4 0.010 66.1790 1.029343e-012 3.510325e-002 pass
1000 1000 4 0.008 85.3792 1.029343e-012 3.510325e-002 pass
2000 2000 4 0.068 78.6838 4.298950e-012 3.739560e-002 pass
2000 2000 4 0.069 76.9367 4.298950e-012 3.739560e-002 pass
2000 2000 4 0.067 79.9057 4.298950e-012 3.739560e-002 pass
2000 2000 4 0.071 75.7190 4.298950e-012 3.739560e-002 pass
3000 3000 4 0.213 84.5168 8.755385e-012 3.371489e-002 pass
3000 3000 4 0.223 80.6831 8.755385e-012 3.371489e-002 pass
3000 3000 4 0.220 82.0125 8.755385e-012 3.371489e-002 pass
3000 3000 4 0.223 80.7084 8.755385e-012 3.371489e-002 pass
4000 4000 4 0.407 104.9645 1.896949e-011 4.134580e-002 pass
4000 4000 4 0.406 105.2312 1.896949e-011 4.134580e-002 pass
4000 4000 4 0.428 99.8058 1.896949e-011 4.134580e-002 pass
4000 4000 4 0.409 104.3529 1.896949e-011 4.134580e-002 pass
5000 5000 4 0.787 105.9639 2.581643e-011 3.599893e-002 pass
5000 5000 4 0.767 108.6443 2.581643e-011 3.599893e-002 pass
5000 5000 4 0.768 108.6290 2.581643e-011 3.599893e-002 pass
5000 5000 4 0.812 102.6540 2.581643e-011 3.599893e-002 pass
10000 10000 4 5.899 113.0406 9.603002e-011 3.386116e-002 pass
10000 10000 4 5.706 116.8768 9.603002e-011 3.386116e-002 pass
15000 15000 4 18.656 120.6298 2.042799e-010 3.217442e-002 pass
15000 15000 4 18.671 120.5335 2.042799e-010 3.217442e-002 pass
20000 20000 4 43.234 123.3770 4.097986e-010 3.627616e-002 pass
20000 20000 4 43.113 123.7243 4.097986e-010 3.627616e-002 pass
25000 25000 4 83.771 124.3612 6.089565e-010 3.462917e-002 pass
25000 25000 4 84.781 122.8808 6.089565e-010 3.462917e-002 pass
30000 30000 4 143.948 125.0578 8.421348e-010 3.319704e-002 pass
Your flops are crazy low!
Intel(R) Optimized LINPACK Benchmark data
Current date/time: Mon Jun 24 19:14:37 2013
CPU frequency: 4.597 GHz
Number of CPUs: 1
Number of cores: 4
Number of threads: 4
Parameters are set to:
Number of tests: 9
Number of equations to solve (problem size) : 1000 2000 3000 4000 5000 10000 15000 20000 25000
Leading dimension of array : 1000 2000 3000 4000 5000 10000 15000 20000 25000
Number of trials to run : 4 4 4 4 4 2 2 2 2
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 4 4
Maximum memory requested that can be used=705536800, at the size=25000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.007 100.0239 1.002198e-012 3.417754e-002 pass
1000 1000 4 0.007 98.9791 1.002198e-012 3.417754e-002 pass
1000 1000 4 0.007 94.5137 1.002198e-012 3.417754e-002 pass
1000 1000 4 0.007 101.0553 1.002198e-012 3.417754e-002 pass
2000 2000 4 0.035 152.2219 4.327261e-012 3.764187e-002 pass
2000 2000 4 0.036 146.6798 4.327261e-012 3.764187e-002 pass
2000 2000 4 0.036 146.4302 4.327261e-012 3.764187e-002 pass
2000 2000 4 0.037 143.0490 4.327261e-012 3.764187e-002 pass
3000 3000 4 0.114 157.6002 9.653528e-012 3.717342e-002 pass
3000 3000 4 0.112 160.2734 9.653528e-012 3.717342e-002 pass
3000 3000 4 0.108 166.9275 9.653528e-012 3.717342e-002 pass
3000 3000 4 0.110 163.3613 9.653528e-012 3.717342e-002 pass
4000 4000 4 0.238 179.5631 1.565487e-011 3.412126e-002 pass
4000 4000 4 0.246 173.8406 1.565487e-011 3.412126e-002 pass
4000 4000 4 0.237 179.9607 1.565487e-011 3.412126e-002 pass
4000 4000 4 0.237 180.3108 1.565487e-011 3.412126e-002 pass
5000 5000 4 0.450 185.1852 2.512907e-011 3.504046e-002 pass
5000 5000 4 0.451 184.7307 2.512907e-011 3.504046e-002 pass
5000 5000 4 0.449 185.6883 2.512907e-011 3.504046e-002 pass
5000 5000 4 0.451 185.0557 2.512907e-011 3.504046e-002 pass
10000 10000 4 3.358 198.6099 1.012627e-010 3.570624e-002 pass
10000 10000 4 3.278 203.4404 1.012627e-010 3.570624e-002 pass
15000 15000 4 12.120 185.6833 1.979849e-010 3.118294e-002 pass
15000 15000 4 12.153 185.1834 1.979849e-010 3.118294e-002 pass
20000 20000 4 24.397 218.6350 3.564028e-010 3.154945e-002 pass
20000 20000 4 24.390 218.6981 3.564028e-010 3.154945e-002 pass
25000 25000 4 48.645 214.1629 7.503647e-009 4.267055e-001 pass
25000 25000 4 48.662 214.0885 6.397731e-010 3.638160e-002 pass
Performance Summary (GFlops)
Size LDA Align. Average Maximal
1000 1000 4 98.6430 101.0553
2000 2000 4 147.0953 152.2219
3000 3000 4 162.0406 166.9275
4000 4000 4 178.4188 180.3108
5000 5000 4 185.1650 185.6883
10000 10000 4 201.0252 203.4404
15000 15000 4 185.4334 185.6833
20000 20000 4 218.6666 218.6981
25000 25000 4 214.1257 214.1629
Residual checks PASSED
End of tests
Mon 06/24/2013
07:19 PM