CPU gurus? Enlighten me as to optimal CPU cache associativity.

yllus

Elite Member & Lifer
Aug 20, 2000
20,577
432
126
Okay, been doing some extremely light reading on CPU cache properties recently with all the new articles on the design of the X-Box and other consoles out. I'm not mistaken, CPUs almost universally lean towards using the set associatively mapped caching system. Now, in Anand's Hardware Behind the Consoles - Part I: Microsoft's Xbox, he writes:

"The CPU that powers the Xbox is a Coppermine based Pentium III with only 128KB L2 cache. While this would make many think that the processor is indeed a Celeron, one of the key performance factors of the Pentium III that is lost in the Celeron core was left intact for this core. The Coppermine core was left with an 8-way set associative L2 cache instead of the 4-way set associative cache of the Celeron. Based on what we've seen with the Coppermine and Coppermine128 (Celeron) cores we estimate that the 8-way set associative L2 cache gives this particular core a 10% performance advantage over the Coppermine128 core of the Celeron."

I can't help but wonder what that could imply for my pitiful Celeron 466. I had always figured 4-way associativity was the optimal setup for a mere 128KB of L2 cache - but this X-Box neutering has put doubts in that. Is there any other significant difference between the X-Box's Cumine CPU and the Celeron128 I've got that would make 4-way better to use than 8-way?

I also vaguely remember an option in software (in wCPUid, I think) that allowed a user to edit the block associativity of his CPU, am I just imagining that or is that possible? Hell, if it's in my best interests to go to 8-way associative mode and actually possible to do, I'm there.

Thanks in advance.
 

kylef

Golden Member
Jan 25, 2000
1,430
0
0


<< Is there any other significant difference between the X-Box's Cumine CPU and the Celeron128 I've got that would make 4-way better to use than 8-way? >>



Doubtful. Aside from the extremely rare (some say mythical) beast known as an "anti-conflict miss", MORE associativity is always a better idea IF cache latency is not a concern. The reason for this is simple: lower associativity causes more conflict misses.

In the real world, however, CPU designers have to trade-off the higher hit rates of large associativities and the better speed (and smaller transistor count) of direct-mapped and 1 or 2-way set-associative caches. Generally, they have used the former for L2 caches (which generally have pretty poor hit rates when taken alone but have more latency to play with -> ideal for a higher set associativity) and the latter for L1 caches (who get high hit rates but must be very fast, to return the hit within one or two clock cycles).

More probably in your case, Intel needed to differentiate the Celeron product line from the PIII. And, it is probable that the die size of the 4-way set associative cache is smaller, saving some $$ by improving the die yields.

The AMD Thunderbird, which seems to have a speed advantage to the Coppermine, uses a 16-way set associative L2 cache instead of an 8-way.
 

MustPost

Golden Member
May 30, 2001
1,923
0
0
<<More probably in your case, Intel needed to differentiate the Celeron product line from the PIII. And, it is probable that the die size of the 4-way set associative cache is smaller, saving some $$ by improving the die yields.>>

From what ive read they just use a laser to disable half of the L2 cache on a P3 to make a Celeron. This reduces the 8-way set to 4-way.

Doesn't the X-box use a pretty much normal mobile P3.
 

JDJ

Member
Dec 10, 2001
28
0
0


<< From what ive read they just use a laser to disable half of the L2 cache on a P3 to make a Celeron. This reduces the 8-way set to 4-way. >>



I had also heard that in some cases there were P3 with crippled L2 cache, ie a portion of the 256k cache didnt work. To save $$ they took a laser to half the L2 as mentioned above and were able to 'recycle' chips that would have gone to waste.

- John
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |