L2 Cache Latency

 






Comment #1

Hello,

I read with great interest your article on L2 Cache Latency testing. I have two questions. First, do you think that the results might be somewhat different if the tested processor were say a PII or PIII instead of a Celeron? Is there not some significant differences in the construction and operation of L2 Cache between the PIIs and PIIIs as compared to the Celerons?

Second, I recently had use of a SOYO 6BA+III, which supposedly had L2 Latency Adjustability, however any time I changed this setting from the designated selection, CPUID indicated the L2 had been disabled and overall machine performance suffered as if it had. Do you know if there is a problem with SOYO's implementation of L2 Latency Adjustability or was something else going on? During my period of use of the SOYO board I had both a PII-333 running at 450 and a PIII-450 running at 558 and saw the same results with both processors. Memory although fleeting, seems to bring up something I had read once about SOYO having a problem in this area but I failed to be able to find any reference.

Thanks for your time and efforts,

Ronald Keller


Hello Ronald,

I think there will be differences between the L2 cache latency performances of the Celeron, the Pentium II and Pentium III. Because the L2 caches of the Pentium II and III are external and run slower than the Celeron, changes in the performance of the L2 cache is more likely to bring about a bigger effect on overall system performance.

If the L2 cache has really been disabled by your adjustment, you will be able to feel the drop in performance. A simple processor benchmark can verify if the L2 cache was really disabled or not. If the processor did not suffer any drastic drop in performance, you can be assured that the L2 cache has not been disabled.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #2

Adrian:

May I suggest doing the L2 cache latency tests again.... this time with a different overclocked processor (how about the popular 366@550?) and different RAM (the popular 128MB PC133?) to see if the results are specific to that processor and RAM combination.

Alan R. Mege


Hello Alan,

Unfortunately, I do not have the resources to buy another processor and more RAM just for such tests. However, I believe the results are not processor-speed or RAM-grade specific. No matter how fast the processor run, the effect of different cache latencies should remain the same for the same processor as nothing fundamental has changed, merely its speed.

As for PC133 SDRAM, it will have virtually no effect at all if used with the Celeron 366 @ 550MHz. That's because the SDRAM will run synchronously with the system bus which is 100MHz for the Celeron 366 @ 550MHz. So, it will be like running standard PC100 SDRAM. Also, because we are testing only the performance differences between different L2 cache latencies, RAM performance won't affect that. Whatever performance boost from running the SDRAM at a higher speed will affect all the L2 cache latencies. So, while the test results may differ, the difference between the performance of the different L2 cache latencies will remain the same.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #3

Hello,

What does the "default" L2 cache latency setting in the BIOS set the latency to, for Celerons (300A)?
Any help is much appreciated. Keep up the good job on your site :)

Thanks,

Arpad Sooky


As I mentioned in the Dissection section, the default L2 cache latency for the Celeron is 5.

Hope that helps you some! :)

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #4

You may want to try the tests on a P3 or P2, since the cache is actually separated from the processor core and that is where the real problem lies in overclockers. Celerons rarely have the same hassles as P3 or P2's. If you don't have the processor to do it, let me know, I have a P3 450 and a BE6, PC133 SDRAM, Quantum HD, Viper 770 32MB non-Ultra, SBlive! to do the same tests over for you.... just send me exact procedures you used to assure validity.

Rick Dumont
Former writer for The Tech-squad (but EverQuest killed my time)


Wow! Thanks for the offer of help, Rick! That would be great since I doubt I'll ever save up enough money to buy a Pentium II processor, much less a Pentium III.

Overall, the test procedures involve keeping all system variables equal except for the L2 cache latency. So, that means no changes to the system must happen during the entire benchmarking process. You will have to fully optimize your hard disk and then defragment all empty space. Then after each benchmark run, you will have to defragment the empty space on your hard disk again to ensure that the next test will have the same environment.

All in all, you will need to run all benchmarks three times for each and every L2 cache latency. Needless to say, that's going to take a long, long time! ;-) The benchmarks I ran were WinBench 99 1.1's CPUMark 99 and FPU WinMark 99 tests, 3D WinBench 99 1.1's 3D Transformation and Lighting and 3D Transformation tests and all 3DMark 99 Max tests. If possible, try to get benchmark results from CacheChk 6.0 as well. I ran all those benchmarks but I still have yet to evaluate the data from CacheChk 6.0.

I would recommend running one round of all the benchmarks before going to the second round. That's better than running the same benchmark three times before going to the next one because this method will ensure that the conditions will be equal for the tests. Hmm... I guess that's about it. Hope you will have some results for us soon! :)

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #5

Regarding your article on L2 cache latencies.. I think you've left out two important topics. Addressing these would make your benchmarks much more significant. Now that we know changing the L2 cache doesn't have a significant performance impact...

(1) Does altering the L2 cache latency really improve your ability to get a stable overclock? And if so, how much of a difference does it make?

(2) Since you tested with a Celeron (which has a different cache arrangement then the P2/P3) do the same results apply similarly to the P2/P3?

Overall I was glad that someone wrote concerning this topic --- it had been something I was curious about. I have a P3-450 that overclocks really well, but only with my L2 cache latency tweaked up. I was curious how this was affecting my performance, and now I know. THAT IS, assuming your findings carry over to the P2/P3 with its large half-speed cache :)

Thanks,

Darren Shepard


Hello Darren,

It's hard to say if changing the L2 cache latency will improve your ability to get a stable overclock. Theoretically, it's certainly possible but just like you saw in this article, theory isn't everything it's made out to be. There's no way to find out without more tests which would need to involve multiple "stubborn" processors which I, most unfortunately, do not have. :(

Again, I believe the results will be different for the Pentium II and III. I suspect the performance differences with those two processors will be much more significant that it is with the Celeron.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #6

Hello,

I have read your article on L2 cache latency. The results are very strange to my opinion. I don't doubt them however! Maybe this is the solution for this problem: at lower latency settings more errors occur. These error however are corrected by the ECC-codes. This way we are not aware of them. But we get a speed-penalty if we have a ECC-correction. I don't know of this all is true.... but it is an interesting thought.

Roel Vliegen
PS English is not my first language... 8)


Hello Roel,

That's certainly a very interesting thought there! I didn't even think of it. Thanks for giving me that knock on the head. I can certainly visualise an update in the future which will include test results for the L2 cache latencies with ECC correction disabled. Then we will know if ECC correction really affects the performance of the various L2 cache latencies.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #7

Hi,

Just a question regarding the L-2 latency timing. Do you reach the same results when running your CPU at default clockspeed?

Regards,

Philip Leenen


Hello Philip,

I have not tested the L2 cache latencies at the default clockspeed because the benchmarking of 15 different L2 cache latencies using 5 different tests (each with 3 runs) requires an extraordinarily long amount of time. However, I'm confident that the performance differences between the various L2 cache latencies will be the same, irrespective of the processor clockspeed.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #8

Good article, great idea. Just a few comments, when you're talking about 0.2% differences in a comprehensive benchmark like 3D Mark and Winbench, you really should be doing at least 3 runs and average the results. Heck, I run Winbench multiple times with the same system same latency and I get different results every time.

Other than that, I think you should explain what exactly it means to have a latency of "14" or "2". Also, I get the feeling that the latency will be different for everyone. What you would probably want is latency that corresponds to the CPU clock cycles, so that they are in sync. I suspect that "13" is the worst and just one more, "14" is the best, is because 14 is exactly sync'd whereas 13 is off by one and thus the CPU has to actually wait "26" before it gets to read from the cache. Anyways.. good article, hope you can make more improvements in the future too!

Bryant Fong


Hello Bryant,

As per standard benchmarking procedure, I ran all test three times and the results were averaged before being presented in this article. I also checked all benchmark results to ensure that they were consistent. So, I'm quite sure the results are as accurate as they can get.

To be frank, I, too, suspected that it was a synchronization issue as I mentioned in my Conclusion. But as another reader has pointed out, there may yet be another answer for the benchmark results - ECC Error Correction.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #9

Hi Adrian,

Hey, thanks for running those L2 latency tests, I'm sure it was something everyone was wondering about. However, I think it would've been useful if a bit of the scientific method were introduced into the tests regarding the benchmark programs used.

If you simply ran 3DMark 15 times at the SAME L2 setting, I have a suspicion that you would find a variance of around 1.7%. This would imply that the benchmarks aren't even statistically significant! But then that would mean that L2 cache settings don't matter at all, which seems counterintuitive, doesn't it?

Mark Kundinger


Hello Mark,

I don't really think 3DMark 99 Max is a particularly consistent benchmark. But it's useful as an alternative benchmark which we can all use to verify results from other benchmarks. To minimize the variance, I have already benchmarked each cache latency using 3DMark 99 (and other benchmarks too) three times and averaged the results before using them in this article. So, the variance would be considerably less than the 1.7% you mentioned.

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/


Comment #10

Hello!

I liked the article; what it reminded me of was the bad old days where one had to worry about HD interleave (pre386). I have an Athlon 500 with SD11 with no BIOS adjustment for L2 cache latency. I'm looking into overclocking cards and will see if I can get one that will allow me to set this parameter. Thanks again for writing this article!

Fred Bosick


No problem at all, Fred! :)

Adrian Wong
Adrian's Rojak Pot
http://www.rojakpot.com/
http://www.adriansrojakpot.com/

 

Comments?

If you have a comment or two about this article, please post them here.

Thanks for your time and I hope you enjoyed the article! :)

 

 
 

 

 
     
   

 

 
   

 
     
 

                   

 
   

 

 
 
Last Updated 20-02-2001

All trademarks used are properties of their respective owners.
Copyright © 1998-2000 Adrian Wong. All rights reserved.

 
Visit the new Tech ARP @ http://www.techarp.com/ !