strange CPU speedups with SMP on Athlon 64 X2
From: Nathan Becker (nbecker_at_physics.ucsb.edu)
Date: 08/30/05
- Previous message: Roman Zippel: "Re: [Linux-fbdev-devel] [PATCH 1/1 2.6.13] framebuffer: bit_putcs() optimization for 8x* fonts"
- Next in thread: Willy Tarreau: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Reply: Willy Tarreau: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Reply: Nick Piggin: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: Tue, 30 Aug 2005 12:16:04 -0700 (PDT) To: linux-kernel@vger.kernel.org
Hi,
I'm having a strange problem when I benchmark some of my physics
simulation code on my new Athlon 64 X2 4800 machine. It occurs on all
current kernels that I have tested including 2.6.12.5 and 2.6.13.
If I run my benchmark single threaded, so that one of the two CPU cores is
just idling then the calculation goes pretty fast. But if I load both CPU
cores simultaneously but with INDEPENDENT calculations, then each
calculation runs about 12-15% faster than when running alone. I have
found this to be always reproducible. There is no disk access involved in
the calculation and RAM usage is fairly minimal so this is not caused by
caching. Also, if I compile the kernel to disable SMP then the machine
runs a single calculation at the same speed as when running alone when SMP
is enabled.
I am aware of the timing issues on these machines (especially since I
reported the bug http://bugzilla.kernel.org/show_bug.cgi?id=5105 ).
However, I double-checked my benchmark with a stop-watch, so this is
independent of something strange happening in the timer.
I also checked the cpufreq governor and according to the logs, my CPU is
holding steady at the maximum setting of 2.4GHz. I set the governor to
"performance" mode which should prevent unintended downclocking.
I would be happy to post my exact C source that I use to do the benchmark,
but I wanted to get some feedback first in case I'm just doing something
stupid. Also, since I'm not subscribed to this list, please cc me
directly regarding this topic.
Thanks very much,
Nathan
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
- Previous message: Roman Zippel: "Re: [Linux-fbdev-devel] [PATCH 1/1 2.6.13] framebuffer: bit_putcs() optimization for 8x* fonts"
- Next in thread: Willy Tarreau: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Reply: Willy Tarreau: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Reply: Nick Piggin: "Re: strange CPU speedups with SMP on Athlon 64 X2"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
- Re: Alternative SZ IntToStrB&V v0.04
... Dennis, please do not write that I never wrote. ... on calculation for
one element deviation. ... > When we use the benchmark, we do one run on each of the
target PC's. ... That is reasonable argument against showing maximum spreading value. ...
(borland.public.delphi.language.basm) - Re: Fastcode IntToStr 0.15 B&V
... If the benchmark would relly test with a uniform distribution the small ...
> 20 times and calculates the spread. ... It uuses the same calculation of spread.
... Most important is that the sequence is long enough to ... (borland.public.delphi.language.basm) - [ANNOUNCE] Interbench v0.20 - Interactivity benchmark
... This benchmark application is designed to benchmark interactivity in Linux.
... whereas interactivity would allow you to play audio ... It is designed to emulate
the cpu scheduling behaviour of interactive tasks ... (Linux-Kernel) - Some initial postmark numbers from a dual-PIII+ATA, 4.x and 6.x
... I pulled down the postmark benchmark, and gave it a spin on a dual PIII ...
the "alone" measurements, associated with the setup and tear-down of the ... While I didn't
attempt to measure precise CPU usage as yet, ... (freebsd-performance) - [ANNOUNCE] kernbench-0.20
... I set out to make this benchmark a portable and easy to use script for anyone ...
It cleans and primes a kernel tree with a make defconfig. ... optimal load: make
-j ... Percent CPU 101 ... (Linux-Kernel)