Re: strange CPU speedups with SMP on Athlon 64 X2

From: Nick Piggin (nickpiggin_at_yahoo.com.au)
Date: 08/31/05

  • Next message: Juli: "Re[5]:How do you do"
    Date:	Wed, 31 Aug 2005 21:59:40 +1000
    To: Nathan Becker <nbecker@physics.ucsb.edu>
    
    

    Nathan Becker wrote:
    >
    > I would be happy to post my exact C source that I use to do the
    > benchmark, but I wanted to get some feedback first in case I'm just
    > doing something stupid. Also, since I'm not subscribed to this list,
    > please cc me directly regarding this topic.
    >

    Hi Nathan,

    Cache issues may explain this. When 2 processes are allocating
    memory in parallel they'll be given different interleavings of
    pages which could explain the speedup.

    Start one process, get it to memset all its memory, then pause
    it and do the same thing. Then set them both running at the same
    time (ie. after they've each touched their memory in turn), what
    do you see?

    (By memset()ing the memory, you'll cause the kernel to actually
    allocate a physical page. By doing that one after the other, we
    hope to eliminate interleaving issues.)

    -- 
    SUSE Labs, Novell Inc.
    Send instant messages to your online friends http://au.messenger.yahoo.com 
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at  http://www.tux.org/lkml/
    

  • Next message: Juli: "Re[5]:How do you do"

    Relevant Pages

    • Re: [Lhms-devel] [PATCH 0/7] Fragmentation Avoidance V19
      ... the kernel spends nearly 1/3rd of its time in the page allocator when ... > It seems inevitable that we need both physically contiguous memory ... Send instant messages to your online friends http://au.messenger.yahoo.com ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: [rfc] lockless pagecache
      ... However I think for Oracle and others that use shared memory like ... speculative get_page may be useful for more code than just pagecache ... Send instant messages to your online friends http://au.messenger.yahoo.com ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: Avoiding external fragmentation with a placement policy Version 12
      ... memory block on a loaded machine. ... It does make MAX_ORDER allocations _possible_ when previously ... Send instant messages to your online friends http://au.messenger.yahoo.com ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: [RFC] x86-64: Use SSE for copy_page and clear_page
      ... Then in that case you have doubled your memory bandwidth ... Clearly some things benefit from not going through the cache. ... Send instant messages to your online friends http://au.messenger.yahoo.com ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: 2.4.23aa2 (bugfixes and important VM improvements for the high end)
      ... > triggering the zone-normal shortage in 32G, ... Also bear in mind that as memory gets tight, ... Without shared pagetables, ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)