[RFC] buddy allocator withou bitmap(2) [3/3]

From: Hiroyuki KAMEZAWA (kamezawa.hiroyu_at_jp.fujitsu.com)
Date: 08/31/04

  • Next message: Oliver Kiddle: "kernel panic (probably XFS related)"
    Date:	Tue, 31 Aug 2004 19:47:27 +0900
    To: Linux Kernel ML <linux-kernel@vger.kernel.org>
    
    

    This is the last.
    Implements __free_pages_bulk() stuff.

    -- Kame

    ------------------------------------------------

    This patch removes bitmap operation from free_pages()

    Instead of using bitmap, this patch records page's order in
    page struct itself, page->private field.

    "Does a page's buddy page exist or not ?" is checked by following.
    ------------------------
    if ((address of buddy is smaller than that of page) &&
        (page->flags & PG_buddyend))
        this page has no buddy in this order.
    ------------------------

    ---
     linux-2.6.9-rc1-mm1-k-kamezawa/mm/page_alloc.c |   87 +++++++++++++++++--------
     1 files changed, 60 insertions(+), 27 deletions(-)
    diff -puN mm/page_alloc.c~eliminate-bitmap-free mm/page_alloc.c
    --- linux-2.6.9-rc1-mm1-k/mm/page_alloc.c~eliminate-bitmap-free	2004-08-31 18:37:21.428480424 +0900
    +++ linux-2.6.9-rc1-mm1-k-kamezawa/mm/page_alloc.c	2004-08-31 18:37:21.434479512 +0900
    @@ -157,6 +157,27 @@ static void destroy_compound_page(struct
     #endif		/* CONFIG_HUGETLB_PAGE */
     /*
    + * This function checks whether a page is free && is the buddy
    + * we can do coalesce if
    + * (a) the buddy is free and
    + * (b) the buddy is on the buddy system
    + * (c) the buddy has the same order.
    + * for recording page's order, we use private field and PG_private.
    + *
    + * Because page_count(page) == 0, and zone->lock is aquired.
    + * Atomic page->flags operation is needless here.
    + */
    +static inline int page_is_buddy(struct page *page, int order)
    +{
    +	if (PagePrivate(page) &&
    +	    (page_order(page) == order) &&
    +	    !(page->flags & (1 << PG_reserved)) &&
    +            page_count(page) == 0)
    +		return 1;
    +	return 0;
    +}
    +
    +/*
      * Freeing function for a buddy system allocator.
      *
      * The concept of a buddy system is to maintain direct-mapped table
    @@ -168,9 +189,12 @@ static void destroy_compound_page(struct
      * at the bottom level available, and propagating the changes upward
      * as necessary, plus some accounting needed to play nicely with other
      * parts of the VM system.
    - * At each level, we keep one bit for each pair of blocks, which
    - * is set to 1 iff only one of the pair is allocated.  So when we
    - * are allocating or freeing one, we can derive the state of the
    + *
    + * At each level, we keep a list of pages, which are head of chunk of
    + * pages at the level. A page, which is a head of chunks, has its order
    + * in page structure itself and PG_private flag is set. we can get an
    + * order of a page by calling  page_order().
    + * So we are allocating or freeing one, we can derive the state of the
      * other.  That is, if we allocate a small block, and both were
      * free, the remainder of the region must be split into blocks.
      * If a block is freed, and its buddy is also free, then this
    @@ -180,42 +204,53 @@ static void destroy_compound_page(struct
      */
     static inline void __free_pages_bulk (struct page *page, struct page *base,
    -		struct zone *zone, struct free_area *area, unsigned int order)
    +		struct zone *zone, unsigned int order)
     {
    -	unsigned long page_idx, index, mask;
    -
    +	unsigned long page_idx, mask;
    +	struct page *coalesced_page;
    +	
     	if (order)
     		destroy_compound_page(page, order);
    +
     	mask = (~0UL) << order;
     	page_idx = page - base;
     	if (page_idx & ~mask)
     		BUG();
    -	index = page_idx >> (1 + order);
    -
     	zone->free_pages += 1 << order;
    -	while (order < MAX_ORDER-1) {
    -		struct page *buddy1, *buddy2;
    +	BUG_ON(bad_range(zone,page));
    -		BUG_ON(area >= zone->free_area + MAX_ORDER);
    -		if (!__test_and_change_bit(index, area->map))
    +	while (order < MAX_ORDER-1) {
    +		struct page *buddy;
    +		int buddy_idx;
    +		
    +		buddy_idx = (page_idx ^ (1 << order));
    +		
    +		if ((buddy_idx < page_idx) &&
    +		    PageBuddyend(base + page_idx))
    +			/*
    +			 * this page is lower end of mem_map
    +			 * there is no valid buddy.
    +			 */
    +			break;
    +		
    +		buddy = base + buddy_idx;
    +		if (!page_is_buddy(buddy, order))
     			/*
     			 * the buddy page is still allocated.
     			 */
     			break;
    -
    -		/* Move the buddy up one level. */
    -		buddy1 = base + (page_idx ^ (1 << order));
    -		buddy2 = base + page_idx;
    -		BUG_ON(bad_range(zone, buddy1));
    -		BUG_ON(bad_range(zone, buddy2));
    -		list_del(&buddy1->lru);
    -		mask <<= 1;
     		order++;
    -		area++;
    -		index >>= 1;
    +		mask <<= 1;
     		page_idx &= mask;
    -	}
    -	list_add(&(base + page_idx)->lru, &area->free_list);
    +		list_del(&buddy->lru);
    +		/* for propriety of PG_private bit, we clear it */
    +		ClearPagePrivate(buddy);
    +	}
    +	/* record the final order of the page */
    +	coalesced_page = base + page_idx;
    +	SetPagePrivate(coalesced_page);
    +	set_page_order(coalesced_page,order);
    +	list_add(&coalesced_page->lru, &zone->free_area[order].free_list);
     }
     static inline void free_pages_check(const char *function, struct page *page)
    @@ -253,12 +288,10 @@ free_pages_bulk(struct zone *zone, int c
     		struct list_head *list, unsigned int order)
     {
     	unsigned long flags;
    -	struct free_area *area;
     	struct page *base, *page = NULL;
     	int ret = 0;
     	base = zone->zone_mem_map;
    -	area = zone->free_area + order;
     	spin_lock_irqsave(&zone->lock, flags);
     	zone->all_unreclaimable = 0;
     	zone->pages_scanned = 0;
    @@ -266,7 +299,7 @@ free_pages_bulk(struct zone *zone, int c
     		page = list_entry(list->prev, struct page, lru);
     		/* have to delete it as __free_pages_bulk list manipulates */
     		list_del(&page->lru);
    -		__free_pages_bulk(page, base, zone, area, order);
    +		__free_pages_bulk(page, base, zone, order);
     		ret++;
     	}
     	spin_unlock_irqrestore(&zone->lock, flags);
    _
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at  http://www.tux.org/lkml/
    

  • Next message: Oliver Kiddle: "kernel panic (probably XFS related)"

    Relevant Pages

    • [PATCH] no buddy bitmap patch : alloc/free [1/2]
      ... This buddy system uses page->private field to record free page's order ... Freeing function for a buddy system allocator. ... } static inline void free_pages_check(const char *function, struct page *page) ...
      (Linux-Kernel)
    • [RFC] buddy allocator without bitmap(3) [3/3]
      ... It is guaranteed that there is no invalid "buddy" in the buddy system, ... * So we are allocating or freeing one, we can derive the state of the * other. ... } static inline void free_pages_check(const char *function, struct page *page) ...
      (Linux-Kernel)
    • [PATCH] no buddy bitmap patch revisit : for mm/page_alloc.c [1/2]
      ... This patch removes bitmaps from page allocator in mm/page_alloc.c. ... The algorithm of the buddy system is unchanged. ... } static inline void free_pages_check(const char *function, struct page *page) ...
      (Linux-Kernel)
    • [RFC][PATCH] no bitmap buddy allocator : modifies __free_pages_bulk() (3/4)
      ... * This function checks whether a page is free && is the buddy ... +static inline int page_is_buddy(struct page *page, int order) +{ ... * So we are allocating or freeing one, we can derive the state of the * other. ... } static inline void free_pages_check(const char *function, struct page *page) ...
      (Linux-Kernel)
    • [RFC] buddy allocator without bitmap [3/4]
      ... As a side effect of removing a bitmap, we must test whether a buddy of a page ... 1-1) loop 1st ... Freeing function for a buddy system allocator. ... * So we are allocating or freeing one, we can derive the state of the * other. ...
      (Linux-Kernel)