On Monday 04 June 2012 16:22:51 Michal Nazarewicz wrote:On Mon, 04 Jun 2012 15:43:56 +0200, Bartlomiej Zolnierkiewicz <b.zolnierkie@xxxxxxxxxxx> wrote:
> +/*
> + * Returns true if MIGRATE_UNMOVABLE pageblock can be successfully
> + * converted to MIGRATE_MOVABLE type, false otherwise.
> + */
> +static bool can_rescue_unmovable_pageblock(struct page *page, bool locked)
> +{
> + unsigned long pfn, start_pfn, end_pfn;
> + struct page *start_page, *end_page, *cursor_page;
> +
> + pfn = page_to_pfn(page);
> + start_pfn = pfn & ~(pageblock_nr_pages - 1);
> + end_pfn = start_pfn + pageblock_nr_pages - 1;
> +
> + start_page = pfn_to_page(start_pfn);
> + end_page = pfn_to_page(end_pfn);
> +
> + for (cursor_page = start_page, pfn = start_pfn; cursor_page <= end_page;
> + pfn++, cursor_page++) {
> + struct zone *zone = page_zone(start_page);
> + unsigned long flags;
> +
> + if (!pfn_valid_within(pfn))
> + continue;
> +
> + /* Do not deal with pageblocks that overlap zones */
> + if (page_zone(cursor_page) != zone)
> + return false;
> +
> + if (!locked)
> + spin_lock_irqsave(&zone->lock, flags);
> +
> + if (PageBuddy(cursor_page)) {
> + int order = page_order(cursor_page);
>-/* Returns true if the page is within a block suitable for migration to */
> -static bool suitable_migration_target(struct page *page)
> + pfn += (1 << order) - 1;
> + cursor_page += (1 << order) - 1;
> +
> + if (!locked)
> + spin_unlock_irqrestore(&zone->lock, flags);
> + continue;
> + } else if (page_count(cursor_page) == 0 ||
> + PageLRU(cursor_page)) {
> + if (!locked)
> + spin_unlock_irqrestore(&zone->lock, flags);
> + continue;
> + }
> +
> + if (!locked)
> + spin_unlock_irqrestore(&zone->lock, flags);
spin_unlock in three spaces is ugly. How about adding a flag that holds the
result of the function which you use as for loop condition and you set it to
false inside an additional else clause? Eg.:
bool result = true;
for (...; result && cursor_page <= end_page; ...) {
...
if (!pfn_valid_within(pfn)) continue;
if (page_zone(cursor_page) != zone) return false;
if (!locked) spin_lock_irqsave(...);
if (PageBuddy(...)) {
...
} else if (page_count(cursor_page) == 0 ||
PageLRU(cursor_page)) {
...
} else {
result = false;
}
if (!locked) spin_unlock_irqsave(...);
}
return result;
Thanks, I'll use the hint (if still applicable) in the next patch version.
> + return false;
> + }
> +
> + return true;
> +}
How do you make sure that a page is not allocated while this runs? Or you just
don't care? Not that even with zone lock, page may be allocated from pcp list
on (another) CPU.
Ok, I see the issue (i.e. pcp page can be returned by rmqueue_bulk() in
buffered_rmqueue() and its page count will be increased in prep_new_page()
a bit later with zone lock dropped so while we may not see the page as
"bad" one in can_rescue_unmovable_pageblock() it may end up as unmovable
one in a pageblock that was just changed to MIGRATE_MOVABLE type).
It is basically similar problem to page allocation vs alloc_contig_range()
races present in CMA so we may deal with it in a similar manner as
CMA: isolate pageblock so no new allocations will be allowed from it,
check if we can do pageblock transition to MIGRATE_MOVABLE type and do
it if so, drain pcp lists, check if the transition was successful and
if there are some pages that slipped through just revert the operation..
[*] BTW please see http://marc.info/?l=linux-mm&m=133775797022645&w=2
for CMA related fixes