Re: [PATCH RFC] hotplug-memory: refactor online_pages to separate zone growth from page onlining
- From: Yasunori Goto <y-goto@xxxxxxxxxxxxxx>
- Date: Tue, 01 Apr 2008 16:17:38 +0900
Dave Hansen wrote:
On Sat, 2008-03-29 at 16:53 -0700, Jeremy Fitzhardinge wrote:
Dave Hansen wrote:
On Fri, 2008-03-28 at 19:08 -0700, Jeremy Fitzhardinge wrote:I had no intention of globally disabling it. I just need to disable it
My big remaining problem is how to disable the sysfs interface for thisI've been thinking about this some more, and I wish that you wouldn't
memory. I need to prevent any onlining via /sys/device/system/memory.
just throw this interface away or completely disable it.
for my use case.
Right, but by disabling it for your case, you have given up all of the
testing that others have done on it. Let's try and see if we can get
the interface to work for you.
I suppose, but I'm not sure I see the point. What are the benefits of
using this interface? You mentioned that the interface exists so that
its possible to defer using a newly added piece of memory to avoid
fragmentation. I suppose I can see the point of that
Not only to avoid fragmentation, but also for notification
to user level for preparing memory add event.
When memory is added, there is a notification via udev for each memory
device.
In our box, one node which includes some DIMMs and CPUs can be added by
hot-add, and there is another notification for 1 node by ACPI's
container device.
After user level check for preparing, user(or shell script) can
online memory.
IIRC, some of user level application would require this notification.
ex) resource manager over physical/logical partitioning.
But in the xen-balloon case, the memory is added on-demand precisely
when its about to be used, and then onlined in pieces as needed.
Extending the usermode interface to allow partial onlining/offlining
doesn't seem very useful for the case of physical hotplug memory, and
its not at all clear how to do it in a useful way for the xen-balloon
case. Particularly for offlining, since you'd need to guarantee that
any page chosen for offlining isn't currently in use.
Basically, I hope there is no change for user level interface between
physical hotplug and Xen as much as possible.
So, I would like to make sense why memory is added "on-demand" on Xen.
I thought the hypervisor gathers a section's memory and moves all of them
from one guest to another at a time. Its gathering time may be long time.
But, each per page moving may cause of fragmentation, if my understanding
is correct....
To me, it sounds like the only different thing that you want is to makeWell, yes and no.
sure that only partial sections are onlined. So, shall we work with the
existing interfaces to online partial sections, or will we just disable
it entirely when we see Xen?
For the current balloon driver, it doesn't make much sense. It would
add a fair amount of complexity without any real gain. It's currently
based around alloc_page/free_page. When it wants to shrink the domain
and give memory back to the host, it allocates pages, adds the page
structures to a ballooned pages list, and strips off the backing memory
and gives it to the host. Growing the domain is the converse: it gets
pages from the host, pulls page structures off the list, binds them
together and frees them back to the kernel. If it runs out of ballooned
page structures, it hotplugs in some memory to add more.
How does this deal with things like present_pages in the zones? Does
the total ram just grow with each hot-add, or does it grow on a per-page
basis from the ballooning?
Well, there are two ways of looking at it:
either hot-plugging memory immediately adds pages, but they're also
all immediately allocated and therefore unavailable for general use, or
the pages are notionally physically added as they're populated by
the host
In principle they're equivalent, but I could imagine the former has the
potential to make the VM waste time scanning unfreeable pages.
I'm not sure the patches I've posted are doing this stuff correctly
either way.
I don't make sense both your idea yet. Could you tell me more?
One of them may be same to my understanding. But I'm not sure.
Thanks.
--
Yasunori Goto
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
- Prev by Date: linux-next: Tree for April 1
- Next by Date: Re: nobody cared about IRQ 19 (firewire, on a HP 2510p notebook)
- Previous by thread: linux-next: Tree for April 1
- Next by thread: Re: [PATCH RFC] hotplug-memory: refactor online_pages to separate zone growth from page onlining
- Index(es):
Relevant Pages
|
Loading