Hi Mel,
On Mon, 2006-11-27 at 13:18 +0000, Mel Gorman wrote:On Wed, 22 Nov 2006, Rohit Seth wrote:
This patch provides a IO hole size in a given address range.
Hi,
This patch reintroduces a function that doubles up what
absent_pages_in_range(start_pfn, end_pfn). I recognise you do this because
you are interested in hole sizes before add_active_range() is called.
Right.
However, what is not clear is why these patches are so specific to x86_64.
Specifically in the fake numa case, we want to make sure that we don't
carve fake nodes that only have IO holes in it. Unlike the real NUMA
case, here we don't have SRAT etc. to know the memory layout beforehand.
It looks possible to do the work of functions like split_nodes_equal() in
an architecture-independent manner using early_node_map rather than
dealing with the arch-specific nodes array. That would open the
possibility of providing fake nodes on more than one architecture in the
future.
The functions like splti_nodes_equal etc. can be abstracted out to arch
independent part. I think the only API it needs from arch dependent
part is to find out how much real RAM is present in range without have
to first do add_active_range.
Though as a first step, let us fix the x86_64 (as it doesn't boot when
you have sizeable chunk of IO hole and nodes > 4).
I'm also not sure if other archs actually want to have this
functionality.
What I think can be done is that you register memory as normal and then
split up the nodes into fake nodes. This would remove the need for having
e820_hole_size() reintroduced.
Are you saying first let the system find out real numa topology and then
build fake numa on top of it?