Re: [PATCH v1 00/10] mm: online/offline 4MB chunks controlled by device driver
From: Michal Hocko
Date: Wed Jul 18 2018 - 09:19:16 EST
[got back to this really late. Sorry about that]
On Thu 24-05-18 23:07:23, David Hildenbrand wrote:
> On 24.05.2018 16:22, Michal Hocko wrote:
> > I will go over the rest of the email later I just wanted to make this
> > point clear because I suspect we are talking past each other.
>
> It sounds like we are now talking about how to solve the problem. I like
> that :)
>
> >
> > On Thu 24-05-18 16:04:38, David Hildenbrand wrote:
> > [...]
> >> The point I was making is: I cannot allocate 8MB/128MB using the buddy
> >> allocator. All I want to do is manage the memory a virtio-mem device
> >> provides as flexible as possible.
> >
> > I didn't mean to use the page allocator to isolate pages from it. We do
> > have other means. Have a look at the page isolation framework and have a
> > look how the current memory hotplug (ab)uses it. In short you mark the
> > desired physical memory range as isolated (nobody can allocate from it)
> > and then simply remove it from the page allocator. And you are done with
> > it. Your particular range is gone, nobody will ever use it. If you mark
> > those struct pages reserved then pfn walkers should already ignore them.
> > If you keep those pages with ref count 0 then even hotplug should work
> > seemlessly (I would have to double check).
> >
> > So all I am arguing is that whatever your driver wants to do can be
> > handled without touching the hotplug code much. You would still need
> > to add new ranges in the mem section units and manage on top of that.
> > You need to do that anyway to keep track of what parts are in use or
> > offlined anyway right? Now the mem sections. You have to do that anyway
> > for memmaps. Our sparse memory model simply works in those units. Even
> > if you make a part of that range unavailable then the section will still
> > be there.
> >
> > Do I make at least some sense or I am completely missing your point?
> >
>
> I think we're heading somewhere. I understand that you want to separate
> this "semi" offline part from the general offlining code. If so, we
> should definitely enforce segment alignment for online_pages/offline_pages.
>
> Importantly, what I need is:
>
> 1. Indicate and prepare memory sections to be used for adding memory
> chunks (right now add_memory())
Yes, this is section based. So you will always get memmap (struct page)
for the whole section.
> 2. Make memory chunks of a section available to the system (right now
> online_pages())
Yes, this doesn't have to be section based. All you need is to mark
remaining pages as offline. They are reserved at this moment so nobody
should touch tehem.
> 3. Remove memory chunks of a section from the system (right now
> offline_pages())
Yes. All we need is to note that those reserved pages are actually good
to offline. I have mentioned that reserved pages are yours at this stage
so you can note the special state without an additional page flag.
The generic hotplug code just have to learn about this new state.
has_unmovable_pages sounds like a proper place to do that. You simply
clear the offline state and the PageReserved and you are done with the
page.
> 4. Remove memory sections from the system (right now remove_memory())
no change needed
> 5. Hinder dumping tools from reading memory chunks that are logically
> offline (right now PageOffline())
I still fail to see why do we even care about some dumping tools. Pages
are reserved so they simply shouldn't touch that memory at all.
--
Michal Hocko
SUSE Labs