Re: [PATCH 0/2] Kexec jump: The first step to kexec base hibernation

From: Rafael J. Wysocki
Date: Fri Jul 13 2007 - 07:34:30 EST


On Friday, 13 July 2007 11:25, david@xxxxxxx wrote:
> On Fri, 13 Jul 2007, Rafael J. Wysocki wrote:
>
> > Date: Fri, 13 Jul 2007 11:17:37 +0200
> > From: Rafael J. Wysocki <rjw@xxxxxxx>
> > To: david@xxxxxxx
> > Cc: Jeremy Maitin-Shepard <jbms@xxxxxxx>,
> > "Huang, Ying" <ying.huang@xxxxxxxxx>,
> > Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx>, Pavel Machek <pavel@xxxxxx>,
> > nigel@xxxxxxxxxxxxxxxxxx, linux-kernel@xxxxxxxxxxxxxxx,
> > linux-pm@xxxxxxxxxxxxxxxxxxxxxxxxxx
> > Subject: Re: [PATCH 0/2] Kexec jump: The first step to kexec base hibernation
> >
> > On Friday, 13 July 2007 05:12, david@xxxxxxx wrote:
> >> On Thu, 12 Jul 2007, Jeremy Maitin-Shepard wrote:
> >>
> >>> "Rafael J. Wysocki" <rjw@xxxxxxx> writes:
> >>>>>>> 3. Support the in-place kexec? The relocatable kernel is not necessary
> >>>>>>> if this can be implemented.
> >>>>>>> 4. Image writing/reading. (Only user space application is needed).
> >>>>>>
> >>>>>> And a kernel interface for that application.
> >>>>>
> >>>>> I do't understand this statement, this application is just useing the
> >>>>> standard kernel interfaces (block devices to read/write to disk, network
> >>>>> devices to read/write to a server, etc). no new interfaces needed.
> >>>
> >>>> Yes, but it will have to know _what_ to save, no?
> >>>
> >>> I agree that a kernel interface would be important; something like
> >>> /dev/snapshot that can be read by the "save image" kernel, and written
> >>> to by the "restore image" kernel. Note that similarly, kdump provides a
> >>> kernel interface to an ELF image of the old kernel.
> >>
> >> I thought that the idea was to save the entire contents of ram so that
> >> caches, etc remain populated.
> >>
> >> having the system kernel free up ram and then making a sg list of what
> >> memory needs to be backed up would be a nice enhancement, but let's let
> >> that remain a future enhancement until everyone agrees that the basic
> >> approach works.
> >
> > It's not that easy. :-)
> >
> > First, there are memory regions that we don't want to save, because the
> > restoration of them may cause problems (generally all of the reserved pages
> > fall into this category).
> >
> > We also don't want to save free RAM and we don't want to save the memory
> > occupied by the hibernation kernel (ie. the "new" one).
>
> free ram is useually a pretty small number of pages (unless you free up
> ram before suspend). avoiding the ram reserved for the new kernel should
> be pretty simple (actually, it doesn't hurt much to save that ram, it just
> hurts if you try to restore it)
>
> > Also, please note that we can't restore 100% of RAM, even if we save it.
>
> Ok, now we need a data channel from the old kernel to the hibernate
> kernel, to the restore kernel. and the messier the memory layout the
> larger this data channel needs to be (hmm, what's the status on the memory
> defrag patches being proposed?) if this list can be made small enough it
> would work to just have the old kernel put the data in a known location in
> ram, and let the other two parts find it (in ram for the hibernate kernel,
> in the hibernate image for the wakeup kernel).

I think the hibernation kernel should mmap() the "old" kernel's (and it's
processes') memory available for saving, so that the image-saving process
can read its contents from the original locations.

> how do the existing hibernate processes store this?

There are two approaches. In the first of them (used in the mainline) we just
create copies of all pages eligible for saving (hence we can't create images
larger than 50% of RAM) atomically and then we save the contents of these
copies (either directly from the kernel or through a user space process). This
way we don't need to worry that they may be modified before we can save
them.

The other approach is the Nigel's one, in which all LRU pages are first saved
and then used as additional storage for copying the rest of memory contents.
This has a drawback that we are not 100% sure if the LRU won't be modified
after we've used them to store the copies of the other pages.

> since people are complaining about the amount of ram that a kexec kernel
> would take up I'm assuiming it's somethingmore complex then just a bitmap
> of all possible pages.

No, it's just bitmaps, AFAICS, and the complaints are a bit overstated, IMO. ;-)

> most of the conversation so far has been around the process of makeing the
> snapshot and storing it. what are the processes and tools available to
> restore images?

We have quite an efficient restoration code in the kernel right now. It's
able to upload big images (something like total RAM minus the size of the
boot kernel, initrd and, optionally, the resume application), which is much
more than we're able to save. :-)

It can work with images uploaded via /dev/snapshot from the user space
(specific image format is required, but that can be changed easily).

Geetings,
Rafael


--
"Premature optimization is the root of all evil." - Donald Knuth
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/