Re: [RFC] [PATCH 00/13] Introduce task_pid api

From: Paul Jackson
Date: Mon Nov 14 2005 - 22:37:06 EST


Serge wrote:
> the vserver model

What's that?

> Unfortunately that would not work for checkpoints across boots,

That could be made to work, by an early init script that looked in the
"checkedpointed tasks storage locker" on disk, and reserved any pid's
used therein, marking them as frozen. A little care can ensure that
no task with such a pid is already running.

> or, more importantly, for process (set) migration.

Migration to a system that has already been up a while, where no
reservation of pid's was made ahead of time, hence where pid's overlap,
would not work with my EFROZEN scheme - you are right there.

How large is our numeric pid space (on 64 bit systems, anyway)? If
large enough, then reservation of pid ranges becomes an easy task. If
say we had 10 bits to spare, then a server farm could pre-ordain say a
thousand virtual servers, which come and go on various hardware
systems, each virtual server with its own hostname, pid-range, and
other such paraphernalia.

However there is an elephant in the room, or a camel outside the tent
or some such. Yes, the camel's nose may well fit inside the tent,
but before we invite his nose, we should have some idea if the rest
of the camel will fit.

In other words, since I don't see any compelling reason for this
virtualization of pids -except- for checkpoint/restart sorts of
features, the usefulness of pid virtualization would seem to rest on
the acceptability of the rest of the checkpoint/restart proposal.

For all I know now (not much) the amount of effort required to
sufficiently virtualize all the elements of the Linux kernel-user
interface enough to enable robust job migration across machines and
reboots may well make virtualizing the kernel's address space look easy.
Linux is not VM.

Hence, until sold on the larger direction, I am skeptical of this
first step.

Though, I will grant, I am interested too. A good Linux
checkpoint/restart solution would be valuable.

--
I won't rest till it's the best ...
Programmer, Linux Scalability
Paul Jackson <pj@xxxxxxx> 1.925.600.0401
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/