ftp://ftp.cs.wisc.edu/condor
AFAIK, it does "checkpointing". This is a user space, completely portable solution (runs on many platforms).
(The README says that it worked with gcc-2.6.2, Kernel 1.1.75, and libc-4.6.27).
Cheers, -Harald