Re: RE2: [OKS] Module removal

From: Keith Owens (kaos@ocs.com.au)
Date: Mon Jul 01 2002 - 21:25:45 EST


On Mon, 1 Jul 2002 22:40:34 -0300,
Werner Almesberger <wa@almesberger.net> wrote:
>If I remember right, the main arguments why module removal can
>race with references were:
>....
> - removal happening immediately after module usage count is
> decremented to zero may unload module before module has
> executed "return" instruction
>For the removal-before-return problem, I thought a bit about it
>on my return flight. It would seem to me that an "atomic"
>"decrement_module_count_and_return" function would do the trick.

This is just one symptom of the overall problem, which is module code
that adjusts its use count by executing code that belongs to the
module. The same problem exists on entry to a module function, the
module can be removed before MOD_INC_USE_COUNT is reached.

Apart from abandoning module removal, there are only two viable fixes:

1) Do the reference counting outside the module, before it is entered.

   This is why Al Viro added the owner: __THIS_MODULE; line to various
   structures. The problem is that it spreads like a cancer. Every
   structure that contains function pointers needs an owner field.
   Every bit of code that dereferences a function pointer must first
   bump the owner's use count (using atomic ops) and must cope with the
   owner no longer existing.

   Not only does this pollute all structures that contain function
   pointers, it introduces overhead on every function dereference. All
   of this just to cope with the relatively low possibility that a
   module will be removed.

2) Introduce a delay after unregistering a module's services and before
   removing the code from memory.

   This puts all the penalty and complexity where it should be, in the
   unload path. However it requires a two stage rmmod process (check
   use count, unregister, delay, recheck use count, remove if safe)
   so all module cleanup routines need to be split into unregister and
   final remove routines.

   This is relatively easy to do without preemption, it is
   significantly harder with preempt. There are also unsolved problems
   with long running device commands with callbacks (e.g. CD-R fixate)
   and with kernel threads started from a module (must wait until
   zombies have been reaped).

Rusty and I agree that option (2) is the only sane way to do module
unload, assuming that we retain module unloading. First decide if the
extra work is justified.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sun Jul 07 2002 - 22:00:08 EST