Re: [PATCH 2/6] mm: mark all calls into the vmalloc subsystem as potentially sleeping

From: Andy Lutomirski
Date: Wed Oct 19 2016 - 11:35:08 EST


On Wed, Oct 19, 2016 at 6:05 AM, Christoph Hellwig <hch@xxxxxx> wrote:
> On Wed, Oct 19, 2016 at 12:15:41PM +0100, Chris Wilson wrote:
>> On Tue, Oct 18, 2016 at 08:56:07AM +0200, Christoph Hellwig wrote:
>> > This is how everyone seems to already use them, but let's make that
>> > explicit.
>>
>> Ah, found an exception, vmapped stacks:
>
> Oh, fun. So if we can't require vfree to be called from process context
> we also can't use a mutex to wait for the vmap flushing. Given that we
> free stacks from the scheduler context switch I also fear there is no
> good way to get a sleepable context there.
>
> The only other idea I had was to use vmap_area_lock for the protection
> that purge_lock currently provides, but that would require some serious
> refactoring to avoid recursive locking first.

It would be quite awkward for a task stack to get freed from a
sleepable context, because the obvious sleepable context is the task
itself, and it still needs its stack. This was true even in the old
regime when task stacks were freed from RCU context.

But vfree has a magic automatic deferral mechanism. Couldn't you make
the non-deferred case might_sleep()?

--
Andy Lutomirski
AMA Capital Management, LLC