Re: [GPIO]implement sleeping GPIO chip removal

From: Maciej Szmigiero
Date: Wed Nov 10 2010 - 10:28:16 EST


W dniu 10.11.2010 06:09, Grant Likely pisze:
> On Sun, Nov 07, 2010 at 07:30:33PM +0100, Maciej Szmigiero wrote:
>> [GPIO]implement sleeping GPIO chip removal
>>
>> Existing GPIO chip removal code is only of "non-blocking" type: if the chip is currently
>> requested it just returns -EBUSY.
>> This is bad for devices which disappear and reappear, like those on hot pluggable buses,
>> because it forces the driver to call gpiochip_remove() in loop until it returns 0.
>>
>> This patch implements a new function which sleeps until device is free instead of
>> returning -EBUSY like gpiochip_remove().
>>
>> Signed-off-by: Maciej Szmigiero <mhej@xxxxx>
>
> This patch makes me uncomfortable, but I'm not entirely sure why. Is
> there a reason that the process is manipulated directly instead of
> using a completion? Perhaps I'm bother by the joint use of
> ->dead + ->removing_task that is bothering me. I need to mull on this
> one some more.
>
> Also, comments below...
>

>> @@ -95,6 +95,10 @@ static int gpio_ensure_requested(struct gpio_desc *desc, unsigned offset)
>> const struct gpio_chip *chip = desc->chip;
>> const int gpio = chip->base + offset;
>>
>> + /* no new requests if chip is being deregistered */
>> + if ((chip->dead) && (test_bit(FLAG_REQUESTED, &desc->flags) == 0))
>> + return -ENODEV;
>> +
>
> Not holding spin lock. Race condition.

Every call to gpio_ensure_requested() in gpiolib.c is done while holding gpio_lock and this function is not exported.

>> +int gpiochip_remove_sleeping(struct gpio_chip *chip, int interruptible)
>> +{
>> + unsigned id;
>> + unsigned long flags;
>> +
>> + /* prevent new requests */
>> + chip->dead = 1;
>
> race, grab spinlock first.

Race with ...?
The only place where chip->dead is changed to zero is later in gpiochip_remove_sleeping() (which cannot be called at the same time for the same
chip more than once anyway) or gpiochip_add() which is new chip registration function.

>> + busy = 1;
>> + break;
>> + }
>> + }
>
(..)
> In fact, gpiochip_remove could be called directly here (with some
> spin_lock refactoring) and exit the loop if it doesn't return -EBUSY.
>
(..)
> Don't open code this. Generalize the code in gpiochip_remove() instead.

I agree with you on this.

W dniu 10.11.2010 10:49, Thomas Gleixner pisze:
> Maybe because it open codes a sloppy refcounting with a loop and magic
> sleeps instead of converting the code to kobjects and proper
> refcounting ?
>

The only way to do GPIO chip removal in the current code is to busy-loop.
"Sloppy" (as you called it) waiting is still more CPU-friendly than looping
in hope that somebody will finally release the chip.
If you would like to implement it as kobject then go ahead and post the code
so it can be used in drivers.

Best regards and thanks for comments,
Maciej Szmigiero
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/