Re: Sysfs and suicidal attributes
From: Alan Stern
Date: Tue Jul 10 2007 - 13:18:48 EST
On Wed, 11 Jul 2007, Tejun Heo wrote:
> > Addition during resume is _not_ safe. The new device can easily get
> > added to the already-processed list in the wrong position, so that the
> > next time the system is suspended the PM core tries to suspend the new
> > device's parent before suspending the new device.
>
> When a bus device is being woken up and rescanning the bus, all
> prerequisites for the bus should have been resumed already. As long as
> the newly added device is placed after the bus device itself, things
> should be safe. Well, that's the theory at least.
What actually happened in practice was that a device send a wakeup
request and hence was resumed without the PM core's knowledge. As a
result it was still on the "suspended" list when its new child was
added to the end of the "resumed" list. Then when the PM core got
around to the parent, the parent was placed after the child. Since the
"resumed" list is iterated in reverse order for suspending, you see the
problem.
If the PM core had been holding the parent's semaphore this wouldn't
have occurred. But it does illustrate the pitfalls of adding a device
during resume.
> > I had envisioned using SRCU to synchronize existing device additions
> > with the beginning of a suspend. down_read_trylock should work just as
> > well (aside from cacheline bouncing).
>
> How high performance device addition/removal should be? Wouldn't rwlock
> be enough?
rwlock is a type of spinlock, right? Hence it can't sleep and can't be
used for device addition/removal. Also rwlocks aren't fair and are
subject to livelock.
> > Device removal isn't as much of a problem as addition. We could allow
> > it with no difficulty. Failing it isn't really an option because
> > device_del() returns void. And anyway, my idea was to have the PM core
> > acquire all the device semaphores at the start of the suspend -- this
> > would automatically block any attempted removal until the semaphore was
> > released.
>
> Hmmm... locking all device mutexes sounds scary to me but I tend to be a
> chicken and lockdep is great, so it might just work. :-)
These aren't mutexes; they are semaphores (although they are used as
mutexes). I speculate that part of the reason they have never been
converted to mutexes is because of potential lockdep issues.
The order of locking isn't a problem. The list itself defines the
correct order.
> > What with all the sysfs changes, I wouldn't know where to begin
> > looking. The idea occurred to me mainly because I was considering
> > blocking all access to sysfs during a suspend -- the problem being that
> > a suspend is itself triggered by a write to a sysfs attribute!
> > Somehow that write would have to wait for all _other_ existing accesses
> > to complete and block new ones.
>
> Why is that necessary? What happens if there's some long-running
> read/write operation using bin attr?
That is certainly a possible problem. I don't know if there are any
such long-running operations.
Is it necessary? No. But it would make things easier if suspend
handling could be localized in one place (the sysfs core) instead of
spread out among a bunch of attribute methods.
Using a freezable workqueue for the callbacks would also localize the
suspend handling.
Alan Stern
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/