Re: [PATCH 1/8] iommu: Add I/O ASID allocator
From: Jacob Pan
Date: Tue Jun 11 2019 - 13:12:17 EST
On Tue, 11 Jun 2019 15:37:42 +0100
Jean-Philippe Brucker <jean-philippe.brucker@xxxxxxx> wrote:
> On 11/06/2019 13:26, Jacob Pan wrote:
> >> +/**
> >> + * ioasid_set_data - Set private data for an allocated ioasid
> >> + * @ioasid: the ID to set data
> >> + * @data: the private data
> >> + *
> >> + * For IOASID that is already allocated, private data can be set
> >> + * via this API. Future lookup can be done via ioasid_find.
> >> + */
> >> +int ioasid_set_data(ioasid_t ioasid, void *data)
> >> +{
> >> + struct ioasid_data *ioasid_data;
> >> + int ret = 0;
> >> +
> >> + xa_lock(&ioasid_xa);
> > Just wondering if this is necessary, since xa_load is under
> > rcu_read_lock and we are not changing anything internal to xa. For
> > custom allocator I still need to have the mutex against allocator
> > removal.
>
> I think we do need this because of a possible race with ioasid_free():
>
> CPU1 CPU2
> ioasid_free(ioasid) ioasid_set_data(ioasid, foo)
> data = xa_load(...)
> xa_erase(...)
> kfree_rcu(data) (no RCU lock held)
> ...free(data)
> data->private = foo;
>
make sense, thanks for explaining.
> The issue is theoretical at the moment because no users do this, but
> I'd be more comfortable taking the xa_lock, which prevents a
> concurrent xa_erase()+free(). (I commented on your v3 but you might
> have missed it)
>
Did you reply to my v3? I did not see it. I only saw your comments about
v3 in your commit message.
> >> + ioasid_data = xa_load(&ioasid_xa, ioasid);
> >> + if (ioasid_data)
> >> + rcu_assign_pointer(ioasid_data->private, data);
> > it is good to publish and have barrier here. But I just wonder even
> > for weakly ordered machine, this pointer update is quite far away
> > from its data update.
>
> I don't know, it could be right before calling ioasid_set_data():
>
> mydata = kzalloc(sizeof(*mydata));
> mydata->ops = &my_ops; (1)
> ioasid_set_data(ioasid, mydata);
> ... /* no write barrier here */
> data->private = mydata; (2)
>
> And then another thread calls ioasid_find():
>
> mydata = ioasid_find(ioasid);
> if (mydata)
> mydata->ops->do_something();
>
> On a weakly ordered machine, this thread could observe the pointer
> assignment (2) before the ops assignment (1), and dereference NULL.
> Using rcu_assign_pointer() should fix that
>
I agree it is better to have the barrier. Just thought there is already
a rcu_read_lock() in xa_load() in between. rcu_read_lock() may have
barrier in some case but better not count on it. No issues here. I will
integrate this in the next version.
> Thanks,
> Jean
[Jacob Pan]