Re: [PATCH 1/3] rculist: Fix list_entry_rcu to read ptr with rcu_dereference_raw

From: Patrick Marlier
Date: Wed Mar 25 2015 - 11:01:36 EST


On 03/25/2015 03:30 PM, Paul E. McKenney wrote:
On Tue, Mar 24, 2015 at 11:31:38AM +0100, Patrick Marlier wrote:
Change to read effectively ptr with rcu_dereference_raw and not the
__ptr variable on the stack.

Signed-off-by: Patrick Marlier <patrick.marlier@xxxxxxxxx>
Avoiding an extra load could be worthwhile in a number of situations,
agreed.
Not only a load. It adds a store and a load on the stack and I think this creates a dependency in the processor pipeline.

However, won't this change cause sparse to complain if invoked on a
non-RCU-protected pointer? The ability to use list-RCU API
members on both RCU and non-RCU pointers was one of the points
of the previous commit, right?
Probably we can put back the cast but I am not familiar enough with the RCU API.

Also, the problem here is that you probably want ACCESS_ONCE to happen on the content of 'ptr' and not on the stack variable '__ptr'.

(you have to follow this chain: rcu_dereference_raw -> rcu_dereference_check -> __rcu_dereference_check -> lockless_dereference -> ACCESS_ONCE)

#define lockless_dereference(p) \
({ \
typeof(p) _________p1 = ACCESS_ONCE(p); \
smp_read_barrier_depends(); /* Dependency order vs. p above. */ \
(_________p1); \
})

#define __ACCESS_ONCE(x) ({ \
__maybe_unused typeof(x) __var = (__force typeof(x)) 0; \
(volatile typeof(x) *)&(x); })
#define ACCESS_ONCE(x) (*__ACCESS_ONCE(x))

Note that ACCESS_ONCE is doing "&" on x.

IMHO, I would prefer saving some useless instructions for better performance rather than giving too much flexibility on the API (also pretty sure the cast can be still done).
--
Patrick Marlier
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/