Re: [PATCH v3 03/12] rust: xarray: add `contains_index` method

From: Andreas Hindborg

Date: Thu Feb 12 2026 - 05:21:59 EST

"Liam R. Howlett" <Liam.Howlett@xxxxxxxxxx> writes:

> * Andreas Hindborg <a.hindborg@xxxxxxxxxx> [260211 02:41]:
>> "Liam R. Howlett" <Liam.Howlett@xxxxxxxxxx> writes:
>>
>> > * Andreas Hindborg <a.hindborg@xxxxxxxxxx> [260209 14:38]:
>> >> Add a convenience method `contains_index` to check whether an element
>> >> exists at a given index in the XArray. This method provides a more
>> >> ergonomic API compared to calling `get` and checking for `Some`.
>> >
>> > I think this is going to result in less efficient code for most uses.
>> >
>> > Most users use the results returned, not just checking if there is or is
>> > not a value. So if you find the value an xarray state and then just
>> > throw it away and find it again, it'll be less efficient.
>> >
>> > If there are users that do use the xarray to just check if something
>> > exists or not (which there probably are?), then it should be in a
>> > wrapper for that code and not the generic API. Otherwise we will have
>> > users pop up to use this method when they should not.
>>
>> I agree in that I would prefer matching on the result of a lookup and
>> using that result. This is not always possible due to a limitation in
>> the current implementation of the borrow checker. Please see my response
>> to Tamir [1], it has all the pointers.
>>
>> Since we cannot use a match statement in certain situations, we have to
>> fall back to something that does not borrow from the collections, like
>> `array.get(index).is_some()`. I would argue that
>>
>> if array.contains_index(foo) { ... }
>>
>> is easier to read than
>>
>> if array.get(foo).is_some() { ... }
>>
>> And I don't think `array.contains()` is going to have worse codegen than
>> `array.get(index).is_some()`.
>
> This is probably my lack of rust knowledge...
>
>
> My concern is around API usage. I am concerned people will use xas as a
> throw-away lookup with this API and cause more walks of the xarray to
> the same location.
>
> In the normal API, we have lookups like this; you take a lock, look
> something up, drop the lock and return it. Since the life cycle of the
> stored information is outside the scope of the xarray, the user is
> dependent on the entry being stable by some other means after the xarray
> lock is dropped.

As discussed elsewhere, the current Rust XArray API only has fully
exclusive access, no reader/writer separation.

After a lookup the returned object is bound to the lifetime of lock. So
you cannot drop the lock while holding a mutable reference to something
in the collection, and the "other means" part of your paragraph is
automatically handled in Rust.

> In the advanced API, we do more within the locked area, usually.
>
> Usually, applications don't just print out there is a value, they do
> something with it. So I would expect a real example to be something
> like (this horrible psudo-c/rust mangled mess):
>
> let entry = array.get_mut(foo);
>
> if (entry.is_some()) {
> /* do something with entry */
> send_to_party(entry);
> } else {
> /* deal with it not existing */
> }
>
> What I don't want to do:
>
> if (array.contains_index(foo)) {
> entry = array.get_mut(foo);
> } else {
> ...
> }
>
> Where contains_index(foo) sets up an xas, walks to the location, returns
> the entry (or not) and then translates into a boolean.. then if it's
> true we set up another xas to walk to the same location.
>
> That is, the worst code gen would come from this:
> if (array.get(foo).is_some()) { array.get_mut(foo).. }

You are completely right, this causes two walks to the same location,
which is not great.

> From what you said here and the link, you are saying we need to do this
> in certain situations due to rust's borrow checker and the lifetime, but
> I cannot see why we would need to walk the xarray twice from the example
> provided.

Let me try to explain with an example using user space Rust. Consider a
situation where we have two two key-value mapping data structures. Since
we are in user space, I'll go with BTreeMap from the standard library,
but it could be XArray in the kernel.

If we want to implement a transaction across these two KV maps, we
probably need to take a lock to get mutable access to them. Let's
represent this with a `struct Maps` that has two fields `a` and `b` for
these two mutable references:

use std::collections::{
BTreeMap,
btree_map::{Entry, OccupiedEntry},
};

type MapType = BTreeMap<u32, u32>;

struct Maps<'a> {
a: &'a mut MapType,
b: &'a mut MapType,
}

Now we have two locked data structures and we can implement our
transaction over them. Let's say the algorithm is the following:

- Given a key, if that key exists in map A, return a mutable reference
to the value for that key in map A.
- Else, move the key and value for the lowest ordered key from map A to
map B and return a mutable reference the that value in map B.

My first attempt at implementing this algorithm would be something like
this:

fn transaction_impl1<'a>(maps: &'a mut Maps, key: u32) -> &'a mut u32 {
if let Entry::Occupied(o) = maps.a.entry(key) {
o.into_mut()
} else {
let value = maps.a.first_entry().expect("Not empty").remove();
maps.b.entry(key).or_insert(value)
}
}

However, this does not compile.

error[E0499]: cannot borrow `*maps.a` as mutable more than once at a time
--> src/main.rs:51:21
|
47 | fn transaction_impl1<'a>(maps: &'a mut Maps, key: u32) -> &'a mut u32 {
| -- lifetime `'a` defined here
48 | if let Entry::Occupied(o) = maps.a.entry(key) {
| - ------ first mutable borrow occurs here
| _____|
| |
49 | | o.into_mut()
50 | | } else {
51 | | let value = maps.a.first_entry().expect("Not empty").remove();
| | ^^^^^^ second mutable borrow occurs here
52 | | maps.b.entry(key).or_insert(value)
53 | | }
| |_____- returning this value requires that `*maps.a` is borrowed for `'a`

The `into_mut()` on line 49 captures the lifetime of `o`, and because the
value is returned from the function, it must live for `'a`, which is
till the end of the function.

As far as I understand, this is a borrow checker limitation. It is easy
for us to look at this code and decide that the borrow on line 51 will
never alias with the borrow on line 49.

If we change the code so that we do not capture the lifetime of the
returned object across the if/else, the code will compile:

fn transaction_impl2<'a>(maps: &'a mut Maps, key: u32) -> &'a mut u32 {
if maps.a.contains_key(&key) {
maps.a.get_mut(&key).expect("Key is present")
} else {
let value = maps.a.first_entry().expect("Not empty").remove();
maps.b.entry(key).or_insert(value)
}
}

But now we do two walks in `maps.a` in the taken path, which is not efficient.

>
> And making it easier to do this could result in a lot more users
> doubling xarray walks without realising that it's a bad idea (unless
> it's this special case).
>
> ...
>>
>> [1] https://lore.kernel.org/rust-for-linux/20260209-xarray-entry-send-v3-0-f777c65b8ae2@xxxxxxxxxx/T/#m95fb90870c511491f4f487dbf852c689cd0733f4
>>
>
> I have trouble following 'the taken arm' in your link. I think you mean
> one of the branches based on the existence of the entry, but I don't
> know which is the 'taken' and how 'self' is out of scope.
>
> Other links off the above link seem to indicate it is a problem with the
> rust borrow checking hitting a false positive.
>
> It seems we need to look up things twice to work around the false
> positive - or implement something like get_or_insert()?
>
> Or, wait for the new checker to be released - but that doesn't fix all
> the false positives, just this one?
>
> So, do all users of the xarray suffer from this false positive?

Not all users will suffer from this. The following code compiles fine:

fn transaction_impl3<'a>(maps: &'a mut Maps, key: u32, value: u32) -> &'a mut u32 {
if let Entry::Occupied(o) = maps.a.entry(key) {
o.into_mut()
} else {
maps.b.entry(key).or_insert(value)
}
}

This is valid because the lifetime captured by `o.into_mut()` is not
used on the other match arm.

I hope this helps clarify the situation.

Best regards,
Andreas Hindborg