Re: [RFC PATCH] nvmet: Make blksize_shift configurable

From: Damien Le Moal
Date: Fri Apr 18 2025 - 06:23:23 EST


On 4/18/25 18:56, Richard Weinberger wrote:
> On Freitag, 18. April 2025 11:37 'Damien Le Moal' via upstream wrote:
>>> + if (!ns->blksize_shift)
>>> + ns->blksize_shift = blksize_bits(bdev_logical_block_size(ns->bdev));
>>
>> If the user set logical block size is smaller than the block dev logical block
>> size, this is not going to work... No ? Am I missing something ?
>
> Likely, yes.
> TBH, I'm not sure whether it makes actually sense for the bdev case to make
> blksize_shift configurable.

Probably not... I do understand the value for the file case though.

> The case I see most benefit is the backing file case.
>
>>> + if (!ns->blksize_shift) {
>>> + /*
>>> + * i_blkbits can be greater than the universally accepted
>>> + * upper bound, so make sure we export a sane namespace
>>> + * lba_shift.
>>> + */
>>> + ns->blksize_shift = min_t(u8,
>>> + file_inode(ns->file)->i_blkbits, 12);
>>
>> This will work for any block size, regardless of the FS block size, but only if
>> ns->buffered_io is true. Doesn't this require some more checks with regards to
>> O_DIRECT (!ns->buffered_io case) ?
>
> Good catch. I'll add a check.

And by the way, you need to check for STATX_DIOALIGN since some FS (e.g. xfs)
can handle direct IOs that are not aligned to the FS block size. See the recent
changes in drivers/block/loop.c to improve direct IO handling, specifically, the
function loop_query_min_dio_size().

--
Damien Le Moal
Western Digital Research