[PATCH v7 00/14] large atomic writes for xfs

From: John Garry
Date: Tue Apr 15 2025 - 08:15:51 EST


Currently atomic write support for xfs is limited to writing a single
block as we have no way to guarantee alignment and that the write covers
a single extent.

This series introduces a method to issue atomic writes via a
software-based method.

The software-based method is used as a fallback for when attempting to
issue an atomic write over misaligned or multiple extents.

For xfs, this support is based on reflink CoW support.

The basic idea of this CoW method is to alloc a range in the CoW fork,
write the data, and atomically update the mapping.

Initial mysql performance testing has shown this method to perform ok.
However, there we are only using 16K atomic writes (and 4K block size),
so typically - and thankfully - this software fallback method won't be
used often.

For other FSes which want large atomics writes and don't support CoW, I
think that they can follow the example in [0].

Catherine is currently working on further xfstests for this feature,
which we hope to share soon.

Based on 8ffd015db85f (tag: v6.15-rc2, xfs/xfs-6.16-merge,
xfs/xfs-6.15-fixes, xfs/for-next) Linux 6.15-rc2

[0] https://lore.kernel.org/linux-xfs/20250310183946.932054-1-john.g.garry@xxxxxxxxxx/

Differences to v6:
- log item sizes updates (Darrick)
- rtvol support (Darrick)
- mount option for atomic writes (Darrick)
- Add RB tags from Darrick and Christoph (Thanks!)

Differences to v5:
- Add statx unit_max_opt (Christoph, me)
- Add xfs_atomic_write_cow_iomap_begin() (Christoph)
- drop old mechanical changes
- limit atomic write max according to CoW-based atomic write max (Christoph)
- Add xfs_compute_atomic_write_unit_max()
- this contains changes for limiting awu max according to max
transaction log items (Darrick)
- use -ENOPROTOOPT for fallback (Christoph)
- rename xfs_inode_can_atomicwrite() -> xfs_inode_can_hw_atomicwrite()
- rework varoious code comments (Christoph)
- limit CoW-based atomic write to log size and add helpers (Darrick)
- drop IOMAP_DIO_FORCE_WAIT usage in xfs_file_dio_write_atomic()
- Add RB tags from Christoph (thanks!)

Darrick J. Wong (3):
xfs: add helpers to compute log item overhead
xfs: add helpers to compute transaction reservation for finishing
intent items
xfs: allow sysadmins to specify a maximum atomic write limit at mount
time

John Garry (11):
fs: add atomic write unit max opt to statx
xfs: rename xfs_inode_can_atomicwrite() ->
xfs_inode_can_hw_atomicwrite()
xfs: allow block allocator to take an alignment hint
xfs: refactor xfs_reflink_end_cow_extent()
xfs: refine atomic write size check in xfs_file_write_iter()
xfs: add xfs_atomic_write_cow_iomap_begin()
xfs: add large atomic writes checks in xfs_direct_write_iomap_begin()
xfs: commit CoW-based atomic writes atomically
xfs: add xfs_file_dio_write_atomic()
xfs: add xfs_compute_atomic_write_unit_max()
xfs: update atomic write limits

Documentation/admin-guide/xfs.rst | 8 +
block/bdev.c | 3 +-
fs/ext4/inode.c | 2 +-
fs/stat.c | 6 +-
fs/xfs/libxfs/xfs_bmap.c | 5 +
fs/xfs/libxfs/xfs_bmap.h | 6 +-
fs/xfs/libxfs/xfs_trans_resv.c | 315 +++++++++++++++++++++++++++---
fs/xfs/libxfs/xfs_trans_resv.h | 22 +++
fs/xfs/xfs_bmap_item.c | 10 +
fs/xfs/xfs_bmap_item.h | 3 +
fs/xfs/xfs_buf_item.c | 19 ++
fs/xfs/xfs_buf_item.h | 3 +
fs/xfs/xfs_extfree_item.c | 10 +
fs/xfs/xfs_extfree_item.h | 3 +
fs/xfs/xfs_file.c | 87 ++++++++-
fs/xfs/xfs_inode.h | 2 +-
fs/xfs/xfs_iomap.c | 191 +++++++++++++++++-
fs/xfs/xfs_iomap.h | 1 +
fs/xfs/xfs_iops.c | 77 +++++++-
fs/xfs/xfs_iops.h | 3 +
fs/xfs/xfs_log_cil.c | 4 +-
fs/xfs/xfs_log_priv.h | 13 ++
fs/xfs/xfs_mount.c | 86 ++++++++
fs/xfs/xfs_mount.h | 11 ++
fs/xfs/xfs_refcount_item.c | 10 +
fs/xfs/xfs_refcount_item.h | 3 +
fs/xfs/xfs_reflink.c | 143 +++++++++++---
fs/xfs/xfs_reflink.h | 6 +
fs/xfs/xfs_rmap_item.c | 10 +
fs/xfs/xfs_rmap_item.h | 3 +
fs/xfs/xfs_super.c | 28 ++-
fs/xfs/xfs_trace.h | 115 +++++++++++
include/linux/fs.h | 3 +-
include/linux/stat.h | 1 +
include/uapi/linux/stat.h | 8 +-
35 files changed, 1130 insertions(+), 90 deletions(-)

--
2.31.1