Re: [PATCH 2/4] mm: add atomic flush guard for IOCB_DONTCACHE writeback

Next message: K Prateek Nayak: "Re: [PATCH v2 5/7] sched/fair: Increase weight bits for avg_vruntime"
Previous message: Damien Le Moal: "Re: [PATCH] ata: libata-eh: Do not retry reset if the device is gone"
In reply to: Jeff Layton: "[PATCH 2/4] mm: add atomic flush guard for IOCB_DONTCACHE writeback"
Next in thread: Jeff Layton: "Re: [PATCH 2/4] mm: add atomic flush guard for IOCB_DONTCACHE writeback"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Christoph Hellwig

Date: Thu Apr 02 2026 - 01:28:02 EST

On Wed, Apr 01, 2026 at 03:10:59PM -0400, Jeff Layton wrote:
> When the PAGECACHE_TAG_WRITEBACK tag clears after a round of writeback
> completes, all concurrent IOCB_DONTCACHE writers see the tag clear
> simultaneously and submit proportional flushes at once — a thundering
> herd that causes p99.9 tail latency spikes.
>
> Add an AS_DONTCACHE_FLUSHING flag to the address_space and use
> test_and_set_bit() to ensure at most one IOCB_DONTCACHE writer
> flushes at a time. Other writers that find the bit set skip their
> flush entirely. The bit is cleared when the flush completes.

This sounds like a bad reimplementation of the single writeback thread
:)

Have you considered stopping to do in-caller writeback for
IOCB_DONTCACHE vs just leaving it to the writeback daeon?

Either by totally disabling the writeback and just leaving the
dropbehind bit, or by queuing up wb_writeback_work instances for
the ranges, or by just increasing the pressure for the writeback
daemon. Note that with all schemes including the one in this patch
we might eventually run into writeback scalability limits, which
will require multiple writeback workers.