Re: IO queuing and complete affinity with threads (was Re: [PATCH 0/8] IO queuing and complete affinity)

From: Jeremy Higdon
Date: Tue Feb 12 2008 - 03:28:50 EST

Next message: Tvrtko A. Ursulin: "Re: One minute delay when booting 2.6.24.1"
Previous message: Jean Delvare: "Re: [PATCH] hwmon: (adm1026) Properly terminate sysfs groups (Was:panic about sysfs with adm1026)"
In reply to: David Chinner: "Re: IO queuing and complete affinity with threads (was Re: [PATCH 0/8] IO queuing and complete affinity)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Mon, Feb 11, 2008 at 04:22:11PM +1100, David Chinner wrote:
>
> What I think Nick is referring to is the comments I made that at a
> higher layer (e.g. filesystems) migrating completions to the
> submitter CPU may be exactly the wrong thing to do. I don't recall
> making any comments on migrating submitters - I think others have
> already commented on that so I'll ignore that for the moment and
> try to explain why completion on submitter CPU /may/ be bad.
>
> For example, in the case of XFS it is fine for data I/O but it is
> wrong for transaction I/O completion. We want to direct all
> transaction completions to as few CPUs as possible (one, ideally) so
> that all the completion processing happens on the same CPU, rather
> than bouncing global cachelines and locks between all the CPUs
> taking completion interrupts.

So what you want is all XFS processing (for a given filesystem,
presumably) on a limited set of cores (ideally 1) and all block
and SCSI processing (for a given device) on a similarly limited
set.

On Altix, that was far more important than having the interrupt
and issue CPU be close to the hardware -- at least with typical
LSI or Qlogic controllers where there are only one or two MMIO
reads per command issued, and completions can be stacked up.

There is still an advantage to being close to the hardware, but
a much bigger advantage to not bouncing cachelines.

Maybe what you want is a multistage completion mechanism where
each stage can run on a different CPU, if thread context switches
are cheaper than bouncing data structures around....

jeremy
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Tvrtko A. Ursulin: "Re: One minute delay when booting 2.6.24.1"
Previous message: Jean Delvare: "Re: [PATCH] hwmon: (adm1026) Properly terminate sysfs groups (Was:panic about sysfs with adm1026)"
In reply to: David Chinner: "Re: IO queuing and complete affinity with threads (was Re: [PATCH 0/8] IO queuing and complete affinity)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]