Re: Bandwidth Allocations under CFQ I/O Scheduler

From: Jens Axboe
Date: Thu Oct 19 2006 - 08:21:36 EST


On Wed, Oct 18 2006, Paulo Marques wrote:
> Jens Axboe wrote:
> >[...]
> >Precisely, hence CFQ is now based on the time metric. Given larger
> >slices, you can mostly eliminate the impact of other applications in the
> >system.
>
> Just one thought: we can't predict reliably how much time a request will
> take to be serviced, but we can account the time it _took_ to service a
> request.
>
> If we account the time it took to service requests for each process, and
> we have several processes with requests pending, we can use the same
> algorithm we would use for a large time slice algorithm to select the
> process to service.
>
> This should make it as fair over time as a large time slice algorithm
> and doesn't need large time slices, so latencies can be kept as low as
> required.

Two problems:

- You can't chop things down to single request times. A cost of a
request greatly varies depending on what preceeded it, hence you need
to account batches of requests from a process - this is what the time
slice currently accomplishes.

- Whether a process has requests pending or not varies a lot. The
typical bandwidth problem is due to processes doing sync or dependent
io where you only get io in pieces over time.

A request based approach only works over processes that always (or
almost always) have work left to do. You absolutely need the time slice
or some other waiting mechanism to help those that don't.

> However, having a small time slice will probably help the hardware
> coalesce several request from the same process that are more likely to
> be to nearby sectors, and thus improve performance.

Either the process is submittinger larger amounts of io and you'll get
the merging anyways, or it isn't. There's a large difference in time
scales here.

--
Jens Axboe

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/