On Fri, Nov 03 2006, Brent Baccala wrote:Hello -
I'm running 2.6.15 (Debian) on a Pentium M laptop, PCI attached ext3
filesystem.
I'm writing my first asynchronous I/O program, and for a while I
thought I was really doing something wrong, but more and more I'm
starting to conclude that the problem might be in the kernel.
Basically, I've narrowed things down to a test program which opens a
large (700 MB) file in O_DIRECT mode and fires off 100 one MB async
reads for the first 100 MB of data. The enqueues take about 5 seconds
to complete, which is also about the amount of time this disk needs to
read 100 MB, so I suspect that it's blocking.
I've gotten the POSIX AIO interface at least tolerably running using
the GLIBC thread-based implementation, but I really want the native
interface working.
I whittled the test program down to use system calls instead of the
POSIX AIO library, and I'm attaching a copy. You put a big file at
'testfile' (it just reads it) and run the program:
baccala@debian ~/src/endgame$ time ./testaio
Enqueues starting
Enqueues complete
real 0m5.327s
user 0m0.004s
sys 0m0.740s
baccala@debian ~/src/endgame$
Of that five seconds, it's almost all spent between the two "enqueues"
messages.
You don't mention what hardware you are running this on (the disk sub
system). io_submit() will block, if you run out of block layer requests.
We have 128 of those by default, but if your io ends up getting chopped
into somewhat smaller bits than 1MiB each, then you end up having to
block on allocation of those. So lets say your /src is mounted on
/dev/sdaX, try:
# echo 512 > /sys/block/sda/queue/nr_requests
(substitute sda for whatever device your /src is on)
and re-test. The time between starting and complete should be a lot
smaller, now that you are not blocking on blkdev request allocation. You
may also want to look at the max_sectors_kb in the queue/ directory,
that'll tell you how large a single io will be at most once it reaches
the driver.
--
Jens Axboe