Re: [RFC PATCH] block: Fix bio merge induced high I/O latency

From: Mathieu Desnoyers
Date: Sun Feb 01 2009 - 21:08:39 EST


Hi Jens,

I tried your patch at

http://bugzilla.kernel.org/attachment.cgi?id=20001

On a 2.6.29-rc3 kernel. I get the following OOPS just after I start
running the fio test. It happens after a few

cfq: moving ffff88043d4b42e0 to dispatch
cfq: moving ffff88043d4b4170 to dispatch

messages (~20).

Here is the oops :

------------[ cut here ]------------
kernel BUG at block/cfq-iosched.c:650!
invalid opcode: 0000 [#1] PREEMPT SMP
LTT NESTING LEVEL : 0
last sysfs file: /sys/block/sda/stat
CPU 2
Modules linked in: loop ltt_tracer ltt_trace_control ltt_userspa]
Pid: 2934, comm: kjournald Not tainted 2.6.29-rc3 #3
RIP: 0010:[<ffffffff80419c2b>] [<ffffffff80419c2b>] cfq_remove_0
RSP: 0018:ffff88043b167c20 EFLAGS: 00010046
RAX: 0000000000000000 RBX: ffff88043fd9e088 RCX: 0000000000000001
RDX: 0000000000000010 RSI: ffff88043887b590 RDI: ffff88043887b590
RBP: ffff88043b167c50 R08: 0000000000000002 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff88043fd9e088
R13: ffff88043887b590 R14: ffff88043fc40200 R15: ffff88043fd9e088
FS: 0000000000000000(0000) GS:ffff88043e81a080(0000) knlGS:00000
CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 00007f2a5f98b8c0 CR3: 000000043e8c4000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process kjournald (pid: 2934, threadinfo ffff88043b166000, task )
Stack:
000000000000003b ffff88043887b590 ffff88043fd9e088 ffff88043e5a0
ffff88043fc40200 ffff88002809ed50 ffff88043b167c80 ffffffff8041d
0000000000000001 ffff88043887b590 ffffe2001b805138 ffff88043e5a0
Call Trace:
[<ffffffff80419e4d>] cfq_dispatch_insert+0x3d/0x70
[<ffffffff80419f2f>] cfq_wait_on_page+0xaf/0xc0
[<ffffffff804098ed>] elv_wait_on_page+0x1d/0x20
[<ffffffff8040d207>] blk_backing_dev_wop+0x17/0x50
[<ffffffff80301872>] sync_buffer+0x52/0x80
[<ffffffff806a33b2>] __wait_on_bit+0x62/0x90
[<ffffffff80301820>] ? sync_buffer+0x0/0x80
[<ffffffff80301820>] ? sync_buffer+0x0/0x80
[<ffffffff806a3459>] out_of_line_wait_on_bit+0x79/0x90
[<ffffffff8025a8a0>] ? wake_bit_function+0x0/0x50
[<ffffffff80301769>] __wait_on_buffer+0xf9/0x130
[<ffffffff80379acd>] journal_commit_transaction+0x72d/0x1650
[<ffffffff806a5c87>] ? _spin_unlock_irqrestore+0x47/0x80
[<ffffffff8024dd2f>] ? try_to_del_timer_sync+0x5f/0x70
[<ffffffff8037e488>] kjournald+0xe8/0x250
[<ffffffff8025a860>] ? autoremove_wake_function+0x0/0x40
[<ffffffff8037e3a0>] ? kjournald+0x0/0x250
[<ffffffff8025a38e>] kthread+0x4e/0x90
[<ffffffff8025a340>] ? kthread+0x0/0x90
[<ffffffff8020db2a>] child_rip+0xa/0x20
[<ffffffff8020d480>] ? restore_args+0x0/0x30
[<ffffffff8025a340>] ? kthread+0x0/0x90
[<ffffffff8020db20>] ? child_rip+0x0/0x20
Code: 4d 89 6d 00 49 8b 9d c0 00 00 00 41 8b 45 48 4c 8b 73 08 2
RIP [<ffffffff80419c2b>] cfq_remove_request+0x6b/0x250
RSP <ffff88043b167c20>
---[ end trace eab134a8bd405d05 ]---

It seems that the cfqq->queued[sync] counter should either be
incremented/decremented in the new cfq_wait_on_page, or that the fact
that the type of request (sync vs !sync) changes would not be taken care
of correctly. I have not looked at the code enough to find out exactly
what is happening, but I though you might have an idea of the cause.

Thanks,

Mathieu

--
Mathieu Desnoyers
OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F BA06 3F25 A8FE 3BAE 9A68
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/