Re: Reiserfs deadlock in 2.6.36

From: Bastien ROUCARIES
Date: Fri Nov 19 2010 - 05:04:31 EST


Le jeudi 18 novembre 2010 17:30:51, Frederic Weisbecker a écrit :
> On Thu, Nov 18, 2010 at 04:49:56PM +0100, Bastien ROUCARIES wrote:
> > Hi,
>
> > It seems that reiserfs deadlock like in bug #4850 on my machine:
> Hi Bastien,
>
> This really looks like a hung task detector report.
> Several tasks are stuck in queue_log_writer(), waiting
> to be woken up on the "journal->j_join_wait" event and
> that never happens because the waker is also stuck.
> The problem is your report doesn't show where the waker
> is stuck, but the hung task detector reports it, it just
> did before or after the chunk you've posted.
>
> If you could provide me the entire report, I could fix this
> easily.

there is something after in my logs
Nov 18 15:27:08 portablebastien kernel: [20040.668744] kded4 D 00000001004a8412 0 2869 1 0x00000000
Nov 18 15:27:08 portablebastien kernel: [20040.668752] ffff8801114adf60 0000000000000086 ffff880100000000 ffffffff81632020
Nov 18 15:27:08 portablebastien kernel: [20040.668760] 0000000000014940 0000000000014940 ffff880037ad7fd8 0000000000014940
Nov 18 15:27:08 portablebastien kernel: [20040.668770] 0000000000014940 ffff8801114ae238 ffff8801114ae240 ffff8801114adf60
Nov 18 15:27:08 portablebastien kernel: [20040.668780] Call Trace:
Nov 18 15:27:08 portablebastien kernel: [20040.668815] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.668825] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:27:08 portablebastien kernel: [20040.668840] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.668848] [<ffffffff8112c164>] ? __dquot_initialize+0x20/0x16d
Nov 18 15:27:08 portablebastien kernel: [20040.668862] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.668875] [<ffffffffa0196bff>] ? reiserfs_create+0xe6/0x205 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.668883] [<ffffffff810f4712>] ? generic_permission+0xe/0x8a
Nov 18 15:27:08 portablebastien kernel: [20040.668892] [<ffffffff810f54b7>] ? vfs_create+0x66/0x88
Nov 18 15:27:08 portablebastien kernel: [20040.668898] [<ffffffff810f49c5>] ? d_alloc_and_lookup+0x4a/0x67
Nov 18 15:27:08 portablebastien kernel: [20040.668904] [<ffffffff810f5e6a>] ? do_last+0x268/0x536
Nov 18 15:27:08 portablebastien kernel: [20040.668911] [<ffffffff810f7aa9>] ? do_filp_open+0x1e4/0x53b
Nov 18 15:27:08 portablebastien kernel: [20040.668919] [<ffffffff810eb2e0>] ? do_sys_open+0x56/0xe4
Nov 18 15:27:08 portablebastien kernel: [20040.668926] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:27:08 portablebastien kernel: [20040.673550] kalarm D 00000001004a5de3 0 2995 1 0x00000000
Nov 18 15:27:08 portablebastien kernel: [20040.673557] ffff88011cfced00 0000000000000082 ffff880000000000 ffffffff81632020
Nov 18 15:27:08 portablebastien kernel: [20040.673563] 0000000000014940 0000000000014940 ffff8800cabfffd8 0000000000014940
Nov 18 15:27:08 portablebastien kernel: [20040.673571] 0000000000014940 ffff88011cfcefd8 ffff88011cfcefe0 ffff88011cfced00
Nov 18 15:27:08 portablebastien kernel: [20040.673580] Call Trace:
Nov 18 15:27:08 portablebastien kernel: [20040.673598] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.673606] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:27:08 portablebastien kernel: [20040.673620] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.673633] [<ffffffffa01b1c65>] ? xattr_lookup+0x17/0xef [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.673647] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.673661] [<ffffffffa01a0516>] ? reiserfs_dirty_inode+0x56/0x95 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.673668] [<ffffffff81048a76>] ? current_fs_time+0x1e/0x24
Nov 18 15:27:08 portablebastien kernel: [20040.673676] [<ffffffff81107d03>] ? __mark_inode_dirty+0x27/0x1ac
Nov 18 15:27:08 portablebastien kernel: [20040.673683] [<ffffffff810fdcf4>] ? file_update_time+0xf6/0x121
Nov 18 15:27:08 portablebastien kernel: [20040.673690] [<ffffffff810b0703>] ? __generic_file_aio_write+0x164/0x280
Nov 18 15:27:08 portablebastien kernel: [20040.673698] [<ffffffff8124e41e>] ? sock_aio_read+0xb6/0xc5
Nov 18 15:27:08 portablebastien kernel: [20040.673704] [<ffffffff810b0874>] ? generic_file_aio_write+0x55/0x9f
Nov 18 15:27:08 portablebastien kernel: [20040.673711] [<ffffffff810ec718>] ? do_sync_write+0xb1/0xea
Nov 18 15:27:08 portablebastien kernel: [20040.673719] [<ffffffff811533f3>] ? security_file_permission+0x18/0x2e
Nov 18 15:27:08 portablebastien kernel: [20040.673726] [<ffffffff810ecc4b>] ? vfs_write+0xa4/0x100
Nov 18 15:27:08 portablebastien kernel: [20040.673732] [<ffffffff810ecd5a>] ? sys_write+0x45/0x6b
Nov 18 15:27:08 portablebastien kernel: [20040.673738] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:27:08 portablebastien kernel: [20040.678435] akregator D ffffc900054360e0 0 2996 1 0x00000000
Nov 18 15:27:08 portablebastien kernel: [20040.678441] ffff880037bf2210 0000000000000082 ffff8800cab77aa8 ffff88011f6d6630
Nov 18 15:27:08 portablebastien kernel: [20040.678448] 0000000000014940 0000000000014940 ffff8800cab77fd8 0000000000014940
Nov 18 15:27:08 portablebastien kernel: [20040.678454] 0000000000014940 ffff880037bf24e8 ffff880037bf24f0 ffff880037bf2210
Nov 18 15:27:08 portablebastien kernel: [20040.678460] Call Trace:
Nov 18 15:27:08 portablebastien kernel: [20040.678467] [<ffffffff8130e223>] ? schedule_hrtimeout_range_clock+0xc4/0x125
Nov 18 15:27:08 portablebastien kernel: [20040.678482] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.678488] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:27:08 portablebastien kernel: [20040.678501] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.678514] [<ffffffffa01b1c65>] ? xattr_lookup+0x17/0xef [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.678526] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.678539] [<ffffffffa01a0516>] ? reiserfs_dirty_inode+0x56/0x95 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.678544] [<ffffffff81048a76>] ? current_fs_time+0x1e/0x24
Nov 18 15:27:08 portablebastien kernel: [20040.678550] [<ffffffff81107d03>] ? __mark_inode_dirty+0x27/0x1ac
Nov 18 15:27:08 portablebastien kernel: [20040.678555] [<ffffffff810fdcf4>] ? file_update_time+0xf6/0x121
Nov 18 15:27:08 portablebastien kernel: [20040.678560] [<ffffffff810b0703>] ? __generic_file_aio_write+0x164/0x280
Nov 18 15:27:08 portablebastien kernel: [20040.678566] [<ffffffff810b16f0>] ? generic_file_aio_read+0x51e/0x576
Nov 18 15:27:08 portablebastien kernel: [20040.678571] [<ffffffff810b0874>] ? generic_file_aio_write+0x55/0x9f
Nov 18 15:27:08 portablebastien kernel: [20040.678577] [<ffffffff810ec718>] ? do_sync_write+0xb1/0xea
Nov 18 15:27:08 portablebastien kernel: [20040.678583] [<ffffffff811533f3>] ? security_file_permission+0x18/0x2e
Nov 18 15:27:08 portablebastien kernel: [20040.678588] [<ffffffff810ecc4b>] ? vfs_write+0xa4/0x100
Nov 18 15:27:08 portablebastien kernel: [20040.678593] [<ffffffff810ecd5a>] ? sys_write+0x45/0x6b
Nov 18 15:27:08 portablebastien kernel: [20040.678598] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:27:08 portablebastien kernel: [20040.683411] pidgin D 00000001004abe7c 0 7472 2942 0x00000000
Nov 18 15:27:08 portablebastien kernel: [20040.683417] ffff8800b83cc420 0000000000000082 0000000000000000 ffffffff81632020
Nov 18 15:27:08 portablebastien kernel: [20040.683424] 0000000000014940 0000000000014940 ffff88003d1d7fd8 0000000000014940
Nov 18 15:27:08 portablebastien kernel: [20040.683430] 0000000000014940 ffff8800b83cc6f8 ffff8800b83cc700 ffff8800b83cc420
Nov 18 15:27:08 portablebastien kernel: [20040.683436] Call Trace:
Nov 18 15:27:08 portablebastien kernel: [20040.683451] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.683457] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:27:08 portablebastien kernel: [20040.683470] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.683476] [<ffffffff8112c164>] ? __dquot_initialize+0x20/0x16d
Nov 18 15:27:08 portablebastien kernel: [20040.683489] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.683500] [<ffffffffa0196bff>] ? reiserfs_create+0xe6/0x205 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.683506] [<ffffffff810f4712>] ? generic_permission+0xe/0x8a
Nov 18 15:27:08 portablebastien kernel: [20040.683512] [<ffffffff810f54b7>] ? vfs_create+0x66/0x88
Nov 18 15:27:08 portablebastien kernel: [20040.683517] [<ffffffff810f5e6a>] ? do_last+0x268/0x536
Nov 18 15:27:08 portablebastien kernel: [20040.683523] [<ffffffff810f7aa9>] ? do_filp_open+0x1e4/0x53b
Nov 18 15:27:08 portablebastien kernel: [20040.683529] [<ffffffff810eb2e0>] ? do_sys_open+0x56/0xe4
Nov 18 15:27:08 portablebastien kernel: [20040.683534] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:27:08 portablebastien kernel: [20040.688478] okular D 00000001004a9d20 0 19668 19569 0x00000000
Nov 18 15:27:08 portablebastien kernel: [20040.688485] ffff8800b813caf0 0000000000000082 0000000000000000 ffff88011f65caf0
Nov 18 15:27:08 portablebastien kernel: [20040.688491] 0000000000014940 0000000000014940 ffff8800b82f9fd8 0000000000014940
Nov 18 15:27:08 portablebastien kernel: [20040.688497] 0000000000014940 ffff8800b813cdc8 ffff8800b813cdd0 ffff8800b813caf0
Nov 18 15:27:08 portablebastien kernel: [20040.688506] Call Trace:
Nov 18 15:27:08 portablebastien kernel: [20040.688522] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688530] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:27:08 portablebastien kernel: [20040.688543] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688558] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688571] [<ffffffffa01997a4>] ? reiserfs_truncate_file+0x175/0x279 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688579] [<ffffffff810c728f>] ? unmap_mapping_range+0x59/0xf5
Nov 18 15:27:08 portablebastien kernel: [20040.688592] [<ffffffffa019b6ae>] ? reiserfs_vfs_truncate_file+0x22/0x2d [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688599] [<ffffffff810b9269>] ? vmtruncate+0x36/0x41
Nov 18 15:27:08 portablebastien kernel: [20040.688612] [<ffffffffa0197c20>] ? reiserfs_setattr+0x26f/0x2f4 [reiserfs]
Nov 18 15:27:08 portablebastien kernel: [20040.688619] [<ffffffff81048a76>] ? current_fs_time+0x1e/0x24
Nov 18 15:27:08 portablebastien kernel: [20040.688626] [<ffffffff810ff2e1>] ? notify_change+0x195/0x27e
Nov 18 15:27:08 portablebastien kernel: [20040.688633] [<ffffffff810ec0ac>] ? do_truncate+0x68/0x86
Nov 18 15:27:08 portablebastien kernel: [20040.688639] [<ffffffff810eb531>] ? __dentry_open+0x186/0x291
Nov 18 15:27:08 portablebastien kernel: [20040.688645] [<ffffffff810f45e8>] ? get_write_access+0x10/0x3a
Nov 18 15:27:08 portablebastien kernel: [20040.688652] [<ffffffff810f605f>] ? do_last+0x45d/0x536
Nov 18 15:27:08 portablebastien kernel: [20040.688658] [<ffffffff810f7aa9>] ? do_filp_open+0x1e4/0x53b
Nov 18 15:27:08 portablebastien kernel: [20040.688665] [<ffffffff810ec802>] ? do_sync_read+0xb1/0xea
Nov 18 15:27:08 portablebastien kernel: [20040.688672] [<ffffffff810eb2e0>] ? do_sys_open+0x56/0xe4
Nov 18 15:27:08 portablebastien kernel: [20040.688678] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:29:08 portablebastien kernel: [20160.691920] kdeinit4 D 00000001004b2d6e 0 2866 1 0x00000000
Nov 18 15:29:08 portablebastien kernel: [20160.691928] ffff88011cfc8da0 0000000000000086 0000004000000000 ffffffff81632020
Nov 18 15:29:08 portablebastien kernel: [20160.691935] 0000000000014940 0000000000014940 ffff8800379f3fd8 0000000000014940
Nov 18 15:29:08 portablebastien kernel: [20160.691941] 0000000000014940 ffff88011cfc9078 ffff88011cfc9080 ffff88011cfc8da0
Nov 18 15:29:08 portablebastien kernel: [20160.691947] Call Trace:
Nov 18 15:29:08 portablebastien kernel: [20160.691958] [<ffffffff8130e00e>] ? __mutex_lock_common+0x127/0x193
Nov 18 15:29:08 portablebastien kernel: [20160.691965] [<ffffffff81030624>] ? flush_tlb_page+0x56/0x73
Nov 18 15:29:08 portablebastien kernel: [20160.691970] [<ffffffff8130e146>] ? mutex_lock+0x1a/0x33
Nov 18 15:29:08 portablebastien kernel: [20160.691976] [<ffffffff810c9358>] ? do_wp_page+0x309/0x70d
Nov 18 15:29:08 portablebastien kernel: [20160.691982] [<ffffffff810b085f>] ? generic_file_aio_write+0x40/0x9f
Nov 18 15:29:08 portablebastien kernel: [20160.691989] [<ffffffff810ec718>] ? do_sync_write+0xb1/0xea
Nov 18 15:29:08 portablebastien kernel: [20160.691996] [<ffffffff811533f3>] ? security_file_permission+0x18/0x2e
Nov 18 15:29:08 portablebastien kernel: [20160.692027] [<ffffffff810ecc4b>] ? vfs_write+0xa4/0x100
Nov 18 15:29:08 portablebastien kernel: [20160.692032] [<ffffffff810ecd5a>] ? sys_write+0x45/0x6b
Nov 18 15:29:08 portablebastien kernel: [20160.692038] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:29:08 portablebastien kernel: [20160.695820] kded4 D 00000001004a8412 0 2869 1 0x00000000
Nov 18 15:29:08 portablebastien kernel: [20160.695826] ffff8801114adf60 0000000000000086 ffff880100000000 ffffffff81632020
Nov 18 15:29:08 portablebastien kernel: [20160.695832] 0000000000014940 0000000000014940 ffff880037ad7fd8 0000000000014940
Nov 18 15:29:08 portablebastien kernel: [20160.695838] 0000000000014940 ffff8801114ae238 ffff8801114ae240 ffff8801114adf60
Nov 18 15:29:08 portablebastien kernel: [20160.695844] Call Trace:
Nov 18 15:29:08 portablebastien kernel: [20160.695876] [<ffffffffa01aab69>] ? queue_log_writer+0x7e/0xac [reiserfs]
Nov 18 15:29:08 portablebastien kernel: [20160.695884] [<ffffffff8103f7c9>] ? default_wake_function+0x0/0xf
Nov 18 15:29:08 portablebastien kernel: [20160.695897] [<ffffffffa01ae70a>] ? do_journal_begin_r+0x1d3/0x2c0 [reiserfs]
Nov 18 15:29:08 portablebastien kernel: [20160.695904] [<ffffffff8112c164>] ? __dquot_initialize+0x20/0x16d
Nov 18 15:29:08 portablebastien kernel: [20160.695917] [<ffffffffa01ae8b4>] ? journal_begin+0xbd/0xfa [reiserfs]
Nov 18 15:29:08 portablebastien kernel: [20160.695928] [<ffffffffa0196bff>] ? reiserfs_create+0xe6/0x205 [reiserfs]
Nov 18 15:29:08 portablebastien kernel: [20160.695935] [<ffffffff810f4712>] ? generic_permission+0xe/0x8a
Nov 18 15:29:08 portablebastien kernel: [20160.695941] [<ffffffff810f54b7>] ? vfs_create+0x66/0x88
Nov 18 15:29:08 portablebastien kernel: [20160.695946] [<ffffffff810f49c5>] ? d_alloc_and_lookup+0x4a/0x67
Nov 18 15:29:08 portablebastien kernel: [20160.695951] [<ffffffff810f5e6a>] ? do_last+0x268/0x536
Nov 18 15:29:08 portablebastien kernel: [20160.695957] [<ffffffff810f7aa9>] ? do_filp_open+0x1e4/0x53b
Nov 18 15:29:08 portablebastien kernel: [20160.695963] [<ffffffff810eb2e0>] ? do_sys_open+0x56/0xe4
Nov 18 15:29:08 portablebastien kernel: [20160.695968] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:29:08 portablebastien kernel: [20160.699768] ksmserver D 00000001004b2f71 0 2917 2866 0x00000004
Nov 18 15:29:08 portablebastien kernel: [20160.699774] ffff8800d70428e0 0000000000000082 0000000000000000 ffffffff81632020
Nov 18 15:29:08 portablebastien kernel: [20160.699781] 0000000000014940 0000000000014940 ffff8800d706bfd8 0000000000014940
Nov 18 15:29:08 portablebastien kernel: [20160.699787] 0000000000014940 ffff8800d7042bb8 ffff8800d7042bc0 ffff8800d70428e0
Nov 18 15:29:08 portablebastien kernel: [20160.699793] Call Trace:
Nov 18 15:29:08 portablebastien kernel: [20160.699799] [<ffffffff8130e00e>] ? __mutex_lock_common+0x127/0x193
Nov 18 15:29:08 portablebastien kernel: [20160.699804] [<ffffffff8130e146>] ? mutex_lock+0x1a/0x33
Nov 18 15:29:08 portablebastien kernel: [20160.699809] [<ffffffff8130dd12>] ? __wait_on_bit_lock+0x76/0x85
Nov 18 15:29:08 portablebastien kernel: [20160.699815] [<ffffffff810b085f>] ? generic_file_aio_write+0x40/0x9f
Nov 18 15:29:08 portablebastien kernel: [20160.699820] [<ffffffff810ec718>] ? do_sync_write+0xb1/0xea
Nov 18 15:29:08 portablebastien kernel: [20160.699826] [<ffffffff810d011f>] ? do_page_add_anon_rmap+0x82/0x8f
Nov 18 15:29:08 portablebastien kernel: [20160.699832] [<ffffffff811533f3>] ? security_file_permission+0x18/0x2e
Nov 18 15:29:08 portablebastien kernel: [20160.699837] [<ffffffff810ecc4b>] ? vfs_write+0xa4/0x100
Nov 18 15:29:08 portablebastien kernel: [20160.699843] [<ffffffff810ecd5a>] ? sys_write+0x45/0x6b
Nov 18 15:29:08 portablebastien kernel: [20160.699848] [<ffffffff81008a02>] ? system_call_fastpath+0x16/0x1b
Nov 18 15:29:13 portablebastien shutdown[20760]: shutting down for system reboot

>
> There are three places that can do this wakeup:
>
> - reiserfs_allow_writes()
> - wake_queued_writers()
> - do_journal_end()
>
> So it would be really great if you could select the following
> config options:
>
> CONFIG_REISERFS_CHECK
> CONFIG_PROVE_LOCKING
> CONFIG_DEBUG_SPINLOCK_SLEEP
> CONFIG_DEBUG_MUTEXES
> CONFIG_DEBUG_SPINLOCK
> CONFIG_FRAME_POINTER
> CONFIG_DETECT_HUNG_TASK
>
> and try to reproduce the problem. You need to wait 3 minutes after
> the lockup. And then, dmesg must contain everything I need to
> fix this.

Unfortunatly it was the only trace I could get, will try to reproduce it however.

Bastien

> Thanks a lot in advance!
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/