On my server I keep getting following error message causing the server to be unresponsive:<br />
<br />
<br />
Dec 24 12:00:32 kernel: INFO: task kjournald:476 blocked for more than 120 seconds.<br />
Dec 24 12:06:03 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br />
Dec 24 12:06:07 kernel: kjournald D 00000487 2660 476 19 501 475 (L-TLB)<br />
Dec 24 12:06:07 kernel: f2d09ed4 00000046 b8154ff9 00000487 00000002 00000000 08eb58b0 0000000a<br />
Dec 24 12:06:08 kernel: f7fcb550 b81566f7 00000487 000016fe 00000002 f7fcb65c c3420788 f2fb5ac0<br />
Dec 24 12:06:09 kernel: 00000000 00000000 f2d09ecc c041f0c8 0098462e c042d86b c350957c ffffffff<br />
Dec 24 12:06:12 kernel: Call Trace:<br />
Dec 24 12:06:15 kernel: [<c041f0c8>] __wake_up+0x2a/0x3d<br />
Dec 24 12:06:17 kernel: [<c042d86b>] getnstimeofday+0x30/0xb6<br />
Dec 24 12:06:18 kernel: [<c06242f8>] io_schedule+0x36/0x59<br />
Dec 24 12:06:19 kernel: [<c04796d7>] sync_buffer+0x30/0x33<br />
Dec 24 12:06:21 kernel: [<c06244cf>] __wait_on_bit+0x33/0x58<br />
Dec 24 12:06:25 kernel: [<c04796a7>] sync_buffer+0x0/0x33<br />
Dec 24 12:06:28 kernel: [<c04796a7>] sync_buffer+0x0/0x33<br />
Dec 24 12:06:29 kernel: [<c0624556>] out_of_line_wait_on_bit+0x62/0x6a<br />
Dec 24 12:06:29 kernel: [<c0437420>] wake_bit_function+0x0/0x3c<br />
Dec 24 12:06:29 kernel: [<c0479654>] __wait_on_buffer+0x1c/0x1f<br />
Dec 24 12:06:29 kernel: [<f885c4c6>] journal_commit_transaction+0x4d6/0xf60 [jbd]<br />
Dec 24 12:06:29 kernel: [<c042e6c5>] lock_timer_base+0x15/0x2f<br />
Dec 24 12:06:29 kernel: [<c042e744>] try_to_del_timer_sync+0x65/0x6c<br />
Dec 24 12:06:29 kernel: [<f885fd38>] kjournald+0xa1/0x1c2 [jbd]<br />
Dec 24 12:06:29 kernel: [<c04373f3>] autoremove_wake_function+0x0/0x2d<br />
Dec 24 12:06:29 kernel: [<f885fc97>] kjournald+0x0/0x1c2 [jbd]<br />
Dec 24 12:06:29 kernel: [<c043732e>] kthread+0xc0/0xee<br />
Dec 24 12:06:29 kernel: [<c043726e>] kthread+0x0/0xee<br />
Dec 24 12:06:29 kernel: [<c0405c87>] kernel_thread_helper+0x7/0x10<br />
Dec 24 12:06:29 kernel: =======================<br />
Dec 24 12:06:29 kernel: INFO: task auditd:2205 blocked for more than 120 seconds.<br />
Dec 24 12:06:29 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br />
Dec 24 12:06:29 kernel: auditd D 00000484 2288 2205 1 2237 2204 (NOTLB)<br />
Dec 24 12:06:29 kernel: f2f76ed0 00000086 a6e809e1 00000484 00000473 0000000e 00000000 00000009<br />
<br />
<br />
The server has a software RAID:<br />
<br />
# cat /proc/mdstat <br />
Personalities : [raid1] <br />
md0 : active raid1 sda1[1]<br />
120384 blocks [2/1] [_U]<br />
<br />
md1 : active raid1 sda3[1]<br />
486215168 blocks [2/1] [_U]<br />
<br />
unused devices: <none><br />
<br />
Initially I thought it may be because of RAID so I removed the second disk from the RAID array but I still get this error time to time. I don't know what's causing it and when this happens I find the CPU in WAIT. Please let me know if you need more information and how to fix it.
↧