In random occasions, on several servers running CentOS 6.3, the FS becomes un-writable and applications hung and create a stack dump, only reboot can solve the problem.<br />
<br />
I suspect a spinlock in NFS but I'm not sure.<br />
<br />
We are using EXT4 and NFS (version 3) on these systems (VMware ESX4.1 vms) and it happens on random times on random servers.<br />
The system is:<br />
<br />
Linux wcliwb108 2.6.32-279.2.1.el6.x86_64 <a href="http://bugs.centos.org/view.php?id=1">0000001</a> SMP Fri Jul 20 01:55:29 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux<br />
<br />
INFO: task tail:28122 blocked for more than 120 seconds.<br />
Feb 14 17:15:13 wcliwb108 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br />
Feb 14 17:15:13 wcliwb108 kernel: tail D 0000000000000002 0 28122 28094 0x00000084<br />
Feb 14 17:15:13 wcliwb108 kernel: ffff88013ba6fe38 0000000000000082 0000000000000000 0000000000000024<br />
Feb 14 17:15:13 wcliwb108 kernel: ffff88013ba6fe28 ffffffffffffffe9 ffff88013ba6fdc8 ffffffff81178d24<br />
Feb 14 17:15:13 wcliwb108 kernel: ffff8801139a5af8 ffff88013ba6ffd8 000000000000fb88 ffff8801139a5af8<br />
Feb 14 17:15:13 wcliwb108 kernel: Call Trace:<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff81178d24>] ? nameidata_to_filp+0x54/0x70<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff814fefbe>] __mutex_lock_slowpath+0x13e/0x180<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff814fee5b>] mutex_lock+0x2b/0x50<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffffa00e6f60>] ext4_llseek+0x60/0x110 [ext4]<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff81179d5a>] vfs_llseek+0x3a/0x40<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff8117b516>] sys_lseek+0x66/0x80<br />
Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff8100b0f2>] system_call_fastpath+0x16/0x1b<br />
Feb 14 17:17:13 wcliwb108 kernel: INFO: task tail:28122 blocked for more than 120 seconds.<br />
Feb 14 17:17:13 wcliwb108 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br />
Feb 14 17:17:13 wcliwb108 kernel: tail D 0000000000000002 0 28122 28094 0x00000084<br />
Feb 14 17:17:13 wcliwb108 kernel: ffff88013ba6fe38 0000000000000082 0000000000000000 0000000000000024<br />
Feb 14 17:17:13 wcliwb108 kernel: ffff88013ba6fe28 ffffffffffffffe9 ffff88013ba6fdc8 ffffffff81178d24<br />
Feb 14 17:17:13 wcliwb108 kernel: ffff8801139a5af8 ffff88013ba6ffd8 000000000000fb88 ffff8801139a5af8
↧