Quantcast
Viewing all articles
Browse latest Browse all 19115

0006254: Filesystems became un-writable and apps doing IO are blocks

In random occasions, on several servers running CentOS 6.3, the FS becomes un-writable and applications hung and create a stack dump, only reboot can solve the problem.<br /> <br /> I suspect a spinlock in NFS but I'm not sure.<br /> <br /> We are using EXT4 and NFS (version 3) on these systems (VMware ESX4.1 vms) and it happens on random times on random servers.<br /> The system is:<br /> <br /> Linux wcliwb108 2.6.32-279.2.1.el6.x86_64 <a href="http://bugs.centos.org/view.php?id=1">0000001</a> SMP Fri Jul 20 01:55:29 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux<br /> <br /> INFO: task tail:28122 blocked for more than 120 seconds.<br /> Feb 14 17:15:13 wcliwb108 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br /> Feb 14 17:15:13 wcliwb108 kernel: tail D 0000000000000002 0 28122 28094 0x00000084<br /> Feb 14 17:15:13 wcliwb108 kernel: ffff88013ba6fe38 0000000000000082 0000000000000000 0000000000000024<br /> Feb 14 17:15:13 wcliwb108 kernel: ffff88013ba6fe28 ffffffffffffffe9 ffff88013ba6fdc8 ffffffff81178d24<br /> Feb 14 17:15:13 wcliwb108 kernel: ffff8801139a5af8 ffff88013ba6ffd8 000000000000fb88 ffff8801139a5af8<br /> Feb 14 17:15:13 wcliwb108 kernel: Call Trace:<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff81178d24>] ? nameidata_to_filp+0x54/0x70<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff814fefbe>] __mutex_lock_slowpath+0x13e/0x180<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff814fee5b>] mutex_lock+0x2b/0x50<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffffa00e6f60>] ext4_llseek+0x60/0x110 [ext4]<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff81179d5a>] vfs_llseek+0x3a/0x40<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff8117b516>] sys_lseek+0x66/0x80<br /> Feb 14 17:15:13 wcliwb108 kernel: [<ffffffff8100b0f2>] system_call_fastpath+0x16/0x1b<br /> Feb 14 17:17:13 wcliwb108 kernel: INFO: task tail:28122 blocked for more than 120 seconds.<br /> Feb 14 17:17:13 wcliwb108 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.<br /> Feb 14 17:17:13 wcliwb108 kernel: tail D 0000000000000002 0 28122 28094 0x00000084<br /> Feb 14 17:17:13 wcliwb108 kernel: ffff88013ba6fe38 0000000000000082 0000000000000000 0000000000000024<br /> Feb 14 17:17:13 wcliwb108 kernel: ffff88013ba6fe28 ffffffffffffffe9 ffff88013ba6fdc8 ffffffff81178d24<br /> Feb 14 17:17:13 wcliwb108 kernel: ffff8801139a5af8 ffff88013ba6ffd8 000000000000fb88 ffff8801139a5af8

Viewing all articles
Browse latest Browse all 19115

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>