[Linux-cluster] Freeze with cluster-2.03.11

Kadlecsik Jozsef kadlec at mail.kfki.hu
Mon Apr 6 11:09:35 UTC 2009


On Sun, 5 Apr 2009, Wendy Cheng wrote:

> Based on code reading ...
> 1. iput() gets inode_lock (a spin lock)
> 2. iput() calls iput_final()
> 3. iput_final() calls gfs_drop_inode() that calls
>    generic_drop_inode()
> 4. generic_drop_inode() unlocks inode_lock.
> 
> In theory, this logic violates the usage of spin lock as it is expected 
> to lock for a short period of time but gfs_drop_inode() could take a 
> while to finish. It has a blocking write page that need to make sure the 
> data gets sync-ed to storage before it can returns. Make matter worse is 
> that inode_lock is a global lock that could block non-GFS threads. One 
> would think a quick fix is to drop the inode_lock at the beginning of 
> gfs_drop_inode() and then re-acquires it after gfs sync the page. 
> Unfortunately, inode_lock is not an exported symbol and GFS is an 
> out-of-tree filesystem that has to be compiled as a kernel module. So 
> this trick won't work for GFS.

Actually, it can work. inode_lock is not private and gfs can unlock/lock 
it:

--- gfs-orig/ops_super.c	2009-01-22 13:33:51.000000000 +0100
+++ gfs/ops_super.c	2009-04-06 13:07:06.000000000 +0200
@@ -9,6 +9,7 @@
 #include <linux/statfs.h>
 #include <linux/seq_file.h>
 #include <linux/mount.h>
+#include <linux/writeback.h>
 
 #include "gfs.h"
 #include "dio.h"
@@ -68,8 +69,11 @@
 	if (ip &&
 	    !inode->i_nlink &&
 	    S_ISREG(inode->i_mode) &&
-	    !sdp->sd_args.ar_localcaching)
+	    !sdp->sd_args.ar_localcaching) {
+	    	spin_unlock(&inode_lock);
 		gfs_sync_page_i(inode, DIO_START | DIO_WAIT);
+		spin_lock(&inode_lock);
+	}
 	generic_drop_inode(inode);
 }

Tomorrow I'll give it a try, there's no time to test it today.

> With a flight to catch tomorrow and a flu-infected body, I lose the will 
> to think over what the correct fix should and/or will be.

A speedy recover! And thank you all your efforts!

Best regards,
Jozsef
--
E-mail : kadlec at mail.kfki.hu, kadlec at blackhole.kfki.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: KFKI Research Institute for Particle and Nuclear Physics
         H-1525 Budapest 114, POB. 49, Hungary




More information about the Linux-cluster mailing list