[Linux-cluster] GFS strange behavior and mount hang on 2.6.9 - 3 nodes

Patrick Caulfield pcaulfie at redhat.com
Tue Nov 9 13:15:16 UTC 2004


On Fri, Nov 05, 2004 at 02:33:57PM -0800, Daniel McNeil wrote:
> I been testing 3-node GFS file system on shared fibre channel
> storage, and run into a couple of strange things.  The 3 nodes
> are cl030, cl031, and cl032.
> 
> 1. After running tar tests on 3 nodes for about a day,
>    I wanted to try out the patch to get rid of the might_sleep()
>    warning.  I umounted the GFS file system on cl031 and then
>    tried to rmmod the lock_dlm module, but couldn't because of
>    the use count on the modules:
> 
> 
> # rmmod lock_dlm
> ERROR: Module lock_dlm is in use
> 
> ---->
> At this point, the lock_dlm module would not unload because 
> it still had a use count of 2.

That will be because of the previous oops I imagine.

 
> The "got connection" messages after the umount look strange.
> What do those messages mean?

They mean that a connection was made to the node that has just dismounted the
filesystem. This is wrong and alomst certainly the cause of what happened below.
I'm looking into it at the moment.


-- 

patrick




More information about the Linux-cluster mailing list