[Linux-cluster] GFS mounting fails when rejoining the cluster

gordan at bobich.net gordan at bobich.net
Tue Nov 13 13:22:08 UTC 2007


Hi,

I seem to have a weird problem. I have a 5 node (actually 4 with 
room to put another node in) cluster, if all the nodes 
come up at the same time, everything works fine. If I reboot one of the 
nodes, it doesn't re-connect the GFS share. It says:

Trying to join cluster "lock_dlm" "mycluster:myshare"

dlm: connecting to 1
dlm: connecting to 2
dlm: connection to 3

Joined cluster. Now mounting FS...

dlm: Got connection from 3

and then it just sits there.

The other nodes also lose access to the shared file system. How do I 
troubleshoot this? Everything works OK when the nodes all come up at the 
same time, but the re-joining seems to break the whole cluster.

I'm using CentOS 5 with the latest updates.

Gordan




More information about the Linux-cluster mailing list