[Linux-cluster] Clvm over gnbd + rgmanager

Mon Mar 7 15:27:39 UTC 2005

Ion Alberdi wrote:

> Hi everybody,
>
> I'm now trying to use the cluster logical volume manager.
>                (/dev/hdb)
> debian--------------------------->buba (/dev/gnbd/dd)
>  |             --GNBD---->
>  |
>  |________(/dev/hdb)____>gump(/dev/gnbd/dd)
>
> I create on buba or gump a logical volume of 1G
> (After launching the cluster and gnbd):
> buba#pvcreate /dev/gnbd/dd
> buba#vgcreate vg1 /dev/gnbd/dd
> buba#lvcreate -L1024 -n lv1 vg1
> and
> #vgchange -a y  on the three nodes,
> now the three nodes have /dev/vg1/lv1.
>
> On one of the nodes I create en ext3 fs:
> #mkfs.ext3 -j /dev/vg1/lv1
>
> I launch the rgmanager, which has to put a basic script which writes 
> the name of the node running the script on a file in the ext3 fs.
> All works well until the syslog from the node running the script shows:
>
> Mar  7 15:19:53 gump clurgmgrd[3978]: <notice> status on fs "my fs" 
> returned 1 (generic error)
> /*There starts the problem I don't know why status (isMounted in 
> /usr/share/cluster/fs.sh) returns a failure code...)*/
> Mar  7 15:19:53 gump clurgmgrd[3978]: <notice> Stopping resource group 
> hello
> Mar  7 15:19:55 gump clurgmgrd[3978]: <notice> Resource group hello is 
> recovering
> Mar  7 15:19:55 gump clurgmgrd[3978]: <notice> Recovering failed 
> resource group hello
> Mar  7 15:19:55 gump clurgmgrd[3978]: <notice> start on fs "my fs" 
> returned 2 (invalid argument(s))
> /*Syslog is wrong there because the fs.sh is not ocf compliant, in 
> fs.sh exit 2 does not meen wrong argument, but FAIL*/
> Mar  7 15:19:55 gump clurgmgrd[3978]: <warning> #68: Failed to start 
> hello; return value: 1
> Mar  7 15:19:55 gump clurgmgrd[3978]: <notice> Stopping resource group 
> hello
> Mar  7 15:19:57 gump clurgmgrd[3978]: <notice> Resource group hello is 
> recovering
> Mar  7 15:19:57 gump clurgmgrd[3978]: <warning> #71: Relocating failed 
> resource group hello
>
>
> and on the other node:
> ar  7 15:23:14 buba clurgmgrd[5205]: <notice> start on script "Hello 
> Script" returned 1 (generic error)
> Mar  7 15:23:14 buba clurgmgrd[5205]: <warning> #68: Failed to start 
> hello; return value: 1
> Mar  7 15:23:14 buba clurgmgrd[5205]: <notice> Stopping resource group 
> hello
> Mar  7 15:23:16 buba clurgmgrd[5205]: <notice> Resource group hello is 
> recoverin

This is indeed a problem with fs.sh.  It attempts to umount with the 
device name rather than the mount point, which is not guaranteed to work 
in all situations.  From mount(8):

"A file system is specified by giving the  directory  where
       it  has been mounted. Giving the special device on which the file 
sys-
       tem lives may also work, but is obsolete, mainly because it will  
fail
       in case this device was mounted on more than one directory."

So that's why it is failing.  The solution is probably to umount with 
the mount point.  I'll open a bug and let Lon decide.

Thank you for pointing this out.