[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] Clvm over gnbd + rgmanager



Ion Alberdi wrote:

Hi everybody,

I'm now trying to use the cluster logical volume manager.
               (/dev/hdb)
debian--------------------------->buba (/dev/gnbd/dd)
 |             --GNBD---->
 |
 |________(/dev/hdb)____>gump(/dev/gnbd/dd)

I create on buba or gump a logical volume of 1G
(After launching the cluster and gnbd):
buba#pvcreate /dev/gnbd/dd
buba#vgcreate vg1 /dev/gnbd/dd
buba#lvcreate -L1024 -n lv1 vg1
and
#vgchange -a y  on the three nodes,
now the three nodes have /dev/vg1/lv1.

On one of the nodes I create en ext3 fs:
#mkfs.ext3 -j /dev/vg1/lv1

I launch the rgmanager, which has to put a basic script which writes the name of the node running the script on a file in the ext3 fs.
All works well until the syslog from the node running the script shows:


Mar 7 15:19:53 gump clurgmgrd[3978]: <notice> status on fs "my fs" returned 1 (generic error)
/*There starts the problem I don't know why status (isMounted in /usr/share/cluster/fs.sh) returns a failure code...)*/
Mar 7 15:19:53 gump clurgmgrd[3978]: <notice> Stopping resource group hello
Mar 7 15:19:55 gump clurgmgrd[3978]: <notice> Resource group hello is recovering
Mar 7 15:19:55 gump clurgmgrd[3978]: <notice> Recovering failed resource group hello
Mar 7 15:19:55 gump clurgmgrd[3978]: <notice> start on fs "my fs" returned 2 (invalid argument(s))
/*Syslog is wrong there because the fs.sh is not ocf compliant, in fs.sh exit 2 does not meen wrong argument, but FAIL*/
Mar 7 15:19:55 gump clurgmgrd[3978]: <warning> #68: Failed to start hello; return value: 1
Mar 7 15:19:55 gump clurgmgrd[3978]: <notice> Stopping resource group hello
Mar 7 15:19:57 gump clurgmgrd[3978]: <notice> Resource group hello is recovering
Mar 7 15:19:57 gump clurgmgrd[3978]: <warning> #71: Relocating failed resource group hello



and on the other node:
ar 7 15:23:14 buba clurgmgrd[5205]: <notice> start on script "Hello Script" returned 1 (generic error)
Mar 7 15:23:14 buba clurgmgrd[5205]: <warning> #68: Failed to start hello; return value: 1
Mar 7 15:23:14 buba clurgmgrd[5205]: <notice> Stopping resource group hello
Mar 7 15:23:16 buba clurgmgrd[5205]: <notice> Resource group hello is recoverin

This is indeed a problem with fs.sh. It attempts to umount with the device name rather than the mount point, which is not guaranteed to work in all situations. From mount(8):


"A file system is specified by giving the directory where
it has been mounted. Giving the special device on which the file sys-
tem lives may also work, but is obsolete, mainly because it will fail
in case this device was mounted on more than one directory."


So that's why it is failing. The solution is probably to umount with the mount point. I'll open a bug and let Lon decide.

Thank you for pointing this out.


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]