[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

RE: [Linux-cluster] clurgmgrd - <err> #48: Unable to obtain cluster lock: Connec




hiii, for running a service by rgmanager, a lock mechanism is run. this lock mechanism is lock_dlm. You don't lock_nolock. As first, you should load the lock_dlm module. after than you should start the "clurgmgrd".

From: rhurst bidmc harvard edu
Reply-To: linux clustering <linux-cluster redhat com>
To: linux-cluster redhat com
Subject: [Linux-cluster] clurgmgrd - <err> #48: Unable to obtain cluster lock: Connectiontimed out
Date: Mon, 7 May 2007 13:54:56 -0400

What could cause clurgmgrd fail like this?  If clurgmgrd has a hiccup
like this, is it supposed to shutdown its services?  Is there something
in our implementation that could have prevented this from shutting down?

For unexplained reasons, we just had our CS service (WATSON) go down on
its own, and the syslog entry details the event as:

May  7 13:18:39 db1 clurgmgrd[17888]: <err> #48: Unable to obtain
cluster lock: Connection timed out
May  7 13:18:41 db1 kernel: dlm: Magma: reply from 2 no lock
May  7 13:18:41 db1 kernel: dlm: reply
May  7 13:18:41 db1 kernel: rh_cmd 5
May  7 13:18:41 db1 kernel: rh_lkid 200242
May  7 13:18:41 db1 kernel: lockstate 2
May  7 13:18:41 db1 kernel: nodeid 0
May  7 13:18:41 db1 kernel: status 0
May  7 13:18:41 db1 kernel: lkid ee0388
May  7 13:18:41 db1 clurgmgrd[17888]: <notice> Stopping service WATSON

... and its service entry looks like this:

<service autostart="0" domain="DB" exclusive="1" name="WATSON"
recovery="disable">
    <ip address="192.168.3.111" monitor_link="1"/>
    <fs device="/dev/VGWATSON/lvoldata" force_fsck="0" force_unmount="1"
fsid="53188" fstype="ext3" mountpoint="/watson-data"
name="WATSON-lvoldata" options="" self_fence="0">
        <fs device="/dev/VGWATSON/lvoldb1" force_fsck="0"
force_unmount="1" fsid="29524" fstype="ext3"
mountpoint="/watson-data/sys/db1" name="WATSON-lvoldb1" options=""
self_fence="0"/>
        <script file="/etc/init.d/WATSON" name="WATSON RC"/>
    </fs>
    <clusterfs ref="WATSON-lvol0">
        <clusterfs ref="WATSON-lvol1"/>
    </clusterfs>
</service>


Robert Hurst, Sr. Caché Administrator
Beth Israel Deaconess Medical Center
1135 Tremont Street, REN-7
Boston, Massachusetts   02120-2140
617-754-8754 ∙ Fax: 617-754-8730 ∙ Cell: 401-787-3154
Any technology distinguishable from magic is insufficiently advanced.


<< smime.p7s >>




--
Linux-cluster mailing list
Linux-cluster redhat com
https://www.redhat.com/mailman/listinfo/linux-cluster

_________________________________________________________________
Catch suspicious messages before you open them—with Windows Live Hotmail. http://imagine-windowslive.com/hotmail/?locale=en-us&ocid=TXT_TAGHM_migration_HM_mini_protection_0507


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]