[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

[Linux-cluster] startup problem: "cman not started: Can't bind to local cman socket /usr/sbin/cman_tool: aisexec daemon didn't start "



it is high likely that anybody already knows this error and knows how to fix it:

---snip---
[root box1 ~]# /etc/init.d/cman start
Starting cluster:
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... failed
cman not started: Can't bind to local cman socket /usr/sbin/cman_tool: aisexec daemon didn't start
[FAILED]
[root box1 ~]#

---snap--- 

the cluster does not start even what is written in the logfileif looks OK for me:

---snip---

Apr 24 19:59:06 box1 ccsd[5129]: Starting ccsd 2.0.60:
Apr 24 19:59:06 box1 ccsd[5129]: Built: Jan 24 2007 15:31:03
Apr 24 19:59:06 box1 ccsd[5129]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Apr 24 19:59:06 box1 ccsd[5129]: cluster.conf (cluster name = alpha_cluster, version = 6) found.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] AIS Executive Service RELEASE 'subrev 1204 version 0.80.1'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Using default multicast address of 239.192.196.121
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_cpg loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais cluster closed process group service v1.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_cfg loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais configuration service'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_msg loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais message service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_lck loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais distributed locking service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_evt loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais event service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_ckpt loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais checkpoint service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_amf loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais availability management framework B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_clm loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais cluster membership service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_evs loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais extended virtual synchrony service'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component openais_cman loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais CMAN membership service 2.01'
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] entering GATHER state from 12.
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Creating commit token because I am the rep.
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Saving state aru 1e high seq received 1e
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Storing new sequence id for ring 304
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] entering COMMIT state.
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] entering RECOVERY state.
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [0] member 192.168.50.194:
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep 192.168.50.194
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [1] member 192.168.50.195:
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep 192.168.50.194
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [2] member 192.168.50.196:
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring seq 768 rep 192.168.50.194
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] Did not need to originate any messages in recovery.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM CONFIGURATION CHANGE
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New Configuration:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.194)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.195)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.196)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined:
Apr 24 19:59:09 box1 openais[5070]: [SYNC ] This node is within the primary component and will provide service.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM CONFIGURATION CHANGE
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New Configuration:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.194)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.195)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0) ip(192.168.50.196)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined:
Apr 24 19:59:09 box1 openais[5070]: [SYNC ] This node is within the primary component and will provide service.
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] entering OPERATIONAL state.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message 192.168.50.195
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message 192.168.50.196
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got nodejoin message 192.168.50.194
Apr 24 19:59:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 30 seconds.
Apr 24 20:00:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 60 seconds.
Apr 24 20:00:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 90 seconds.
Apr 24 20:01:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 120 seconds.
Apr 24 20:01:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 150 seconds.
Apr 24 20:02:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 180 seconds.
Apr 24 20:02:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 210 seconds.
Apr 24 20:03:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 240 seconds.
Apr 24 20:03:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 270 seconds.
Apr 24 20:04:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 300 seconds.
Apr 24 20:04:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 330 seconds.
Apr 24 20:05:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 360 seconds.
Apr 24 20:05:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 390 seconds.
Apr 24 20:06:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 420 seconds.
Apr 24 20:06:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 450 seconds.
Apr 24 20:07:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 480 seconds.
Apr 24 20:07:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 510 seconds.
Apr 24 20:08:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 540 seconds.
Apr 24 20:08:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 570 seconds.
Apr 24 20:09:01 box1 ccsd[5129]: Stopping ccsd, SIGTERM received.

---snap--- 

thats the /etc/cluster/cluster.conf:

---snip---
[root box1 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="alpha" config_version="6" name="alpha_cluster">
<cman>
<multicast addr="224.0.0.1"/>
</cman>
<fence_daemon post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="box1" nodeid="1">
<multicast addr="224.0.0.1" interface="eth0"/>
<fence>
<method name="1">
<device name="human" nodename="box1.sbe"
</method>
</fence>
</clusternode>
<clusternode name="box2" votes="1" nodeid="2">
<multicast addr="224.0.0.1" interface="eth0"/>
<fence>
<method name="1">
<device name="human" nodename="box2.sbe"
</method>
</fence>
</clusternode>
<clusternode name="box3" votes="1" nodeid="4">
<multicast addr="224.0.0.1" interface="eth0"/>
<fence>
<method name="1">
<device name="human" nodename="box3.sbe"
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_manual" name="human"/>
</fencedevices>
<rm log_level="7" log_facility="syslog">
<failoverdomains/>
<resources/>
</rm>
</cluster>


---snap---

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]