it is high likely that
anybody already knows this error and knows how to fix it:
---snip---
[root box1 ~]# /etc/init.d/cman start
Starting
cluster:
Loading modules... done
Mounting configfs... done
Starting
ccsd... done
Starting cman... failed
cman not started: Can't bind to local cman socket
/usr/sbin/cman_tool: aisexec daemon didn't start
[FAILED]
[root box1 ~]#
---snap---
the cluster does
not start even what is written in the logfileif looks OK for
me:
---snip---
Apr 24 19:59:06 box1 ccsd[5129]: Starting
ccsd 2.0.60:
Apr 24 19:59:06 box1 ccsd[5129]: Built: Jan 24 2007 15:31:03
Apr 24 19:59:06 box1 ccsd[5129]: Copyright (C) Red Hat, Inc. 2004 All rights
reserved.
Apr 24 19:59:06 box1 ccsd[5129]: cluster.conf (cluster name =
alpha_cluster, version = 6) found.
Apr 24 19:59:08 box1 openais[5135]: [MAIN
] AIS Executive Service RELEASE 'subrev 1204 version 0.80.1'
Apr 24 19:59:08
box1 openais[5135]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and
contributors.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Copyright (C) 2006
Red Hat, Inc.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Using default
multicast address of 239.192.196.121
Apr 24 19:59:08 box1 openais[5135]:
[MAIN ] openais component openais_cpg loaded.
Apr 24 19:59:08 box1
openais[5135]: [MAIN ] Registering service handler 'openais cluster closed
process group service v1.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ]
openais component openais_cfg loaded.
Apr 24 19:59:08 box1 openais[5135]:
[MAIN ] Registering service handler 'openais configuration service'
Apr 24
19:59:08 box1 openais[5135]: [MAIN ] openais component openais_msg loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering service handler
'openais message service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN
] openais component openais_lck loaded.
Apr 24 19:59:08 box1 openais[5135]:
[MAIN ] Registering service handler 'openais distributed locking service
B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_evt loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering
service handler 'openais event service B.01.01'
Apr 24 19:59:08 box1
openais[5135]: [MAIN ] openais component openais_ckpt loaded.
Apr 24
19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais
checkpoint service B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ]
openais component openais_amf loaded.
Apr 24 19:59:08 box1 openais[5135]:
[MAIN ] Registering service handler 'openais availability management framework
B.01.01'
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] openais component
openais_clm loaded.
Apr 24 19:59:08 box1 openais[5135]: [MAIN ] Registering
service handler 'openais cluster membership service B.01.01'
Apr 24 19:59:08
box1 openais[5135]: [MAIN ] openais component openais_evs loaded.
Apr 24
19:59:08 box1 openais[5135]: [MAIN ] Registering service handler 'openais
extended virtual synchrony service'
Apr 24 19:59:08 box1 openais[5135]:
[MAIN ] openais component openais_cman loaded.
Apr 24 19:59:08 box1
openais[5135]: [MAIN ] Registering service handler 'openais CMAN membership
service 2.01'
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] entering GATHER
state from 12.
Apr 24 19:59:08 box1 openais[5070]: [TOTEM] Creating commit
token because I am the rep.
Apr 24 19:59:08 box1 openais[5070]: [TOTEM]
Saving state aru 1e high seq received 1e
Apr 24 19:59:08 box1 openais[5070]:
[TOTEM] Storing new sequence id for ring 304
Apr 24 19:59:08 box1
openais[5070]: [TOTEM] entering COMMIT state.
Apr 24 19:59:09 box1
openais[5070]: [TOTEM] entering RECOVERY state.
Apr 24 19:59:09 box1
openais[5070]: [TOTEM] position [0] member 192.168.50.194:
Apr 24 19:59:09
box1 openais[5070]: [TOTEM] previous ring seq 768 rep 192.168.50.194
Apr 24
19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] position [1] member
192.168.50.195:
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] previous ring
seq 768 rep 192.168.50.194
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] aru
1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]:
[TOTEM] position [2] member 192.168.50.196:
Apr 24 19:59:09 box1
openais[5070]: [TOTEM] previous ring seq 768 rep 192.168.50.194
Apr 24
19:59:09 box1 openais[5070]: [TOTEM] aru 1e high delivered 1e received flag 0
Apr 24 19:59:09 box1 openais[5070]: [TOTEM] Did not need to originate any
messages in recovery.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM
CONFIGURATION CHANGE
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New
Configuration:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.194)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.195)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.196)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined:
Apr 24
19:59:09 box1 openais[5070]: [SYNC ] This node is within the primary component
and will provide service.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] CLM
CONFIGURATION CHANGE
Apr 24 19:59:09 box1 openais[5070]: [CLM ] New
Configuration:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.194)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.195)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] r(0)
ip(192.168.50.196)
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Left:
Apr 24 19:59:09 box1 openais[5070]: [CLM ] Members Joined:
Apr 24
19:59:09 box1 openais[5070]: [SYNC ] This node is within the primary component
and will provide service.
Apr 24 19:59:09 box1 openais[5070]: [TOTEM]
entering OPERATIONAL state.
Apr 24 19:59:09 box1 openais[5070]: [CLM ] got
nodejoin message 192.168.50.195
Apr 24 19:59:09 box1 openais[5070]: [CLM ]
got nodejoin message 192.168.50.196
Apr 24 19:59:09 box1 openais[5070]: [CLM
] got nodejoin message 192.168.50.194
Apr 24 19:59:35 box1 ccsd[5129]:
Unable to connect to cluster infrastructure after 30 seconds.
Apr 24
20:00:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 60
seconds.
Apr 24 20:00:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 90 seconds.
Apr 24 20:01:05 box1 ccsd[5129]: Unable to
connect to cluster infrastructure after 120 seconds.
Apr 24 20:01:35 box1
ccsd[5129]: Unable to connect to cluster infrastructure after 150 seconds.
Apr 24 20:02:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure
after 180 seconds.
Apr 24 20:02:35 box1 ccsd[5129]: Unable to connect to
cluster infrastructure after 210 seconds.
Apr 24 20:03:05 box1 ccsd[5129]:
Unable to connect to cluster infrastructure after 240 seconds.
Apr 24
20:03:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 270
seconds.
Apr 24 20:04:05 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 300 seconds.
Apr 24 20:04:35 box1 ccsd[5129]: Unable to
connect to cluster infrastructure after 330 seconds.
Apr 24 20:05:05 box1
ccsd[5129]: Unable to connect to cluster infrastructure after 360 seconds.
Apr 24 20:05:35 box1 ccsd[5129]: Unable to connect to cluster infrastructure
after 390 seconds.
Apr 24 20:06:05 box1 ccsd[5129]: Unable to connect to
cluster infrastructure after 420 seconds.
Apr 24 20:06:35 box1 ccsd[5129]:
Unable to connect to cluster infrastructure after 450 seconds.
Apr 24
20:07:05 box1 ccsd[5129]: Unable to connect to cluster infrastructure after 480
seconds.
Apr 24 20:07:35 box1 ccsd[5129]: Unable to connect to cluster
infrastructure after 510 seconds.
Apr 24 20:08:05 box1 ccsd[5129]: Unable to
connect to cluster infrastructure after 540 seconds.
Apr 24 20:08:35 box1
ccsd[5129]: Unable to connect to cluster infrastructure after 570 seconds.
Apr 24 20:09:01 box1 ccsd[5129]: Stopping ccsd, SIGTERM received.
---snap---
thats the
/etc/cluster/cluster.conf:
---snip---
[root box1 ~]# cat
/etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster
alias="alpha" config_version="6" name="alpha_cluster">
<cman>
<multicast addr="224.0.0.1"/>
</cman>
<fence_daemon
post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="box1" nodeid="1">
<multicast
addr="224.0.0.1" interface="eth0"/>
<fence>
<method
name="1">
<device name="human" nodename="box1.sbe"
</method>
</fence>
</clusternode>
<clusternode name="box2"
votes="1" nodeid="2">
<multicast addr="224.0.0.1"
interface="eth0"/>
<fence>
<method name="1">
<device name="human" nodename="box2.sbe"
</method>
</fence>
</clusternode>
<clusternode name="box3"
votes="1" nodeid="4">
<multicast addr="224.0.0.1"
interface="eth0"/>
<fence>
<method name="1">
<device name="human" nodename="box3.sbe"
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_manual"
name="human"/>
</fencedevices>
<rm log_level="7"
log_facility="syslog">
<failoverdomains/>
<resources/>
</rm>
</cluster>
---snap---