[Linux-cluster] Multihome network configuration not working .!

Andriy Galetski andriy at druzhba.lviv.ua
Tue Sep 28 08:00:03 UTC 2004


> On Mon, Sep 27, 2004 at 11:23:15AM +0300, Andriy Galetski wrote:
> > Hi !
> > I want to use Multihome network configuration with broadcast
> > but it is not working. ...
> >
> > When disconnected main connection
> > and trying:
> > cman_tool join -n cl020
> >
> > On other node receive:
> > Sep 27 10:36:13 cl10 kernel: CMAN: node cl20 rejoining
> > Sep 27 10:36:17 cl10 kernel: CMAN: node cl20 is not responding -
removing
> > from the cluster
>
>
> You shouldn't specify the alternative node name in this case. Just a
normal
> cman_tool join should work, cman will try both interfaces during the join
> anyway.
>
> If it doesn't, can you enable COMMS & MEMB debugging in cnxman-private.h
and
> send me the output please? (yes,I know it means a recompile...sorry)
>
> patrick
>

Ok

Two node cluster with configuration:

<?xml version="1.0"?>
<cluster name="alpha" config_version="1">
  <cman two_node="1" expected_votes="1">
  </cman>
  <nodes>
    <node name="cl10" votes="1">
        <altname name="cl010"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl10"/>
        </method>
      </fence>
    </node>
    <node name="cl20" votes="1">
        <altname name="cl020"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl20"/>
        </method>
      </fence>
    </node>
  </nodes>
  <fence_devices>
    <device name="human" agent="fence_manual"/>
  </fence_devices>
</cluster>
......
On cl20 node I disconnect eth0 which belong to node name="cl20",
eth1 which belng to  altname name="cl020" left working.
Then on cl10 I run:

cman_tool join -d
alternative node name cl010
setup up interface for address: cl10
Broadcast address for c3cc90a is ff3cc90a
setup up interface for address: cl010
Broadcast address for a00a8c0 is ff00a8c0

The messages receive:
ep 28 10:41:57 cl10 kernel: CMAN: Waiting to join or form a Linux-cluster
Sep 28 10:41:57 cl10 ccsd[30333]: Connected to cluster infrastruture via:
CMAN/SM Plugin v1.0
Sep 28 10:41:57 cl10 ccsd[30333]: Initial status:: Inquorate
Sep 28 10:42:13 cl10 kernel: CMAN: forming a new cluster
Sep 28 10:42:13 cl10 kernel: CMAN: quorum regained, resuming activity

After that on cl20 run:
[root at cl20 root]# cman_tool join -d
alternative node name cl020
setup up interface for address: cl20
Broadcast address for 143cc90a is ff3cc90a
setup up interface for address: cl020
Broadcast address for 1400a8c0 is ff00a8c0

CL20 messages:
Sep 28 10:48:22 cl20 kernel: CMAN: Waiting to join or form a Linux-cluster
Sep 28 10:48:22 cl20 ccsd[27495]: Connected to cluster infrastruture via:
CMAN/SM Plugin v1.0
Sep 28 10:48:22 cl20 ccsd[27495]: Initial status:: Inquorate
Sep 28 10:48:23 cl20 kernel: : 02 00 1a 99 c0 a8 00 0a 00 00 00 00 00 00 00
00
Sep 28 10:48:23 cl20 kernel: CMAN: sending membership request
Sep 28 10:48:23 cl20 kernel: : 02 00 1a 99 c0 a8 00 0a 00 00 00 00 00 00 00
00
Sep 28 10:48:23 cl20 last message repeated 7 times
Sep 28 10:48:23 cl20 kernel: CMAN: got node cl10
Sep 28 10:48:23 cl20 kernel: : 02 00 1a 99 c0 a8 00 0a 00 00 00 00 00 00 00
00
Sep 28 10:48:23 cl20 last message repeated 3 times
Sep 28 10:49:08 cl20 kernel: CMAN: Being told to leave the cluster by node 1
Sep 28 10:49:08 cl20 kernel: CMAN: we are leaving the cluster

CL10 messages:
Sep 28 10:48:23 cl10 kernel: : 02 00 1a 99 c0 a8 00 14 00 00 00 00 00 00 00
00
Sep 28 10:48:23 cl10 last message repeated 2 times
Sep 28 10:48:23 cl10 kernel: CMAN: got node cl20
Sep 28 10:48:23 cl10 kernel: : 02 00 1a 99 c0 a8 00 14 00 00 00 00 00 00 00
00
Sep 28 10:48:23 cl10 last message repeated 6 times
Sep 28 10:48:27 cl10 kernel: CMAN: node cl20 is not responding - removing
from the cluster
Sep 28 10:48:31 cl10 kernel: CMAN: node cl20 is not responding - removing
from the cluster
Sep 28 10:48:42 cl10 kernel: : 02 00 1a 99 c0 a8 00 14 00 00 00 00 00 00 00
00

[root at cl10 root]# cat /proc/cluster/nodes
Node  Votes Exp Sts  Name
   1    1    1   M   cl10
   2    1    1   X   cl20

[root at cl10 root]# ping cl020
PING cl020.druzhba.lviv.ua (192.168.0.20) 56(84) bytes of data.
64 bytes from cl020.druzhba.lviv.ua (192.168.0.20): icmp_seq=0 ttl=64
time=0.155 ms
64 bytes from cl020.druzhba.lviv.ua (192.168.0.20): icmp_seq=1 ttl=64
time=0.078 ms


In result CMAN don't use altname interfaces (

Any Idea ???

Thanks.




More information about the Linux-cluster mailing list