[Linux-cluster] Cluster node hung ( SM: Assertion failed )

Andriy Galetski andriy at druzhba.lviv.ua
Thu Sep 9 07:50:44 UTC 2004


Hi!
I have two node cluster. (CL10 and CL20)
Generally it's working good.
But sometimes after rebooting one of nodes
I get unstable situation:
For example ... after incorrect shutdown node CL10
Remaining node CL20 regaining quorom and left operational until
other node CL10 was and start joining to cluster.
In that moment console CL10 get message ...

kernel: CMAN: no HELLO from cl20, removing from the cluster

On CL20 side console view ..

SM: Assertion Failed on line 52 of file
/usr/local/src/cluster/cman-kernel/src/sm_misc.c

SM: assertion: "!error"

SM: time 1729980

Kernel panic: SM:
    Records message above and reboot.
...
and  CL20 hung after that.

My config is:
<?xml version="1.0"?>
<cluster name="alpha" config_version="1">

  <cman two_node="1" expected_votes="1">
  </cman>

  <nodes>
    <node name="cl10" votes="1">
        <altname name="cl010"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl10"/>
        </method>
      </fence>
    </node>
    <node name="cl20" votes="1">
        <altname name="cl020"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl20"/>
        </method>
      </fence>
    </node>
  </nodes>

  <fence_devices>
    <device name="human" agent="fence_manual"/>
  </fence_devices>

</cluster>

__
Thanks for any information.




More information about the Linux-cluster mailing list