We had configured RHEL 6.2 - 2 node Cluster with clvmd + gfs2 + cman + smb. We have 4 nic cards in the servers where 2 been configured in bonding for heartbeat (with mode=1) and 2 been configured in bonding for public access (with mode=0). Heartbeat network is connected directly from server to server. Once in 3 – 4 days, the heartbeat goes down and comes up automatically in 2 to 3 seconds. Not sure why this down and up occurs. Because of this in cluster, one system is got fenced by other.
Is there anyway where we can increase the time to wait for the cluster to wait for heartbeat. Ie if the cluster can wait for 5-6 seconds even the heartbeat fails for 5-6 seconds the node won’t get fenced. Kindly advise.
Sathya Narayanan V
This communication may contain confidential information. If you are not the intended recipient it may be unlawful for you to read, copy, distribute, disclose or otherwise use the information contained within this communication.. Errors and Omissions may occur in the contents of this Email arising out of or in connection with data transmission, network malfunction or failure, machine or software error, malfunction, or operator errors by the person who is sending the email. Precision Group accepts no responsibility for any such errors or omissions. The information, views and comments within this communication are those of the individual and not necessarily those of Precision Group. All email that is sent from/to Precision Group is scanned for the presence of computer viruses, security issues and inappropriate content. However, it is the recipient's responsibility to check any attachments for viruses before use.