[Linux-cluster] qdisk problems during/after network problems
Lon Hohberger
lhh at redhat.com
Mon Dec 18 18:25:34 UTC 2006
On Mon, 2006-12-18 at 11:14 +0000, Frederik Ferner wrote:
> Hi List,
>
> I am currently testing Redhat Cluster Suite for a number of two node
> clusters accessing EMC storage systems. Everything seems to be running
> fine expect for qdisk.
>
> On Friday we had a network problem during which the nodes were still
> able to see each other but none of the addresses used in my heuristics
> for qdisk. The result was not what I expected, when the network came
> back, both nodes claimed to be master.
> Dec 15 10:31:47 duoserv2 ccsd[5595]: Cluster is quorate. Allowing connections.
> Dec 15 10:31:50 duoserv2 clurgmgrd: [7950]: <info> /dev/mapper/logs1-logs1 is not mounted
> Dec 15 10:31:51 duoserv2 qdiskd[31393]: <crit> Critical Error: More than one master found!
> Dec 15 10:31:51 duoserv2 qdiskd[31393]: <crit> A master exists, but it's not me?!
> Dec 15 10:31:52 duoserv2 qdiskd[31393]: <info> Node 1 is the master
> ...
>
> At the same time on the second node:
> Dec 15 10:31:45 duoserv1 qdiskd[316]: <notice> Score sufficient for master operation (5/3; max=6); upgrading
> Dec 15 10:31:46 duoserv1 qdiskd[316]: <info> Assuming master role
> Dec 15 10:31:47 duoserv1 kernel: CMAN: quorum regained, resuming activity
> Dec 15 10:31:47 duoserv1 ccsd[5624]: Cluster is quorate. Allowing connections.
> Dec 15 10:31:47 duoserv1 clurgmgrd[3631]: <notice> Quorum Achieved
> Dec 15 10:31:51 duoserv1 qdiskd[316]: <crit> Critical Error: More than one master found!
> Dec 15 10:31:52 duoserv1 qdiskd[316]: <info> Node 2 is the master
> Dec 15 10:31:52 duoserv1 qdiskd[316]: <crit> Critical Error: More than one master found!
> ...
>
> This continues until I finally notice and restart qdiskd on both nodes,
> when they agree on one master again.
This should never happen. Please file a bugzilla.
-- Lon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20061218/5cca7065/attachment.sig>
More information about the Linux-cluster
mailing list