[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] "dlm_controld[nnnn]: cluster is down, exiting" on node1 when starting node2




On Fri, 5 Jun 2009, David Teigland wrote:

They are all complaining that the the cluster is down, which is a polite
way
of saying that aisexec has died/crashed/failed/killed/gone-away.

Thanks. Why might that have occurred? Where would I look for clues? How
can I increase logging output from aisexec?

If you're lucky it'll leave a core file, otherwise aisexec is notorious for
disappearing without leaving any clues about why.

That's very disconcerting to hear. Doesn't sound like HA. :-(

To clarify, aisexec does not often disappear, it's very reliable.  The point
was that in the rare case when it does, it's notorious for not leaving any
reasons behind.

Thanks for the clarification.


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]