[Fedora-xen] Xen and Cluster Manager (OpenAIS::TOTEM)

Fabien MALFOY fm at startx.fr
Fri Mar 9 14:24:17 UTC 2007


Hi all

I experience problems with Xen while trying to build a two nodes cluster
with Cluster Manager. The Domain-0 host (called M0 here) is Fedora Core
6 and the Xen version is 3.0.3. Then, I created two virtual machines M1
and M2(FC6 too).

The matter is :
When i try to initialize the cluster using M1 and M2, these are not able
to start the service correctly. Cluster Manager depends on the new
cluster framework OpenAIS which uses the TOTEM protocol. Here is one
part of the log :

Mar 7 15:29:17 M1 openais[2220]: [CMAN ] CMAN 2.0.60 (built Jan 24 2007
15:30:39) started
Mar 7 15:29:17 M1 openais[2220]: [SYNC ] Not using a virtual synchrony
filter.
Mar 7 15:29:17 M1 openais[2220]: [MAIN ] AIS Executive Service: started
and ready to provide service.
Mar 7 15:29:18 M1 ccsd[2214]: Initial status:: Inquorate
*Mar 7 15:29:32 M1 openais[2220]: [TOTEM] The consensus timeout expired.*
*Mar 7 15:29:32 M1 openais[2220]: [TOTEM] entering GATHER state from 3.*
*Mar 7 15:29:47 M1 openais[2220]: [TOTEM] The consensus timeout expired.*
*Mar 7 15:29:47 M1 openais[2220]: [TOTEM] entering GATHER state from 3.*
*Mar 7 15:30:02 M1 openais[2220]: [TOTEM] The consensus timeout expired.*
*Mar 7 15:30:02 M1 openais[2220]: [TOTEM] entering GATHER state from 3.*

And this till the timeout...
As in this case I tried to build a cluster made of two virtualized
nodes, I wanted to be sure that the source of the problem was not my
configuration files or another reason of that type.
So I adapted my cluster.conf to replace M2 by M0 and then I installed
Cluster Manager on M0. When I started the service on M0 and M1, only the
one on M0 started correctly (understand "the non-virtualized machine")
and didn't receive any network data from M1.
For the tests, I disabled all network traffic filtering. So I deduce
that there's something strange between Xen and OpenAIS.

I would like to specify that I really tried all my knowledge to solve
this. I even thinked to disable the checksum offloading but no way.

Can anyone help me ? I also accept those who want to tell me they're in
the same lose :-D

-- 
Fabien MALFOY
+---------------------------------+
| Société StartX                  |
| 08.70.33.38.70 - 06.69.67.71.55 |
| http://startx.fr                |
+---------------------------------+





More information about the Fedora-xen mailing list