[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] cluster suite crashing



I grabbed the RHEL5 branch out of CVS, but compilation fails with

make[2]: Entering directory `/usr/src/cluster-cvs/cluster/dlm/lib'
gcc -Wall  -g -I. -O2  -D_REENTRANT -c -o libdlm.o libdlm.c
libdlm.c: In function âset_version_v5â:
libdlm.c:324: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:325: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:326: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âset_version_v6â:
libdlm.c:335: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:336: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:337: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âdetect_kernel_versionâ:
libdlm.c:443: error: storage size of âvâ isnât known
libdlm.c:446: error: invalid application of âsizeofâ to incomplete type âstruct dlm_device_versionâ libdlm.c:448: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:449: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:450: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:452: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:453: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:454: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c:443: warning: unused variable âvâ
libdlm.c: In function âdo_dlm_dispatchâ:
libdlm.c:590: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âls_lock_v6â:
libdlm.c:835: error: âstruct dlm_lock_paramsâ has no member named âxidâ
libdlm.c:837: error: âstruct dlm_lock_paramsâ has no member named âtimeoutâ
libdlm.c: In function âls_lockâ:
libdlm.c:892: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âdlm_ls_lockxâ:
libdlm.c:916: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âdlm_ls_unlockâ:
libdlm.c:1067: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âdlm_ls_deadlock_cancelâ:
libdlm.c:1099: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:1115: error: âDLM_USER_DEADLOCKâ undeclared (first use in this function)
libdlm.c:1115: error: (Each undeclared identifier is reported only once
libdlm.c:1115: error: for each function it appears in.)
libdlm.c: In function âdlm_ls_purgeâ:
libdlm.c:1134: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:1145: error: âDLM_USER_PURGEâ undeclared (first use in this function)
libdlm.c:1146: error: âunion <anonymous>â has no member named âpurgeâ
libdlm.c:1147: error: âunion <anonymous>â has no member named âpurgeâ
libdlm.c: In function âcreate_lockspaceâ:
libdlm.c:1311: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function ârelease_lockspaceâ:
libdlm.c:1415: error: invalid use of undefined type âstruct dlm_device_versionâ
libdlm.c: In function âdlm_kernel_versionâ:
libdlm.c:1501: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:1502: error: invalid use of undefined type âstruct dlm_device_versionâ libdlm.c:1503: error: invalid use of undefined type âstruct dlm_device_versionâ
make[2]: *** [libdlm.o] Error 1
make[2]: Leaving directory `/usr/src/cluster-cvs/cluster/dlm/lib'
make[1]: *** [all] Error 2
make[1]: Leaving directory `/usr/src/cluster-cvs/cluster/dlm'
make: *** [all] Error 2

I guess it doesn't like the officially supported RHEL kernel (2.6.18-8.1.8).

We also are trying to get the 5.1 Beta rpms going with no success. So far a kernel panic on 5.1 kernel (2.6.18-36)



Lon Hohberger wrote:
On Thu, Aug 02, 2007 at 11:08:51AM -0500, Chris Harms wrote:
rgmanager-2.0.24-1.el5

I'm not sure if this is useful or not, but I had just rebooted Node B when we pulled the cables on Node A. It is possible not all of the services / inter-node communication had completed.

Could you pull from CVS (RHEL5 or 51 branches)?  The current code has a
couple of crash bugs fixed.

Note that if you store:

DAEMON_COREFILE_LIMIT="unlimited"
RGMGR_OPTS="-w"

... in /etc/sysconfig/cluster, rgmanager will generate a core file in
the root directory.  Attaching the core to the bug report will help
determine whether it's something already fixed in CVS.

But seriously, if you see 'daemon died, rebooting' it's either user
error (you did a 'kill -9' of only one rgmanager pid) or a bug (crash).

-- Lon



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]