[Linux-cluster] bug? mount hangs.

Ling, Xiaofeng xiaofeng.ling at intel.com
Wed Aug 25 01:59:31 UTC 2004


Hi,
    When I trying to setup GFS on two node. some times it triggers the
kdb and the mount hangs.
Follow is the dmesg and config file.
I use kernel 2.6.6up no preemption with kdb patch. two nodes are both
DELL desktop with Intel P3 and P4 CPU.
Is this a know issue?

------------------------------------------------------------------------
---------------------------------------------------
Unable to handle kernel NULL pointer dereference at virtual address
00000046
 printing eip:
d087c916
*pde = 00000000
Oops: 0000 [#1]
CPU:    0
EIP:    0060:[<d087c916>]    Not tainted
EFLAGS: 00010286   (2.6.6kdb)
EIP is at send_to_sock+0x41/0x20a [dlm]
eax: 00000002   ebx: c763c060   ecx: 00000000   edx: 00000000
esi: d089124c   edi: c763c060   ebp: 00000000   esp: c7007f88
ds: 007b   es: 007b   ss: 0068
Process dlm_sendd (pid: 19895, threadinfo=c7006000 task=caeaa7b0)
Stack: d0887851 0000002b c7007fb0 c011475f c994bc68 c12a31b0 c763c068
00000286
       c763c060 d089124c c7006000 00000002 d087cce0 c763c060 c7006000
00000000
       00000000 00000000 d087cfda d0887905 00000000 00000000 0000007b
0000007b
Call Trace:
 [<c011475f>] __wake_up_common+0x31/0x50
 [<d087cce0>] process_output_queue+0x55/0x75 [dlm]
 [<d087cfda>] dlm_sendd+0x95/0xe9 [dlm]
 [<d087cf45>] dlm_sendd+0x0/0xe9 [dlm]
 [<c0104291>] kernel_thread_helper+0x5/0xb

Code: 8b 40 44 89 44 24 1c 8d 47 30 89 44 24 14 8b 5f 30 3b 5c 24
 <6>CMAN: Being told to leave the cluster by node 2
CMAN: we are leaving the cluster
SM: 00000001 sm_stop: SG still joined
SM: 01000002 sm_stop: SG still joined
input: AT Translated Set 2 keyboard on isa0060/serio0

my config file.
----------------------------------------------------------------
<?xml version="1.0"?>
<cluster name="god" config_version="1">

<cman two_node="1" expected_votes="1">
</cman>

<nodes>
    <node name="xling" votes="1">
      <fence>
        <method name="human">
          <device name="last_resort"/>
        </method>
      </fence>
    </node>
    <node name="ocfs2" votes="1">
      <fence>
        <method name="human">
          <device name="last_resort"/>
        </method>
      </fence>
    </node>
  </nodes>
  <fence_device>
    <device name="last_resort" agent="fence_manual"/>
  </fence_device>
</cluster>
                          
-------------------
Ling Xiaofeng(Daniel)
Intel China Software Lab.
iNet: 8-752-1243
8621-52574545-1243(O)
xfling at users.sourceforge.net
Opinions are my own and don't represent those of my employer 




More information about the Linux-cluster mailing list