[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] Corosync goes cpu to 95-99% (same problems with RHEL6.2)



On 11/04/2011 07:05 PM, Nicolas Ross wrote:
get a support signoff. Also the corosync updates have not finished
through our validation process. Only hot fixes (from support) are
available

Regards
-steve


Sorry to re-open this thread ... But exists any news about this problem??

In fact, there is !

It appears that this situation is within the microcode of some specific
xeon "nahalem" (sorry for the spelling) processors... It has to do with
switching cstate and the way rhel6.1 now switch state that was not done
in 6.0.

You can look at bugzilla # 710265 and kb docs # 61105.

Our temporary fix for the moment was to disable cstate transition by
adding :

intel_idle.max_cstate=0 processor.max_cstate=1

to the kernel line in grub.conf, update and reboot. We hadn't had any
cpu spikes on any of the 5 nodes we've updated yet. The 3 remaining
still haven't been updated due to production downtime.

Get a support signoff for this, I'm in no way endorsing this solution,
as I can't know if you're in the same situation as mine.

Have fun !

Ok, I have upgraded one rhel6.2 host (in a cluster with two nodes) and problems persists ... Versions:

corosync-1.4.1-4.el6.i686
corosynclib-1.4.1-4.el6.i686
cman-3.0.12.1-23.el6.i686
rgmanager-3.0.12.1-5.el6.i686

 Will be any solution at sometime??




--
CL Martinez
carlopmart {at} gmail {d0t} com


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]