Re: Need help with Reboot cause


excuse delay. personal reasons.

i have been over your readings and they are not far off, but they are not
as close as would like to see. this may be from not being calibrated.

Peter J. Stieber wrote:

> g> i do hope peter stieber is picking up on all this. :)
> He is :-)
> I loaded gnome-applet-sensors and was able to monitor the temperature on
> each of the two Opterons in the system.  The were a little high.

you need to check oem spec for temp ranges for cpus.

> I also installed the hddtemp package.  They were warm as well.

drives are warm, but they will be from running.

> I moved the machine (it was in a corner without good air flow), took off
> the side panel, pointed a small room fan into the case.  This
> significantly cooled off the CPUs and HDs, but I could still get the
> machine to reboot by running my build.  After one reboot, just trying to
> startx (the machine starts at run level 3) caused a reboot.

startx can be a load. dumping sensors will show this.

running a 3 minute sensor dump would be a good way to see just what is
actually happening when running build. using a 1 minute dump would be
better for watching when you know that you are loading down system.

i would be concerned with voltage and temp for cpus. voltages need to be
checked against a dvm and calibrated if off.

getting temps calibrated is a little harder to do. because of this, i would
recommend pulling cpus, cleaning old heat sink paste and applying new. if
old is dry, this will cause a loss of heat transfer. also, you need to insure
that fans are up to speed. check oem for actual speeds.

default high/low temp, voltage, and fan settings should be changed to
proper ranges.

something is wrong here:

> V1.5:        +2.61 V  (min =  +0.00 V, max =  +3.32 V)
v1.5 is high.

> CPU_Fan:    1633 RPM  (min =    0 RPM)
fan is slow. can cause cpu to over heat. unless it is a variable speed.

> CPU Temp:    +41.5 C  (low  = -127.0 C, high = +127.0 C)
high, but not excessive. variable speed fan could cause.

> w83627hf-isa-0290
> Adapter: ISA adapter
> VCore 1:     +1.52 V  (min =  +1.74 V, max =  +2.14 V)   ALARM
> VCore 2:     +1.49 V  (min =  +1.74 V, max =  +2.14 V)   ALARM

> +5V:         +4.46 V  (min =  +5.56 V, max =  +5.75 V)   ALARM
> +12V:        +4.80 V  (min = +14.04 V, max = +15.38 V)   ALARM
> -12V:        -1.83 V  (min =  +2.52 V, max =  +5.81 V)   ALARM
> -5V:         -4.04 V  (min =  +5.00 V, max =  -4.85 V)   ALARM
> V5SB:        +5.48 V  (min =  +1.26 V, max =  +2.77 V)   ALARM
> VBat:        +3.26 V  (min =  +2.99 V, max =  +0.61 V)   ALARM
these need to be checked against specs.

> fan1:          0 RPM  (min = 168750 RPM, div = 2)  ALARM
> fan2:          0 RPM  (min = 2667 RPM, div = 2)  ALARM
> fan3:          0 RPM  (min = 4245 RPM, div = 2)  ALARM
fans missing?

> temp1:       +28.0 C  (high = -65.0 C, hyst = -79.0 C)  ALARM  sensor =
> thermistor
> temp2:       +32.0 C  (high = +80.0 C, hyst = +75.0 C)  sensor = thermistor
> temp3:       -48.0 C  (high = +80.0 C, hyst = +75.0 C)  sensor = thermistor

do you have a manual to tell you what these sensors are for?

> cpu0_vid:   +1.500 V
> beep_enable:enabled
> I didn't even know there was an ISA adapter in the machine, but the
> values all look wacky.

indeed they are. manual?

> Here is the output of hddtemp
> hddtemp --unit=F SATA:/dev/sda SATA:/dev/sdb SATA:/dev/sdc
> /dev/sda: Maxtor 6Y160M0: 77 F
> /dev/sdb: WDC WD2500KS-00MJB0: 96 F
> /dev/sdc: ST3750330AS: 80 F

warm, but not a problem. drives do get warm. when they get up in double
digits is when yo need to worry.

> This is becoming a science experiment because we are going to purchase a
> replacement for this machine, but I am trying to keep it up until the
> new machine becomes available,

a new box will be a wise investment. having a spare would be even wiser.
do you have spare hdd?

> and maybe this exercise in futility may
> help someone else searching the list.

not to mention tech support getting old system. :)


peace out.



