[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] service stuck in "starting" state



Well it looks like that you're missing a SSL Certificate, so Apache isn't starting. It also looks like the server is trying to stop the service after an unsuccessful startup and since it isn't started, the init script returns an error. I had something similar with Squid a while back.

On Fri, Jul 10, 2009 at 4:35 PM, <jason monsterjam org> wrote:
hey cluster gurus..
I have a 2 node cluster thats been running without issue for quite a while.. all of a sudden one of the nodes will not
completely start the apache webserver service.. it looks like this

[root tf1 ~]# clustat
Member Status: Quorate

 Member Name                              Status
 ------ ----                              ------
 tf1                                      Online, Local, rgmanager
 tf2                                      Online, rgmanager

 Service Name         Owner (Last)                   State
 ------- ----         ----- ------                   -----
 Apache Service       tf1                            starting
 postfix service      tf1                            started
[root tf1 ~]#

and I see that the httpd is NOT started. although, if I do
/etc/init.d/httpd start
the service starts without issue.

grepping for apache and http in the logs, I see this..

Jul 10 14:32:13 tf1 httpd: httpd shutdown failed
Jul 10 14:32:52 tf1 httpd: httpd shutdown failed
Jul 10 14:33:11 tf1 httpd: httpd shutdown failed
Jul 10 14:33:57 tf1 httpd: Syntax error on line 117 of /etc/httpd/conf.d/ssl.conf:
Jul 10 14:33:57 tf1 httpd: SSLCertificateFile: file '/etc/httpd/conf/ssl.crt/server.crt' does not exist or is empty
Jul 10 14:33:57 tf1 httpd: httpd startup failed
Jul 10 14:34:06 tf1 httpd: Syntax error on line 117 of /etc/httpd/conf.d/ssl.conf:
Jul 10 14:34:06 tf1 httpd: SSLCertificateFile: file '/etc/httpd/conf/ssl.crt/server.crt' does not exist or is empty
Jul 10 14:34:06 tf1 httpd: httpd startup failed
Jul 10 14:34:08 tf1 httpd: httpd shutdown failed
Jul 10 16:23:33 tf1 clurgmgrd: [6168]: <info> Executing /etc/init.d/httpd stop
Jul 10 16:23:34 tf1 httpd: httpd shutdown failed
Jul 10 16:24:31 tf1 httpd: httpd shutdown failed
Jul 10 16:24:36 tf1 httpd: httpd shutdown failed
Jul 10 16:24:41 tf1 httpd: httpd startup succeeded
Jul 10 18:10:13 tf1 clurgmgrd: [6231]: <info> Executing /etc/init.d/httpd stop
Jul 10 18:10:13 tf1 httpd: httpd shutdown failed
Jul 10 18:22:00 tf1 httpd: httpd startup succeeded
[root tf1 log]# grep apache  messages
Jul 10 04:40:00 tf1 clurgmgrd[6267]: <notice> stop on script "cluster_apache" returned 1 (generic error)
Jul 10 10:04:33 tf1 clurgmgrd[6149]: <notice> stop on script "cluster_apache" returned 1 (generic error)
Jul 10 14:29:54 tf1 clurgmgrd[6281]: <notice> stop on script "cluster_apache" returned 1 (generic error)
Jul 10 16:23:34 tf1 clurgmgrd[6168]: <notice> stop on script "cluster_apache" returned 1 (generic error)
Jul 10 18:10:13 tf1 clurgmgrd[6231]: <notice> stop on script "cluster_apache" returned 1 (generic error)
[root tf1 log]#


Im guessing its the  stop on script "cluster_apache" returned 1 (generic error)
but I looked at the /etc/init.d/httpd on tf1 and tf2 and they are both the same size

[root tf2 ~]# ls -al /etc/init.d/httpd
-rwxr-xr-x  1 root root 3201 Jan 30  2007 /etc/init.d/httpd

[root tf1 log]# ls -al /etc/init.d/httpd
-rwxr-xr-x  1 root root 3201 Jan 30  2007 /etc/init.d/httpd

and the apache service starts/stops just fine on tf2 when the services get failed over to that machine.

any ideas on what can be wrong?

Jason

--
Linux-cluster mailing list
Linux-cluster redhat com
https://www.redhat.com/mailman/listinfo/linux-cluster


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]