/srv/irclogs.ubuntu.com/2012/09/20/#upstart.txt

=== j4m3s__ is now known as j4m3s_
=== Kiall is now known as zz_Kiall
=== zz_Kiall is now known as Kiall
astrostlis there a way to 'unstick' the upstart system on rhel6 without rebooting?  stop: Job has already been stopped: elasticsearch - fair enough.  when i start it, though, it hangs indefinitely.  i've had this happen before when developing scripts, and the 'fix' is to reboot it and start with a good script.18:52
astrostlrhel6 upstart = upstart-0.6.5-12.el6.x86_64 .  old, i know.18:52
astrostli know my scripts are right now because if i copy elasticsearch.conf to elasticsearch2.conf it starts and stops correctly18:56
SpamapSastrostl: stuck how?19:12
astrostli ended up rebooting to clear it, but19:12
SpamapSastrostl: can you paste the output of 'status elasticsearch' and the job.conf ?19:13
astrostlstuck as in 'start elasticsearch' does nothing19:13
astrostlhangs indefinitely19:13
SpamapSastrostl: hangs indefinitely sounds like a problem with the .conf file19:13
astrostlas verified with the copy, it isn't.19:13
astrostlhttp://pastebin.com/6tmJ4tau19:13
astrostl(there is a leading s in the actual file)19:14
SpamapSweird, that should return as soon as su is executed19:14
astrostl'start elasticsearch' - hangs forever19:14
SpamapSbtw, using su has some problems19:14
astrostlcp elasticsearch.conf elasticsearch2.conf && 'start elasticsearch' - works perfectly19:14
SpamapSit opens a pam session19:14
astrostlthis isn't an su thing19:14
SpamapSagreed, but you should be aware of that19:14
SpamapSastrostl: status elasticsearch shows what?19:15
astrostlif i ctrl-c, says it's running19:15
astrostlif i stop, hangs indefinitely again19:15
astrostlif i ctrl-c that, then status, says stopped19:15
SpamapS"says its running" is a bit vague19:15
SpamapScan you paste the full output?19:15
astrostli ended up rebooting to clear it19:15
astrostlexample: elasticsearch start/running, process 175319:16
SpamapSastrostl: got syslogs for around that time?19:16
astrostlyes, they have nothing of note19:16
SpamapSastrostl: should be something like 'init: ....'19:16
astrostli watched messages live, it reports nothing at all when it's in "stuck" mode19:17
astrostlnot on start, not on stop19:17
SpamapSastrostl: I've only ever seen start hang forever when there's a really long post-start or expect fork where the main process never forks19:17
astrostl'kill -HUP 1' doesn't resolve it either19:17
SpamapSHUP'ing init is definitely not advised19:17
astrostlas i said twice, i ended up *REBOOTING*19:17
astrostlrelative to that, a HUP on init is not significant in my view19:18
SpamapSI understand that. Trying to prep you for the next time.19:18
SpamapSastrostl: HUP doesn't do what I thought it did... so ignore that warning. :p19:19
astrostlinit is designed to take a hup for reloads (e.g. inittab changes)19:19
SpamapSastrostl: ok so your question, how do I unstick a job, is hard to answer without some extra logs..19:20
SpamapSastrostl: if you expect it to happen again, perhaps raise log priority with 'initctl log-priority info'19:20
astrostli've had this happen 2-3 times during upstart script development19:20
astrostlbasic pattern: make an upstart script, start it, oops, try to stop, hangs indefinitely, now we're screwed19:21
astrostlcertain conditions from failed script starts seem to put that *NAME* in a hosed state19:22
SpamapSyes there is one well known way to do that19:23
astrostlfixing it won't do - fixing it and *RENAMING* (or rebooting) does19:23
SpamapSnotably, bug #40639719:23
astrostlis there a well-known way to undo that, aside from rebooting?19:23
SpamapShttps://bugs.launchpad.net/upstart/+bug/40639719:23
SpamapSastrostl: if its that bug, what has happened is upstart has lost track of the pid it thinks it should be tracking...19:24
SpamapSastrostl: the way to know if you've hit that problem is if 'status $jobname' shows a pid that does not exist19:24
SpamapSastrostl: the way to fix it w/o reboot is to exhaust the pid space so it does exist, then upstart will kill it19:24
astrostlthat sounds like the problem exactly19:24
astrostllol @ the solution - but that's exactly what i need to know.  thx!19:25
* dluna had the exact same problem a couple of weeks ago19:25
SpamapSyeah, I'm hoping a fix can be worked out in the next Ubuntu dev cycle, but I doubt that will land in any RHEL release any time soon with systemd looming19:28
SpamapSastrostl: please mark yourself as being affected by that bug.. it helps us figure out what to work on next19:28
SpamapSthat bug, by far, has the highest "heat"19:29
astrostlwill do, although i'm less optimistic that rhel will notice or care given how far back they are from the prod version of upstart19:30
astrostlupvoted, subscribed19:31
astrostlthx again, cya20:32

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!