/srv/irclogs.ubuntu.com/2012/03/14/#launchpad-yellow.txt

gary_posterbenji, frankban gmb, my older son lost more than a cup of blood today from a mongo nose bleed.  (On Sunday a friend of his pushed him off a trampoline and he landed on his nose, which may be related.)  I'm not sure what my schedule is yet--the bleed has stopped, but we're trying to get a dr appt for him asap.  In any case, whether it's with me or not, the people who are available for a call in a bit over an hour12:04
gary_postershould have a call12:04
benjigary_poster: will do; good luck with the nose bleed12:06
gary_posterthank you12:08
gary_posterfrankban, I'm going to file a bug for the testrepository unicode issue and then you can offer your branch as a possible solution.  Have you or gmb started a juju instance or shall I?12:58
gmbgary_poster, I haven't; I'm having juju bootstrap problems after upgrading.12:59
gmb(again)12:59
gary_postergmb, the one I mentioned yesterday, with the booleans?12:59
gary_posterI ended up removing all charms except the buildbot ones from the local repo13:00
gmbgary_poster, No, bootstrap just hangs, at least for ec2.13:00
gary_posterugh13:00
gmbI need to actually poke it with a stick to find out what's going on.13:00
gmbs/stick/strace/13:00
gary_poster:-)13:00
gary_postergmb, I'll see if I dupe13:00
gary_postergmb, so, you mean, you say "juju bootstrap" and the command never exits?  If so, that's not happening for me13:03
gmbgary_poster, Yeah, that's what I mean. Hmm. I'll break out strace after a call.13:04
gmbs/a/the13:04
gary_posterok13:04
benjigmb and frankban: the horde awaits: https://talkgadget.google.com/hangouts/extras/canonical.com/goldenhorde13:09
benjigary_poster: you probably aren't here, but you can join too if you are13:10
gary_posterthank you all.  I'll be able to attend too.  ignore the sounds of baby crying in the background13:10
gary_posterdr appt is 10:4513:10
gmbgary_poster, Firefox went bye-bye.13:14
gmbIn fact  it looks like a lot have things have gone  a bit sideways...13:16
gary_posterack gmb :-/13:16
gmbYay, power socket.13:47
* gmb -> lunch13:54
benjigary_poster: have you seen this in any of your test runs?14:11
benji**********************************************************************14:11
benjiCould not communicate with subprocess14:11
benji**********************************************************************14:11
gary_posterbenji I don't think so but will search through tee'd results from yesterday, one sec14:12
benjigary_poster: oops, it looks like the OOM killer got me14:12
gary_posterbenji, oh :-/14:12
gary_posterbenji, actually, yes14:13
gary_posterbenji, twice in my run, not in yours14:13
benjihmm14:14
gary_posterbenji, http://pastebin.ubuntu.com/883280/14:15
gary_posterbenji, I'm making a card for that14:15
gary_posterbenji, could be xvfb issue14:15
gary_posterbenji, it would be very interesting to have you run list 1 locally *not* in an lxc14:17
gary_posterto see if that shows up14:17
benjigary_poster: yep14:18
benjigary_poster: I may have an opportunity to do that soonish (I might have just fixed the read-only problem (with a one-line change to a test tearDown))14:18
gary_postergo you, benji  :-)14:18
benji:)14:19
gary_posterthough the read-only thing is a run list 2 issue14:19
benjigary_poster: right, I meant that once I fix this (list 1) issue, I could do a run of list 214:20
gary_posterThere was something else like that I saw yesterday that I did not put a card in for...looking through results again14:20
benjier, swap 1 and 2 above14:21
gary_postercool :-)14:22
frankbangary_poster: should I handle the subtle differences in leases path between lucid and oneiric/precise in setuplxc? Or can I just assume the lxc is lucid?14:22
gary_posterfrankban...it would be nice to handle precise because that will be what we use within a few months.  If it is not easy now, put a card in slack time for the precise version14:24
gary_posterMaybe that's the right thing to do either way; I'll leave it up to you14:24
frankbangary_poster: ok thanks14:25
gary_posternp14:25
gary_postergmb, I hain't fergotten ya.14:26
gary_poster(that's in "random silly accent Gary made up" to be clear)14:26
gary_posterAlmost ready, hopefully14:26
gmbgary_poster, Okay. I've not finished lunch yet though :)14:28
gary_postergmb, lol, ok14:28
gary_posterbenji, this happend on my testlist 2 run (yours) locally, but did *not* show up in the buildbot failure output from two days ago.  Did you see anything like this? http://pastebin.ubuntu.com/883307/14:34
* benji looks14:34
gary_posters/ my testlist 2 run (yours)/ my testlist 2 run (the one you ran first)/14:34
benjigary_poster: I haven't seen that failure.14:35
gary_posterok, benji, I'll say it was unique to something on my system then.  Thanks14:35
gary_posterbarry's absence will make it difficult to pick his brain14:51
gary_postergmb, I'm doing a bzr branch lp:~gmb/charm-tools/add-charm-helpers on the assumption that this is the right thing to do14:58
gmbgary_poster, It is; you'll also need to get lp:~gmb/ubuntu/precise/charm-tools/new-packaging for the debian stuff that I'm currently using14:59
gmb(Since charm-tools has separate source and packaging branches)14:59
gary_posterah ok14:59
gmbhttps://code.launchpad.net/~gmb/+recipe/charm-tools-daily has some of the details.15:00
gary_posterack15:00
* gary_poster decides he's still sick enough to be allowed to sit down while staring at build logs15:01
benjiThe review season begins.15:05
* gary_poster doesn't understand why a "successful build" would contain something that failed to build15:05
gmbgary_poster, Because the recipe built successfully - creating the source package - but the binary didn't :)15:06
gary_postergmb, bah :-)15:06
gmb:)15:07
gary_posterbenji frankban gmb, I added miscellaneous, high priority, dated cards to represent the work each of us needs to do for the reviews.  We should look at this on every call to make sure we are making sufficient and timely progress on it.15:22
benjik15:22
gary_posterWe now, relatively suddenly, have a *lot* of miscellaneous cards. :-) To get rid of one...15:22
gary_postergmb, what's up with lp2kanban?  Did you get that running, or is Brad still doing it, or...?15:23
gmbgary_poster, I got it running; let me check it.15:23
gary_posterthanks15:23
gary_posterthe "..." cards from Francesco ought to have had their titles filled afaik15:23
gary_posterfrankban, I'm trying to decide whether I feel qualified reviewing your branch. ;-) Meanwhile though, I did see one thing that I learned this week.  In this line...15:26
gary_posterwhile [ $delay -gt 0 -a ! -s {leases1} -a ! -s {leases2} ]15:26
gary_posterI suggest quoting "$delay"15:26
gary_posterIf you do not, in some edge cases the "[" command will get upset and confused, in my experience and per some advice I've read recently.15:26
gary_poster(in particular, the case in which $delay is empty...which will probably never happen here since I assume 0 is not empty, but it is supposedly just good practice to quote substitutions because of this kind of fragility)15:27
frankbangary_poster: humm, maybe when $delay is something starting with "-"?15:27
gary_posteryeah maybe that too15:27
frankbangary_poster: ok thanks for the hint15:28
gary_postersure15:29
gmbgary_poster, I've run it manually; looks like it's not running on cron for some reason; I'll poke around.15:29
gary_posterthanks gmb15:29
gmbAnyway; /me -> afk for a short while15:29
gmb(Bus home)15:29
gary_posterWhat the heck, I'll claim this review frankban :-)15:31
frankban:-D15:31
gary_posterfrankban, what does this mean: "truncate -c -s0 {leases1}"15:31
frankbangary_poster: if the file exists, truncate it at size 0, otherwise, do nothing15:32
gary_postergotcha15:32
gary_posterback in a few15:32
* gary_poster is lunching now16:25
gmbgary_poster, Okay, so I'll catch you when you get back, but interesting aside: if I build the package manually using Brad's steps, I get something that actually installs the python files. Not to anywhere useful, but it does _actually_ install them instead of ignoring them.16:57
gmbs/you/I/16:57
gary_postergmb, I'm here, but reviewing and emailing and stuff16:58
gary_posterand, interesting16:58
gmbgary_poster, Ah, okay. I'm going to be heading out shortly anyway. Haven't spoken to Barry yet; I'll ping him now and see if he'll be free for a chat later.16:58
gary_postergmb, cool.  barry doesn't seem to be around today16:59
gmbgary_poster, Ah, yes, I just noticed. Darnit.16:59
gary_postergmb, so what should our plan be?16:59
gmbgary_poster, My first choice is to get the python stuff out of charm tools and package that. This other way is proving unrewarding.17:00
gary_posterand in particular, what are our goals for the handover, and what are our goals for when I'm working.17:00
gary_postergmb, that's fine with me, but Clint is the maintainer of the charm helpers project17:00
gary_posterso I'd like to convince him17:00
gary_posterI'm happy to give him an ultimatum, of sorts:17:01
gary_postereither focus on helping us get his preferred approach to work, or let us do what you propose17:01
gary_posterwe've spent (via you primarily and brad secondarily) a *lot* of time on this17:02
gary_posterand we should wrap it up one way or the other17:02
gary_posterI'm happy to speak managerially about this to Clint ;-)17:02
gary_posteralso, what is your schedule for the next few hours work-wise, again?17:03
gmbgary_poster, Okay. I agree with you. My thought was that having a separate package that works would be a nice convincer :).17:03
gary_posterheh17:03
gmbgary_poster, I'll be afk until about 20:30 UTC, and will then finish my day ~1h.17:03
gary_postergmb, afk starting when? now-ish?17:03
gmbBut TBH I'll likely be around quite late; Sarah's got reports to write, so she pretty much needs me to shut up and get on with other things.17:04
gmbgary_poster, Yes.17:04
gary_poster:-)17:04
gary_posterok gmb, I'll see if I can corral SpamapS while you are gone17:04
gmbThanks :).17:04
gary_poster:-) welcome.  ttyl17:04
* gmb -> exeunt to divers alarums17:04
gary_posterheh17:04
gary_posterfrankban, bash-as-programming-language frightens me more and more as I learn more and more about it ;-) but what you've done looks very good.  There are some constructs that I understand from context but will simply trust you on (case in point: why the initial dollar sign in "delay=$(( $delay - 1 ))"?  Why can't it be "delay=( $delay - 1 )"?)17:11
gary_posterSo, will approve :-)17:12
gary_posterDone17:13
frankbangary_poster: thanks. that block was taken as is from start-ephemeral... so, no idea either, and I am not curious...17:13
gary_posterlol17:13
gary_postercool17:13
gary_posterI'm somewhat amazed that people still program in this stuff :-)17:14
frankbanyes, only perl is worse than this...17:14
frankban;-)17:15
gary_posterheh17:15
frankbangary_poster: landing, and, I am not a buildout expert, so maybe you could take a quick look at https://code.launchpad.net/~frankban/lpsetup/add-buildout/+merge/9746617:23
gary_posterfrankban, will do.17:27
frankbanty gary_poster17:27
gary_posterfrankban, approved buildbot branch conditionally17:54
gary_poster./bin/test doesn't work17:54
gary_posterother ideas in my reply17:55
frankbangary_poster: we still don't have unit tests in lpsetup: it is actually the next step. Having test, bin/test should work17:56
gary_posterfrankban, :-) ok cool17:56
frankbangary_poster: buildout was suggested by benji (basically to have a test runner for free I think). I think pip is already supported (setup.py), once the project will be registered to PyPI17:59
gary_posterfrankban, ok cool18:00
frankbanty gary_poster, EOD, have a nice evening18:00
gary_posteryou too frankban, bye18:00
benjiok, my list is down to 36 failures and 126 errors; submitting MP for read-only fix now18:20
gary_posterawesome benji!19:16
gary_postervery large number19:16
gary_posterbut not as large as 3000 :-)19:16
benjigary_poster: I'm persuing making bugs for all of the test isolation failurs, but I wonder if that's the right thing to do.  Thoughts?19:16
gary_posterbenji, that's fine as long as it doesn't take too long.  I've identified four "high" priority cards19:17
gary_posteryou and frankban have addressed two19:17
gary_posterthe other two on board are, as you'd expect, the next things I'd like to see19:18
gary_posterbecause they prevent us from getting a true full run of test19:18
gary_posters19:18
benjiI am in a state of kanban conflict: the board is now not over the limit (with me moving my branch into landing) but we can't add anything else without going back over19:19
gary_posterbenji, if there is a high card, there is clear visible reason for why we would do so19:19
gary_posterI don't normally encourage this19:20
gary_posterbut the high cards, and the reasoning/concern behind them, warrant it in my opinion19:20
benjiok, that sounds reasonable to me; I'll take one of the two remaining high cards19:20
gary_postercool19:20
gary_posterbenji, does test list 2 still hang at the end even with your isolation fix?  It would have been nice if... :-)19:31
benjigary_poster: if test list 2 is tmpXctd5i, then no19:31
benjiso, you're suggesting that bug 954384 may already be fixed?19:32
_mup_Bug #954384: test teardown can hang <paralleltest> <Launchpad itself:Triaged> < https://launchpad.net/bugs/954384 >19:32
gary_posterbenji, it doesn't hang?  yes!  that is what I am suggesting!  With glee in my heart!19:32
* benji watches as gary_poster skips through a meadow and whistles with songbirds.19:33
gary_poster:-)19:33
gary_posteryeah, I've been kind of a ball of tension this week for one reason and another.19:33
gary_posterthis relieves a big reason19:33
benjiI can say definitively that it didn't hang, but I can't say for sure that it hung before my fix.... hmm, or can I; let me look at something.19:34
gary_posterright, it could still be lxc19:34
gary_posterunless it hung for you before19:35
benjinope, I don't have any evidence that it hung before19:36
benjiI can do another test run, maybe late tonight (because it takes forever, 199 minutes last time) to try to get it to hang (running the pre-fix code).19:37
benjiall this casts doubt on whether or not I can or should work on 954384 (which I planned on doing)19:37
benjiI could try to reproduce the hang wiht a smaller test subset.  That seems like a smart thing to do.19:38
benjiI also really need to get a precise lxc or vm up.19:38
gary_posterbenji, why don't you do the precise vm/lxc19:41
gary_posterIf we don't have the hang anymore then trying to repro doesn't make a lot of sense to me19:42
gary_posteroh19:42
gary_posterright19:42
gary_posteryou don't have any evidence that it hung before19:42
gary_posterwhich means it could still be an lxc thing19:42
benjiright, since I haven't provoked it, we can't be sure that me not seeing it is evidence that it is gone19:43
benjibut the not having a vm thing is really hurting me too, for example, I can't land the fix for the read-only bug until I have one.  I have to depend on the kindness of strangers, hint hint.19:44
benjispeaking of, is lpsetup supposed to work?  I get TypeError: unsupported operand type(s) for +: 'NoneType' and 'str' when I run it19:45
gary_posterbenji, lpsetup it is not supposed to work19:49
gary_posterbenji, kindness of strangers, lol, ok19:50
benjiok, good, I guess :)19:50
benjithanks!19:50
gary_posterbenji, I am setting up my juju buildbot to run your branch, then will land19:50
gary_posterI mean, I will land it once I have started the tests, not after they have run19:51
gary_posterwell19:51
gary_posteroh heck, yes19:51
gary_posterlxc-wait is really a drag19:52
benjiheh19:54
benjiok... it is apparently both required that we devleop on precise (because we can't land otherwise) and impossible to set LP up on precise (email thread started by 20:03
benjiDeryck20:04
benjibang! bang!20:04
* benji shoots the ferral tabs that got in here.20:04
* benji is orthographically challenged, but can still spell "orthographically challenged"20:05
gary_posterbenji, lxc works :-)20:38
benjigary_poster: I don't follow.20:39
gary_posterbenji, which is what I do (precise host, lucid lxc instances)20:39
gary_posterand that appears to work20:39
gary_posterbenji, on another note20:39
gary_posterlxc-start-ephemeral is adding to the number of days of my life that I have lost this week from stress :-P20:39
benjiheh20:39
benjiwait, you develop on lucid?  I thought we were supposed to use precise?20:40
gary_posterbenji, I edit my code on precise.  I run my tests and my dev tools on lucid20:40
gary_posterwhich is AOK20:41
gary_posterbenji, back on my stress-inducing topic...take a glance at http://ec2-50-16-1-238.compute-1.amazonaws.com:8010/builders/lucid_lp/builds/2/steps/shell_8/logs/stdio20:41
benjioh!  I needed that bit of info!  I'll try to set up a lucid lxc container then.20:41
gary_posternotice the __init__ complaint, #1.  Where the heck did that come from?20:42
benjihmm "exit code 3"20:42
gary_poster#2, I try to run the given command on the slave20:42
gary_posterThis fails, complaining that xvfb fails20:42
gary_poster#3, I try to run the "start a ephemeral container and then I'll log into it".  This works.  Then I log in, become buildbot, and run the other part of the command.  This succeeds20:43
benjiooh, I didn't see that.  That is odd.20:43
gary_posterSo...I'm about to try duping the ssh command20:43
gary_posterAnd all I *really* want to do is verify that frankban's fix from yesterday fixes our problem, and verify that your fix from today fixes the majority of our tests, and might or might not fix the hang problem20:45
gary_posterbut instead I keep getting "one step forward two steps back" issues20:45
gary_posterok, venting over.  thanks benji. ;-) now I'll go back to trying to do the ssh20:45
benjigood luck :)20:46
gary_poster;-) thanks20:46
gary_posterbenji argh:20:50
gary_posterssh -n -o StrictHostKeyChecking=no -o UserKnownHostsFile=/dev/null -i /var/lib/buildbot/.ssh/launchpad_lxc_id_rsa buildbot@10.0.3.104 -- 'xvfb-run --error-file=/var/tmp/xvfb-errors.log --server-args=-screen 0 1024x768x24 -a /var/lib/buildbot/slaves/slave/lucid-devel/build/bin/test --subunit'20:50
gary_posternotice the lack of quotes that we so carefully added20:50
benjipfft20:50
gary_posterbenji, what did we do to fix that again?  I forgot20:50
benjiI'm trying to remember.20:51
benjigary_poster: I think this was the very long line and regex to remove extra spaces in setuplxc.20:53
gary_posterbenji, this is what we are producing now, from the setuplxc script:20:54
gary_posterlxc-start-ephemeral -u buildbot -S /var/lib/buildbot/.ssh/launchpad_lxc_id_rsa -o lptests -- xvfb-run --error-file=/var/tmp/xvfb-errors.log --server-args='-screen 0 1024x768x24' -a /var/lib/buildbot/slaves/slave/lucid-devel/build/bin/test --subunit20:54
gary_posterIOW, it is good20:54
gary_posterafaict20:54
gary_posterI'm going to try random quoting in lxc-start-ephemeral...20:54
benjigary_poster: I don't think that's good.  Remember that ssh handles the arguments incorrectly, we need to put everythin after -- in double-quotes.20:57
gary_posterbenji, well...20:58
gary_posterbenji, maybe that's what we did.  I don't remember, but we could dig it up.  BUT...20:59
gary_posterIf I call this20:59
gary_posterlxc-start-ephemeral -u buildbot -S /var/lib/buildbot/.ssh/launchpad_lxc_id_rsa -o lptests -- xvfb-run --error-file=/var/tmp/xvfb-errors.log --server-args='-screen 0 1024x768x24' -a /var/lib/buildbot/slaves/slave/lucid-devel/build/bin/test --subunit20:59
gary_posterthen ISTM that lxc-start-ephemeral ought to be able to quote everything itself20:59
gary_posterhowever21:00
gary_posterif you run that command21:00
gary_poster$@ (in lxc-start-ephemeral) is this21:00
gary_posterxvfb-run --error-file=/var/tmp/xvfb-errors.log --server-args=-screen 0 1024x768x24 -a /var/lib/buildbot/slaves/slave/lucid-devel/build/bin/test --subunit21:01
gary_posterthat is, the quotes have been lost already21:01
gary_posterbefore we even get to ssh21:01
benjiwell, not exactly, the "--server-args=-screen 0 1024x768x24" bit is in a single arugment, but $@ formats the arguments as a string, loosing the grouping information which is then re-interpreted by ssh (incorrectly)21:02
benjisetuplxc on devel looks like it should do the right thing to me.  I suggest looking at the setuplxc on your box to be sure it has the latest code21:03
gary_posterbenji, what I believe is supposed to be the fix is there:21:06
gary_posterhttp://pastebin.ubuntu.com/883900/21:06
benjigary_poster: yeah, that looks right to me... hmm, at one point we also put double-quotes around the $@ passed to ssh inside lxc-start-ephemeral; I can't remember if that was part of the fina fix or not, but that'd be the next thing I try21:07
benjithis is irritating, reasoning about this stuff shouldn't be this hard21:09
gary_posterbenji, on late team lead call21:09
gary_posterbenji, I'm still on call21:21
gary_posterbut21:21
gary_posteroh21:21
gary_posterI bet you have stopped :-)21:21
benjigary_poster: I'm trying to stop. ;)21:22
gary_posterbenji, :-) ok.  So, I think that might have been a red herring.  I'll explain last :-)21:22
gary_posterlater I mean21:23
gmbgary_poster, So, did you hear anything back from Spamaps?21:42
gary_postergmb, ugh, no, I've been deep into my private hell of buildbot no longer working, sorry.  Let me see if he is still around21:43
gmbgary_poster, No worries. Seems like this week is a week of private hells.21:43
gary_posteryeah :-/21:43
gmbIf we all had a hangout, at least we'd be toasty together.21:43
gary_posterheh, yeah :-)21:43
gary_postergmb, you on #juju?22:00
gmbgary_poster, yes; I was looking at the wrong #juju :)22:01
gary_poster:-)22:01
gmbgary_poster, Awesomesauce.22:01
gmbAlso, never saying "awesomesauce" again.22:02
gary_posterlol22:02
gary_postergmb, do you have any champagne?22:03
* gary_poster steps away22:05
gmbgary_poster, No, but I'll sleep better tonight :)22:11
gary_postergmb, cool :-)22:11

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!