[00:24] sinzui: ping [00:34] thumper: you about? [00:34] ya [00:35] http://pastebin.ubuntu.com/7187528/ [00:35] the test passed for me [00:35] just ran it locally, not ppc obviously [00:36] wallyworld_: which gccgo ? [00:36] but i thought you said that just using gccgo would cause it to fail [00:36] sorry [00:36] waigani: [00:36] davecheney: gcc version 4.9.0 20140330 [00:36] waigani: your ppc details are incoming [00:36] waigani: now we need to get you set up on the ppc machine [00:36] wallyworld_: I'm stealing your identity ;) [00:36] waigani: your login has been setup... [00:36] nothing worth stealing really [00:36] waigani: I can add you as an ssh user to my vm [00:37] thumper: waigani got a new vm for waigani [00:37] davecheney: already? [00:37] thumper: okay. Didn't you say it failed for you on your local machine? [00:37] yeah... [00:37] davecheney: nice :) [00:37] waigani: run the entire package [00:38] waigani: what lp is this again ? [00:38] thumper: all pass [00:38] hmm... [00:38] waigani: definitely need to get you running that on power [00:38] davecheney: https://bugs.launchpad.net/juju-core/+bug/1300032 [00:38] <_mup_> Bug #1300032: charm: gccgo test failure [00:38] davecheney: what is that line to import ssh keys from lp? [00:38] waigani: yeah that will pass on your local macine [00:38] disconnect from your network [00:39] ah [00:39] or export http_proxy="www.microsoft.com" [00:39] or something [00:39] thumper: ssh-copy-id [00:39] ssh-import-id [00:39] i forget [00:39] i think the latter [00:39] ssh-import-id $LAUNCHPAD [00:39] kk [00:40] waigani: your machine is winton-09 [00:40] instructions are here -> https://docs.google.com/a/canonical.com/document/d/1m9R2n6LPLNLGjdopcNkQYVG8D5V4FTyvc1vvn-9ZifM/edit [00:44] 11:40 < davecheney> waigani: your machine is winton-09 [00:44] 11:40 < davecheney> instructions are here -> https://docs.google.com/a/canonical.com/document/d/1m9R2n6LPLNLGjdopcNkQYVG8D5V4FTyvc1vvn-9ZifM/edit [00:44] davecheney: cheers. I've got a screen full of errors now :) yay [00:49] hows this coming https://blueprints.launchpad.net/ubuntu/+spec/servercloud-p-arm-service-orchestration [00:49] and could i use devices such as http://www.amazon.com/Rikomagic-Android-Cortex-A9-External-Miracast/dp/B00GPJLVQW [00:51] Valduare: wrong channel made [00:51] mate [00:51] sorry [00:52] this is where the developers of juju are? [00:52] the guys who are implementing juju for arm or is there a #juju-arm channel [00:52] Valduare: we write juju [00:52] we don't know anything about ubuntu on arm [00:52] specfically [00:52] try #server [00:53] or #ubuntu [00:53] ok [00:54] Valduare: sorry, i can't really tell you who to talk to, only that we're not the right people [00:55] someone here has to be interested in testing juju on arm? [00:55] doesnt make sense there wouldnt lol [00:59] Valduare: there is... [00:59] Valduare: mwhudson looks at arm of some form [00:59] Valduare: depends what you are wanting to know [00:59] axw: hey, here on time? [00:59] thumper: yo, I am in fact [00:59] cool [01:00] thumper: im interested in some suggestions for arm servers that will be supported well [01:00] i only have experience with mk808 style hdmi stick devices heh [01:06] hmm... I'm getting: ssh_exchange_identification: Connection closed by remote host [01:15] waigani: add a -vv on there [01:20] thumper: sorry, closed wrong tab [01:21] davecheney: emailed you output [01:22] waigani: something between you an batuan is blocking port 22 [01:23] waigani: is tim using the internet again [01:23] heh [01:34] c.Assert(a, gc.DeepEquals, []string{"amd64"}) [01:34] ... obtained []string = []string{"i386", "amd64"} [01:34] ... expected []string = []string{"amd64"} [01:34] OOPS: 81 passed, 1 FAILED [01:34] --- FAIL: TestMAAS (11.60 seconds) [01:34] FAIL [01:34] FAIL launchpad.net/juju-core/provider/maas 11.699s [01:34] ffs [01:35] Valduare: there is still (!) no really compelling arm server product [01:35] mwhudson: highbank ? [01:35] davecheney: calxeda went bust [01:35] so if you have one, sure [01:35] but you can't get any more... [01:35] mwhudson: though not a lot of ram ie 2 gigs each, these mk902 devices look pretty neat [01:36] Valduare: when the ubuntu server team look at those arm boards [01:36] the lack of an onboard LOM or IPMI controllre [01:36] pretty much knock them out of contention [01:36] Valduare: just check that the ethernet isn't some usb2 rubbish [01:36] yes, and that [01:36] ah [01:36] I was talking to the linux-sunxi guys about a u-boot that could pause and wait for the kernel over network [01:37] Valduare: sure [01:37] but there is a difference between being able to do that in software [01:37] and being able to do it in hardware [01:37] and it's that distinction that is what draws the line between consumer grade and server grade [01:37] at least in my mind [01:37] aye [01:38] if you want to solder things and play around and have fun, then yes, go for it [01:38] but it's not something you put in a data centre [01:38] i dream of running my small DC on solar one day lol [01:38] mwhudson: except in our data center :) [01:38] i've heard that is *exactly* how the lp arm builders work :P [01:38] davecheney: not any more, we have some highbanks now :-) [01:38] lucky we got them in time [01:38] * davecheney doffs hat [01:38] but yes, we did have panda boards in our dc [01:39] is _really_ hated them [01:39] mwhudson: yeah [01:39] mwhudson: i love my panda [01:39] i've never had a more reliable builder [01:39] what do you think of panda board vs mk902 type device [01:39] although probably not as much as some of the strange things we had before [01:39] Valduare: I don't have much experience with mk902s [01:39] Valduare: panda is better supported for running ubuntu-type distros on i think [01:39] i've used the A20's from cubie [01:40] imx6's [01:40] Ti's [01:40] and the exegons [01:40] whatever [01:40] stupid name [01:40] -1 for marketing a product that nobody can spell [01:40] lol [01:40] then i was looking at these little beasts http://www.ebay.com/itm/FREE-SHIP-Supermicro-A1SAM-2750F-O-Intel-Atom-C2750-DDR3-SATA3-V-4GbE-/191091952279?pt=LH_DefaultDomain_0&hash=item2c7df7ca97 [01:40] 8 core atom - 64 gigs ecc ram [01:42] not enough money in the world hwen your buying for yourself heh [01:48] on your icydocks what kind of drives are you running in them [01:48] Valduare: i appreciate you really want to talk about arm [01:48] but this isn't the channel [01:49] aye - [02:30] wallyworld_: hangout? [02:30] sec [02:31] wallyworld_: https://plus.google.com/hangouts/_/76cpj3ag8c2517af97k8drl72c?hl=en when you are ready [02:42] thumper: I'm back [02:46] wallyworld_: https://plus.google.com/hangouts/_/7acpj3620q4qsuh0smb7157iqg?hl=en [02:46] nope [02:46] waigani: https://plus.google.com/hangouts/_/7acpj3620q4qsuh0smb7157iqg?hl=en [02:46] wallyworld_: sorry [02:46] np [03:03] thumper: packages are all installed now [03:03] waigani: oh, good [03:12] davecheney: I'm in! But ... trying to install golang, it has unmet dependencies which can't be installed [03:15] waigani: paste ? [03:15] davecheney: http://pastebin.ubuntu.com/7187924/ [03:18] waigani: where did you get those instructions ? [03:20] waigani: instructions are here -> https://docs.google.com/a/canonical.com/document/d/1m9R2n6LPLNLGjdopcNkQYVG8D5V4FTyvc1vvn-9ZifM/edit [03:25] waigani: thumper mwhudson http://paste.ubuntu.com/7187946/ [03:25] current state on ppc64 [03:39] * thumper looks [03:39] wow, joyent tests suck [03:41] thumper: is that why they pass ? [03:41] or the whole, 90% chance of timing out [03:41] davecheney: I was just looking at the time [03:56] davecheney: what am I missing? Can't install go on ppcvm: http://pastebin.ubuntu.com/7188020/ [03:59] thumper: yeah, i never got to fix the tests run time [03:59] didn't look into it [04:00] also, testing bug 1300032 locally, gccgo without network, I get a pile of linker errors: http://paste.ubuntu.com/7187960/ [04:00] <_mup_> Bug #1300032: charm: gccgo test failure [04:01] I rebuilt pkg, updated and upgraded via apt-get - still get the same errors [04:01] thumper: ^ [04:01] wallyworld_: ok, we can put it on the TODO list [04:02] waigani: where did you read those instructions ? [04:02] waigani: hmm... I don't know [04:03] waigani: lets fix one problem at a time [04:03] davecheney: trying to follow yours, go get ..., no go, so trying to install go [04:03] waigani: can you please show me where in the instructions that it it says to do the line that you are having trouble with [04:03] $ go get -v launchpad.net/juju-core/… [04:03] waigani: which document [04:04] davecheney: the one you sent me: https://docs.google.com/a/canonical.com/document/d/1m9R2n6LPLNLGjdopcNkQYVG8D5V4FTyvc1vvn-9ZifM/edit# [04:04] what does the line directly preceeding that one say ? [04:05] $ sudo apt-get install gccgo-4.9 gccgo-go bzr git [04:05] did that command work ? [04:06] davecheney: nop [04:06] ok, so lets fix that one [04:06] then following commands might work [04:06] okay [04:06] waigani: http://pastebin.ubuntu.com/7188020/ [04:07] ^ this paste does not match the command you said you entered [04:07] can you check please [04:08] davecheney: that was my attempt to install go [04:08] davecheney: http://pastebin.ubuntu.com/7188045/ [04:08] ok, so those are all installed [04:09] apt is complaing because a prevoius installation failed [04:09] what happens if you follow its advice ? [04:09] it'll probably tell you to uninstall something, probably the 'golang' meta package [04:09] apt-get -f install.. [04:10] yep removed [04:10] you should be good to go [04:12] yep, reran install and I've got go. Thanks [04:13] cool [04:14] /usr/bin/go is probably provided by an alternative [04:14] so it won't be present til the post install of gccgo-go runs [04:14] and that doesn't run if there are apt errors [04:38] * thumper taps fingers waiting for lbox [04:40] * davecheney plays a few hands of solitare [04:41] for someone... simple test tweak https://codereview.appspot.com/82940044 [04:43] thumper: you need a defer to close that listenre [04:43] well, need is a bit strong [04:43] maybe should === vladk|offline is now known as vladk [06:07] protip: if you've been running juju tests all day, check /tmp === vladk is now known as vladk|offline [07:22] mornin' all [07:30] morning rogpeppe [07:30] axw: hiya [07:31] rogpeppe: I have an LGTM from fwereade on the fixes to parallel open, did you want to take a look before I land? [07:31] axw: just looking [07:33] fwereade: no rush, just checking: did you see my email about InstanceDistributor? [07:36] axw: TestDialWebsocketStopped looks a little dubious to me [07:36] axw: doesn't it run the risk of panicking? [07:37] rogpeppe: I don't see how, can you please expand? [07:38] axw: you're passing websocket.Config into newWebsocketDialler [07:38] axw: s/passing/passing a nil/ [07:39] axw: istm that you're only lucky that it doesn't get into websocket.DialConfig before it receives the stop notification [07:40] rogpeppe: no luck involved, it's intentional that it should not use the config if the channel is stopped. but I can pass in a non-nil config if you feel strongly [07:41] axw: ah, of course, you close the channel before calling the function. [07:41] axw: doh! sorry, ignore me [07:41] rogpeppe: no worries :) [07:42] axw: LGM [07:42] axw: LGTM even :-) [07:42] cheers [07:43] axw, not yet [07:43] rogpeppe: what's wrong if "else if"? [07:43] fwereade: okey dokey [07:44] axw: it makes me look to see if there's some reason that the if is necessary [07:45] axw: s/if is/else is/ [07:45] axw: the only reason that i can see that it's necessary here is to save a line, but i don't think that's a great reason [07:46] rogpeppe: ok. I tend to do it not so much to save lines, but to group related comparisons [07:46] related tests [07:46] whatever [07:48] axw: yeah, fair enough. i guess whereever i see an else after a break or return, i wonder if there's been a mistake [07:48] axw: but if you feel strongly about it, keep it [07:48] not particularly :) === vladk|offline is now known as vladk [08:13] good morning [08:16] morning vladk [08:21] morning all [08:22] morning dimitern [08:26] rogpeppe: isn't "HA: agent cache current API addresses" (i.e. with a worker) pointless if we keep provider-state up-to-date in storage? [08:27] axw: only if the agent can get access to provider-state [08:27] axw: (which it can't currently) [08:27] ah right [08:28] rogpeppe, fwereade when you both get a moment can you take another look at https://codereview.appspot.com/75600044/. fwereade: It's changed since you LGTMed it [08:28] mattyw: will do [08:30] rogpeppe: https://codereview.appspot.com/83050045 - jujud generates shared secret & calls SetStateServingInfo [08:31] axw: brill, looking [08:33] axw: voidspace is shortly to propose a CL which adds a SetStateServingInfo to agent config. [08:33] axw: (and a StateServingInfo getter too) [08:34] ah ok [08:34] axw: but actually, that should merge ok with your branch [08:35] axw: isn't the "between 6 and 1024 chars and base64" stipulation funny? [08:37] rogpeppe: it is a bit, not sure what the deal is there [08:38] rogpeppe: it's not clear on whether it's inclusive/exclusive either, but mongo doesn't complain with 1024 at least [08:38] axw: good [08:38] axw: 1024 seems a bit like overkill though [08:39] axw: especially as the randomness won't compress and cloudinit files are limited in size [08:39] axw: although... it actually never goes in a cloudinit file, does it? [08:39] axw: so it probably makes no odds [08:40] rogpeppe: right, it doesn't. it only goes to disk and mongo [08:41] axw: i'm just looking at MongoUpstartService. I *think* that's never used to generate an upstart conf that goes across the network. does that seem right to you? [08:42] axw: in which case filepath would be more appropriate than path, but then again, this is *upstart*, so path will be fine in fact. [08:42] rogpeppe: MongoUpstartService is used inside environs/cloudinit [08:42] for how long I don't know [08:42] axw: that needs to go away though [08:43] I assume once this is all done, we'll generate the mongo upstart service in the agent [08:43] sure [08:43] axw: when EnsureMongoService is called by jujud [08:43] axw: yeah, there's a long-standing CL that will do that [08:44] axw, thumper: hopefully I requeued the right branches to land. I had to do some bot surgery to get 1.18 available, and I noticed we were out-of-disk-space, which is probably why you were resubmitting them earlier. [08:45] jam1: thanks, sorry if I was interfering [08:46] axw: I didn't do my normal safety checks to see if it was running anything and grab the fslock, but hopefully I didn't break anything too badly. === axw is now known as axw-afk [08:50] morning all [08:50] g'morning [09:23] voidspace: hiya === axw-afk is now known as axw [09:44] * fwereade will be a few mins late to the meeting to allow the nearby hoovering to complete [09:44] don't wait for me [09:44] fwereade: it's an hour later [09:45] so much the better then :) [09:46] voidspace: https://code.launchpad.net/~axwalk/juju-core/jujud-bootstrap-mongo-sharedsecret/+merge/213610 [09:50] fwereade: that's a lot of hoovering :0 [09:50] haha [09:50] rogpeppe: I'm not muted [09:50] rogpeppe: I can hear [09:51] voidspace: hmm, you just went ultra-quiet [09:58] wwitzel3: I'm just grabbing a coffee, then I'll be in our 1:1 hangout [09:59] good morning [09:59] jam: sounds good [10:01] good morning perrito666 [10:09] morning all [10:11] morning natefinch [10:14] natefinch: morning [10:14] wwitzel3: morning [10:15] voidspace: morning [10:24] voidspace: environs/cloudinit/cloudinit.go [10:29] rogpeppe1: ~mfoord/juju-core/stateservinginfo [10:39] so running tests is pooing in my tmp directory [10:40] fwereade: will you have a moment after the standup to discuss SetPrivate/PublicAddress? [10:40] axw: re: https://code.launchpad.net/~axwalk/juju-core/jujud-bootstrap-mongo-sharedsecret/+merge/213610 [10:41] axw: what happens if you pass --keyFile when we don't use replSet, especially on a Mongo that was initialized differently ? [10:46] wallyworld_, ofc [10:58] jam: I tested with the local provider, it just worked [10:58] jam: I created another card to create the keyfile on upgrade [10:59] axw: thx. I think the idea is that EnsureMongoServer code (that hasn't landed yet) is intended to "make the state consistent" [10:59] which would be writing that file [10:59] and rewriting the Upstart file [11:00] makes sense [11:00] I will update the card to reflect that [11:03] voidspace: about the tmp directory. That can happen if you kill the tests. In theory it shouldn't ever happen if you don't kill the tests. The problem with killing them is that deferred statements don't happen if the application dies, and that's how we do all our cleanup. [11:04] (note the "in theory" ... we seem to leak stuff in tmp occasionally on the bot, which shouldn't be killing anything) [11:06] perrito666: I think someone changed canonical admin, because I just got emails for 5 holiday requests from you. Approving them now. [11:06] jam: sarah did [11:06] jam: I was under the wrong person [11:07] natefinch: right, thanks [11:14] fwereade, post call.. ran into a strange bug around proper garbage collection of machines in status ... seems to be consistent around the 150 machine point.. https://bugs.launchpad.net/juju-core/+bug/1299584 .. was curious if you might be able to identify src location so i could poke at a bit more. [11:14] <_mup_> Bug #1299584: dead machines stuck in status [11:16] hazmat, that sounds like a provisioner bug, but I can't see why it only shows up after a while [11:17] hazmat, the provisioner is responsible for the last step of machine cleanup -- the cleanup actually removes other stuff, and leaves the machines dead for the provisioner [11:17] fwereade, ic, thanks [11:19] rogpeppe1, which tool is that? [11:20] rogpeppe1, re deps and go get [11:20] hazmat: https://github.com/tools/godep [11:21] hazmat: but we should go the api stabilisation route too [11:34] rogpeppe1, natefinch: https://codereview.appspot.com/83150043 IsMaster API review [11:35] wwitzel3: will take a look after the meeting [11:35] rogpeppe1: yep, thanks [11:42] fwereade: who is the relevant landscape guy? [11:44] wwitzel3: just a personal opinion, I would like a less pythonic way to mock SeectPeerAddress :) but it its completely personal and with no justification [11:46] perrito666: comment on the review with an example and I'd be happy to consider it [11:50] voidspace: shall we move into another hangout? [11:51] rogpeppe1: sure [11:51] rogpeppe1: ah, wrapping up [11:51] voidspace: or... we could stay, yeah [11:53] wwitzel3: done, I am not sure if mine is better :) It could actually be worse [11:57] local coffee substitute beverage time, brb [12:00] wwitzel3: you have a review [12:00] perrito666: i made an alternative suggestion [12:12] rogpeppe1: I dont fully understand your suggestion (although I sense is better than mine) === psivaa is now known as psivaa-lunch [12:28] rogpeppe1: I'm going on lunch - if you produce a branch that fixes the config flow, or some test code to generate config files, can you email me the details please [12:38] epic - http://0value.com/my-Go-centric-Vim-setup [12:41] bodie_: see also http://blog.gopheracademy.com/vimgo-development-environment [12:42] rogpeppe1: I definitely need to read that in depth [12:44] it's not surprising i've got problems with hangouts. my internet speed is currently terrible. (0.74/0.18 Mbps) [12:45] rogpeppe1: measured how? [12:46] jam, I think you just beat me to make an MP to merge 1.17.7 into 1.18 [12:46] perrito666: speedtest.net (usually fairly reliable) [12:49] rogpeppe1: 15/1.2 :| I did not see that coming, being in the internet equivalent to congo [12:49] perrito666: that's better than i usually get [12:50] perrito666: but your effective speed will depend on routers out of your country too [12:50] well I do live in a neighborhood where internet is only used by me during work hours [12:50] I'm pretty sure the internet equivalent of the congo is the congo [12:51] mgz: well, I dont know, our topology is rather shameful :p but you might be right there jaja (the most used example outside is argentina) [12:51] mgz: the issue here is that I live on the country usually used as bad example [12:52] sinzui: good morning, dont hate me, did you see my email? :) [12:52] Not yet [12:52] mgz: lol [12:56] rogpeppe1, very nice :) I'm still using Sublime mostly because I'm lazy. [12:57] the gosublime package is pretty handy. a little bloaty, perhaps. [12:57] bodie_: i'm an outlier when it comes to editor usage :-) [12:58] it's lonely at the top ;) [12:58] bodie_: yeah, I use sublime because I like having mouse support and rational hotkeys. Gosublime does have a lot of stuff I don't need. [12:59] bodie_: i did use vi once upon a time though [12:59] I'm really comfortable with vi; so, "vintage mode" in sublime is great! ;) [13:00] i wrote a vi-mode command line editor once [13:00] natefinch: I left ST3 now and moved back to the good old vim [13:00] fun [13:01] I'm slowly transitioning off my training wheels, but for now it helps to be able to click around and see the filetree at a glance [13:01] sinzui: :). I'm still sorting out the bot, but it should be good now. [13:01] I believe Ian wants to get the joyent stuff into a 1.17.8 release. [13:02] thank you jam. I saw that in the log. We will see if I configured the joyent env properly for testing [13:05] mgz, jam, fwereade, https://codereview.appspot.com/83200043 - provisioner starting machines with networks specified [13:06] mgz, poke re status of that getnetworkslist branches? [13:09] dimitern: I'll get it landed now-ish I think, and tweak after [13:09] mgz, great, thanks! [13:11] voidspace: here's a 1.16 agent config: http://paste.ubuntu.com/7189587/ === psivaa-lunch is now known as psivaa [13:26] rogpeppe1: awesome [13:26] rogpeppe1: just updating my coffee - thanks [13:27] your coffee has revision numbers? [13:27] who's branch of coffee are you using? [13:28] * fwereade collecting laura [13:28] dimitern, in the provsioner, shouldn't we just accept the error from the provider? StartInstance errors shouldn't take down the task, they should be recorded as machine errors === rogpeppe2 is now known as rogpeppe [13:28] my internet connection is dodgy currently [13:28] my provider's running tests on the line [13:36] hey, is anyone working on "CLI: Ensure deploy --to and add-unit --to reports an error when the selected machine's provisioned networks match the service's" or can I take it? [13:40] sinzui: so we have a logical conflict about 1.17.7 in the 1.18 branch. Probably some test that needed to be updated for new 3rd party lib locations. I won't be able to get to it tonight, but will work on it first thing tomorrow. [13:50] wwitzel3, natefinch, perrito666: to help sinzui get it done, is it possible to look into this: https://code.launchpad.net/~jameinel/juju-core/1.18-merges-1.17.7/+merge/213615 [13:51] It looks like merging the juju/1.18 branch with my branch causes a problem with a bad import [13:51] I'm guessing we just moved it to a 3rd party lib. [13:51] you should be able to reproduce it locally, as it looks like a build failure. [13:55] jam: I'm starting an interview in 5 minutes, or I would look into it. [13:55] np, it might still work better for you, since I'm EOD [13:55] but don't pick it up unless nobody else steps up [13:55] natefinch: ^^ [13:55] jam: sure [13:56] jam, natefinch: I can look as well just want to finish a thought. [13:56] natefinch: we can pair on it? [13:57] wwitzel3: sorry, doing an interview, but possibly after, though that may be after noon. [13:57] I can look at it too also, wwitzel3 let me know if that thought becomes too long [13:58] perrito666: if you want to hangout while I wrap this up, then we can pair on it [13:58] perrito666: I glanced at it, but didn't have any quick ideas [13:59] well, if for a hangout you need my screen shared it will be a problem :) my linux pc is not hangoutable :p [14:03] rogpeppe: so I've pushed the test for 1.16 StatePort parsing based on the data you gave me [14:03] voidspace: cool [14:04] wwitzel3: branching 1.18 to test it locally [14:04] rogpeppe: just tweaking the test to write out a 1.18 config file from it so I can push the 1.18 test for the same thing [14:04] voidspace: i'm working on changing environs/cloudinit to use StateServingInfo [14:04] rogpeppe: that will be the only missing piece for my branch [14:04] subject to exhaustive review of course [14:04] voidspace: i got the data by bootstrapping a 1.16 local environ, BTW [14:04] voidspace: so you could do the same thing with 1.18 potentially [14:05] rogpeppe: ok [14:05] perrito666, rogpeppe: pushed up dates for https://codereview.appspot.com/83150043/ when you have time for another review. Thank you. [14:06] perrito666: I pulled down jam's branch and will see what I can get from that locally [14:08] fwereade, agreed [14:08] fwereade, but i haven't changed that - just added an extra check before StartInstance [14:08] dimitern, cool [14:08] oh wait [14:08] dimitern, why would we do that though? [14:09] fwereade, to save time? [14:09] dimitern, it's a whole new code path specifically to avoid setting StartInstance errors in the usual place where people will see them [14:09] fwereade, ok, i see your point [14:10] fwereade, will drop the check and just call start instance [14:10] dimitern, that's great, tyvm [14:10] dimitern, if we want to save provider calls at some point we can do basiacl [14:10] dimitern, actually [14:11] dimitern, if yu can make it clean, I endorse what you're doing -- but please just make sure the errors end up on the machine status as they would have done had you called StartInstance [14:11] dimitern, do not feel obliged though [14:12] dimitern, threading it nicely through the provisioner task might easily not be worth the complexity [14:13] ERROR cannot use 37017 as state port, already in use [14:13] when I use netstat it reports that the port is open [14:13] but for pid it reports "-" [14:13] ah, but "sudo netstat" gives me the real answer [14:18] wwitzel3: do you know anything about dummy environ state tests must be run with MgoTestPackage ? [14:21] perrito666: nope [14:28] rogpeppe2: so I get an error trying to bootstrap with trunk [14:28] rogpeppe2: https://pastebin.canonical.com/107549/ [14:28] error: --instance-id option must be set [14:29] voidspace: that means you've got a jujud that's not in the same directory as your juju executable, i think [14:29] voidspace: how did you bootstrap? [14:29] rogpeppe2: see the pastebin [14:30] voidspace: ah, sorry. will get me 2-factor auth key [14:30] 526487 [14:30] ha ha [14:31] hah [14:31] rogpeppe2: ah, I've done a "go install" instead of a "go build" [14:31] rogpeppe2: and it might now be working... [14:31] voidspace: ah, right [14:31] voidspace: go build isn't useful for much, generally [14:31] rogpeppe2: yeah, this is taking much longer! [14:32] sorry for the noise... [14:33] voidspace: this change is proving a little invasive. i think it might be better to do it as a followup to yours. [14:34] rogpeppe2: right [14:34] rogpeppe2: except mine has a conflict with trunk [14:34] rogpeppe2: so we can't land mine without it [14:34] still bootstrapping... [14:34] voidspace: it does take ages to bootstrap - it downloads a whole image, i think [14:35] voidspace: although, this is the local provider and on trunk, right? [14:35] rogpeppe2: I might have an agent.cof though [14:35] rogpeppe2: yes [14:35] voidspace: hmm, that generally completes quite quickly for me [14:35] "Bootstrapping Juju machine agent" [14:36] sinzui, around? [14:36] rogpeppe2: if it's downloading anything it will be sloooow [14:36] rural internet [14:36] sinzui, I have some juju-core CI questions for the Ubuntu MRE [14:36] I think I can just use this conf though [14:37] voidspace: sgtm [14:37] rogpeppe2: ok to put the real certs straight into the test file? they'll be regenerated next time I bootstrap, right [14:37] jamespage, I just added my first comment, I need to add more [14:37] sinzui, thanks [14:38] mgz: ping [14:38] voidspace: yeah, i think so. [14:39] rogpeppe2: hah, the existing test data for 1_18 is for machine-0 [14:39] so I actually need a config *without* the secrets [14:39] hmmm... and it's still bootstrapping [14:40] ah, and this is why [14:40] a million log lines saying [14:40] 2014-04-01 14:31:22 DEBUG juju.state open.go:101 connection failed, will retry: dial tcp 127.0.0.1:37017: connection refused [14:40] mgz, ping [14:41] hmm, does anyone else see a test failure of state/api in trunk? [14:41] the test looks wrong to me [14:43] rogpeppe2: passes for me [14:44] well, now trying api/... [14:44] yep - all passes [14:44] voidspace: ah, i think it's probably to do with talktalk [14:46] hmm, not sure actually [14:47] voidspace: could you paste me the output of running go test -gocheck.vv -gocheck.v 'TestOpenMultiple$' launchpad.net/juju-core/state/api [14:47] voidspace: please [14:47] oops [14:47] s/-gocheck.v /-gocheck.f / [14:48] voidspace: ^ [14:48] i can't see how it can work - the address from Listen isn't ok to dial AFAICS [14:49] wwitzel3, dimitern: hey [14:49] rogpeppe2: yep [14:50] mgz, i'll need that branch soon ;) [14:50] rogpeppe2: 0 passed [14:50] wwitzel3: I got it to fail, that is for sure [14:50] :p [14:50] dimitern: just done some fiddling on the maas code, merging it now [14:51] mgz, thanks! [14:51] mgz, did you manage to live test it? [14:51] rogpeppe2: however, if I do [14:51] voidspace: oh, frick. go test launchpad.net/juju-core/state/api -gocheck.vv -gocheck.f 'TestOpenMultiple$' [14:51] go test launchpad.net/juju-core/state/api -gocheck.vv -gocheck.f 'TestOpenMultiple$' [14:51] I get ok launchpad.net/juju-core/state/api 0.325s [14:51] and no other output [14:52] voidspace: weird [14:52] voidspace: what if you cd to state/api and do go test -gocheck.vv -gocheck.f 'TestOpenMultiple$' ? [14:52] dimitern, I am confused... we have StartInstanceParams which has a Networks field; and a MachineConfig field, which itself has Networks fields [14:52] dimitern: no, not yet [14:52] dimitern, which one counts? why does the other exist? [14:52] rogpeppe2: a lot of output but a pass [14:53] rogpeppe2: want a pastebin? [14:53] voidspace: please [14:53] fwereade, StartInstanceParams had this added initially, but the machineConfig one will be used for the cloudinit setup [14:53] rogpeppe2: https://pastebin.canonical.com/107557/ [14:53] dimitern, can we drop it from StartInstanceParams then? otherwise it's just a bug waiting to happen [14:53] fwereade, and maas provider uses StartInstanceParams one to call acquireNode with networks, when set [14:53] brb [14:54] fwereade, i'd like to do that in a follow-up, when integrating with mgz's branch [14:54] voidspace: weird [14:54] voidspace: [LOG] 36.85478 INFO juju.state.api connection established to "wss://[::]:35729/" [14:55] voidspace: that doesn't seem to work on my machine [14:55] dimitern, btw, what led us to try to do network setup in cloudinit? is it *really* easier that an one-shot job in the machine agent? [14:55] voidspace: ahhh, perhaps because i've got ipv6 disabled [14:55] fwereade, it's easier for me at least [14:55] fwereade, and less code i think [14:55] dimitern, what's easier? [14:56] mgz: wondering how I would pull down a remote branch and switch to it? without making a whole new folder [14:56] fwereade, do i need to test anything wrt provisioning machines with networks when we have a test that errors from StartInstance are transformed to StatusError? [14:56] dimitern, "easier" is a valid argument, I just can't see how it applies here :) [14:56] mgz: can't seem to remember or find the syntax [14:57] fwereade, I need to add 1 API call and a few scripts and everything can be done in the provisioner and cloudinit [14:57] dimitern, don't think so, there's already a test for that [14:57] fwereade, otherwise we'll also need a worker [14:57] fwereade, ok, thought so [14:58] wwitzel3: `bzr branch --switch lp:PATH/TO/BRANCH co:BRANCH` in the dir with the colo branch setup [14:58] mgz: co: .. that is what I kept forgetting [14:58] mgz: thank you [14:58] mgz: what is supposed to happen if you make a tcp connection to an all-zeros IP address? [14:59] the socket library should give you an error [15:00] dimitern, honestly I'm not seeing the complexity in a one-shot worker that ignores Stop() and just uses a tomb for error transmission [15:00] mgz: well, it doesn't (on voidspace's machine, at any rate, and it looks like the gobot machine too) [15:00] perrito666: I'm running the tests now, I was accidently merging trunk which was fixing what ever issue this merge is having [15:01] fwereade, i have it mostly done already [15:01] dimitern, whereas every little bit of weight we add to cloudinit worries me [15:02] fwereade, it's a temporary solution until we have the worker [15:02] dimitern, the bulk of it should be identical, right? [15:02] dimitern, it's just when those same scripts happen to run? [15:02] wwitzel3: it blows when merged to jam's branch, running testsuite now, bbl [15:02] Ill go lunch [15:02] fwereade, yeah, and a new api facade and methods [15:03] fwereade, and the worker itself [15:03] dimitern, it's a one-method facade, though, right? [15:03] fwereade, so you're against setting up networks in cloudinit in general? [15:04] rogpeppe2: you can use a known-invalid hostname instead optionally [15:04] voidspace: i'm presuming this program passes if you run it on your machine: http://play.golang.org/p/1TD2_F1NTd [15:04] dimitern, somewhat against it, yeah [15:04] mgz: in this case, the tests are relying on it being valid [15:04] fwereade, if you're really feeling that way, i can go with the worker [15:04] (still vulnerable to bad dns servers... ideally we don't want bogus connections getting out at all) [15:04] mgz: which is troubling [15:04] dimitern, where are you getting the info about what networks actually exist at cloudinit time? [15:04] mgz: does this code run ok on your machine? http://play.golang.org/p/1TD2_F1NTd [15:04] fwereade, but i can't guarantee the whole lot will be ready by tomorrow eod, as we talked [15:05] dimitern, I'm still talking it through and may yet be convinced [15:05] fwereade, provisioner gets that and passes the needed information to render the appropriate cloudinit [15:05] fwereade: from maas /api/1.0/networks/?node= [15:07] dimitern: sending to bot with version dep bump [15:08] mgz, ah, dependencies.tsv needs updating yes [15:09] dimitern, mgz: so, I may be missing something -- what's the difference between the method on the provisioner facade vs the one on the mooted new one? [15:09] rogpeppe2: trying [15:09] dimitern: right, that's why it was a bit more of a dance [15:10] fwereade, so the provisioner one is called before StartInstance to get the networks and pass them in StartInstanceParams/MachineConfig, from which the cloudinit scripts get their params [15:11] fwereade, and the networker api is basically the same thing, but prepares and runs the scripts at startup [15:11] fwereade, in the latter case we don't have the provisioner api call, in the former we don't have the worker or its api [15:12] dimitern, I'm not convinced yet I'm afraid -- that StartInstanceParams/MachineConfig thing feels like a pretty strong sign we're DIW [15:12] fwereade, and also, in the latter case if you specify invalid networks (unknown or not available on the machine) you'll get an error after MA starts and no status error, whereas in the former case the provisioner can set statuserror in these cases [15:13] rogpeppe2: seems to work [15:13] [::]:48575 &net.TCPAddr{IP:net.IP{0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}, Port:48575, Zone:""} [15:13] accepted [15:13] dialled [15:13] dimitern, surely not? those networks still go into StartInstanceParams, and the provider barfs then if it can't give them to us [15:13] fwereade, DWI=Direct Inside Wire ? :) [15:13] dimitern, Doing It Wrong ;p [15:13] ah [15:13] voidspace: right, thought so. otherwise tests would've failed on your machine [15:14] voidspace: but tests were passing on my machine, so i guess something's changed. [15:14] fwereade, well, acquireNode either gives us a node with these networks or returns an error [15:14] dimitern, yeah [15:14] fwereade, so i stand corrected, with a worker we should still have startinstance-time errors, which will be recorded in status [15:15] dimitern, should be, I think [15:15] rogpeppe2: well, that does the same on a canonistack machine as it does for voidspace [15:15] godammit, still can't bootstrap on my machine [15:15] 2014-04-01 15:15:23 DEBUG juju.state open.go:101 connection failed, will retry: dial tcp 127.0.0.1:37017: connection refused [15:15] dimitern, my big concern with the worker approach is that machine agent tests will remain a godawful fucking nightmare to test by side-effect, and doing it right *there* really will push us past allmanner f deadlines [15:15] mgz: yup, i'm not surprised [15:16] rebooting, something's futzed [15:16] mgz: but AFAICS it should not be working, right? [15:16] I'm pretty sure if I write it in C, it doesn't [15:16] fwereade, not sure I'm following you there - why MA tests will suffer? [15:16] * rogpeppe2 tries to remember his rusty sockets skills [15:17] dimitern, it's hell testing anything in the machine agent because there's no indirection layer -- we can't test that it starts a Fooer, we need to start the agent and hang around watching other stuff until we get evidence roughly aligned with the notion that it might have done so [15:18] dimitern, and this will try to rewrite config that expects sudo access, I would imagine [15:18] fwereade, yeah right [15:18] dimitern, so all the other tests will start barfing in really weird ways [15:18] fwereade, correct [15:18] dimitern, not necessarily reproducible, etc etc [15:18] fwereade, we can mock IsRootUser or something though, like for the local provider [15:18] dimitern, it still sucks doing that at arm's length [15:19] dimitern, ok, I have convinced myself [15:19] fwereade, :) so? [15:20] dimitern, go ahead with the cloudinit approach, but *please* make sure the code at the sharp end is so nicely isolated you can just move it and be done [15:20] dimitern, and we need to have networks on *one* of MachineConfig and StartInstanceParams :) [15:21] fwereade, ok, will try doing it sensibly [15:21] dimitern, if it happens in a followup I'm not too bothered, so long as it does happen -- add a card maybe? [15:21] dimitern, actually add a followup card for the worker [15:21] fwereade, i'll poke it around some more and might do it now actually [15:21] dimitern, this is mr.right-now, not mr.right [15:21] fwereade, there are several cards re the worker [15:22] dimitern, great :) [15:22] dimitern, I find myself wishing for dependency graphs in kanban and stuff [15:22] didn't help, still can't bootstrap trunk with the local provider [15:22] 2014-04-01 15:21:29 DEBUG juju.state open.go:101 connection failed, will retry: dial tcp 127.0.0.1:37017: connection refused [15:22] fwereade, cheers [15:22] * fwereade bbiab again [15:22] fwereade, i still need the review though ;) [15:32] rogpeppe2: final test pushed [15:32] voidspace: brill [15:32] so only resolving the APIServerDetails / StateServingInfo conflict left for my branch [15:33] rogpeppe2: want a review on your branch? [15:34] voidspace: please - trivial: https://codereview.appspot.com/83090044 [15:34] rogpeppe2: yep, looks good to me [15:34] voidspace: ta [15:35] another for good measure [15:35] mgz: apparently it does work in C (try telnet 0.0.0.0 22) [15:35] :-) [15:36] rogpeppe2: even gethostbyaddr doesn't complain, which surprises me [16:13] fwereade, I dropped the networks from startinstanceparams and now only the ones inside machineconfig are used [16:16] wwitzel3: ping [16:17] dimitern: branch landed [16:18] mgz, awesome [16:19] voidspace: lp:~rogpeppe/juju-core/mfoord-stateservinginfo [16:22] wwitzel3: perrito666: I think it is just that the 1.18 branch had a test case in it that needs to be updated to use a new import [16:23] I would think you could just change to that dir, run "go test" directly, and it should print out the bad imports [16:24] jam: I got two failing here, replicaset and .. [16:25] machine [16:25] so, for example :FAIL launchpad.net/juju-core/cloudinit [build failed] [16:25] cd cloudinit [16:25] go test [16:25] should just show a bad import [16:27] jam: sadly, here they work [16:27] perrito666: did you take the 1.18 branch and merge it with mine? [16:28] jam: nope, merge had failed [16:29] I think I found the problem.... It looks like the build process had created a library for a directory we had deleted, if I just nuke $GOPATH/pkg it works [16:30] rogpeppe2: merged, now running tests [16:30] rogpeppe2: I see a SharedSecret failure already [16:30] at least, on the bot [16:30] jam: let me check [16:31] jam: same here [16:31] perrito666: so that isn't what *you* will see. I think the steps to reproduce are: "checkout the 1.18 branch, install it, merge my branch, try to run the test suite". The initial build will have left some cruft behind that go doesn't realize is stale. [16:31] jam: it what i did [16:32] since i first tried 1.18 [16:32] a rm of pkg fixed it [16:32] brb [16:34] mgz, fwereade, updated https://codereview.appspot.com/83200043/ - should be much more straightforward now [16:42] heya, does anyone know a way to fake a public-address with the local provider? [16:44] perrito666: was grabbing some lunch [16:46] perrito666: I must be doing something wrong because I pulled down jam's branch and all the tests passed for me after running godeps -u [16:47] mgz: looks like there might've been a failure to update dependencies.tsv [16:47] rogpeppe2: when you get a chance, if you could have another look at https://codereview.appspot.com/83150043/ [16:47] mgz: i see these failures running tests in provider/maas [16:47] ./environ_whitebox_test.go:342: suite.providerSuite.testMAASObject.TestServer.NewNetwork undefined (type *gomaasapi.TestServer has no field or method NewNetwork) [16:47] ./environ_whitebox_test.go:538: suite.providerSuite.testMAASObject.TestServer.ConnectNodeToNetwork undefined (type *gomaasapi.TestServer has no field or method ConnectNodeToNetwork) [16:47] wwitzel3: it appears to be a Go issue. If you branch juju-core/1.18 and then "go get" it to build the dependencies, and *then* merge my branch, your $GOPATH/pkg will have some libs in it [16:47] that should be rebuilt [16:47] but it would seem are not [16:47] and then it can't link properly [16:48] wwitzel3: perrito666: I cleaned out the dir on the bot, and the branch looks to be landing now. [16:49] jam: I wonder if we should make that part of the build process for the bot? [16:49] rogpeppe2: I messed up the edit, only the revno change was applied, not the revid [16:49] mgz: for shame! [16:52] mgz: ah [16:52] mgz: in general, it's a not a bad idea to generate the whole file automatically [16:53] mgz: i.e. godeps -t $(go list ./...) > dependencies.tsv [16:53] (-t should probably be the default actually) [16:54] (as should ./...) [16:55] rogpeppe2: stamp on cr 83060044 please [16:55] rogpeppe2: that would be a bad idea, because I have serv [16:56] *several, branches on other revs [16:56] or non-trunk [16:56] mgz: link? [16:56] take and [16:56] *take an existing code review link and replace the number [16:56] oh, ok then :-) [16:57] https://codereview.appspot.com/83060044 [16:57] there was one up the screen a bit [16:59] mgz: LGTM [17:03] sinzui: it has now landed [17:06] thank you jam [17:08] sinzui: so we just need to land the joyent stuff there [17:08] jam, no rush...it doesn' pass qa. I wont accept it unit it does [17:09] jam: wwitzel3 sorry, was having a second lunch [17:10] :p [17:10] rogpeppe2: branch proposed [17:10] https://codereview.appspot.com/82960044/ [17:17] wallyworld: when you can, land your stuff into the lp:juju-core/1.18 === vladk is now known as vladk|offline [17:21] wallyworld, Juju CI doesn't love the provider. We found several bugs preventing CI from passing. https://bugs.launchpad.net/juju-core/+bugs?field.tag=joyent-provider [17:23] We are blocked by bug 1300877 [17:23] <_mup_> Bug #1300877: joyent fails to bootstrap due to missing shared-secret [17:24] sinzui: having a look as wallyworld is likely not on yet [17:26] mgz, It's 3:25 AM for him. If he is up, he wont be comprehensible [17:27] mgz, This issue might be more crucial to some devs. trunk is broken: bug 1300889 [17:27] <_mup_> Bug #1300889: azure and hp cannot deploy as of r2524 [17:28] why are we instance polling against the provider? [17:28] * hazmat forgot [17:30] see you all tomorrow [17:30] g'night [17:30] EOD [17:32] EOD for me too [17:33] hazmat: because a) we don't know when an instance gets an address without polling and b) addresses can change [17:33] g'night all [17:33] night rogpeppe2 [17:36] rogpeppe2, all of which is discoverable from the instance.. yes? [17:36] hazmat: yes [17:36] so no need to poll. [17:37] the provider.. the instance already knows and can update. [17:37] rogpeppe2, thanks [17:38] hazmat: the state needs to know the instance addresses [17:38] rogpeppe2, machine agents can update state [17:39] hazmat: how does the machine agent find its own addresses without calling a provider method (the instance Addresses method in fact)? [17:40] rogpeppe2, depends on which environment its brought up in.. it can talk to the metadata source in most of those environments.. or it can query its nics addresses. [17:40] hazmat: ah, by discoverable from the instance you mean "discoverable by code running on the instance", not "discoverable from the Instance value" :-) [17:40] rogpeppe2, yes [17:41] metadata servers are live data streams in openstack, ec2 between instance and underlying iaas [17:46] hazmat: i have no problem with getting machines to update their own addresses - it's compatible with the current model. [17:46] rogpeppe2, and stops some of silliness of dos'ing the user's cloud account on rate limits. [17:47] hazmat: yeah. we do make some effort to avoid making lots of calls now, BTW. [17:47] hazmat: although the constants should probably be checked for reasonableness [17:47] rogpeppe2, its an inherent problem imo to how we're creating iaas resources.. [17:47] rogpeppe2, creating machines individually, polling them, creating security groups per machine, checking them all the time .. [17:48] rogpeppe2, working on sprint topics for later this month.. [17:48] hazmat: right [17:48] hazmat: creating machines individually probably isn't *too* much of a problem, as you don't create machines that often. but polling them definitely can be. [17:49] rogpeppe2, you may not.. but i do.. [17:49] rogpeppe2, which is how i ended up finding the provisioner bug that garbage collecting machines after 150.. [17:49] still want to source dive on that one.. but in meetings all day.. [17:50] rogpeppe2, your right though its not as much of an issue as the instance polling or security group polling/per server stuff [17:50] hazmat: i think the provisioner is wasteful because it assumes that something else may be actively messing with the machines under its control [17:51] hazmat: if it assumed an environment largely static apart from through its own intervention, i think it could be a lot better [17:51] rogpeppe2, the entire iaas interaction model hasn't changed from pyjuju days .. i wrote that code at the first juju sprint at my house.. [17:51] hazmat: but that's definitely an assumption we'd have to think hard about [17:51] we've just copied the model forward.. [17:51] hazmat: yup [17:51] hazmat: didn't want to reinvent everything... [17:55] hazmat: but at least you're admitting it's your fault ;) [17:55] natefinch, well some of it.. there were quite a few regressions nonetheless.. [17:55] i autoretried transient provider errors for example. [17:56] natefinch, and i never polled provider for addresses, i had the machine agent do it in provider specific manner (ifconfig, metadata) etc. [17:56] natefinch, but sure i'll take responsibility for my mistakes :-) [17:57] hazmat: I was really just joking. There's definitely stuff we need to improve. It's just a matter of prioritizing it. [17:57] natefinch, but i can't talk all the blame for those decisions.. [17:57] we had quite some 'discussions' [17:57] yup [17:58] hazmat: once we get the newbies up to speed, we'll have a lot more throughput. And once this damn HA stuff lands... [17:59] speaking of newbies going up to speed, I am trying to do "CLI: Ensure deploy --to and add-unit --to reports an error when the selected machine's provisioned networks match the service's" but I am not sure what is it about, hints? [17:59] trying to bootstrap trunk and I am getting Unable to locate package juju-mongodb .. what does that seem like I should know about that [18:01] wwitzel3: broken soething when switched to 1.18 and back? [18:02] perrito666: yeah, looks like one of the machines in the maas was dirty [18:04] perrito666, heyhey [18:05] perrito666, this'll be inside the state package [18:05] perrito666: I presume that is supposed to be "Report an error when the networks *do not* match". Basically, if you have a machine on a specific network, and the service has a *different* network it requires, we shouldn't let you try to deploy the service to that machine [18:05] perrito666, yeah, what nate said [18:06] sinzui: what do you need to get a 1.17.8 out the door? *I'm* not aware of the QA failures that we should be fixing. [18:06] fwereade: natefinch thank you [18:06] perrito666, in particular take care of the fact that machines have two different sets of networks -- the include/exclude with which they were created, and the *actual* set of networks they're on [18:09] jam, Bug #1300889 is serious for trunk. The brokenness delays CI from testing 1.18. 1.17.8 can be any set of passing revisions. I am not demanding anything new be landed after 1.17.7 [18:09] <_mup_> Bug #1300889: azure and hp cannot deploy as of r2524 [18:09] fwereade: I presume we are speaking about the actual set [18:10] perrito666, yeah, desired service networks need to match actual machine networks [18:10] sinzui: so, AIUI, we'll want to cherrypick wallyworld's Joyent provider changes into the 1.18 branch, and then not create it from trunk. [18:10] which is why I moved 1.17.7 over to the 1.18 branch. [18:11] perrito666, you might be able to draw inferences about possible matches based on requested machine networks alone,for the situations where the machines don't yet have actual networks, but I don't think that's a particularly major concern [18:11] Ok, new issue bootstrapping trunk .. the shared-secret for mongo in /var/lib/juju doesn't seem to be getting created. [18:11] "mongod.37017[8686]: Tue Apr 1 18:07:38.932 error getting file /var/lib/juju/shared-secret: No such file or directory" [18:11] sinzui: which brings us back to r2499 on trunk [18:11] and then a couple patches on top of that [18:11] wwitzel3, rogpeppe2 was going to move where that was stored? [18:11] rather than everything in trunk today [18:12] jam, I could do that. But as I said. the joyent test doesn't pass. [18:12] fwereade: ok, I will investigate that [18:12] fwereade: thaks [18:12] wwitzel3, he said it would be in the agent confg [18:13] jam, CI will not bless the joyent provider in its current state. We have marked the test as not required. I would claim it works [18:13] fwereade: do you know the source of why we are saying "Joyent should be in 1.18" ? [18:14] *I'm* personally ok with blessing 1.17.7 as 1.18, but there seem to be conflicting requirements [18:14] jam, so long as we get it into 14.04 it's not a big deal -- the theory was that it was close enough to complete to go in without stress and hassle [18:15] fwereade: from what I can tell there is stress and hastle :) [18:15] jam, sinzui: this is plainly not the case, so let's not block 1.18 on it [18:15] fwereade: sinzui: are *you* both ok if we bless 1.17.7 as 1.18.0 ? [18:15] should we be marshalling the troops into testing it as such? [18:16] I know of bug https://bugs.launchpad.net/juju-core/+bug/1299802 but I wasn't able to reproduce it [18:16] <_mup_> Bug #1299802: upgrade-juju 1.16.6 -> 1.18 (tip) fails [18:16] jam I am okay with that [18:17] jam, I think we need to address bug 1299802 to say we have 1.18.0 [18:17] <_mup_> Bug #1299802: upgrade-juju 1.16.6 -> 1.18 (tip) fails [18:17] sinzui: if I could reproduce it, yes. But upgrade just worked for me. [18:17] jam, CI never saw this issue. Clould the bug be invalid [18:18] sinzui: I was following up on it for rogpeppe2 who filed the bug. I haven't been able to reproduce it (though I've only tried one time) .The symptoms are baffling (as we have 2 requests in a row and the second one is returning an old value.) === johnweldon is now known as jcw4 [18:18] jam, anything that gets us a stable release with the last cycle's goodness in gets my approval [18:18] jam, +1 [18:19] sinzui: so it is sort of "do we run the test N times and see if we can trigger it"? That is what CI has been doing, right? [18:19] jam, right it does. [18:19] sinzui: https://bugs.launchpad.net/juju-core/+bug/1300877 looks like something new from axw [18:19] <_mup_> Bug #1300877: joyent fails to bootstrap due to missing shared-secret [18:20] at least, I didn't think we needed shared-secret until the recent HA changes. [18:20] sinzui: which implies it isn't the Joyent provider's fault [18:20] jam, well we will see when 1.18 is tested. [18:21] oh, well we will see when the additions are merged [18:21] sinzui: well.... do we want to delay 1.18 for the additions? [18:21] there is some doubt that it will be worthwhile [18:21] and we can fix trunk and just have it there. [18:22] it sounds like fwereade is happy to let the Joyent provider slip [18:22] I don't know of a specific stakeholder requirement for Joyent in 1.18 [18:22] I'd rather just call 1.17.7 1.18.0 if we are happy. [18:22] jam, The only reason I think we want to extra revisions is because we promised something would be in 1.18 [18:22] cmars, does bug 1290824 have to be in 1.18.0 [18:22] <_mup_> Bug #1290824: juju should ask the charm store to decide the default series for a charm [18:22] sinzui: "somebody promised someone" but I don't know who to put in those blanks. [18:23] and it sounds like fwereade isn't one of those people [18:23] and *I'm* not. [18:23] so we can wait for Ian to find out where his requirements are coming from. [18:23] jam, arosales cares very much that it lands for 14.04, but that is the only specific stakeholder I am aware of [18:23] fwereade: right. 14.04 is separate (IMO) [18:23] because all the stuff we're doing is critical for 14.04 :) === vladk|offline is now known as vladk [18:26] jam, It might be fair to say that 1.18.0 docs is the blocker. I think I need to get the collated release notes together, and update the docs. for the release. I need a day at least. I think 2 days is more realistic [18:26] sinzui: then we're not in a rush and can sort out the release stuff. There is bug #1282690 [18:26] <_mup_> Bug #1282690: ensure joyent provider gets included in 1.18 release [18:27] but I have the feeling that is "it must be in trusty" [18:27] not "it must be in 1.18" [18:27] jam, That is my feeling too [18:27] 1.18.1 is easy to make [18:28] * arosales reads back scroll [18:29] jam, are stopping odd-even versions with the 1.18.0 release? What do I update trunk to 1.19.a1...1.19.b1...1.19.0 [18:29] sinzui: we have to do a 1.20, because 1.18 doesn't know that 1.19 would be a "stable" release. [18:29] jam fwereade: I think this may be some building blocks that cmars is working on. But we can get around it by specifically having users state what the preference is. [18:30] arosales, ah, Iwas talking about joyent [18:30] jam, understood [18:30] jam, fwereade: the only issue is when a charm lives in both precise and trusty and the user does not make a statement on what their preference is. [18:30] fwereade: ah joyent, yes we need that :-) [18:30] arosales, cmars is hard at work on default-series as we speak :) [18:30] fwereade: ack [18:30] yep [18:30] jam what are the concerns with joyent? [18:31] arosales: does it have to be in 1.18 [18:31] we have a 1.17.7 release that looks nice and stable [18:31] but doesn't have it [18:31] arosales, joyent has no *specific* 1.18 requirement, though? AIUI it's 14.04, and we hoped it was already good enough to slot in, but it's not [18:31] do we delay 1.18 to get it in. [18:31] can we land the majority, and bug fix post? [18:32] wallyworld: had suggested we don't "turn it on" and land bug fixes post 1.18 to "turn it on" [18:32] but we would need the majority of code to land, or SRU paperwork is a lot harder [18:32] jam: better question, what is blocking Joyent? [18:32] jam, arosales, we might need to call joyent 1.20.0 if the additional code is not "like* a bug fix to the eyes of Ubuntu [18:33] arosales: it landed in trunk, trunk is not stable, it did not land in an otherwise-stable branch. [18:33] so it is "unknown" if there are problems [18:33] though we haven't gotten it to work in trunk [18:34] sinzui: that is my point for landing it in 1.18. SRU'ing it in will be a lot more since it is a feature, but we can call it enablment [18:35] jam: work required to test joyent in current stable is how much? [18:35] arosales, we have a test [18:35] arosales, the test failed because of another bug introduced into trunk a few hours ago [18:35] arosales, when wallyworld merged joyent into the 1.18 branch, CI will test it. We will know if it works or not [18:36] ok, as I understand it that is the issue for landing joyent in 1.18 is the testing [18:36] current tests are failing due to a bug outside joyent, that perhaps joyent provides [18:36] so if we take out joyent the bug is still there [18:37] jam: do you have a bug # ? [18:37] s/provides/exposes/ [18:37] Juju code and features really grew in the last 5 months. Can I say "Juju is 20% more fucking awesome" in the release notes [18:38] ha :-) [18:38] jam sinzui sorry for being dense here, but I'll try to summerize [18:39] 1. There is a question of landing Joyent in 1.18 [18:39] 2. The reason is Joyent tests are not passing atm [18:39] 3. Test aren't passing due to a new found bug which Jam is referencing that may or may not be dependant on joyent [18:40] arosales: so, I think the bigger thing is "why does it have to be in 1.18" vs "it needs to be in trusty" [18:40] jam, sinzui ^ is this correct? [18:40] arosales, yes. [18:40] if 1.20 is in trusy before final freeze I am ok with that [18:40] jam, 1.18.0 entered CI. I expect CI to love it like it loved 1.17.7 [18:41] * arosales was the understanding 1.18 was the final juju relase before trusty GA [18:42] arosales: we have a bunch of stuff that is also meant to be for trusty GA [18:43] jam: to your point joyent does not have a hard dep for 1.18, but does have a hard dep for trusty GA [18:44] sinzui: fwereade: so if it is going to be ~2 days to get the docs together, we can take that time and try for a 1.17.8 w/ Joyent, but be ready to pull the plug and go with 1.17.7, does that sound fair? [18:44] arosales: I respect that getting it in early if everything looks good is better PR wise [18:44] jam, +1 [18:44] so as long as we have plan in place for trusty GA I am ok with that. Now we should be really transparent to the Ubuntu Archive Admins and let them know what is still planning on being delivered so they are not surprised with the branch post 1.18. [18:44] jam, sinzui: sgtm [18:45] jam, CI really tells us yes or no. It is easy to revert if we get a no [18:45] sinzui: r [18:45] right. [18:45] fwereade: jam: sinzui: thanks for trying to get all the stake holder issues in. I know its a tough job. [18:46] I just mean, 1.18 isn't *delayed* if we try to get Joyent in. If it looks like it would be, we don't delay we just go with 1.17.7d [18:46] sinzui: another question for you [18:46] sinzui: arm vs armhf [18:47] do we know why the tools we build have armhf in them, but juju internally wants to call them arm [18:47] https://code.launchpad.net/~jason-hobbs/juju-core/fix-armhf/+merge/212014 is a patch towards fixing it [18:47] but we have to decide if "the tools must be right" [18:47] so we have to conform to them [18:47] or if we can change the naming on the tools. [18:48] sinzui: because I might be wrong, but if you untar one of those juju-1.17.6.1-trusty-armhf.tgz files, and have it run jujud --version [18:48] I *think* it would spit out juju-1.17.6.1-trusty-arm [18:48] jam, I have never understood the internal names. The tool naming script is convention/guess. Not reason [18:48] and not juju-1.17.6.1-trusty-armhf [18:49] jhobbs: do you have an Arm board that can give us an answer? [18:49] jhobbs: I *can* say that simplestreams uses armhf: http://juju-dist.s3.amazonaws.com/tools/streams/v1/com.ubuntu.juju:released:tools.json [18:49] jam http://bazaar.launchpad.net/~juju-qa/juju-core/ci-cd-scripts2/view/head:/assemble-public-tools.bash [18:50] jam, The see the get_arch() rule [18:50] does this log help answer? [18:50] https://code.launchpad.net/~jason-hobbs/juju-core/fix-armhf/+merge/212014/comments/503262 [18:50] this is from a local bootstrap on an armhf [18:50] running juju-1.17.6-trusty-arm [18:51] jhobbs: that just tells me the "juju" client thinks it is 1.17.6-trusty-arm, but not what the jujud binary thinks it is inside the tarball. [18:51] However, I think given simplestreams, and other sources of information into this system [18:51] it is easier to rename all the internal-to-juju stuff to call it armhf [18:51] so the function [18:51] jam, Ubuntu spits out archs, the script has to decide what to do when it finds one in an archive. The script used to abort. I removed it because powerpc is supposedly legitimate from ubuntu's perspective. I assume if we made a package for the arch, it is fine to make the same tool. But juju/go has it's own thoughts on the matter [18:51] ubuntuArch() which maps from 386 => i386 [18:52] needs todo the same for "arm" => "armhf" [18:52] sinzui: and somewhere it is ppc64le and somewhere ppc64el, I haven't quite sorted all that out [18:52] jam, sorry. I mean to say the script used to abort when it found an unknown arch. Now It warns [18:53] jam, exactly the issue that made me choose to warn. [18:53] The difference looks like a typo [18:53] jhobbs: so I think the answer is. The path of least resistance is to get "juju version" to report "juju-1.17.6-armhf", and then tool lookup, etc can just continue using armhf [18:54] jam ok cool [18:54] jujud --version? [18:54] jhobbs: well, they both use the same logic [18:54] or juju version? or are they same thing? [18:54] so yes, both of them. [18:55] ok cool, i will try to do that and resubmit the patch [18:55] sinzui: my understanding is "ppc64el" stands for Power PC 64 Little Endian [18:56] jhobbs: so it *should* be changing it in juju/arch/arch.go [18:56] jam. I need to increment the rev of the 1.18 branch. CI knows the version is less than what we are working on. I can use 1.17.8 or 1.18.0. I favor the latter. [18:56] and tracking down any other constants that we're missing (hopefully few) [18:56] sinzui: I think we're going to try for Joyent, so 1.17.8 [18:57] jam, so "le" means "endian little"? [18:57] okay [18:57] jam yeah, the patch i submitted just changed it in juju/arch/arch.go (and unit tests), which was enough to have it find the right tools [18:57] but i didn't check output of juju version [18:59] jhobbs: so we used to have an "ubuntuArch" function [18:59] in version/version.go IIRC [18:59] I thought that moved to juju/arch/arch.go [18:59] that maps go arch to ubuntu arch it sounds like [19:00] i have to get on a call, thanks for the help! [19:00] jhobbs: NormalizeArch has a bunch of regexes, one of them maps "386" to "i386", and we have "armv.*" mapping to ARM. We just need to map "arm" to ARM [19:43] quick question about this diagram http://blog.labix.org/wp-content/uploads/2013/06/config-changed.png [19:43] we're trying to understand how Set works [19:44] my understanding is that apiclient Call triggers stuff on the state server [19:44] but it's not clear how it gets to Machine 2 and more importantly the unit [19:45] I'm also not certain my understanding is correct [19:46] I thought the units / services query the state server for changes, and download them if there are changes === vladk is now known as vladk|offline [19:57] bodie_: sorry, I don't know that part of the code well [19:57] bodie_: I presume they must poll for changes [19:58] we touched on some of it in my email thread, but how the unit knows when there's new content on the stateservice is unclear. I believe you are correct [20:00] bodie_: charms have hooks, which are just executable files, which get called when things happen. Config-changed is a hook that gets called when the config changes. [20:01] I see, so the state service triggers config-changed on the unit? [20:03] bodie_: I'm not sure of the exact mechanism that goes from {data updated in mongodb} to {config-changed hook runs} [20:26] natefinch, do you have a minute to review a branch that increments the 1.18 branch version https://codereview.appspot.com/83300043 [20:27] natefinch, nm, wrong target [20:30] sinzui: let me know when/if you need me to do that [20:31] o/ [20:32] thumper: yo [20:33] thumper: what's accounts.conf? [20:33] natefinch: the idea that we don't have configuration for environments, just accounts, like amazon or hp-cloud [20:33] you then name the environment during bootstrap and use an account [20:34] I'm sure there is a blueprint somewhere [20:35] thumper: I get it. so separating the name from the provider info [20:35] acj [20:35] ack [20:35] fwereade: i notice in the backscroll there is an issue with the joyent provider? i'd like to know what it is as 1. units tests pass, 2. i tested live and it worked fine with wordpress/mysql [20:36] thumper: so almost like extending what we have to allow you to bootstrap multiple copies of what we now consider a single environment defined in environments.yaml, just giving it an extra name on top of that name. [20:36] sinzui: hi, i wanted to ask you about the status of getting juju ci going on arm64 [20:37] natefinch: kinda a bit *waves hands* [20:37] thumper: lol, good enough [20:37] natefinch, https://codereview.appspot.com/83310043 is correct [20:39] sinzui: lgtm [20:40] mwhudson, The experience has been dismal. The AMI is under powered. 2 in 5 attempts to start it and run tests fail because the instance dies while were are using it. 3 out of 5 attempts fail because we cannot compile it. And one of the reasons it takes forever is because we need to update the image with the correct tool chain [20:40] thank you natefinch [20:41] sinzui: AMI? [20:41] mwhudson, yes, http://ec2-54-84-137-170.compute-1.amazonaws.com:8080/job/walk-unit-tests-arm64-trusty/66/console [20:41] oh, you are using the fast model? [20:42] er foundation model [20:42] mwhudson, I cannot say. I am using the AMI that was built by danf [20:42] oh [20:42] hmf [20:43] ^ that instance has been up for 50 minutes and we still haven't been able to ssh in to it to do apt upgrades and installs [20:43] * mwhudson CANNOT WAIT for hw that is not nda-ed up the ass [20:43] 50 minutes seems appalling [20:43] mwhudson, The test can accept ssh credentials to an instance that is up or start an AMI [20:44] mwhudson, I haven't been able to keep an instance alive for more than 18 hours. I cannot reuse one that has the current tools :( [20:44] ok [20:45] sinzui: are you managing to run the tests on ppc64 ? [20:45] sinzui: also, do you test amd64 but built with gccgo? [20:45] mwhudson, yep they are great. The same test is run by sshing to an instance [20:45] cool [20:46] so if i can scare some hw up, you could use that? [20:46] yep [20:46] which is probably not likely in the short term but oh well [20:48] i wonder if qemu system is there enough yet [20:49] export INSTANCE_TYPE=m1.xlarge [20:49] that does seem underpowered [20:49] sinzui: could you try, i dunno, c3.8xlarge? [20:50] sinzui: do you know anything about the discussion in the backscroll about joyent and 1.18? there's a claim joyent tests aren't passing. does that refer to CI tests? unit tests pass cause otherwise it wouldn't have landed. and I tested live with wordpress/mysql. so is the discussion just around CI setup? [20:50] wallyworld, good that you asked. [20:51] wallyworld, the main bug is also the bug that broke hp and azure deploy. joyent may be fine [20:51] ah ok. is that a recent bug in trunk? [20:52] wallyworld, I am merging the version inc into 1.18 now. You can gather your joyent changes and merged them. The joyent test is already setup and it will run [20:52] wallyworld, yes r2524 landed 12 hours ago is the problem [20:53] sinzui: i was expecting to be asked to merge joyent into 1.18 once 1.18 was updated with 1.17.7 revision. joyent is pretty much standalone [20:53] i was xpecting that we would ship 1.18 with joyent [20:53] i will merge joyent into 1.18 after breakfast [20:54] wallyworld, we are waiting for gobot to merge my branch. you can prepare merges now [20:55] ok, will do. i was concerned when i saw what seemed to be a joyent problem (or what people were saying was a joent problem) [20:57] wallyworld, so was I. abentley happened be looking at both issues as I was reporting them in IRC. He reported the root cause in a different channel. [20:57] sinzui: ok, np. i was worried when stakeholders like arosales might have thought joyent was broken also [20:58] sinzui: do you have the link for your fancy CI dashbaord? i have misplaced it [20:59] wallyworld, CI is currently at http://ec2-54-84-137-170.compute-1.amazonaws.com/ [20:59] sinzui: i found that one. you also had a summarised one [21:00] wallyworld, http://ec2-54-84-218-58.compute-1.amazonaws.com:6543/ci summarises an older version we just ran [21:00] wallyworld, It it is several weeks behind what CI does since jcsackett has been working on other reporting issues [21:01] ok, np. i liked looking at it :-) [21:01] i'll use the standard jenkins one [21:02] * thumper rants [21:02] * thumper takes a deep breath... [21:02] wallyworld, any test/job running with -devel means we know it can fail, but it wont curse the revision [21:03] ok [21:03] anyone know how to have gnuflags accept the same arg multiple times? [21:03] like the python "append" type [21:03] so we can go "juju debug-log --include foo --include bar ..." [21:03] * thumper wants to match pyjuju [21:04] thumper: no idea. but i fear it may not be possible [21:04] fuzz bucket [21:04] rog is the best person i think [21:04] * thumper ignores for now [21:04] and works on the api server side [21:10] hehe fuzz bucket, I am going to start using that [21:13] wallyworld, Do you have a moment to review a branch that increment trunk to 1.19.0 https://codereview.appspot.com/83060045 [21:13] sure [21:14] jeez, that was a big one [21:17] thumper: this is is a backport of joyent into 1.18 so sinzui can run the CI tests https://code.launchpad.net/~wallyworld/juju-core/joyent-1.18/+merge/213724 [21:17] * thumper looks [21:17] it's a cherry pick of 3 trunk revs [21:17] wow [21:18] wallyworld: lgtm [21:18] thumper: sorry :-( it's essentiall bzr merge -r 2515..2516 trunk [21:18] 3 times [21:18] for rev 2512, 2515, 2516 [21:18] wallyworld: cherry pick is fine IMO [21:18] do it [21:18] ok ta :-) [21:18] * thumper goes back to debug-log [21:24] rogpeppe2: I don't suppose you are around? [21:24] hmm... half 10 at night [21:24] perhaps a bit hopeful here [21:27] natefinch: done anything with web sockets? [21:28] thumper: not really, sorry [21:28] hmm... [21:28] I'm not sure this code works... [21:34] ah... [21:34] found it [21:34] that elusive line that connects everything [21:36] perrito666, I got the backup test to work by selecting a slower env/region. [21:37] perrito666, I think speed is the issue. I am going to change CI to run the test itself with my new Hp env [21:52] guys, have you seen this error before? [21:52] 2014-04-01 21:41:49 INFO juju runner.go:262 worker: start "lxc-provisioner" [21:52] 2014-04-01 21:41:49 ERROR juju runner.go:220 worker: exited "lxc-provisioner": permission denied [21:53] I used juju add-machine to add a machine, got it, then I tried to deploy services to it using lxc [21:53] and it's looping on that error [21:53] juju 1.17.7 [21:53] and then [21:53] 2014-04-01 21:41:49 INFO juju runner.go:254 worker: restarting "lxc-provisioner" in 3s [21:53] and it just loops [21:54] https://bugs.launchpad.net/ubuntu/+source/juju-core/+bug/1299588 [21:54] <_mup_> Bug #1299588: LXC permission denied issue with 1.17.7 === hatch__ is now known as hatch [22:11] why do we still have m1.* instances defined for ec2.. [22:11] sinzui: hey I was answering to you [22:12] sinzui: now I wonder if the problem is in the testing script or in the actual restore [22:12] if it is the second we have another.. bug [22:15] perrito666, The failure is in the juju-restore. I have run just it repeatedly in the past. [22:15] * perrito666 curses out loud in spanish [22:18] ok, as much as I am tempted to add a 1st world problems meme on the bug I will add the new info so the bug is addressed properly and the information remains there [22:23] ahasenack: with maas? [22:24] sinzui: do we have maas in CI yet? [22:24] thumper, no. [22:24] sinzui: is there an ETA with that? [22:24] or are we still missing the hardware? [22:25] possibly next month when either hardware or a cloud that lets me run vmaas comes into existence. === hatch__ is now known as hatch [22:44] sinzui: 1.18 branch now has joyent provider merged in [22:45] wallyworld_: is that the last bit required to use the joyent provider or is there more to come? [22:45] rick_h_: it works. there's some followup needed to make it "perfect" [22:45] wallyworld_: ok, but we should be able to grab the 1.18 branch and bootstrap a joyent env? [22:46] rick_h_: yep. i tested in trunk [22:46] wallyworld_: cool thanks [22:47] wallyworld_, fab. 1.18 as 1.17.8 is completing tests now [22:47] rick_h_: while i have you - for a while, trunk has has public/private addresses moved off units and onto their assigned machines, even though there's a method on unit that delegates through to the machine to get the info [22:48] rick_h_: i'm doing some clean up in trunk, and now i need to change some megawatcher tests [22:48] to formalise the fact that the setter methods are now redundant [22:48] wallyworld_: k, I think frankban had a little heads up but let us know and we'll keep an eye out and see if it effects the api stuff we need for machine view [22:49] which means that the tests need to change because there's no event published for units when the underlying machine address changes [22:49] i'll send an email - i think it's all good, just want to be sure [22:51] wallyworld_: yea, not sure without looking into stuff more. If the proxy method is there we should be good [22:51] ok [22:52] thumper: no, manual provider [22:53] ahasenack: I'll ask axw when he starts [22:53] * thumper goes to hit things and think how to fix his filtering problem [22:53] ok [22:58] thumper: I added a comment and logs to https://bugs.launchpad.net/ubuntu/+source/juju-core/+bug/1299588 [22:58] <_mup_> Bug #1299588: LXC permission denied issue with 1.17.7 [23:31] thumper: news: same thing happened with aws, so no weird setup is needed [23:31] I'll append to the bug