[00:21] worker/instancepoller/aggregate.go:67:22: error: reference to undefined identifier ‘ratelimit.New’ bucket := ratelimit.New(gatherTime, 1) [00:21] i have the latest github.com/juju/ratelimit [00:25] same test failure on my part again [00:26] http://paste.ubuntu.com/7117154/ [00:26] hrm [00:26] broken pipe [00:33] davecheney: godeps -u dependencies.tsv === thumper-afk is now known as thumper [00:46] wallyworld_: the reason we don't use version.readSeries is because it conflates the host with what we are going to start using tools [00:46] ok [00:46] is there no way to avoid "some-series" [00:46] sure, can use "precise" [00:46] however [00:47] in order for that method to work correctly, they need to pass in series [00:47] so it should use that when it happens [00:47] that function is unused as yet [00:47] ok [00:47] just make sure you tell them..... [00:47] ack [00:55] davecheney: ping [00:55] davecheney: I'm getting the same failure for juju-mongodb that I was getting for mongodb-server on power [00:55] it seems to think there is no journal present [00:56] is mongod changing the user from root? [00:58] davecheney: you had the local provider working, did you do anything special for mongod? [01:00] davecheney: I want to check in prior to filing a bug [01:00] o/ axw [01:00] hey thumper [01:03] thumper: all I did was the symlink [01:03] to be fair [01:03] tests don't pass on ppc64el [01:03] davecheney: I get it failing to even start [01:03] davecheney: complaining about no journal [01:04] * thumper does an update [01:04] lets see if that changes anything [01:05] nope [01:07] whaaaaaaaaaaaa ? [01:07] when you deploy the local machine [01:07] sorry, bootstrap local env [01:07] umm... not long ago [01:07] worker/instancepoller/aggregate.go:67:22: error: reference to undefined identifier ‘ratelimit.New’ bucket := ratelimit.New(gatherTime, 1) [01:07] I grabbed my branch [01:07] ^ any idea how to fix this one [01:07] rebuilt [01:07] then I can try to have a look [01:08] run godeps? [01:08] ubuntu@winton-02:~/src/launchpad.net/juju-core$ go get -u -v github.com/juju/ratelimit [01:08] github.com/juju/ratelimit (download) [01:08] github.com/juju/ratelimit [01:08] what ? [01:08] \ [01:08] head isn't buildable / [01:08] where is the source for godeps ? [01:08] launchpad.net/godeps [01:08] sorry [01:08] dumb question [01:09] ubuntu@winton-02:~/src/launchpad.net/juju-core$ godeps -u github.com/juju/ratelimit [01:09] godeps: cannot parse "github.com/juju/ratelimit": open github.com/juju/ratelimit: no such file or directory [01:09] oh for fuxcks sake [01:10] why the fuck doesn't it default to something sane [01:14] ubuntu@winton-02:~/src/launchpad.net/juju-core$ pstree -p | grep -C2 mongo | |-{jujud}(31735) | `-{jujud}(31742) |-mongod(31700)-+-{mongod}(31705) | |-{mongod}(31706) | |-{mongod}(31707) | |-{mongod}(31708) [01:14] urgh [01:14] thumper: bzr up'd [01:14] and apt-get updated [01:14] still works for me [01:14] thumper: have you removed the mongodb-server package ? [01:14] davecheney: no [01:15] that'll be the problem [01:15] why? [01:15] first in the path maybe ? [01:15] nah, full path specified [01:15] yes, but /usr/bin/mongod => mongodb-server <-- borken! [01:18] thumper: hang on, let me test that scenario [01:19] davecheney: my upstart script specifies /usr/lib/juju/bin/mongod [01:19] thumper: i'm still trying to reproduce the failure [01:20] * thumper is reading mongo source files ... [01:20] that'll only make it worse [01:22] davecheney, my eyes are bleeding already [01:22] davecheney, the assert seems to be from here.. https://github.com/mongodb/mongo/blob/v2.6/src/mongo/util/logfile.cpp#L241 [01:23] thumper: i've just bootstrapped [01:23] root 32316 2.1 0.5 445460 43088 ? Ssl 01:19 0:00 /usr/bin/mongod --auth --dbpath=/home/ubuntu/.juju/local/db --sslOnNormalPorts --sslPEMKeyFile /home/ubuntu/.juju/local/server.pem --sslPEMKeyPassword xxxxxxx --bind_ip 0.0.0.0 --port 37017 --noprealloc --syslog --smallfiles [01:23] worked fine [01:23] hmm [01:23] davecheney, are you on the same emulation thingy thumper is on? [01:23] davecheney: so what is different for you and me? [01:23] this seems to be an os alignment bug [01:24] I could really use a second pair of brain cells on this [01:24] ii juju-mongodb 2.4.9-0ubuntu2 ppc64el MongoDB object/document-oriented database for Juju [01:24] ii mongodb-server 1:2.4.9-1ubuntu1 ppc64el object/document-oriented database (server package) [01:24] hazmat: davecheney is on a different machine [01:24] bodie_: sure, mine are lonley [01:24] it's in TestWriteFailure in environs/sshstorage [01:24] http://paste.ubuntu.com/7117154/ [01:25] no idea what's supposed to happen, or what's going wrong, except the broken pipe bit [01:25] davecheney: can I get you to try my branch? [01:25] replicable [01:25] it's on a remote [01:25] thumper: sure [01:25] lp:~thumper/juju-core/juju-mongodb [01:25] bodie_: you might ahve to turn up the debug on the remove sshd [01:25] bodie_: my guess is /bin/ssh is [01:25] davecheney: build it, and make sure that juju-mongodb is installed [01:25] a. crashing [01:25] b. being hung up on [01:25] bootstrap a simple local provider [01:26] davecheney: and it should start with no tweaking at all [01:26] OT: hazmat where can I get a trusty install image that is < 2.5 Gb [01:26] davecheney: in theory [01:26] could it be caused by disabling root login? [01:26] bodie_: absolutely [01:27] hm [01:27] check /var/log/secure.log [01:27] or audit.log [01:27] or wherever the AUTH syslog target points too [01:28] bodie_: the bug is that that test is discarding the output of stderr [01:28] davecheney, hmm.. i just use cloud-images most of the time.. their minimal and small.. but you mean like a boot installer.. if you need super small and have the bandwidth.. there's always net install [01:29] hazmat: i just want to download it and try it [01:29] do I really need a 2.5 Gb iso ? [01:29] hrm ? ... I'm redirecting stderr into the test output.. so you're saying it didn't go in because it was being discarded in the first place? [01:29] davecheney, http://cdimage.ubuntu.com/daily-live/current/ look like 950mb [01:29] hazmat: ta, that'll do nicely [01:29] davecheney, kvm with cloud image.. gets you down to even smaller [01:30] bodie_: by default stderr and stdout go to dev null unless otherwise requested [01:30] davecheney, 240mb http://cloud-images.ubuntu.com/trusty/current/ [01:30] hazmat: ta muchly [01:30] davecheney, bottom of that page for the raw downloads [01:30] much better solution [01:30] I called it like so: go test ./... > output.txt 2>&1 & [01:30] davecheney, also.. virtualbox images here.. http://cloud-images.ubuntu.com/vagrant/trusty/current/ [01:31] bodie_: this isn't something you've done [01:31] ok [01:31] this is a bug inside the test code [01:31] hazmat: a bounty of resources [01:31] hazmat: that error [01:31] da fuq [01:31] thumper: pingy [01:32] that isn't a journal problem [01:32] this is somethig to do with the 64mb pages option [01:32] coming [01:32] thumper: this may be the problem [01:32] Linux winton-02 3.13.0-8-generic #28-Ubuntu SMP Mon Feb 17 08:22:39 UTC 2014 ppc64le ppc64le ppc64le GNU/Linux [01:32] thumper: what you got ? [01:32] i reckon you've got a different kernel [01:32] uname -a? [01:32] thumper: yup [01:33] Linux rockne-02 3.13.0-15-generic #35-Ubuntu SMP Mon Mar 3 15:56:08 UTC 2014 ppc64le ppc64le ppc64le GNU/Linux [01:33] hmm [01:33] ok, that is another angle to try [01:39] thumper: hangouts is not loading for me :( [01:41] wallyworld_: you lot still in hangout? I can't even load https://plus.google.com/ [01:41] waigani_: yeah, we're here [01:41] wtf? [01:41] :( [01:41] talk about you :-P [01:41] waigani_: thumper is using the internet [01:41] you'll have to wait your turn [01:42] lol [01:42] lol - bastards [01:49] davecheney: can you run 'getconf PAGESIZE' [01:50] davecheney: I get 65536 [01:50] ubuntu@winton-02:~$ getconf PAGESIZE [01:50] 4096 [01:50] bingo [01:50] i get 4096 also [01:50] arse [01:50] thumper: https://docs.google.com/a/canonical.com/spreadsheet/ccc?key=0Aje1qBKEwuLVdDIxdG9NeWF2REJjemdLbllzbGotamc&usp=drive_web#gid=0 [01:50] thumper: patch getconf :-) [01:50] line 10 [01:50] wallyworld_: on rockne? [01:51] yeah [01:51] thumper: wallyworld_ the 4k/64k thing is provided by the kvm emulation [01:51] it's totally possible that different vm's get different views of the underlying hardware [01:51] but why does a log funtion vare? [01:51] care [01:52] we could just patch it to see if it works as an experiment [01:52] wallyworld_: simpler/safer just to apt-get update a new kernel image [01:53] davecheney: what does your uname -r say? [01:53] 3.13.0-8-generic [01:53] i'm a week or two behind [01:53] i wonder that thumper's is? [01:53] what [01:54] wallyworld_: check scrollback [01:54] ah. -15 [01:55] wallyworld_: i thnk that assert is checking that the log file is aligned to the os page size [01:55] probably because they use mmap a lot [01:55] ^ just a guess [01:55] but probably not totally wrong [01:55] makes sense i guess [01:55] mongo does a lot of dumb shit for performance without proof that the simple way was a problem to begin with [01:55] seems like they could do better though [01:56] s/mongo does a lot of dumb shit for performance without proof that the simple way was a problem to begin with/mongo does a lot of dumb shit [01:56] fassert( 16142, _fd >= 0 ); [01:56] what the bollocks is this ? [01:56] every assert has a unique number [01:57] but this isn't a constant ? [01:57] davecheney: it'll be a number they can grep the source for :-) [01:57] fassert(42, 'the meaning of life') [01:57] * thumper takes a stab in the dark [01:57] thumper: i hope youre joking [01:58] it's mongo, of course not [01:58] * thumper guesses not [01:58] fassert( 2 , 2 != 2 ) // totally unique, bro [01:58] chrome does not load plus.google.com anymore, yet firefox does ?!? You'd think, if anything, it would be the other way around [02:04] davecheney: can you try my branch? [02:04] I was told that there was a second VM for me, but it looks like not... [02:05] permissing denied (public key) [02:27] wallyworld_: added cloudinit test for trusty state server [02:27] thanks :-) [02:27] np [03:04] wallyworld_: bug: 2014-03-19 03:04:00 DEBUG juju.environs.simplestreams simplestreams.go:995 metadata: &{map[com.ubuntu.juju:14.04:ppc64:{ 1.17.6.1 ppc64 map[20140319:0xc21042bde0]} com.ubuntu.juju:12.04:ppc64:{ 1.17.6.1 ppc64 map[20140319:0xc21042b720]}] map[] Wed, 19 Mar 2014 03:03:55 +0000 products:1.0 com.ubuntu.juju:released:tools} [03:04] thumper: root 11980 0.7 0.4 1076640 37628 ? Ssl 03:04 0:00 /usr/lib/juju/bin/mongod --auth --dbpath=/home/ubuntu/.juju/local/db --sslOnNormalPorts --sslPEMKeyFile /home/ubuntu/.juju/local/server.pem --sslPEMKeyPassword xxxxxxx --bind_ip 0.0.0.0 --port 37017 --noprealloc --syslog --smallfiles [03:04] branch looks fine, LGTM [03:04] urgh, bugs in the server package [03:05] Removing mongodb-server (1:2.4.9-1ubuntu1) ... [03:05] arg: remove [03:05] mongodb stop/waiting [03:05] davecheney: me not understand? [03:05] the middle line [03:05] some output that shouldn't leak to the user [03:05] that's debug [03:05] * thumper goes to drink coffee [03:05] davecheney: thanks for testing [03:06] thumper: looking good [03:06] sorry your vm has brain issues [03:06] ttps://bugs.launchpad.net/ubuntu/+source/mongodb/+bug/1294455 [03:07] <_mup_> Bug #1294455: mongodb-server leaks debug information during remove [03:07] not important [03:07] so I tried upping the log level on sshd to see about that broken pipe, and it caused a bunch of other test failures for some reason. [03:08] I guess my next step is to try a 12.04 vm, and if that doesn't work, i don't know [03:08] bodie_: you said you disabled root login [03:08] that is probably the root cause [03:08] yeah, still breaks with root login enabled [03:08] ok [03:08] bodie_: are you canonical [03:08] ? [03:08] contracting w/ canonical, so, sorta [03:08] hai o/ [03:08] right, you have my sympathies [03:09] davecheney: as soon as outgoing ssh and access to port 10170 is sorted out, you can pull trunk and should be able to bootstrap ec2, hp cloud etc environments [03:09] i have no idea how you have managed to get yourself into this knot [03:09] i suspect it's a reality tv show [03:09] i recommend a 12.04 vm [03:09] that is what the landing bot users [03:09] wallyworld_: OH CRAP, i forgot about the api port [03:09] this is a nightmare [03:09] davecheney: i asked for the api port in the rt [03:09] hey man, I'm getting paid to write Go and work from home. I'LL. TAKE. IT. [03:09] wallyworld_: could I try it now [03:10] I just really want to get rolling. the testing stuff is driving me #go-nuts. [03:10] davecheney: you could, but it won't bootstrap cause it will get stuck on ssh [03:10] unless your vm is special [03:10] wallyworld_: could I bootstrap from my amd64 machine to a precise/386 image, or from an armhf machine to an amd64 precise image [03:10] that is effectively the same thing [03:10] yes [03:10] yes you could [03:10] good, i'll test that while I wait [03:10] so long as the tools are there [03:11] or, so long as they are packaged and available locally [03:11] wallyworld_: i'll choose a version that I know tools exists for [03:11] bodie_: same, https://twitter.com/davecheney/status/446075333360361472 [03:11] davecheney: if you have the tarballs, you can use the --metadata-source bootstrap are and they will be uploaded [03:11] arg [03:12] just fyi [03:12] wallyworld_: whoa there big fella, i'm not that keen [03:12] davecheney: I came to the realisation the other day that I basically never wear shoes anymore [03:12] my feet never felt so free [03:27] thumper: https://codereview.appspot.com/77340046 when you're not too busy [03:28] I'd kinda like to get it in for 1.17.x so we don't have backwards compat nightmares [03:28] * axw goes to pick up his daughter [03:31] kk [03:31] * thumper will look now [03:44] wallyworld_: I don't thnk we should/need to ask for a firewall hole for 17017 [03:44] that must be proxyable [03:44] and we should prxy it [03:44] it's http after all [03:45] hmm, yeah [03:45] so the proxy will have to allow [03:46] CONNECT machine-0:17017 HTTP/1.1 [03:46] 17070 i think you mean [03:47] i'll update the rt [03:47] yes [03:47] i don't know what i'm talking abouit [03:47] jsut kind of wildly stabbing at the keys [03:49] davecheney: so, with my branch and your VM, did the local provider start and work with no tweaks? [03:50] yes [03:50] \o/ === vladk|offline is now known as vladk [03:50] davecheney: I've made your tweak suggestion [03:50] hang on, lemmie get some proof [03:50] davecheney: I was kinda relying on us refactoring that code in the next six months due to other OS support [03:50] but was a good call [03:50] so I changed it [03:50] o/ vladk [03:51] thumper: hi [03:51] O_o! http://paste.ubuntu.com/7117795/ [03:51] thumper: http://paste.ubuntu.com/7117798/ [03:51] winning! [03:52] yeah baby, yeah! [03:52] davecheney: make sure you write that one up for the end of day email! [03:52] wallyworld_: so for power -> ec2 we are just waiting firewall poking? [03:53] thumper: believe so. of course my code will first first time, guarantee it [03:53] coolio [03:54] thumper: before we ship 1.18, i want to get my current branch in if possible, which is proper detection of supported arches on each provider, based on what images are available via simplestreams [03:54] wallyworld_: line it up [03:55] thumper: still wip [03:56] wallyworld_: root 11980 0.1 0.4 1076640 41212 ? Ssl 03:04 0:05 /usr/lib/juju/bin/mongod --auth --dbpath=/home/ubuntu/.juju/local/db --sslOnNormalPorts --sslPEMKeyFile /home/ubuntu/.juju/local/server.pem --sslPEMKeyPassword xxxxxxx --bind_ip 0.0.0.0 --port 37017 --noprealloc --syslog --smallfiles [03:56] out of the box [03:56] noice [03:57] davecheney: you mean the password being logged? [03:58] wallyworld_: eh ? [03:58] davecheney: sorry, i didn;t grok your paste [03:58] mongo obscures the passwd on the cmdline itself [03:58] i didn't do that [04:00] marcoceppi: what would it take to promulgate a trusty/mysql charm ? [04:05] thumper: does debug-log work for the local provider ? [04:06] davecheney: not until it is handled over the api [04:06] ubuntu@winton-02:~/src/launchpad.net/juju-core$ juju debug-log [04:06] Warning: Permanently added 'localhost' (ECDSA) to the list of known hosts. [04:06] Permission denied (publickey). [04:06] ERROR rc: 255 [04:06] ok [04:06] davecheney: there is an all-machines.log [04:06] yeah, i just forgot [04:06] np [04:06] ubuntu@winton-02:~/src/launchpad.net/juju-core$ tail -f ~/.ju-bash: cannot create temp file for here-document: No space left on device [04:06] ^C [04:06] uh oh [04:06] somethign has gone batshit [04:07] thumper: thanks, yeah I tested with local and canonistack - just about to test with azure [04:08] ok cock [04:08] root@winton-02:~# du -sh /var/lib/mongodb/* [04:08] 3.1G /var/lib/mongodb/journal [04:08] 64M /var/lib/mongodb/local.0 [04:08] 17M /var/lib/mongodb/local.ns [04:08] 0 /var/lib/mongodb/mongod.lock [04:08] 4.0K /var/lib/mongodb/_tmp [04:08] mongo has gone batshit [04:08] oh hang on [04:08] this is the system mongo db [04:08] not outs [04:08] ours [04:08] ok [04:08] i'll just delete that [04:08] haha [04:09] it probably is still batshit insane [04:09] lemmie raise a bug for that [04:09] * thumper tries to land that branch again [04:11] If you're happy and you know it, land your branch ♬ [04:14] root@winton-02:~# du -sh /var/lib/mongodb [04:14] 8.0K /var/lib/mongodb [04:14] great, now I can't trigger the problem [04:14] asshole [04:29] davecheney: branch landed === vladk is now known as vladk|offline [07:38] morning all [07:39] mgz, jam, relatively quick review ? https://codereview.appspot.com/77340044/ [08:05] mornin' all [08:05] dimitern: 1000 line diff is "relatively quick"? :-) [08:13] morning [08:14] relative to a 2000 line diff it is ;) [08:16] :) [08:17] rogpeppe, most of that are tests [08:18] dimitern: so we're not supposed to read tests as closely then? :-) [08:18] :D [08:18] axw: hiya [08:18] i'll remember that one [08:18] dimitern: i confess that i probably *don't* read tests as closely as production code. [08:19] rogpeppe, well then ;) give it a try ? https://codereview.appspot.com/77340044/ [08:20] dimitern: currently reviewing https://codereview.appspot.com/77500045 [08:20] rogpeppe, sure, np [08:21] fwereade, so it turned out there *will be* a sprint in Malta after all.. just not for us ;) [08:21] last week of may for the apps & client guys [08:22] I'd rather go to Malta than Las Vegas... hmm that is quite the first world problem [08:22] dimitern, ha, yeah [08:22] lol [08:22] ya'all are writing the the wrong Go application [08:23] dimitern, I will continue to quietly agitate for malta sprints [08:39] axw: do we now have a valid environment configuration in the state from the word go now? [08:39] rogpeppe: yup [08:39] axw: cool. we can probably simplify various pieces of agent logic because of that [08:40] rogpeppe: yeah, there are definitely some things I haven't gotten around to simplifying. e.g. workers waiting for valid config [08:40] axw: that's what i was thinking of === vladk|offline is now known as vladk [08:41] rogpeppe: it is still possible to get an invalid config *after* bootstrap, but that's very close to being fixed [08:41] rogpeppe: there's a small amount of work required to ensure config changes in state are transactionally validated&updated [08:42] morning all [08:42] morning vladk [08:42] axw: ah yes, because if you've got two set-environments concurrently, the combination might not be valid, i guess [08:42] vladk: hiya [08:43] rogpeppe: yup. waigani has been working on fixing it. there's a bit more to go, but it's still racy for the moment [08:57] mgz, ping [09:03] axw, fwereade, rogpeppe, jam, i'd really appreciate a review so i can land https://codereview.appspot.com/77340044/ === Ursinha is now known as Ursinha-afk [09:03] dimitern: sorry, still on that other review [09:03] dimitern, I'll take a look [09:04] about to go afk, will take a look later if it's not reviewed [09:04] fwereade, ta! === Ursinha-afk is now known as Ursinha [09:24] fwereade, jam: do you think we need to elide passwords from service-set configuration attributes in the log? [09:38] dimitern, reviewed, let me know what you think [09:39] fwereade, cheers, looking [09:41] fwereade, i'm not quite following you on that one https://codereview.appspot.com/77340044/diff/1/agent/agent.go#newcode135 [09:42] fwereade, how do you suggest to improve the doc comment? [09:42] dimitern, so, we've got a global mutex for config changes [09:42] fwereade, I agree it's not perfect, but it's a step in the right direction [09:42] fwereade, yeah [09:42] dimitern, right, yeah, as I say prgress not perfection [09:43] dimitern, just add a note to configMutex explaining the situation if there isn't already one [09:43] fwereade, that it's used to guard against modifying a created config? [09:44] dimitern, that it sort of implies a singleton config, but all we really have is an accidental-singleton config [09:45] fwereade, ok, got it [09:45] rogpeppe: can I get a follow up review on: https://codereview.appspot.com/76120044/ I simplified it [09:45] jam1: looking [09:48] jam1: what was the problem with my suggested (much simpler still) approach? [09:49] rogpeppe: the only difference is I use package vars (that I can change in a threadsafe manner), and do some logging. [09:49] jam1: utils.OverrideGOMAXPROCSFuncs seems ugly to me [09:49] jam1: and it doesn't seem necessary [09:50] jam1: i'd prefer to keep mocking inside package boundaries when reasonable to do so [09:50] rogpeppe: so the alternative is to make them public variables, which doesn't seem great, vs a public function to set them. [09:50] rogpeppe: how do you suggest telling if GOMAXPROCS was called? by calling it once to force it to a given value, and then calling it again to observe that the runtime has changed? [09:51] jam1: we could use the approach already used inside cmd/jujud: var useMultipleCPUs = utils.UseMultipleCPUs [09:51] (runtime.GOMAXPROCS(1); run the agent; finalN := runtime.GOMAXPROCS(0); c.Assert(finalN, gc.Equals, 2) ) ? [09:52] jam1: (see newDeployContext for an example) [09:52] fairy nuff [09:52] we miss a bit of end-to-end of a test, but we do test the two layers [09:52] fwereade: I reworked the DistributionInstances CL. I'm now looking at reworking (replacing) the UnitAssigner one. Based on our discussion last night, I'm going to add a method, SupportsUnitPlacement, to the new EnvironCapability interface that wallyworld introduced [09:53] fwereade, re https://codereview.appspot.com/77340044/diff/1/agent/agent.go#newcode176 [09:53] rogpeppe: TBH I don't really give a ****, I just want GOMAXPROCS getting called. [09:53] fwereade, the _DELETE_ key is used to specify values to delete [09:53] fwereade: then I'll move EnvironCapability into state, and check it during AddMachine/Unit.AssignTo* [09:53] axw, I'm going to pre-send a single comment on one of your reviews, please read it and ping me :) [09:53] jam1: yeah, sorry. i just didn't like the complexity i saw there. [09:53] fwereade: sure [09:54] NOT LGTM? ;) [09:54] jam1: particularly as it cluttered the public interface of utils [09:54] axw, ;p [09:54] dimitern, yeah, and it makes me sad [09:54] jam1: FWIW your test wasn't properly end to end either [09:54] dimitern, it feels likewe're kind of abusing that Paramsstruct because it has some fields that overlap with what we really want [09:55] fwereade, initially I tried having "" to signal delete, but it's not quite enough if you just need to add/set an empty field [09:55] dimitern, it seems like it'd be clearer and less vulnerable to hilarious screwups to use a different struct for the migrate func [09:55] fwereade, ah, you're saying we need a different struct than AgentConfigParams ? [09:55] dimitern, eg changing tag/nonce is likely to be fatal [09:56] fwereade: why would StateServer not be true for new state servers? are you thinking about repurposed machine agents? [09:56] fwereade, *any* reckless change is likely to be fatal :) [09:56] dimitern, granted, but I don;t see much reason to allow changes that basically can't ever make sense [09:57] axw, the trouble is that MachineConfig.StateServer really means MachineConfig.Bootstrap IIRC [09:57] axw, maybe everything already ignores StateServer if passed into StartInstance? [09:58] fwereade: it's only used in environs/cloudinit [09:58] fwereade, ok, what then, to summarize? change the params struct and remove some of the things you can change? or just change the struct [09:58] axw, if so that's sort of ok but still very vulnerable to accidental screwups [09:58] dimitern, wait, was that struct new in that CL? I thought it was repurposed from creation-time [09:59] dimitern, I think we want a separate struct [09:59] fwereade, AgentConfigParams was chosen because it fits nicely with the NewAgentConfig() taking params and constructing one [09:59] dimitern, but maybe I missed something? [09:59] dimitern, yeah, my contention is that they're different jobs and want different params [10:00] fwereade, yeah, but if we're going to have mostly the same fields? [10:00] dimitern, stuff like "" meaning "leave alone" in one context, and "set to empty" in another, etc [10:00] fwereade, ok, I'll look into it and repropose [10:00] dimitern, how many fields are you actually changing though? [10:01] fwereade, potentially DataDir, LogDir, Jobs and Values [10:01] dimitern, I'd really prefer a struct with those fields (and a separate DeleteValues field, rather than a magic key) [10:01] fwereade, changing DataDir is as dangerous as changing Tag or Nonce, but it's required for this migration [10:02] fwereade, will do [10:02] dimitern, understood, this is a sharp knife we're building [10:02] fwereade: I was thinking I could change the DistributionInstances code to return the instances for state server machines. You still need to know whether it's going to be a state server at provisioning time (I had planned to use MachineConfig.StateServer to decide whether to do this) [10:03] axw, I think we will know that, but we don't currently express it well [10:03] ... because instances can't be moved between Cloud Services later [10:03] rogpeppe, can I get your input on this discussion with axw? see comments in https://codereview.appspot.com/73910043/ and discussion here [10:03] fwereade: alternatively, base it on the jobs in state for the machine being provisioned [10:03] fwereade: looking [10:03] axw, yeah, ISTM that the DistributionInstances path is the rightone [10:04] fwereade: if MachineConfig.StateServer really means "BootstrapMachine" then I think that's the way to go [10:05] axw, I think it pretty much does *but* you have done the most stuff to bootstrap lately, and I have only *read* that code rather than *actively worked with it* so your instincts may well be more finely tuned [10:05] fwereade: that CL is going to have to change a bit anyway, because labels won't be useful anymore [10:05] axw, if you can say for sure I'm wrong there then I will gladly defer to your superior perspective :) [10:05] fwereade: it is now, but that's only because HA isn't merged yet :) [10:06] axw, yeah, indeed :) [10:06] fwereade: that code was all kinda circumspect [10:06] I figured it'd have to change a bit [10:06] axw, there's a whole load of stuff that sync-bootstrap has rendered irrelevant-shading-to-crazy, and it'd be lovely to get time to rationalise all that one day [10:07] indeed [10:07] axw, but while it's not actively hurting us it remains a copious-free-time deal ;) [10:09] fwereade: does the EnvironCapability thing sound reasonable? [10:09] I think it's basically what you were saying last night [10:10] axw, that name sounds good, but I sorry, which CL? [10:12] fwereade: none, I was referring to a message before [10:12] fwereade: I reworked the DistributionInstances CL. I'm now looking at reworking (replacing) the UnitAssigner one. Based on our discussion last night, I'm going to add a method, SupportsUnitPlacement, to the new EnvironCapability interface that wallyworld introduced [10:12] fwereade: then I'll move EnvironCapability into state, and check it during AddMachine/Unit.AssignTo* [10:13] * fwereade hunts frantically for the EnvironCapability code [10:14] fwereade: environs/interface.go [10:14] fwereade: landed yesterday I think [10:14] * fwereade sees it there and wonders how his grep failed [10:15] fwereade: we could also use this for enabling/disabling the firewaller job, for example [10:15] axw, yeah, that sounds sane to me [10:15] axw, thanks [10:15] goodo [10:15] thanks [10:19] fwereade: out of interest, why would/should StateServer not affect cloudinit? won't that be the place for installing juju-mongodb/mongodb-server still? [10:21] axw, it seemed rather better to let the new state servers know all their special info over the api connection rather than jamming important info into cloudinit (where other things can read it) [10:21] I think that might be the only thing to do though, the rest requires a secure connection - so probably a InstallMongo flag would be more appropriate [10:21] axw, also, it'd be nice to be able to promote a machine to a state server at some date in the future [10:21] yeah, makes sense [10:22] axw, even if we don't do so today [10:22] fwereade: sure, except for on azure of course :) [10:22] axw, quite so :) [10:22] axw, goddamn azure :) [10:22] heh [10:22] axw, anyway I think the DistributionInstances thing is good *anyway* [10:22] the new manual provider [10:23] axw, because we'll want to distrubute state servers where possible everywhere, right? [10:23] yeah I think so too, I was just curious [10:23] fwereade: yes it'll be useful for ec2 [10:23] good morning everyone [10:24] fwereade: bbl, gotta take care of my little one. thanks for the chat. [10:24] axw, cheers [10:26] rogpeppe: https://codereview.appspot.com/76120044 updated [10:32] jam1: Hey. [10:32] hi jpds [10:32] What happened to "juju image-metadata" ? [10:32] jpds: it is 'juju metadata generate-image' [10:32] or 'juju metadata validate-image' [10:33] And how do I get index and imagemetadata.json into my S3 stuff? [10:33] jpds: in 1.16 or in 1.17+ ? [10:33] We've been polishing it a bit in 1.17 [10:33] jam1: 1.16. [10:33] :( [10:34] jam1: OK, so I have the stuff; where am I suppose to put it? [10:34] jpds: you haven't bootstrapped yet, correct? [10:34] jam1: Trying to. [10:34] (in 1.17 you can juju bootstrap --metadata-source /path/on/local/disk) but that won't help you for 1.16 [10:35] Excellent. [10:36] I'm digging out 1.16 to see if I have answers for you [10:38] jpds: as a rough approximation, you need to put it somewhere preserving the directory structure (streams/v1/index.json) and then point "image-metadata-url=PATH_TO_ROOT" [10:40] I'm just going to upgrade to 1.17. [10:40] jpds: works for me :) [10:43] grabbing a cuppa before standup [10:47] jam1: It's still complaining that index file has no data for cloud. [10:47] jpds: do you have the backtrace of commands that you've run and their output? [10:47] jam1: Yes. [10:47] You need to make sure that your "generate-image" command has the settings that match what you are bootstrapping. [10:48] standup [10:48] natefinch: thanks for the reminder, switching machines [10:48] brt [10:50] jam1: reviewed [10:51] mgz: poke [11:19] fwereade, updated https://codereview.appspot.com/77340044/ [11:29] fwereade, should be ready to land now i think [11:33] so, to what degree of poking am I allowed to go to obtain a review? :p [11:34] * perrito666 starts mailing plastic fingers [11:34] wallyworld: if you're around, any ideas why "juju bootstrap" would be trying to look for "com.ubuntu.cloud:server:12.04:i386" ? [11:34] Is this a case of "if local Arch == i386, boot a machine with i386" ? [11:34] This is with released 1.17.5 [11:34] so not trunk. [11:34] um [11:34] wallyworld: this is custom image metadata, where the target arch should only contain amd64 [11:35] but the image metadata lookup fails with: [11:35] index file has no data for product name(s) ["com.ubuntu.cloud:server:12.04:i386"] [11:35] jam: is the client machine precise i386? [11:36] wallyworld: I think we should add a line at startup of "juju" processes which logs version.Current [11:37] that would be helpful [11:37] natefinch: http://paste.ubuntu.com/7119308/ [11:38] dimitern, LGTM [11:39] perrito666, hassle everybody mercilessly [11:43] fwereade, tyvm [11:44] fwereade, and I did test it live again :) [11:48] rogpeppe: thanks for the review. you're right about AS not being changeable, but I'm going to change the use of StateServer based on discussion with fwereade anyway [11:49] rogpeppe: it'll unify some of the logic between state/non-state grouping, and it'll also be useful for ec2 AZs later [11:49] (where we need to have a set of instances so we can manually distribute) [11:53] fwereade: what a reckles advice :p [11:54] axw: sgtm [11:59] rogpeppe: in the cmd/jujud/bootstrap_test.go , there are some tests to initBootstrapCommand, which calls InitializeState. That, after our changes from yesterday calls environs.New(envCfg). It works fine when running, but the tests are now failing with an "environment is not prepared" error from environs.New [12:00] rogpeppe: any thoughts on either refactoring the code to make those tests not fail OR refactoring the tests so that the environment is prepared by the time environs.New is called? [12:01] wwitzel3: i think the tests *should* be passing config from a prepared environment [12:02] wwitzel3: if they're not, they need to be fixed [12:03] rogpeppe: ok, so the testConfig getting passed in to initBootstrapCommand ? [12:04] wwitzel3: probably [12:05] rogpeppe: or I guess, how would I ensure that the config being passed in is from a prepared environment? Is there an example of that in a test somewhere? [12:05] rogpeppe: testConfig right now is just a a dummy.SampleConfig that gets run through the b64yaml helper. [12:52] mgz: ping [12:53] wwitzel3: you can get the prepared environment config from the dummy environment that is started up in most tests [12:53] fwereade, jam, mgz: could we talk about API addresses for a moment or two? [12:53] rogpeppe, got a call in 7 mins, but if it's quick, sure [12:54] fwereade: https://plus.google.com/hangouts/_/canonical.com/juju-core ? [12:54] fwereade: might be quickest [12:55] rogpeppe: I'll be there [13:10] mgz: thanks, that's very useful [13:10] rogpeppe: that sounded good to me, [13:10] and hangouts hate me [13:10] so, carry on :) [13:15] natefinch: I pushed tests for the initiateReplicaSet and some other minor fixes to my branch [13:15] natefinch: I am still trying to figure out this environment not prepared stuff [13:17] wwitzel3: ok, I just got back, I'll merge in and see what I can see [13:29] hi guys, https://bugs.launchpad.net/juju-core/+bug/1271144 is still happening, I added some log files to it [13:29] <_mup_> Bug #1271144: br0 not brought up by cloud-init script with MAAS provider [13:29] can someone reopen the bug? [13:29] morning fine fellas [13:30] ahasenack: i *thought* we'd fixed it [13:30] rogpeppe: so did we ;) [13:30] rogpeppe: I just tried with 1.17.5, still no br0 and lxc deployments to bootstrap fail [13:30] ahasenack: but it was very hard for us to test that our fix worked [13:30] ahasenack: because we don't have access to a maas to test on [13:31] rogpeppe: I reproduced it both with a real metal maas server, and with maas running on kvm [13:32] ahasenack: do you know which revno corresponded to 1.17.5 ? [13:32] let me see if the branch is tagged [13:32] is juju-core still in lp? That's where I'm looking [13:32] rogpeppe: juju-1.17.5 2422 [13:33] yeah, after the fix, I wasn't able to reproduce the error on my vbox maas [13:33] current is 2443 [13:33] rogpeppe: we can help debug whatever you need [13:33] that was just running juju tags, I'm assuming that is the official tag for 1.17.5 [13:34] wwitzel3: did you tear down and re-deploy each time? [13:34] er, bzr tags [13:34] wwitzel3: emminently reproducible here :) [13:34] wwitzel3: can you run with this? [13:36] wwitzel3: a quick question, were you testing LXC on machine-0 or on one of the other machines? [13:39] hmm... i'm getting breakage on a fresh go get launchpad.net/juju-core [13:39] log/syslog/config.go:228: method slConfig.CACertPath is not an expression, must be called [13:39] and a few others [13:39] I don't imagine broken code would get merged [13:42] yeah tons of stuff is broken. the only thing I can understand that might explain this is a bad connection.... [13:43] bodie_: a fresh go get probably got updates from 3rd party branches that are newer than we are using [13:43] (go get grabs the tip of all dependencies) [13:43] we have a dependencies.tsv to state what version we are using [13:44] you should be able to "go get launchpad.net/godeps", and then do "godeps -u dependencies.tsv" [13:44] which should set your tree to be the exact version of dependencies we should use for building. [13:46] tried that, and install still complained about tons of random breakage [13:46] I just double-checked on my workstation to confirm, and nothing went wrong [13:46] but it's totally reproducible on my remote [13:46] starting to think the gopher gods have it in for me [13:47] bodie_: godeps doesn't actually update your branches, it just tells you which ones are broken. I'm not sure what the bzr command is to tell those branches to move to the right commit. [13:47] bodie_: *potentially* it is a different version of go libraries? [13:47] natefinch: godeps -u dependencies.tsv does the right thing *if* you've already downloaded the full histories [13:47] it won't fetch new data [13:47] but if you have it locally, it will set the branch to the right revision [13:47] jam: ahh, so you need to bzr pull first, then run godeps [13:48] but go get already bzr pulls doesn't it? [13:48] natefinch: well, potentially "go get -u" to do the recursive fetch of all dependencies, but that potentially overwrites any local commit you have in a dependency. [13:48] bodie_: "go get" or "go get -u" ? [13:48] -u is needed to update in place [13:48] bodie_: go get puts you at head... we need some non-head stuff [13:48] go get -u -... [13:48] well i'm just saying, if you already go got, godeps should have the latest bzr content [13:49] bodie_: right, go get -u launchpad.net/juju-core/... ; godeps -u dependencies.tsv should leave you in a working state [13:49] jam: it seems like having head of all our branches not building is a big mistake for exactly this reason. ideally godeps should only be needed for release branches that need older stuff [13:49] natefinch: you can't develop a 3rd party lib and never land it independently of trunk [13:50] jam: oh, yeah, I see... if you're working on a feature branch, and the feature needs updates to external packages.... yeah ok. \ [13:50] I might have just screwed up here by starting off with go get -u -v when I never ran go get without -u [13:50] giving this a try [13:51] jam: I was deploying the lxc to machine-0 and I was doing a full destroy each time. I have it running now. [13:51] jam: this is why Gustavo's versioning of packages is good... prevents exactly this problem by just making a new version that the new code references. [13:51] natefinch: right, the recent one is that axw has been improving gwacl, but that meant gwacl tip was incompatible with juju-core for a while. [13:51] natefinch: "ish" [13:51] except Gustavo has broken us several times [13:51] because he *thought* his change was compatible [13:51] jam: have I? [13:52] when it really wasn't [13:52] niemeyer: you broke the test suite [13:52] jam: Really!? [13:52] fight fight! [13:52] when you added timeouts for mongo [13:52] jam: I'm now aware about that [13:52] because we were already doing timeouts in juju-core [13:52] and your timeouts and our timeouts doubled up [13:52] jam: heh, yeah, it's definitely not always possible to know if a change will break... but there are changes you *know* will break people [13:52] jam: Well, sorry.. but that's a completely different kind of "breakage" [13:52] jam: What else have I broken? [13:53] niemeyer: our test suite started failing because of an update to mgo. [13:53] I don't know of a case where stuff stopped compiling [13:53] jam: That's good.. [13:53] jam: What else? [13:53] niemeyer: the test suite has broken at least one other time, and we pinned to rev 241 for a while. [13:53] I don't remember the exact reason offhand [13:54] jam: Okay, phew.. glad the "several times" was an excited overstatement then [13:55] jam: In terms of timeouts, why did it break again? Just out of curiosity.. [13:55] oh god [13:55] digitalocean's mirror has go1 [13:55] niemeyer: we had a test that asserted we retried an appropriate number of times, but we ended up having mgo retrying on top of juju retrying [13:55] something like that [13:55] * bodie_ slowly claws skin off face [13:55] jam: Oh man.. okay.. [13:55] I'm going to go yell at someone. brb. [13:55] jam: That's *totally unspecified* [13:56] jam: If you assume mgo will connect a specific number of times, updates *may* break it [13:57] niemeyer: so, more context as I'm remembering. mgo used to flood connections to Mongo, and we needed it to slow down a bit. We did the slow-down in Juju code, but then mgo started slowing down, and then we were too slow, IIRC. [13:57] jam: Sure, that's okay.. [13:57] jam: I reckon the update broke the test logic [13:57] jam: the socket timeout issue was the most painful, that gave us random failures [13:57] jam: But the test logic was truting on unspecified behavior.. I simply cannot promise to not touch that kind of logic [13:58] rogpeppe will remember the details [13:58] trusting [13:58] mgz: There was an actual bug there in this case [13:58] So yeah, that kind of breakage is on me.. will pay a beer to the debugger, sorry. [13:58] But that's unrelated to natefinch's point. [13:59] rogpeppe will look forward to that :) [13:59] mgz: Yeah, I owe him more beers than I can pay [13:59] So, the point remains: the versioning of stable APIs is a good method to avoid API breaks. [14:01] voidspace: lp:~rogpeppe/juju-core/524-state-api-hostports [14:02] * rogpeppe agrees with niemeyer === marcoceppi is now known as marcoceppi-mobil [14:04] niemeyer++ [14:11] okay, now I just think I'm going insane [14:11] ubuntu 12.04 golang is version 1?! [14:11] this can't be right [14:12] bodie_: Why not? [14:12] bodie_: this is why I just always build go from source, plus you need to do that to enable cross compilation anyway [14:13] I don't know... I guess it was my mistake to assume something would be a proper version on 12 LTS [14:13] natefinch: bodie_: precise is just that old [14:13] you need to "add-apt-repository ppa:juju/golang-go" [14:13] and we'll give you 1.1/1.2 (I don't quite remember now) [14:14] bodie_: Hmm.. 1 looks like a proper version...? :) [14:14] bodie_: 12 was 2 years ago [14:15] niemeyer: almost all go code written these days won't compile in go 1. [14:15] natefinch: These days is two years after two years ago.. :) [14:15] niemeyer: yes... and there's been plenty of time to update the golang package to 1.1 [14:16] natefinch: Distributions don't get updated with new code just because it got released [14:16] natefinch: LTS means long term *support* [14:16] just asked the same question in #go-nuts and they pointed me to niemeyer's godeb. heh [14:16] natefinch: Security fixes and the such will get applied [14:16] (a couple of minutes ago) [14:16] natefinch: If you want new code, just update to a new release [14:16] I feel special [14:16] niemeyer: natefinch: yeah, generally Ubuntu doesn't change your toolchain underneath you in the official archives. [14:17] it would have been possible to potentially add a "golang-go-1.1" sort of package. [14:17] but not to change the "golang-go" package out of 1.0 status. [14:17] yeah, cause that's the version it was at, at that point in time. I get that. [14:17] it's just annoying to realize... lol [14:17] bodie_: :) [14:17] well, I would have pushed to be compileab [14:17] spent an hour trying to figure out why everything was broken until I checked the version -_- [14:18] compilable on default tools of Precise for a bit longer. [14:18] bodie_: That's reasonable :) [14:18] but 1.1 was just too much better than 1.0 [14:21] I don't think making a habit of making sure you have a properly updated toolchain is a bad idea [14:21] I just get lazy and assume I'll have the things I need from repos [14:22] didn't even cross my mind to check the version until I noticed the output of go get -v was so different [14:22] bodie_: you probably should run trusty. It's what most of us are developing on at this point [14:22] maybe I'll make a mention of this in CONTRIBUTING... [14:22] oh I'm running trusty on my workstation [14:22] not fun [14:22] this is why I'm making a 12.04 vm [14:23] no? I have no problems with it [14:23] it's been literally one thing after another for an entire week including the weekend [14:23] maybe I'm just being dumb, maybe not everything is totally shipshape on 14.04 [14:23] ruling out the latter [14:24] you also built your mongo on a different release [14:24] the packaged mongo doesn't work with juju [14:24] the packaged juju-mongo doesn't work with juju [14:24] bodie_: true. I guess if mongo doesn't even build on trusty, that's a problem. Though there was a fix to the mongo problem last night. You just need to install juju's mongodb on trusty now [14:24] GCC 4.8.2 doesn't work with mongo source [14:24] okay, cool :) [14:24] maybe I'll do that after I get this 12.04 vm running. lol [14:25] bodie_: if you want. It seems like you're much more likely to run into problems running an old version than a new version [14:26] pretty stoked that Go is moving to a Go-compiled Go. pretty awesome :D [14:26] yeah, hopefully I can just make it work on this box. [14:26] otherwise to the VM I go [14:26] also had issues with a 13.10 vm last night .... [14:26] ssh breaking [14:26] I don't even... [14:27] I'm about ready to decide I should have gone into a career as a gardener [14:28] has anyone tried to download a bundle lately from the charm store? I've tried to export several different ones and all I'm getting is [14:28] envExport: [14:28] services: {} [14:28] relations: [] [14:28] series: precise [14:29] ahasenack, jam: run through it a few times now, still unable to reproduce with my local maas. tear and clean instance each time .. http://paste.ubuntu.com/7120061/ [14:29] wwitzel3: try getting rid of the squid cache [14:31] ahasenack: be happy to if I knew how [14:32] if [ ! -d /var/cache/squid-deb-proxy/00 ]; then [14:32] $SQUID -z -f /etc/squid-deb-proxy/squid-deb-proxy.conf [14:32] fi [14:32] maybe trashing that directory and restarting [14:33] the initscript will create it again [14:37] ppetraki: also may want to try #juju-gui [14:37] hmmmm [14:38] getting the same weirdness as I was last night on my 13.10 [14:38] http://paste.ubuntu.com/7120110/ [14:38] anyone know what might be up? I tried debugging sshd last night, but I just got more breakage [14:38] ahasenack: removed the folder and turned off the squid-deb-proxy service, giving it another try now [14:39] wwitzel3: I'm not sure you can leave it off, juju might expect it to be running [14:41] ahasenack: true [14:47] ahasenack: removed the squid-deb-proxy directory, rebooted, and the br0 interface comes up .. my maas is providing DNS and DHCP. Is yours not? Maybe that is the difference? [14:47] wwitzel3: ours is [14:47] wwitzel3: I tried another maas, an older one on precise (v1.4), and I also nuked the squid cache there. There it worked this time [14:48] wwitzel3: so we only have one maas where this is happening right now, before nuking the cache there I think we need to know what is going on [14:50] bodie_: those tests don't run a real ssh executable [14:51] bodie_: see storageSuite.sshCommand; it writes a fake ssh command to $PATH [14:51] ahasenack: I am able to replicate the behavior before rogpeppe's fix .. maybe if revert to prefix, bootstrap, and populate the cache, then upgrade to 1.17.5 and try to bootstrap without removing the cache .. if I can replicate it that way. [14:51] ahasenack: I will try that to see if i can get in to the same error pattern you're experiencing [14:51] wwitzel3: ok, I'll try again on that maas machine to see if it wasn't just an archive fluke, but without touching the squid cache [14:52] ahasenack: sounds good [14:52] fwereade: https://codereview.appspot.com/77820044 - when you have some time [14:52] (please) === marcoceppi-mobil is now known as marcoceppi [15:01] wwitzel3: can you paste your cloud-init-output.log from a successful run? [15:01] (cc: ahasenack ^^) [15:01] wwitzel3: I see it trying to install bridge-utils immediately *before* an apt-get update === marcoceppi is now known as marco-traveling [15:02] wwitzel3: also line 24 from http://paste.ubuntu.com/7120061/ looks like template fail :) [15:04] sparkiegeek: wwitzel3: http://pastebin.ubuntu.com/7120231/ [15:05] wwitzel3: that's /var/lib/cloud/instance/cloud-config.txt on the bootstrap node [15:05] axw..... ok... so how am I supposed to tackle this [15:06] it's the same on the other bootstrap node where it worked [15:06] could be "luck". Would love to see if it's just a missing apt-get update [15:06] oh, btw [15:06] sparkiegeek: wwitzel3: where it's failing is trusty, and where it worked for me not is precise [15:07] ahasenack: "not is precise" ? [15:07] s/not/now/ [15:07] weirdest typo that happens to everyone [15:07] :) [15:08] so what I mean, is that it's using different archives [15:09] bodie_: sorry, I'm not sure what the root cause is. it just looks like a broken test, but I can't tell at the moment [15:09] axw: isn't it like 2am there? [15:09] natefinch: nah, just after 11pm [15:09] it's a broken test, I think that's as far as I've gotten. heh [15:10] bodie_: I meant as opposed to broken code under test [15:10] axw: ahh, what time zone is that? I don't have it in my world clock list [15:10] ko [15:10] ok* [15:10] probably caused by my custom shell [15:11] natefinch: UTC+8, aka AWST [15:11] although that doesn't make sense either [15:11] axw: cool. I'll add that to my list. Gotta keep track of everyone's time zone so I know when it's reasonable to bug them :) [15:11] :) [15:11] yey, for he who loves reviews now we have a 2 for 1 great offer, review https://codereview.appspot.com/77850043 and get to also review its dependency https://codereview.appspot.com/77490043 [15:12] do not miss this opportunity dear juju hackers [15:12] bodie_: it's explicitly executing as bash, though it's possible that your ~/.bashrc is influencing [15:13] perrito666: I'll look [15:13] axw: that's my best guess too, but I'm a little surprised .bashrc gets triggered [15:14] mgz: that's *a* bug, we should disable it wiht --norc in the test code [15:14] some of the tests do that, but evidently not this one [15:14] bodie_: try sticking "--norc" after "#!/bin/bash" in sshCommand, see if that does anything [15:15] ok [15:22] fwereade: yeah, Base doesn't quite sit well with me but I can't think of a better name :\ [15:22] EnvironDefaults? [15:24] axw: in i prefer entities that do one thing. "Base" types can so easily become a bin to chuck all kinds of random stuff in [15:24] s/in/in general/ [15:25] woot, got broken pipe test passing. [15:25] sshstorage [15:25] on on to the next one [15:25] bodie_: how? --norc? [15:26] rogpeppe: hmm... point taken. maybe it can be replaced with multiple types with default behaviour for specific things [15:26] axw: that would seem better to me [15:27] axw: that means that if you add some new methods, you'll need to explicitly add the default behaviour to all providers, but i actually think that's an advantage [15:28] axw: as it forces providers to think about whether the default behaviour is actually appropriate for that provider [15:28] yeah [15:28] fair enough. I will take another look at that part tomorrow. [15:30] axw: thanks [15:31] I'm off, g'night folks [15:32] axw, I got it working by changing my shell back to bash [15:33] bodie_: weird.... I was pretty sure rogpeppe was running uh... some non-bash shell, and he can run the tests. [15:33] totally weird [15:33] natefinch: yeah, i do [15:34] natefinch: my $SHELL [15:34] % echo $SHELL [15:34] /home/rog/other/plan9port/bin/rc [15:34] :) [15:34] hah! [15:34] I'm using oh-my-zsh [15:34] probably some plugin messing with things [15:34] I'm quickly realizing I need to be as absolutely stock as possible [15:34] THEN get my tests to pass [15:35] then start experimenting, I guess [15:35] bodie_: you're doing a fine job finding test suite isolation issues... [15:35] just not really want you want to be doing [15:35] mgz, bodie_: a good point.... that actually is quite useful, just, yeah.... [15:35] even then, it surely would be better to get tests passing first so I at least know what's breaking what, and then start customizing my environment [15:35] * fwereade is disappearing for a bit to have a snippet of his public holiday [15:36] heh, enjoy! [15:36] * fwereade will be around for a call in 4 hours at the very least [15:51] ppa:juju/golang-go isn't found -- is it $1/golang ? [15:52] bodie_: yup [15:54] fyi, ppa is 1.1.2 [15:55] maybe THAT'S the problem with my 14.04? it's running 1.2.1? [15:55] oy. [15:55] natefinch, go version? [16:08] bodie_: I'm running 1.3, anything >= 1.1 should be fine [16:08] bodie_: er 1.2 [16:08] it would be a trick to be running 1.3 [16:09] I would like to do that as well lol [16:09] rogpeppe: do you have me on mute? [16:10] rogpeppe: or can you just not hear me? [16:10] rogpeppe: or are you ignoring me? [16:10] hah [16:11] mgz: a review for you? https://codereview.appspot.com/77820043 [16:13] ahasenack: http://paste.ubuntu.com/7120556/ , also I was not able to reproduce the error state by bootstrapping with the pre-fix version, then destroying and bootstrapping with the fix. [16:13] rogpeppe: looking [16:13] mgz: ta [16:29] rogpeppe: now actually reviewing it [16:29] mgz: ta#2 [16:34] rogpeppe: lgtm. [16:34] mgz: ta#3! [16:35] mgz: the document has only been around for a few revisions. it's not used by anything else [16:35] as in, wasn't in the last 1.17 release either? that's perfectly fine then. [16:41] Am I the only one that thinks this formatting is terrible? [16:41] func StartInstance( [16:41] env environs.Environ, machineId string, [16:41] ) ( [16:41] instance.Instance, *instance.HardwareCharacteristics, error, [16:41] ) { [16:41] .... [16:41] } [16:43] yeah, it's dire [16:43] wwitzel3: how could we inject some debugging into that cloud-init script? I would love to see the contents of sources.list and sources.list.d/* at that point, and/or maybe just try adding an apt-get update before it tries to install bridge-utils [16:44] wwitzel3: like http://pastebin.ubuntu.com/7120722/ (untested) [16:44] ahasenack: can be done. [16:48] rogpeppe: I wonder if SetAPIHostPorts should actually error out on a zero port [16:48] mgz: perhaps [16:49] mgz: there are so many other ways that the addresses can be wrong though, it seems like it's not really worth doing that one thing [16:49] yeah, maybe, not sure what the right level to be checking these things is really [16:59] is it possible to change the timeout on the bootstrap command? it appears to be 10 mins right now - I can think of physical server deployments where this is not long enough :-) [17:01] jamespage: yep, there's a config value you can set in the environments.yaml [17:01] jamespage: bug 1257649 (see status) [17:01] <_mup_> Bug #1257649: ssh timeout for bootstrap could be configurable [17:01] natefinch, mgz: phew [17:01] jamespage: juju help bootstrap will show you how, I believe === vladk is now known as vladk|away [17:09] mgz, fwereade: currently we store instance ids in the control bucket. can you think of a strong reason to continue to do that for the api addresses, rather than storing the addresses themselves? [17:09] if we store addresses, then we won't need an Environ to be able to work out the API server addresses - just read-only access to the control bucket [17:10] although, i guess we will need to know where the control bucket is [17:11] yeah, we also don't promise that one is globably readable either I think [17:11] but putting addresses in it ('as well') should be fine as a basic optimisation [17:11] mgz: i'm wondering if we need to maintain the instance ids at all [17:11] mgz: ah, of course we do... [17:12] mgz: for destroy-environment --force [17:12] yup. [17:12] weeeel... [17:12] you can implement that in other ways [17:13] just nuking all machines with matching names is how we generally do it I think [17:13] mgz: good point [17:13] mgz: so... maybe we don't need to do it [17:14] mgz: it would make some logic simpler, and it opens the door to having long-term useful .jenv files without provider credentials === DrabMakyo is now known as ShinyMakyo === vladk|away is now known as vladk === vladk is now known as vladk|offline [17:27] here's the branch that changes the peergrouper to publish the api addresses in the state: https://codereview.appspot.com/77600048 [17:28] reviews appreciated [17:28] mgz, dimitern, fwereade, natefinch, wwitzel3: ^ [17:30] rogpeppe: will take a look in a bit [17:30] wwitzel3: ta [17:34] wow, launchpad just sent me an email stating it errord trying to send an email from a comment I added, how rude [17:34] :p [17:34] oh my god [17:34] I think my tests may finally be passing [17:34] you're special now :) [17:39] mgz, I'm not getting a all-machines.log aka debug-log with the MAAS provider and 1.17.5 on trusty - is that something new? [17:39] can't find a bug that looks like it [17:41] ugh, probably, we did change several things around rsyslog [17:41] wwitzel3: around? [17:42] jamespage: the big change was bug 1281071 [17:42] <_mup_> Bug #1281071: juju internal use of rsyslog should use ssl/tls for aggregation [17:42] ahasenack: yep [17:42] wwitzel3: it worked with this patch: http://pastebin.ubuntu.com/7121040/ [17:42] ...but that was in 1.17.4? [17:42] wwitzel3: apt-get update being the fix for me, the rest was just debugging [17:43] Setting up bridge-utils (1.5-2ubuntu7) ... [17:43] stop: Unknown instance: [17:43] networking stop/waiting [17:43] installed bridge-utils, restarted networking, br0 is up [17:44] jamespage: only obvious change since then is the adding of the rsyslog-gnutls package [17:44] ahasenack: ok, I will figure out a way to get that in there [17:45] ahasenack: have to ping some people to see if there is a prefered way to make that happen or if we can just do as you did and add the apt-get update line. [17:45] mgz, yeah - I've seen that work OK on OpenStack with 1.17.4 [17:45] wwitzel3: I'm building debs in my junk ppa, others can try too (I see jamespage was affected too) [17:46] jamespage: we probably need a new bug filed, if you can find anything on a unit about rsyslog and the forwarding, that would be relevent [17:46] mgz, doing that now [17:46] https://bugs.launchpad.net/juju-core/+bug/1294776 [17:46] <_mup_> Bug #1294776: No debug-log with MAAS, 14.04 and juju-core 1.17.5 [17:46] thanks! [17:47] mgz: you have an option on the br0 fix? just added an apt-get update before the explict apt-get install bridge-utils that was rogpeppe's fix? [17:47] mgz: since we have it confirmed fixing the issue for ahasenack [17:47] s/option/opinion [17:48] wwitzel3: ah, so apt-get update is necessary? [17:48] it was for me, not for wwitzel3 [17:48] ahasenack: weird [17:48] could be a difference in the images that were used to install the node? [17:48] ahasenack: do you know what's actually going on there? [17:48] rogpeppe: apt-get install bridge-utils was not finding bridge-utils, that's all I know [17:48] ahasenack: ahhh [17:49] I checked sources.list, they look fine [17:49] so I tried apt-get update [17:49] later during the bootstrap process apt-get update is ran [17:49] ahasenack: so i guess some images ship with a sources.list that doesn't include bridge-utils [17:49] and later again, bridge-utils is installed by something else, and then it works [17:49] rogpeppe: well, no, sources.list is ok, if it weren't a simple apt-get update wouldn't have fixed it [17:50] rogpeppe: maybe some images were generated after apt-get update was run, and some were generated without apt-get update having ever been run [17:50] I can check /var/lib/apt if you want, probably [17:50] ahasenack: sounds worrying :-) [17:50] it's always good to start with a fresh apt-get update run anyway [17:51] you do it in the sync bootstrap [17:51] "just to be sure" [17:51] Adding apt repository: deb http://ubuntu-cloud.archive.canonical.com/ubuntu precise-updates/cloud-tools main [17:51] Running apt-get update [17:51] Running apt-get upgrade [17:51] more because you add another sources.list, sure [17:52] to me it all seems like things to slow down our already slow instance start time, but perhaps that's just me [17:52] i do see that we need to do it [17:52] but i look at docker and thing "a better way is surely possible" [17:52] s/thing/think/ [17:53] I'm a little nervous about fixes when we don't understand why... [17:53] cloud-init does update/upgrade as one of its earliest steps [17:53] well before we get to our runcmd [17:53] why do we need it twice? [17:54] mgz: +1 [17:55] rogpeppe, mgz, ahasenack: when I look at the order that things run on my maas, we issue an apt-get update before attempting to install bridge-utils [17:55] mgz: where does it do it? [17:55] wwitzel3: where's the evidence? There are two places where bridge-utils gets installed [17:56] wwitzel3: in my case, the first one fails [17:56] my first one: [17:56] Reading state information... [17:56] E: Unable to locate package bridge-utils [17:56] stop: Unknown instance: [17:56] networking stop/waiting [17:56] + install -D -m 644 /dev/null /var/lib/juju/nonce.txt [17:56] that comes from that piece of code I patched to include apt-get update [17:56] it failed [17:56] then later, [17:57] The following NEW packages will be installed: [17:57] bridge-utils [17:57] and it goes on and installs it [17:57] I don't know what prompted that one [17:57] ahasenack: the cloud-config specifies apt_update as true [17:57] I think it was bootstrap [17:57] right, it's in that part where it installs package by package [17:57] git [17:58] ahasenack: check the cloud-config file in /var/lib/cloud or whereever it is [17:58] mgz: that apt_update true kicks in after [17:58] o_O [17:58] mgz: that file does not have apt-get update [17:58] only after I added it via the patch [17:58] oh god, is this an ssh race [17:59] wwitzel3: is the bridge-utils install done via runcmd or via the ssh-in-and-do-shit step? [17:59] first attempt, that failed, is via runcmd [17:59] it's in that patch, I pasted it [18:00] http://pastebin.ubuntu.com/7121040/ [18:00] I just added -y, and apt-get update before [18:00] runcmd should absolutely be *after* apt_upgrade [18:00] but apt-get install bridge-utils was there already [18:00] smoser!!!! [18:00] and that's the one failing [18:00] welp, got my tests to pass [18:01] yes [18:01] wait, I think we already had this conversation [18:01] runcmd is after cloud-init implmented apt-get install [18:01] did when runcmd happens get changed? [18:01] ahasenack is seeing odd things [18:02] mgz: where do we set apt_update True? [18:02] running 'apt-get update' is not "just to be sure", its really a requirement. if you want to avoid missing packages in -updates. [18:03] environs/cloudinit/cloudinit.go AddAptCommands [18:03] mgz: I have it being installed twice (bridge-utils) once during the ssh-in .. it is one of the first things to happen after SSH, then later another install is attempted, but at that point it is already at the latest version. [18:03] wwitzel3: you have it as part of cloud-init too [18:03] smoser: the question is, why do we have to run it *again* in runcmd when the cloud-config has apt_update set [18:04] wwitzel3: check /var/lib/cloud/instance/cloud-config.txt [18:04] mgz: how does the cloud-config "file" generated by provider/maas/environ.go get to play with the one from environs/cloudinit/cloudinit.go ? [18:04] rogpeppe: sorry, I thought the call was at 7pm [18:04] rogpeppe: it's a EuroPython committee meeting call [18:05] voidspace: np [18:05] voidspace: it's 7pm CET, i guess [18:05] yep [18:05] mgz: and, remember, this issue of br0 not coming up only happens with the bootstrap node, not with other nodes that are brought up by juju later on [18:05] hints at different cloud-inits [18:06] mgz, well there are race conditions that are unavoidable in apt [18:06] ahasenack: well, does you /var/lib/cloud/instance/clopud-config.txt have the update key or not? [18:06] but i doubt your'e hitting that [18:06] do you see evidence of apt-get update having run in /var/log/cloud-init-output.log ? [18:06] mgz: without my patch, no apt-get update in it. [18:06] smoser: yeah, seems more like some odd bug in our code if you've not done anything evil recently [18:07] ahasenack: okay, that's the bug then. [18:07] mgz, trusty ? precise ? [18:07] for sure neither one *should* hvae had a regression to this affect. [18:07] smoser: I don't think it's an issue with cloud-init. [18:07] and look in /var/log/cloud-init.log [18:07] look for WARN in /var/log/cloud-init.log [18:07] there is no evidence that apt_update: True was passed to it. [18:07] well, look at /var/log/cloud-init-output.log [18:08] it will clearly show you if it was urn [18:08] I did [18:08] (and you'll also see mention of it in /var/log/cloud-init.log if it ran it) [18:08] it's clear that it is following the commands juju gave it via http://pastebin.ubuntu.com/7121040/ [18:08] and there was no apt-get update in that [18:08] so, somehow wayne has that, and you don't [18:09] luck I guess [18:09] also, we use the fast-path-installer, which dumps a cloud image on the node [18:09] wwitzel3: do you use that too? [18:10] I'm assuming he does not [18:10] ahasenack: nope [18:11] ok, that narrows it down a lot [18:11] if you use the debian installer, then it's more or less guaranteed that apt-get update was run [18:11] and /var/lib/apt is still "fresh" [18:12] whereas the fast-path-installer uses a cloud image, that I think was downloaded to the maas server only once when it was first installed [18:12] or it downloads it from the internet, I don't know [18:12] ahasenack: yeah the first 3 lines when I bootstrap after the ssh keys are generated are adding the cloud-tools repository, then an update, and then an upgrade [18:12] wwitzel3: ok, the bug happens before that [18:12] wwitzel3: you only see it in the cloud-init-output.log file [18:13] it happens before juju bootstrap ssh'ed in [18:14] wwitzel3: if you want, switch to the fastpath installer and try again, maybe you'll hit the bug too [18:14] ahasenack: yeah, my cloud-config.txt doesn't have a apt-get update in it, but my bridge is setup fine probably because update has already been run [18:15] that's my conclusion [18:15] mgz: ^^^ wwitzel3's cloud-config.txt doesn't have apt-get update [18:15] like mine [18:15] he got lucky :) [18:15] I didn't :) [18:16] okay, so we need to find out why that's not getting in, and fix it. [18:16] hazmat: remember the br0 not coming up bug on the bootstrap node? There were two cloud-init configs involved, right? Because br0 does get up in services deployed to new machines, just not in the bootstrap one [18:17] hinting that the one used for bootstrap is different than the rest [18:17] there was a long discussion here in the channel when that was first filed [18:17] ahasenack, yes.. there was.. i thought it was addressed with 1.17.5 === vladk|offline is now known as vladk [18:17] hazmat: nope, still happening [18:17] ahasenack, nutshell was on bootstrap node previously bridge-utils wasn't installed because of a divergent code path for bootstrap when constructing cloud-init [18:18] hazmat: in my case, a simple 'apt-get update' just before 'apt-get install bridge-utils' worked [18:18] hazmat: so the fix was to just add "apt-get install bridge-utils" in bootstrap's cloud-init? [18:18] ahasenack, ah.. the package is old.. [18:18] hazmat: right [18:18] wwitzel3: btw, when you get time to get back on the stuff we were working on, I merged from trunk and fixed some tests. Still have the environment not prepared failure, but that's the only one left right now. [18:18] hazmat: I think it was assumed that apt_update: was set to True in cloud-init [18:18] hazmat: which it might as well be, but not for bootstrap's cloud-init === Ursinha is now known as Ursinha-afk [18:19] ahasenack, i'd file a bug.. i gotta run out for a min.. bbiab [18:19] hazmat: sure, it's filed, we were "remembering" it [18:20] that should be enough info to fix it === Ursinha-afk is now known as Ursinha [18:33] natefinch: great, I will pull those changes and take a look [18:33] g'night all [18:33] i'm done too [18:33] happy evenings, all [18:35] later rogpeppe [18:45] natefinch: could you give a second look to https://codereview.appspot.com/77850043/ ? I think it is much better now [18:45] tx :) [18:47] perrito666: LGTM [18:52] natefinch: thank you [18:53] wwitzel3: looks like the dummy environ expects some configuration to be set when you get the state out of it. Looks like it expects a state-id to be set [18:53] natefinch: is there an example of that? [18:58] wwitzel3: not surte [18:58] k, I'll take a look around, brb === vladk is now known as vladk|offline === vladk|offline is now known as vladk === Ursinha is now known as Ursinha-afk === Ursinha-afk is now known as Ursinha === vladk is now known as vladk|offline [19:47] thumper: how's your knowledge of the dummy provider? I'm trying to update the jujud bootstrap tests to include a valid environment and it's bombing out due to an environment not being prepared, but I can't figure out how I'm actually supposed to prepare an environment for the dummy provider. [20:07] natefinch: sorry, no idea [20:08] thumper: heh ok [20:16] http://paste.ubuntu.com/7121842/ [20:16] any ideas? [20:17] I'm on 14.04 [20:17] go1.2.1 [20:17] have the mongo from make install-dependencies [20:17] have all the right revisions via godeps -u dependencies.tsv [20:17] bodie_: you're building trunk, right? [20:17] just wiped all my source code and go-got the new thing before doing all of the above [20:17] yeah [20:18] bodie_: do you have mongod in /usr/lib/juju/bin/ ? [20:19] http://paste.ubuntu.com/7121855/ [20:19] so I guess I have it, but not in my PATH [20:19] this is just what I got via make install-dependencies [20:22] bodie_: is this running the replicaset tests? [20:23] bodie_: nm, I see at the bottom of the paste [20:25] it's not crucial, but i'd really like to get the tests passing on my local [20:25] do I need juju-mongo in my PATH? [20:26] bodie_: no, I don't have it in my path [20:28] bodie_: ha, I'm lying, I do have *a* mongod in my path, and removing it causes that exact error [20:28] bodie_: so, yeah, add that directory to your path, should fix that error [20:29] nice [20:30] thought it looked familiar to what was going wrong when I had no mongo [20:30] you know, it is very hard for me to code in this and not think permanently on http://www.gamefabrique.com/games/juju-densetsu/ which I played as a kid [20:30] that is perhaps the worst error handling I've ever seen, though. No "can't find mongo" instead a nil pointer error. [20:31] heh [20:31] yeah, the tests need some work [20:31] I just am keen to get rolling on actions [20:31] perrito666: I don't know the game, but man, do those kind of graphics bring back memories [20:33] natefinch: It was a famicom game :) I have it ringing in my head since I heard of juju [20:33] yeesh. [20:33] *** Test killed: ran too long (10m0s). [20:33] FAIL launchpad.net/juju-core/worker/uniter/debug 600.003s [20:33] :( [20:33] sinzui: https://bugs.launchpad.net/juju-core/+bug/1294632 [20:33] <_mup_> Bug #1294632: lxc broken in trunk r2440 [20:34] sinzui: have there been any packaging changes with respect to jujud? [20:34] yeah. that is sad [20:34] sinzui: I can run the local provider locally and all good [20:34] the only difference is that I compile jujud [20:34] and the bot probably doesn't [20:34] which means it may not have a jujud locally [20:34] and hence goes looking for tools [20:34] thumper, I did more than compile. I packaged it [20:35] sinzui: what is it testing? the package? [20:35] and which packages? [20:35] does your package contain jujud? [20:35] bodie_: weird, those tests run in ~1 second for me. [20:35] juju-core_1.17.6 [20:35] sinzui: the CI machine, does it do any compile from source, or just tests packages? [20:35] what the hell... [20:36] hmm, maybe there's some driver I'm missing (this is on a VM) [20:36] thumper, I think you mean the new behaviour searches the env (PATH actually) for the jujuds and asks if they match [20:36] sinzui: no... the local provider always needs to get some tools [20:36] what it does by default is to look for a jujud that lives next to the command line juju [20:37] if it doesn't find one, it tries to compile one [20:37] thumper, CI makes the tarball, makes the package from the tarball, then installs the package in a non-system location. Paths are updated and the tests begin. [20:37] if it can't copmile one, it looks for tools [20:37] thumper, tests worked until the revision..tools were found before that rev [20:37] with juju-mongodb in my PATH now: http://paste.ubuntu.com/7121938/ [20:37] sinzui: ok, what is in the package? [20:38] What WE RELEASE [20:38] (this is my 14.04 machine, not the one taking 10 minutes) [20:38] and the contents hasn't changed recently? [20:38] thumper, nothing else failed, so we know the tools were found and also sent to the CPCs [20:38] hmm... [20:38] packaging rules haven't changed for a few weeks [20:39] sinzui: I've assigned it to wallyworld to fix [20:39] he should have a better idea what is going on [20:39] thumper, sinzui: this seems bad (from running the tests with the new juju-mongodb: value *mgo.QueryError = &mgo.QueryError{Code:16149, Message:"exception: cannot run map reduce without the js engine", Assertion:false} [20:39] sinzui: is there any way I can look at the CI machine? [20:40] natefinch: which test? [20:40] AFAIK, we shouldn't be using the javascript engine at all [20:40] thumper: TestCharmStreaming from store/server_test.go [20:40] thumper, ~/jobs/local-deploy/workspace/extracted-bin/usr/lib/juju-1.17.6/bin$ jujud version [20:40] 1.16.6-precise-amd64 [20:41] sinzui: what about ./jujud version [20:41] it doesn't search the path, it looks for one beside the juju being executed [20:42] ah correct, I didn't export path like the test does [20:42] thumper: looks like mgo's MapReduce function must require the javascript engine (used in Store/store.go Store.Counters) [20:45] sinzui: do we run the tests in CI with juju-mongodb? This seems like it should have failed there. [20:46] natefinch: CI runs precise [20:47] no juju-mongodb for precise [20:47] thumper, you are now subscribed to lp:~sinzui/+junk/cloud-city The staging-id_rsa key will let you ssh to jenkins@54.84.137.170 [20:47] thumper, [20:47] PATH=/var/lib/jenkins/jobs/local-deploy/workspace/extracted-bin ./jujud version [20:47] 1.17.6-precise-amd64 [20:47] sinzui: actually [20:47] you are right [20:47] bootstrap is broken locally [20:47] for me too right now [20:47] * thumper sends evil looks wallyworld's way [20:48] thumper: how can we know trusty works if we don't have CI running on it? [20:48] natefinch: we cross our fingers [20:48] thumper: it didn't work [20:48] sinzui: can we have CI set up for trusty? [20:48] :) [20:48] sinzui: sorry to give you more work. I know you've been terribly bored lately. [20:48] natefinch, mongodb-server is forced on the unittests because juju-mongodb support was reverted [20:49] natefinch, ppc64el and arm64 always have juju-mongodb installed [20:50] natefinch, the lxc tests to *not* use juju-mongodb...it broke deployments to the cloud [20:51] so is this why my 14.04 getup is failing? [20:51] pardon me for being obvious here, but... [20:52] I just want to know if I need to nuke my workstation and replace it with a bulletproof version [20:52] bodie_: it seems juju has an incompatibility with the version of mongodb we built specifically to work with juju... [20:52] oy [20:52] bodie_: when you find a bulletproof version, let us know [20:52] sinzui: so I landed a branch yesterday that forces juju-mongodb for all trusty deployments [20:54] bodie_: we stripped out the javascript engine because it had problems on some OSes... but I guess we didn't realize one of the functions we call actually gets sent out to the javascript engine. [20:54] ah, right [20:56] thumper, the aws and canonistack trusty tests are happy [20:56] interesting [20:56] sinzui: what do they do? [20:57] Deploy a wordpress stack, then another tests deploys a stack on stable, then upgrades to the RC [20:57] bodie_: do you want to borrow my mongod? I can zip up the whole directory and you can just drop it on disk. So long as you're on 64 bit, it should work [20:58] thumper, I can see juju-mongodb was installed and started in th last test [20:58] well, it's working on my remote. I'd rather get it to legitimately work on trusty [20:58] ootb type thing [20:58] that way I know it's ... well. trusty [20:58] i'll just use the vps for now [20:58] bodie_: I'm not sure there is a "legitimate" way right now, if you can't actually build mongodb on trusty [20:59] well there's the packaged version and the juju packaged version [20:59] packaged version seems to not work because it's 2.7.0-pre [20:59] sinzui: was mongodb-server installed too? [20:59] * sinzui looks [20:59] quick, what does juju set-env do [21:00] because juju help is a bit ambiguous [21:00] thumper, There is no evidence of it [21:01] hi marco-traveling [21:01] hi thumper [21:01] marco-traveling: it updates the config kept in the state database [21:01] marco-traveling: that is used by everything [21:01] marco-traveling: doesn't touch anything local [21:01] thumper: cool, so it doesn't update the environmet variables for a hook env [21:01] thumper, I am confident that juju-mongodb was the only mongo installed. [21:02] gotta run, sorry. Good luck with mongo everyone [21:02] marco-traveling: no, it updates the juju environment config [21:02] sinzui: thanks [21:06] is there a preferred way for injecting environment variables such that the juju hook context can see them? I tried baking the cloud image with /etc/environment populated with http_proxy, socks_proxy, no_proxy, etc, but that didn't make its way into $charm/hooks/hooks.py [21:07] ev: not at this stage [21:07] ev: what are you after? [21:08] ev: juju is now proxy aware (for http, https, ftp and no) [21:08] ev: no socks_proxy though [21:08] thumper: no_proxy support, basically [21:08] I can't send traffic to swift through the proxy [21:09] ev: you can either put it in your environments.yaml, or for a running environment: 'juju set-env no_proxy=foo,bar' [21:09] thumper: if you had to do this, how would you? Right now I've hacked os.environ['http_proxy'] into every single charm, which is...less than ideal. [21:09] wait what?! How do I get this in my environments.yaml, and how did I miss this option [21:10] ev: 'juju set-env http_proxy=http://myproxy.com' [21:10] ev: because I'm terrible at documentation... [21:10] ev: it is very recent... [21:10] ish [21:10] * thumper goes to look at docs [21:11] ev: sorry about that... [21:12] ev: environment now has: [21:12] http-proxy, https-proxy, ftp-proxy, no-proxy, apt-http-proxy, apt-https-proxy, apt-ftp-proxy [21:12] the apt values default to the plain values (ie. apt-http-proxy defaults to http-proxy) [21:12] but can be overridden by themselves [21:13] eg. my local provider has "apt-http-proxy: http://10.0.3.1:8000" for squid deb proxy [21:13] thumper: that's excellent [21:13] thank you [21:13] np [21:13] thumper: https://codereview.appspot.com/77930043 [21:23] hmmm [21:23] getting "no reachable servers" in my store_test [21:24] anyone know what might cause that? [21:24] (i.e. is the store down?) [21:24] intermittent test failure [21:24] we all get that [21:24] really annoying [21:24] feel free to fix [21:24] will trade fix for beer [21:25] * bodie_ puts up an ultraviolet bug-zapper lamp. [21:36] apparently that worked [21:36] passed this time [21:37] I did say it was intermittent [21:38] thumper, is you scratch out some notes about proxy changes, I will write it up for the release notes and put it into the unstable docs [21:38] s/is/if/ [21:38] sinzui: ok, ta [21:47] thumper: is there any way to tell juju to not clean up the bootstrap node on failure? I constantly find myself wanting this. [21:48] so I can peek at its brain [21:48] ev: no... not that I'm aware of [21:50] rubbish [21:55] so if I'm seeing: [21:55] Installing package: git [21:55] 2014-03-19 21:54:31 ERROR juju.provider.common bootstrap.go:127 bootstrap failed: rc: 1 [21:55] Stopping instance... [21:55] do I have any recourse? [21:55] ev: which provider [21:56] openstack [21:56] does it need an apt proxy? [21:56] it has one set [21:56] hmm, maybe this is on my end [21:57] ev: can you pastebin a 'juju get-env' [21:57] let me just make sure spawning an instance normally works [21:57] thumper: that wont work without a bootstrap :) [21:58] haha, yeah [21:58] ev: environtments.yaml config values? [22:04] thumper: https://codereview.appspot.com/77960043 [22:04] * thumper looks [22:05] * wallyworld goes to make breakfast now he has fixed his regression [22:07] thumper: https://codereview.appspot.com/77970043 [22:11] thumper: it was on my end - dns was missing [22:12] ev: ok, cool [22:12] suspected something was up when apt /immediately/ returned 1, instead of hung like when it's not going via the proxy :) [22:12] woooo! "starting juju machine agent" [22:15] wallyworld: https://bugs.launchpad.net/juju-core/+bug/1285923 is that part of your branch? [22:15] <_mup_> Bug #1285923: provider/ec2: tests fail expecting a ppc64 image for precise [22:15] ev: \o/ [22:16] thumper: not that i know of, i'd need to look [22:16] could be [22:17] ok [22:17] d'you guys have an opinion on goimports? [22:18] I like being lazy, but I don't like being wrong and lazy [22:18] thumper: the arch bit is fixed. i wasn't aware of the series issue. i'll have to read that more closely to grok what the claimed issue is [22:18] wallyworld: ok [22:18] bodie_: I don't use it [22:19] not yet anyway [23:22] is it necessary to use this PPA to get the lbox tool? [23:22] ppa:gophers/go [23:22] when I try to apt-get update, it looks like it's down [23:23] can't I just go get launchpad.net/lbox ? [23:25] giving that a shot to see if it works [23:31] bodie_: it's just a command [23:31] use go get [23:32] yeah, that worked [23:32] I'm trying to use lbox now -- it's asking me to navigate to my browser, but I'm on a headless remote. is this a problem? [23:33] yes that will be a problem [23:33] you could try [23:33] like, is there any other way I can do it, or should I just rsync it to my local and do it from here? [23:33] bah.... [23:33] bzr login [23:33] you need an OAUTH key from LP [23:33] yeah [23:33] hm [23:33] bzr unknown command login [23:33] does that require cobzr? [23:34] er, no [23:34] well, can I just push my branch and then use launchpad.net to propose it? [23:36] bzr lp-propose-merge? [23:40] bodie_: try copying the file ~/.lpad_oauth [23:41] from where to where? It's not on either machine [23:42] can't I at least use bzr to push to my account on launchpad.net and then merge it with my web browser on my workstation? [23:42] er, "propose" the merge [23:46] oh LOL it lets me view the site in my terminal [23:46] excellent [23:48] w3m [23:48] sensible-browser for the win [23:49] hmmm [23:49] works til I get to here [23:50] bodie@juju-dev:~/go/src/launchpad.net/juju-core$ bzr launchpad-login [23:50] binary132 [23:50] bodie@juju-dev:~/go/src/launchpad.net/juju-core$ bzr register-branch fix-bson-references [23:50] launchpad.net password for binary132@gmail.com: : [23:50] bzr: ERROR: Invalid url supplied to transport: "https://binary132%40gmail.com:@xmlrpc.launchpad.net/bazaar/": nonnumeric port [23:50] pardon my paste [23:51] bodie_: that isn't your LP login [23:51] thats your gmail login, yes ? [23:56] well, I used bzr launchpad-login to set it as binary132, which is my lpad login [23:56] so I'm not quite sure why it's asking for "launchpad.net password for binary132@gmail.com"