[00:22] axw: menn0:thumper:wallyworld: an awesome, small review plz :D http://reviews.vapour.ws/r/5523/ [00:22] alexisb_: ping [00:22] otp with reed can look soon [00:22] alexisb_: we should have a quick word [00:23] anastasiamac: looking [00:24] menn0: \o/ [00:25] anastasiamac: well that was easy. ship it [00:27] menn0: cmars: tyvm :) m loving to see this beta's bug count going down or fixes count going up - glass half full :D [00:46] dog walk time === thumper is now known as thumper-dogwalk [00:50] Bug #1471770 changed: TestPrunes fails occasionally still

[00:50] Bug #1580501 changed: cloudimg-base-url parameters not in Juju2 anymore <4010> [01:10] anastasiamac: I think you might've broken something: ERROR failed to bootstrap model: model "controller" of type manual does not support instances running on "amd64" [01:11] (I'm looking into it) [01:11] axw: well, the only thing i can think of is that manual constraints validator does not have arches, and if there are no images for it, then nothing will be merged... [01:12] anastasiamac: yeah, same thought [01:12] axw: i think I'll need to put arches in in there [01:12] anastasiamac: yup [01:12] k. i'll do it now :) [01:12] anastasiamac: I can probably fix it in my branch [01:12] axw: ooh even better ;) [01:12] should be a pretty small change [01:13] axw: i wonder if other providers will need to have similar thing.. i think manual slipped through finger because maybe it never had arches vocab defined..? [01:13] anastasiamac: yeah, it doesn't [01:14] axw: \o/ tyvm. let me know if there is something i can do to assist [01:32] wallyworld: have you noticed that "model-config" says FROM=model for logging-config and resource-tags in a fresh model? [01:33] axw: loggin-config i'd expect because juju itself sets the value via the api. resource-tags i'd need to look at, but i suspect the issue is the schema coeecsion to a map from a string [01:34] bah, string to map i mean [01:34] wallyworld: yeah, almost certainly. I was thinking we should just set the default logging-config though? [01:34] to the same as what the agent runs [01:35] and configure the client differently if needed [01:35] axw: it starts out as one thing and juju sets to another (debug->info) [01:35] or something like that [01:36] but yeah, maybe we could do better, i recall at the time it made sense how it is [01:36] wallyworld: *shrug* it seems odd to me that OOTB the config says it's not default [01:37] agreed. i can't recall the specifics off hand, but juju messes with it [01:37] we can clean up next week [01:42] anastasiamac: would you please review https://github.com/juju/juju/pull/6083/commits/32fdee6ef69e1355480ed9dbd208ca69c97fdd0f [01:45] axw: looks awesome :) did u get a chance to test live too? [01:45] anastasiamac: yup, bootstraps fine after this change [01:45] axw: \o/ LGTM for this commit :D [01:45] anastasiamac: ta === thumper-dogwalk is now known as thumper [02:09] Bug #1449210 changed: cloudsigma index file has no data for cloud

[02:09] Bug #1616197 changed: juju restore-backup error <20160826> [02:09] Bug #1616298 changed: DebugMetricsCommandSuite.TearDownTest fails due to "no reachable servers."

[02:12] is canonical IRC down for anyone else? [02:13] natefinch: yes [02:13] Just back now [02:13] oh good, then I haven't been fired yet. [02:13] miken: not for me... still down \o/ [02:13] natefinch: unless we've all been :) [02:14] Oh - I'm connecting from an internal IP address, and it just reconnected to irc.c.c 2mins ago. [02:25] thumper: you'll need to resubmit your race fix pr i think, after adding "Build failed:" comment to trick the bot [02:25] wallyworld: ack [02:27] axw: got some time to talk manual providers? [02:31] thumper: sure [02:31] axw: https://hangouts.google.com/hangouts/_/canonical.com/manual?authuser=0 [02:42] * anastasiamac about to lose electricity - going afk for lunch and fun [02:43] thumper: you guys talking about that bug I was looking at? [02:45] natefinch: this one? https://bugs.launchpad.net/juju-core/1.25/+bug/1610880 [02:45] Bug #1610880: Downloading container templates fails in manual environment [02:45] anastasiamac: yeah [02:45] anastasiamac: was going to ask you if you had any thoughts about that one [02:46] natefinch: looks like the fix needs to go into 1.25 not master [02:46] natefinch: no thoughts whatsoever - hence, m happy to go with advice from wallyworld to mark it as Invalid. I do wish there was a bit more explanations as to why it is Invalid... [02:47] anastasiamac: well, the customer who is experiencing it is on 1.25, yeah. I don't know if it happens in 2.0, honestly [02:47] it has to be invalid as it's lxc only [02:47] natefinch: k :) do u have enough context to fix? [02:47] oh yeah, dug [02:47] 2.0 uses a totlly different mechanism [02:47] duh [02:47] it's not invalid for 1.25 [02:47] but it is invalid for 2.0 [02:47] correct [02:47] yes. [02:48] all it did was remove the targetted juju project [02:48] left it targetted to juju-core [02:48] Bug #1610880 changed: Downloading container templates fails in manual environment [02:48] you can tell it's only 1.25 from reading the logs [02:49] yes, of course. I hadn't really thought that part through :) [02:50] i've removed juju-core and left for 1.25 [02:50] I added a note as to why it's not applicable to 2.0 :) [02:50] Hi, is there a way to tear down a model from the gui? [02:50] generally if the bug is in 1.25, we also keep it to be fixed in 2.0 [02:50] wallyworld: updates in http://reviews.vapour.ws/r/5510/ [02:50] however, in this case, lxc is not in 2.0; so bug is invalid [02:50] looking [02:50] from command line I can go 'juju destroy-model blah' [02:50] natefinch: thnx :D [02:50] anastasiamac: correct [02:51] wallyworld: m glad that u r agreeing \o/ [02:51] that it is 1.25 only? yes :-) [02:52] thumper: it's possible that the "should be terminated" is coming from the pkill issued by DestroyController, and the SIGABRT stack trace is due to the killall run by the CI script [02:52] wallyworld: and to a new world order, no? :D [02:52] axw: hmm... I'll poke around [02:52] thumper: so maybe it's just a case of the agent not shutting down fast enough [02:52] that too, peace in the middle east and all that [02:54] axw: actually... you may well be right [02:55] perhaps we need to make the destroy controller method on the manual provider wait until the process has been removed [02:56] axw: I think this is the most likely case, and what I'll try for first [02:56] thumper: maybe, could be that the agent is wedged though? we wouldn't normally care, because the controller machine gets destroyed in cloud environments [02:56] the agent will hang around for a while to answer current api calls [02:56] but no, don't think it is fully wedged. [02:56] but I'll look too [02:57] still think it is worthwhile waiting [02:57] thumper: hmmk. seems reasonable to wait, yeah [03:06] redir: really close, got a fix it then ship it, let me know if anything is unclear [03:12] wallyworld: k. tc [03:12] tx even [03:14] wallyworld: hmm... [03:14] things that make you go [03:14] wallyworld: trying to bootstrap a manual provider in lxd [03:15] ERROR failed to bootstrap model: model "controller" of type manual does not support instances running on "amd64" [03:15] wat? [03:15] thumper: damn, that's fallut from anatasia's changes for fixing simplestreams issues, you running from tip? [03:16] yep [03:16] ok, will need to be fixed [03:16] trying to look at the manual provider leaving stuff behind [03:16] but can't bootstrap [03:16] I don't mind fixing if I can be told what needs fixing [03:16] you can comment out the error [03:17] huh? [03:17] it's a bit complicated, image metadata been reworked, so different rules to figure out what can be bootstrapped [03:17] comment out the error return [03:17] ie don't do the check - that will get you going so you can do your fix [03:18] where is that? [03:19] don't commit that change of course [03:19] validateUploadAllowed [03:19] environs/bootstrap/tools.go [03:19] ack [03:20] wallyworld: that'll be the s390x manual bootstrap bug too [03:20] ah, yes, could be [03:21] i'll talk to her when she's back online [03:24] So Posgres 9.6 tl;dr notes: parallel query scans, joins, and aggregates, inceremental vacuum freeze, sycronous replication with multiple standbys, 10 will start a new version scheme (thing firefox/chrome) [03:24] other than that mostly lots of talk about the uber paper. [03:25] redir: oh? [03:26] uber: PostgreSQL -> Some schemaless object store thing on MySQL [03:26] Appearntly theis has caused some hubbbub https://eng.uber.com/mysql-migration/ [03:27] thumper: live from the East Bay Postgres meetup at Pandora... [03:29] Time to go be social at the pub, getting kicked soon. [03:39] wallyworld: http://reviews.vapour.ws/r/5530/ when you have a chance later. I haven't yet updated what we talked about earlier. [03:40] wallyworld: also ignore the bits from the other PR,this is stacked on that so it has those issues too. [03:40] later juju-dev [03:40] ty [03:41] axw: in my local lxd testing, it took two to three seconds from the time kill-controller had exited to the time the jujud agent stopped running [03:43] thumper: sounds about right. in the CI failure it's still running 10 minutes later... [03:44] 10 minutes? [03:44] I thought it was much sooner than tha [03:44] * thumper double checks [03:44] thumper: terminationworker says to terminate at 3:41, then the SIGABRT stack trace comes at about 3:50 [03:44] 3:51 actually, I guess the CI script is waiting 10 minutes [03:45] anastasiamac: did you see there's fallout with the arch / image stuff and manual provider with lxd? [03:46] wallyworld: the one that axw and i discussed and he has fixed (and i lgtm-ed) on his branch? [03:46] wallyworld: I've got a fix for manual in my branch, can't land because master is blocked [03:46] axw: i'd say make it a blocker and use $$fixes$$ [03:46] axw: i think u can land ur branch [03:46] wallyworld: is there a bug #? [03:46] axw: if u do not have a bug, jfdi [03:46] anastasiamac: thanks, didn't see there's a fix, thumper ran into it before [03:46] axw: no... wasn't 10 minutes [03:47] ok [03:47] was almost immediate [03:47] AFAICT [03:48] thumper: http://paste.ubuntu.com/23087267/ [03:49] from http://reports.vapour.ws/releases/4301/job/manual-deploy-precise-amd64/attempt/4018 [03:49] on attempt 4017 it was more immediate [03:51] axw: that timing doesn't match the log outputs at all from 4018 [03:51] kill was here: 02:34:05 [03:51] thumper: I think the logs are appended to [03:51] thumper: I'm probably looking at something old [03:51] that test log output is now in local time [03:52] but still [03:52] thumper: I was looking from the top of the log, it looks like there's multiple test runs in the same log file [03:52] searching from the back, I concur that it's immediate [03:53] ok... good :) [03:54] thumper: though there *is* a very slow one at the top of the log, so it's not consistent [03:55] one problem at a time :) [04:09] wallyworld: in your lxd PR, there's another target var lower down. I didn't realise that it exited early if the alias exists - maybe the check is still needed? does CopyImage return immediately if the image is already there? [04:10] axw: in my testing, i deleted all lxd images. bootstrap the first time downloaded the image (slowly, with progress shown). then another bootstrap did not [04:11] and lxc image list shows the one image [04:11] wallyworld: but that might be because there's still another call to GetAlias [04:11] the instance started immediately though [04:12] so it's using whatever it cached the first time [04:12] i can't see any obvious difference in behaviour [04:12] wallyworld: no, I'm just saying there's still another call to GetAlias that looks like it should be removed. but I'm not sure of the impact. [04:13] oh, i miss understood you. i saw that call too but didn't follow what it did so left it [04:25] i've seen a few cpu/mem spike related bugs... if i were a memory leak in juju, where would i be? :D [05:06] axw, menn0: http://reviews.vapour.ws/r/5532/ [05:10] thumper: LGTM, thanks [05:24] thumper: double ship it :) [05:29] axw: i've got a few very small reviews up if you get a chance later. one is the lxd one which seems ok to me given it behaves as expected when testing [05:30] wallyworld: sure, just finishing up QA for my add-model changes [05:30] no worries [05:36] wallyworld: add-model changes: http://reviews.vapour.ws/r/5534/ [05:36] looking [05:42] wallyworld: I'm QAing your lxd branch, and bootstrap is fetching images that I have again. possibly due to that code removal [05:42] hmmm, it didn't fetch mine again [05:42] but i started from a clean slate [05:42] what are your aliases? [05:42] ubuntu-xenial etc? [05:43] wallyworld: yep [05:43] I have ubuntu-xenial [05:43] hmmm, ok, i'll bootstrap again and see what it does [05:43] nfi why it doesn't work for you [05:43] wallyworld: yep, I put the code back in and it doesn't do it now [05:44] wallyworld: possibly once it has the image again, it wouldn't copy again [05:44] yeah, that's what i was thinking [05:44] there might me some implicit alias or something [05:45] which i think is ok behaviour - so long as it only fetches once. stephane was adament we should be doing it this way or else auto update would not work [05:50] wallyworld: it's probably fine, just doing one last test to satisfy myself [05:50] sounds good, best to be sure [05:50] i'm testing again too [05:50] but download is sloooooooooow [05:59] wallyworld: so, I think the issue is that the local alias I had did not match the image that was in the source [05:59] wallyworld: so it replaced it [05:59] yeah, whereas before maybe we were setting the alias name [05:59] wallyworld: if you were to put that GetAlias code back in, people could continue using their existing images... but I guess they wouldn't auto-update [05:59] that's my understanding [05:59] and we want auto update [06:00] wallyworld: ok. seems fine, maybe just add a release note that it will force an image refresh on everyone? [06:00] sure [06:00] axw: in the add-model / cloud branch - i've just srted looking - do we reject add-model cloud where the controller doesn't support the cloud asked for? [06:01] wallyworld: it will complain that "foo" is not a cloud or a region [06:01] wallyworld: because you can't add clouds to a controller, the only cloud it'll find is the one that was bootstrapped [06:01] wallyworld: I did test that actually, just didn't add in the QA steps [06:01] will do that now [06:01] ta, that would be good as i was wondering [06:02] wallyworld: updated steps under LXD [06:02] great ty [06:02] axw: and you can +1 the lxd pr? [06:03] wallyworld: sorry yes [06:03] done [06:03] not sure if i should land before beta [06:04] might be good to get auto update fixed [06:06] axw: "is neither a cloud nor a region". i don't like that message because aws is a cloud. it's just not supported by the current controller. so people will get confused by the message i think? [06:07] wallyworld: well, it's not a cloud so far as the controller is concerned [06:07] wallyworld: I agree it's a sucky message [06:07] wallyworld: I guess we could look in the client's list of clouds first? [06:07] could we rephrase to say that this controller doesn't support models on cloud "aws", only clouds "lxd" are supported [06:08] yeah, look at client clouds, and if it is a valid cloud name, be smart about the message [06:08] "... are supported by this controller" [06:08] or something [06:09] wallyworld: we don't have an API to list clouds yet. I suppose I could add it [06:09] axw: that's one of the things martin asked for [06:09] so it won't go to waste [06:09] and we have the cloud facade [06:09] wallyworld: yeah, was trying to keep this minimal. shouldn't take too long tho [06:10] understood [06:10] but the message sucks :-) [06:22] axw: gotta duck out to do school pickup, but one last question - on the apiserver side where an unsupported cloud is passed in - it returns an annotated not found error but i think we can d better with the error message there also [06:22] "such and such cloud is not supported, try one of these instead" type thing [06:22] wallyworld: where's that? [06:22] wallyworld: "getting cloud definition" ? [06:23] yeah [06:23] didn;t matter before but now that we are allowing people to specify the cloud themselves [06:23] need to tighten it up IMO [06:23] wallyworld: isn't that redundant if we have the client query the supported clouds? [06:24] that's in add-model, what abot via the api [06:24] python juju client, controller proxy etc [06:25] hmm I guess so [06:25] auto pilot, conjure up etc - they all use the api [06:25] and in conjure up, someone could easily specify an unsupported cloud [06:26] gotta run, bbiab, got to update release notes at some point [06:45] axw: ping - larry shared his vsphere setup which i think you've used recently. Did you have any problems bootstrapping? For me it doesn't complete using beta15. [06:46] frobware: hey. I didn't get past authentication. I think the issue I was seeing was that the client downloads the cloud image and then uploads it to vsphere. I'm quite far away, so that was so slow it timed out [06:46] frobware: sorry I mean, it authenticated but didn't get any further (functionally) than that [06:47] axw: I get as far as... https://pastebin.canonical.com/163942/ [06:48] axw: lines 75 & 76 repeat until timeout [06:48] axw: I have never bootstrapped on vsphere before so could be operator error too [06:48] frobware: ah, well you got further than me :p sorry, I don't know what's up with it. I've never used vsphere before that one time [06:48] and I was just verifying that my auth changes were good [06:51] axw: the only addition I made to the cloud definition was adding to clouds.yaml: vsphere: regions: dc0 {} [06:51] axw: which was largely done based on a bug comment I think you made... somewhere... :) [06:52] frobware: if the issue was with clouds.yaml, it would have failed much earlier [06:52] I don't think it's user error [06:52] more likely the provider or vsphere is broken [06:53] axw: which it did. could not bootstrap because 'datacenter' was undef [06:53] axw: I'll try going back to beta8 as that's where the bug was reported, but largely to see if bootstrap has regressed since. [06:53] frobware: oh I see what you mean. yeah, larry's original clouds.yaml was broken [06:53] oh [06:54] frobware: this is what I've got: https://pastebin.canonical.com/163945/ [06:54] axw: you mean it was broken and needed the regions bit? [06:54] yep [06:55] frobware: well, and he was trying to use non-standard keys. that one I linked is in the valid format [06:56] axw: this is what I'm currently using: https://pastebin.canonical.com/163946/ [06:56] frobware: yep that's fine [06:56] auth-types is unnecessary but won't cause a problem [06:57] wallyworld: how's this? https://pastebin.canonical.com/163947/ [06:57] looking [06:58] axw: yay, much nicer, thank you [06:58] wallyworld: cool. just gotta write some tests, and improve error messages on the server side now [06:58] sgtm [07:01] axw: you could potentially make the add-model cmd dumb and not do any checks and allow them all to be done on the server side [07:02] since you need to make an api call to list clouds anyway [07:02] you could avoid that call [07:02] and just make the create model call [07:02] wallyworld: thought about it, but that makes the cloud/region unstructured which I'm not too keen on [07:02] you could still split on / [07:02] wallyworld: this way we may also support auto-upload of cloud definition, if we want to do that [07:03] wallyworld: sure, but you still don't know if it's cloud or region if there is no / [07:03] true [07:03] ok, ignore me, just thinking out loud [07:17] wallyworld: hey [07:17] wallyworld: still hanging around? [07:18] maybe [07:19] thumper: what's up? [07:19] pretty sure bug 1615839 is that bit you got me to comment out [07:19] Bug #1615839: Manual-provider claims s390x is not supported

[07:19] is anastasiamac on that? [07:20] or shall I take a look? [07:20] might take me longer [07:20] thumper: i think axw landed a driveby [07:20] ot has one in train [07:20] or [07:20] but I could muddle through it [07:20] all good, we broke it, we fix it [07:20] thanks for offering [07:20] who shall I assign the card and bug to? [07:21] check that axw is/has done it, otherwise to anastasia [07:21] axw: have you fixed it? [07:22] is hudson back? [07:24] redir: late for you, go to bed :-) [07:24] yeah just got home and eating something, then bed [07:25] who knew postgres folks were such talkers:) [07:35] thumper: sorry was on school run, it should be fixed by my latest merge, have marked Fixed Committed [07:35] axw: ok, cool [07:35] what was the fix by the way? [07:57] wallyworld: updated my PR, PTAL [07:57] looking [08:03] axw: looks great, ty [08:05] axw: when it lands, let urulama and mhilton know as they've started to need the Clouds() API and are assuming a return of []string whereas we are offering a map of cloud details [08:06] wallyworld: sure [08:07] wallyworld: gonna have to get a second review, this is >500 [08:07] I'll point martin at it, maybe he'll be willing :) [08:07] hmmm [08:07] stupid rule [08:11] wallyworld: well, the rule would not bite if the PR are manageable [08:12] >500 is not manageable for any reviewer [08:12] disagree [08:12] depends on the type of change [08:12] of course u do [08:12] and who's reviewing [08:12] we had a much larger limit in launchpad [08:12] 800 [08:12] 500 is too small [08:12] no, usually only dev knows what they wrote for PR >500 [08:13] not just dev [08:13] i know what's in that pr and i didn;t write it [08:13] u r very specail [08:13] special* [08:15] axw: off to make dinner, updated the pr, thanks for reviewing [08:15] wallyworld: will look in a sec. I'm reviewing your show-user one now [08:17] * frobware is back in ~1 hour [08:27] wallyworld, axw: what have you done to my API deisgn! [08:36] mhilton: we needed more than just the cloud names :-) you get the names as the map keys [08:37] and also you have allowable regions etc which are really useful for the gui [08:38] wallyworld: It's fine, I'm curous. We were getting the regions from the Cloud() endpoint. but doing it all in one go is probably better. [08:39] mhilton: yeah, we think so, one call to get all the info you need [08:50] rogpeppe1: thanks for review, did you just want to check my answer to you question in the review http://reviews.vapour.ws/r/5533/ [09:09] wallyworld: i've just published a review of http://reviews.vapour.ws/r/5533/ [09:09] ta [09:09] wallyworld: weird, i didn't think i'd published anything until now... [09:10] wallyworld: i think you were maybe talking about axw's question [09:10] oh dear, i was [09:10] wallyworld: i was wondering about external user access too, although i forgot to mention it in my review [09:11] this review isn't about any of that [09:11] wallyworld: i think if we left the access field out entirely, things would become more obvious [09:11] it's all already been done [09:11] the access field is also pre existing so i don't realy want to move it [09:11]