/srv/irclogs.ubuntu.com/2015/07/08/#juju-dev.txt

thumpermenn0: are you getting time to finish off https://github.com/juju/juju/pull/2724?00:03
menn0thumper: working on that right now00:03
menn0thumper: not far00:03
thumpercool00:03
menn0thumper: done, about to merge00:11
perrito666mehh I was reviewing a pr and when I got to the end it got discarded01:26
anastasiamacperrito666: :(01:27
anastasiamacperrito666: what time is it for u?01:27
thumperwallyworld: I don't suppose someone on your team can fix the intermittent failure in uniter worker?01:30
thumperFAIL: uniter_test.go:892: UniterSuite.TestUniterUpgradeConflicts01:30
thumperhappens relatively regularly01:30
perrito666anastasiamac: 22:3001:31
wwitzel3axw: ping02:13
wallyworldthumper: sorry, missed ping about failing test. will take a look. currently full with WIP fixes for 1.25 release and soon a critical customer issue which is more a feature thana bug02:15
wallyworldi suspect will be end of week or early next02:15
wallyworldalso working on arm issue for 1.24.302:15
menn0wallyworld, thumper: the mongodb timeout PR has landed in 1.2403:07
wallyworldgreat03:07
wallyworlddo we know who to prod to see if it helps with anything?03:08
menn0wallyworld: ahasenack reported the bug that lead to me doing this work (i'm not sure if it will help or not)}03:09
wallyworldok03:10
menn0thumper also thinks it might help with a problem env he was looking at03:10
wallyworldtime will tell i guess03:10
menn0wallyworld: so what is the github.com/juju/juju/juju package all about?03:27
menn0seems like stuff that I would have expected to be in the state package03:30
wallyworldmenn0: that package is sort of an attempt to get stuff out of state as far as i understand03:41
wallyworldit is horribly named03:42
menn0wallyworld: so state helpers then?03:44
cherylj_wwitzel3: ping?03:45
wallyworldmenn0: more like juju core business logic03:45
wallyworldthat is not persistence related03:45
menn0hmm ok03:45
wallyworldi didn't add it :-)03:45
thumpermenn0: cool03:46
menn0wallyworld: I didn't think you did03:48
wallyworldi can't defend it too hard :-)03:48
menn0wallyworld: review done03:53
wallyworldty menn003:54
wallyworldthumper: have you picked up much about juju and arm via osmosis?04:02
thumpernope04:02
wallyworldand dave is away this week :-(04:02
thumperaye04:08
=== kadams54 is now known as kadams54-away
anastasiamacthumper: u got a fish trophy!05:25
wallyworldjam: hi05:56
jamhey wallyworld05:57
wallyworldjam: network related question if you have a moment05:57
wallyworldbug 147201405:57
mupBug #1472014: juju 1.24.0: wget cert issues causing failure to create containers on 14.04.2 with lxc 1.07 <openstack-installer> <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472014>05:57
wallyworldsee the last couple of comments05:57
wallyworldit seems we don't store / report all the cloud local addresses for a machine05:58
wallyworldso a machine's AddressWatcher doesn't get told about all possible addresses an https request can arrive on05:58
wallyworlddo you know why we throw away some cloud local addresses?05:59
wallyworldi'm not looked at the code in detail, just going by jame's comment05:59
wallyworldbut i'm wary about changing network related code05:59
wallyworldas it has the habit of breaking things06:00
jamwallyworld: so offhand I'd say we don't actually cope with having multiple addresses where things could arrive06:02
jamwallyworld: consider charms, they can only really report 1 private address to eachother06:02
wallyworldhmmm, so which one to pick then06:02
jamwallyworld: so the issue is probably that we are thinking 10.0.6.* is the right address, when really the correct cloud-local address is the 10.0.3* one06:02
wallyworldas we are picking the wrong one06:02
wallyworldhmmm, so how to pick the right one06:03
jamwallyworld: so that machine has 3 addresses that I would consider "cloud-local" sort of addresess, a 10.0.3 a 10.0.6 and a 192.16806:04
wallyworldfor this purpose we should just stick everything in the SAN06:04
jamwallyworld: Surprisingly (for me) 10.0.3 is usually the LXC bridge (I thought)06:04
wallyworldjam: what about this line : setting API hostPorts06:05
jamwallyworld: I think in the  idealized model Juju would be aware of all the subnets and have labled names for them (spaces), in which case it would know that machine X is supposed to talk to machine Y on a given address.06:05
wallyworldit seems there we pass in everything06:05
jamwallyworld: internally it feels like we should be aware and save all the addresses06:05
wallyworldyes, save them all internally sounds good to me too, but if we do that now stuff would break i would tink06:05
jamwallyworld: short term, I think just adding all addresses to the SAN is fine.06:06
wallyworldbut06:06
wallyworldthat relies on AddressWatcher :(06:06
wallyworldso i'll need to change how it all works06:06
wallyworldbollocks06:06
wallyworldi'll go read the code and see what can be done06:06
jamwallyworld: well, I think you can certainly get help from Sapphire on this one.06:06
wallyworldjam: i asked but none were 100% sure about why only one address was saved etc06:07
wallyworldso maybe there's not the level of knowledge there to dive right in06:07
jamwallyworld: well, dimitern is away and the others probably not quite as familiar06:07
jamgophercon being this week.06:08
wallyworldyeah, that's what i figured, hence asking you :-)06:08
wallyworldi'll see if it's possible to tinker with the cert updater06:08
wallyworldi could re-read all machine addresses06:08
wallyworldbut may not be triggered06:08
dooferladwallyworld, jam: just read the backlog. Yell if you want a networking hand.08:09
dooferladwallyworld: also with ARM, I am an ex-ARM employee so if I can help with that please call on me08:10
wallyworlddooferlad: oh, i might take you up on that arm offer. i might ping you after dinner08:41
dooferladjam, fwereade: hangout?09:02
voidspacefwereade: git blame -L302,302 provider/ec2/config_test.go09:30
voidspacefwereade: that's the test that fails for me on master09:30
voidspacefwereade: git blame may be deceived of course...09:31
mattywfwereade, quick ping?11:15
fwereademattyw, heyhey11:15
mattywfwereade, is there any doc or something about the uniter operation/ callbacks arch? I'm finding myself getting in to it and was hoping I could make some decisions about my stuff without having to hassle you11:16
fwereademattyw, only what's inline, I'm afraid11:16
mattywfwereade, I probably only want to call a certain function when a certain hook has finished11:17
fwereademattyw, that sounds like the responsibility of the CommitHook bit to me?11:17
fwereademattyw, but the callbacks themselves are basically evil11:18
mattywfwereade, time for a 5 minute hangout?11:18
fwereademattyw, it's basically just a cut-down uniter facade/adapter for the use of the ops11:18
mattywfwereade, I'll try to timebox it at that11:18
fwereademattyw, sure, start one please?11:18
wwitzel3ericsnow: ping12:36
mupBug #1472596 opened: bootstrap failed yet retry says it succeeded <juju-core:New> <https://launchpad.net/bugs/1472596>12:52
mgzbogdanteleaga: is rr 2107 live or not atm?13:42
bogdanteleagamgz: no it's not13:49
bogdanteleagamgz: it's more of a weird interaction, 2109 is the same but should show a better diff13:49
bogdanteleagamgz: any ideas if I can delete that one?13:49
mgzbogdanteleaga: it is marked as discarded, so that's probably fine, is just getting updated still I guess as it's the same github branch13:52
bogdanteleagamgz: yeah, that was the one submitted to github, but the diff would be a bit funky since it contains another branch13:53
mgzbogdanteleaga: do you need anything else on those branches, or are you good to go?13:56
mgzbogdanteleaga: I think we'll want to backport to 1.24 after master has blessed the change13:56
bogdanteleagamgz: no, I was doing some final tests a couple of hours ago, but everything seems fine13:59
bogdanteleagamgz: got caught up with something else13:59
mgzbogdanteleaga: no problem14:00
bogdanteleagamgz: squashing now and I'll start merging14:01
* bogdanteleaga grabs popcorn14:01
mgz:)14:01
wwitzel3ericsnow: ping14:19
ericsnowwwitzel3: hey hey hey14:20
wwitzel3ericsnow: trying to work through an issue and I've run in to some code that I could use some help deubgging14:24
ericsnowwwitzel3: sure14:24
ericsnowwwitzel3: moonstone?14:25
wwitzel3ericsnow: we can go to a query, conference wifi probably won't work so well with a hangout14:25
ericsnowwwitzel3: right :)14:25
bogdanteleagamgz: you said they ought to be backported?14:35
mgzbogdanteleaga: I think we do need it on 1.24, yeah14:35
mgzwant to see it work on master first of course14:35
bogdanteleagamgz: Build failed: Does not match ['fixes-1472632']14:39
bogdanteleagacan't find a bug with that number14:39
mgzabentley: ^did you mean to make that bug a blocker and private?14:40
abentleymgz: Yes.14:40
mgzit does not really strike me as either14:40
abentleymgz: Regressions are blockers.14:40
abentleymgz: It has debug logs from SSH.14:40
mgzI am reading the debug ssh log and apart from containing your name and some of juju-ci's ips addresses it seems to have nothing personal14:42
mgzand I don't see how this bug prevents us releasing, which is the point of blockers14:42
mgzwe've been releasing fine with this for three 1.24-s14:43
abentleymgz: Well, I was erring on the side of caution with the SSH.  If you're willing to take responsibility for making it public, I'm fine with that.14:43
abentleymgz: I don't know why we've continued to make releases after it was discovered.  I assumed sinzui had filed a regression bug, since he knew about it.14:44
sinzuiabentley: I didn't know about it14:44
mgzabentley: okay. I can confirm this does not contain your private ssh keys. :)14:44
abentleysinzui: Oh? Didn't you say you'd had to rename your ssh key to id_rsa to deal with this issue?14:45
mgzabentley: the issue with the ssh stuff is it depends somewhat on your personal setup, so I knew that the combo of my local ssh config + juju ci scripts borked ssh for juju14:45
sinzuiabentley: yes, for just bootstrapping ec2 and openstack providers. I haven't seen any issue with other providers or ssh in general14:46
mgzI agree this is a regression, but given it has an annoying but somewhat trivial workaround (don't use your personal ssh config) I don't see how it's critical14:46
sinzuiabentley: nd this behaviour matches the windows setup from 18 months ago14:47
abentleymgz: I don't think the existence or lack of workarounds is a factor in whether an issue should block.  We don't want to break users' existing workflows, and this does break users' existing workflows.14:50
sinzuiabentley: I don't disagree14:51
mupBug #1472632 opened: regression: juju ssh dies with (publickey) <blocker> <regression> <ssh> <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472632>15:11
alexisbabentley, do we know what commit caused the regression?15:17
alexisbI would like to see bogdanteleaga be able to land his fix, it is a critical fix for 1.24.315:17
abentleyalexisb: No, we don't know which commit.  We found it by hand, not with automated tests, so we don't have logs that would show it.15:17
alexisbabentley, ok15:18
alexisbsinzui, abentley, mgz there will not be anyone from core looking at that bug until NZ/AUS comes online15:21
mgzI don't see how blocking is productive for this issue, nor how it's justified by our procedure15:22
sinzuialexisb: it doesn't blocl15:22
sinzuinot tag blocker15:22
mgzsinzui: it has that tag currently, I was planning on raising in standup in 5 mins15:22
alexisbsinzui, ok, I must have miss read the back scroll15:23
alexisbI thought bogdanteleaga was blocked15:23
mgzsinzui: `./check_blockers.py check master`15:24
sinzuialexisb: if he is, he can add __JFDI__  to $$merge$$ to make it mege and test <- bogdanteleaga15:24
mgznoo..15:24
mgzeither the bug blocks or it doesn't, we shouldn't be bypassing15:24
mgzI don't think it should block.15:25
bogdanteleagait is blocked currently, but this is just the fix for master, the one for 1.24 is coming up15:25
bogdanteleaganot sure how long it takes for the upgrade ci job to test the fix though15:26
sinzuimgz: as we don't have a test for it and the regression is in the wild, we don't need to block. I think this is like the expressions closing the stable door after the horse has bolted15:26
abentleysinzui: That assumes that the number of people who have not upgraded to 1.24 is not significant.  I think that it is significant.  I think every time we put out a release, especially if we release 1.25, we encourage people who are using 1.23 and earlier to upgrade.15:29
sinzuibogdanteleaga: use "$$merge$$ fixes-1471332" to the pull requet comment to ensure CI will test and merge15:53
bogdanteleagasinzui: I did try it with __JFDI__ but I fluked a test15:55
bogdanteleagasinzui: should be fine on the next try; should I use jfdi or fixes?15:55
sinzuieither bogdanteleaga15:56
bogdanteleagacool15:56
voidspacedooferlad: TheMue: if you have a chance I'd appreciate a review http://reviews.vapour.ws/r/2116/16:10
TheMuevoidspace: one moment, hunting a test failure, but will start in a few seconds16:11
voidspacecool, thanks16:11
voidspacegood luck with the hunt :-)16:11
TheMueah, kewl, panic is gone. now I can jump into your review16:15
TheMuevoidspace: seeing your new ReleaseAddress() signature. could it be that the passed address doesn't match to the passed MAC address? IMHO they always should be a kind of pair with 1:N (one MAC, multiple IP)16:18
voidspaceTheMue: no they're always 1:116:18
voidspaceTheMue: the MAC comes from state.IPAddress16:18
voidspaceinstance id is stored there too16:19
TheMuevoidspace: ok, but technologically one MAC could have o´multiple IP, we only don't model it so right now16:19
voidspaceah16:19
voidspaceour model does allow that, sorry16:19
voidspacealthough we don't use that capability16:19
TheMueok16:20
voidspaceso yes, one mac address could have multiple ip addresses - and we handle that16:20
voidspacewe only delete the device if the IP address is the last one16:20
voidspaceotherwise we just release the address normally16:20
voidspacebiab, grabbing coffee whilst you read16:20
TheMuevoidspace: you've got a review16:31
* TheMue is afk for a moment, continue later16:34
=== liam_ is now known as Guest71963
voidspaceTheMue: thanks17:15
mupBug #1472711 opened: MAAS node has "failed deployment", juju just says "pending" <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472711>17:32
=== kadams54 is now known as kadams54-away
mupBug #1472729 opened: juju stuck in "upgrade in progress " for 20min <juju-core:New> <https://launchpad.net/bugs/1472729>18:20
mupBug #1472729 changed: juju stuck in "upgrade in progress " for 20min <juju-core:New> <https://launchpad.net/bugs/1472729>18:23
mupBug #1472729 opened: juju stuck in "upgrade in progress " for 20min <juju-core:New> <https://launchpad.net/bugs/1472729>18:32
=== kadams54-away is now known as kadams54
mupBug #1472749 opened: github.com/juju/utils has contradictory licences <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472749>19:47
=== kadams54 is now known as kadams54-away
=== kadams54-away is now known as kadams54
=== kadams54 is now known as kadams54-away
marcoceppihello world21:49
marcoceppimaas 1.8.0 juju 1.24.2 deploying to LXC containers seems stuck, it's trying to wget the image from the bootstrap node/api server and it's been doing that for like 15 mins21:49
marcoceppijk, it's moving on now21:53
alexisbhey marcoceppi welcome back from vacation22:00
marcoceppialexisb: hey, thanks!22:01
thumperholy shit balls22:37
thumperthese worker tests are taking ages to run...22:37
thumperFAILgithub.com/juju/juju/state/leadership1200.021s22:39
thumperhmm22:39
thumpertimeout kill22:39
marcoceppinbd, 20 min tests22:40
thumpernbd?22:41
marcoceppino big deal22:42
thumpernah man, it is a big deal22:44
thumperthis is broke22:44
menn0thumper: shall I pick up the juju ssh blocker?22:54
thumperbug?22:54
menn0thumper: bug 147263222:55
mupBug #1472632: regression: juju ssh dies with (publickey) <blocker> <regression> <ssh> <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472632>22:55
menn0it's blocking master and 1.2422:55
alexisbsinzui, ^^ I thought we had decieded that bug was not a blocker22:58
alexisbmenn0, we discussed that one earlier today22:58
thumpermenn0: I think abentley's analysis is wrong23:00
thumpermenn0: if you look at the logs, it was his personal id_rsa that worked23:00
thumperbut 1.24 and master did not appear to be trying23:00
=== mwhudson_ is now known as mwhudson
=== mwhudson is now known as Guest82160
menn0thumper: i haven't looked at it in any detail23:02
alexisbso menn0, thumper, my understanding from this morning is that bug should not be a blocker and the tag was going to be removed23:03
* thumper removes blocker tag23:04
alexisbunfortunately I don't see any of the release dudes online atm23:04
alexisbsweet23:04
menn0thumper, alexisb: cool. that unblocks master23:04
menn01.24 is still blocked due to the window upgrade issue23:04
menn0i've just been talking to bogdanteleaga23:04
menn0he's waiting to see the problem is fixed on master before pushing the fixes to 1.2423:05
thumperthat shouldn't block 1.24 then23:05
menn0the PR to fix 1.24 is ready to go though23:05
menn0thumper: no?23:06
* thumper thinks23:06
thumperit will block us doing a release23:06
thumperbut I don't think it should block us landing other fixes on 1.2423:06
menn0thumper: remove the blocker tag then?23:07
thumpermenn0: do we know if it fixed the issue on master?23:08
menn0thumper: no we don't. CI hasn't gotten to running it yet23:08
menn0thumper: bogdanteleaga just noticed that his fix broke the unit tests under windows so he's going to do a fix for that now23:09
wallyworldwaigani_: can you take a look at 1.24 bug 1472711? it claims bug 1376246 may not quite be fixed23:11
mupBug #1472711: MAAS node has "failed deployment", juju just says "pending" <maas-provider> <juju-core:Triaged> <juju-core 1.24:Triaged> <https://launchpad.net/bugs/1472711>23:11
mupBug #1376246: MAAS provider doesn't know about "Failed deployment" instance status <landscape> <maas-provider> <juju-core:Fix Released by waigani> <juju-core 1.24:Fix Released by waigani> <https://launchpad.net/bugs/1376246>23:11
=== Guest82160 is now known as mwhudson
=== kadams54 is now known as kadams54-away
thumperrick_h_: hey there23:37
menn0thumper: so the windows upgrade still isn't working in CI23:59

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!