/srv/irclogs.ubuntu.com/2017/11/14/#juju-dev.txt

=== mwhudson_ is now known as mwhudson
axwwallyworld: https://github.com/juju/juju/pull/8065 is part of a fix for the enable-ha bug00:24
axwwill look at the replicaset stuff after school drop off00:24
wallyworldaxw: nw, ty, will look after talking to xtian00:24
* thumper needs food badly00:27
wallyworldhml: we need a unit test00:36
hmlwallyworld: okay00:36
wallyworldthere should be stuff to copy from; it's a bit hairy00:36
* thumper is grumpy walking through resources code01:22
hmlwallyworld: just a little hairy.  ha! - pushing the unit test now02:25
wallyworldgreat02:25
wallyworldhml: we just need to also check call names to ensure the rovider behaved as expected, in addition to not crashing wit hthe error02:29
wallyworldthere's examples to copy from02:29
hmlwallyworld: i saw examples for storage clients and such… but not the general sender02:33
wallyworldhml: yeah, guess so. it seems TestStopInstancesNotFound() for example just checks err is nil02:34
wallyworldso should be ok to land based on that precedent02:35
hmlwallyworld: looked around, not much setup for checking the call tree - though i did verify with some logger messages before finaliing02:35
wallyworldsgtm02:36
hmlwallyworld: had a few false positives so  i wanted to verify02:36
wallyworldyeah, testing manually is good for this type of issue02:36
hmlwallyworld: ty - merging now02:40
axwwallyworld: can you please take a look at https://github.com/juju/juju/pull/8065?03:03
wallyworldsure03:04
wallyworldsorry, forgot03:04
wallyworldaxw: done03:16
axwta03:16
axwjam: do you know why mongo.SelectPeerAddress allows machine-level addresses?04:23
jamaxw: you mean 127.* stuff?04:23
jamaxw: you can run an HA cluster for testing on just your local machine04:23
axwjam: I meant to say machine-local, but yeah04:23
axwhmm ok.04:24
jamaxw: we don't want to allow them ourselves04:24
jamso doing so is a bug04:24
jamaxw: but I think that's why *mongo* doesn't refuse them04:24
axwjam: ok, I'll change it then. I meant in our juju/mongo package04:24
jamaxw: so I don't think we personally ever do local-only testing04:25
jamand if we did, we could just use your eth0 ip address 3 times04:25
axwjam: can you please take a look at https://github.com/juju/juju/pull/8066?05:06
jamaxw: will do05:49
axwwallyworld: I've added another commit to https://github.com/juju/juju/pull/8056/, can you please look at the last commit? moves the CACert methods around06:14
wallyworldok06:14
axwwallyworld: sorry wait a sec06:14
axwI mucked up rebase06:14
axwwallyworld: ok, all good now06:14
wallyworldok06:14
wallyworldaxw: so there's 3 facades that dupe the getting of ca cert from controller config - you saying that sinces it's only half a dozen line sof code each time, it's not worth a common plugin06:22
jamaxw: 8066 lgtm06:23
axwjam: thanks07:17
axwwallyworld: sorry, was afk. I took it off APIAddresser because (a) it doesn't have anything to do with API addresses, and (b) it was being exposed by things that didn't care about API addresses, and vice versa07:17
axwwallyworld: i.e. things only cared about CACert and not API addresses07:17
axwwhich should be a pretty clear indication that they're orthogonal07:18
wallyworldsure, i was thinking about a new common plugin07:19
wallyworldbut probably overkill07:19
wallyworldfor what it saves07:19
wallyworldanyway, lgtm07:22
axwwallyworld: yeah I don't think it's worthwhile. if it's used again maybe, but I don't see that happening any time soon07:25
axwI guess the caas provisioner might need it. I'll add it then if required07:25
wallyworldnp07:25
=== frankban|afk is now known as frankban
thumperjam: ping08:17
jamthumper: pong08:26
thumperjam: got time for a quick chat about pingers?08:26
thumperI'm past EOD, but wanted to follow up08:26
thumperjam: I know you are on your standup so I'll leave ideas...08:27
thumperThe dealing with reasources is required but perhaps not sufficient08:27
thumperI agree that we should work out where the other pingers are coming from08:28
thumperhere's a thought...08:28
thumperapi.Open will try all the apiservers, and kill those that aren't the first to respond08:28
thumperperhaps some of those don't get a close noticed on the apiserver, so they hang around for ~1 minute before the agent pinger closes them for not calling Pinger.Ping08:29
thumperif we were trying to open every few seconds, and there were some left around, this might be a reason why it floats around 20-3008:30
thumperjust a thought08:30
thumpergiven that it is required I'd still like to land it08:30
thumperI'll leave it to you to do the $$merge$$ if you are happy enough with my comments and rationale08:31
* thumper out08:31
axwballoons: something's borked in CI, https://github.com/juju/juju/pull/8056 says it's been accepted, but there's nothing running in jenkins10:08
jamaxw: possibly. I've run into a few of those where the bot fails in such a way that it doesn't respond to the PR10:27
jamI can trigger a rebuild if you feel its ready to land10:27
jamI do see a http://ci.jujucharms.com/job/github-merge-juju/508/10:28
jamwhich says it failed10:28
axwjam: should be ready, it just failed on an intermittent unit test -- will try and fix that on develop tomorrow10:33
axwjam: what's the procedure? I can probably do it too, I have jenkins login10:33
jamaxw: I *do* think we should bring it up to balloons / veebers, since I know when it was happening to me, it was a bug in the test script that it wasn't talking back to the bug.10:34
jamaxw: if you log into CI (I use 'developer') you should be able to go back to the bug and just use "rebuild"10:34
jamon http://ci.jujucharms.com/job/github-merge-juju/508/ on the left hand side is a link to: http://ci.jujucharms.com/job/github-merge-juju/508/rebuild10:34
axwjam: ok, thanks10:35
jamexternalreality: can you confirm the PR that you wanted me to review? It seems I had linked to the wrong one earlier10:36
axwjam: seems like jenkins is busted. rebuilding, or starting a new build with the same parameters, does not result in a build job...10:37
axwballoons: ^10:37
jamaxw: hm. maybe the blue ocean stuff broke what I used to do10:38
jamaxw: the other option is that you just reply with the same message that the bot usually does10:39
axwjam: tried that :(10:39
axwnever mind, I can land this tomorrow10:39
jamah, I see you did try that10:39
externalrealityjam: https://github.com/juju/juju/pull/804810:53
jamthx10:53
externalrealitynp10:53
jamexternalreality: I'll see about running your stuff in a sec as well10:59
externalrealitycool11:00
jamwpk: did you do a patch to show normal machine error messages in tabular 'juju status'?11:05
jamI'm running 2.3b3 to test things out, and I had an upgrade try-but-fail which is weird in its own right, but then the machines went to "error" but I don't see it normal status11:06
wpkIt's even in 2.211:14
wpkthe 'Message' field, so it should be there11:14
jamwpk: is it not there because we only include INstance status and not Juju Agent status?11:30
jamwpk: bug #173215611:30
mupBug #1732156: juju upgrade-juju --build-agent allows invalid upgrades <upgrade-juju> <juju:Triaged> <https://launchpad.net/bugs/1732156>11:30
wpkwe're showing machine-status: message:11:32
wpknot juju-status:11:32
wpkIIRC11:32
jamwpk: so, arguably we should allow for both11:39
jamthe former shows provisioning errors11:39
jamthe latter shows machiner errors once things are up11:39
jamhttp://github.com/juju/juju/pull/8063 and http://github.com/juju/juju/pull/8068 could both use reviews11:40
jamexternalreality: wpk ^^ if you have a chance11:40
jamI'm happy to be on-hand if someone wants context11:40
jamthough I think axw effectively approved 8068 because he approved the upstream mgo patch.11:40
jamI think I figured out the problem with Trello's github integration, is that it doesn't default to hiding closed PRs11:42
mupBug #1732163 opened: juju status triggers some uninteresting DEBUG level mesasges <logging> <juju-core:Triaged> <https://launchpad.net/bugs/1732163>11:51
jamexternalreality: so, how were you testing this that you found sometimes it breaks? Is it the CI tests, or just running "go test" in the right directory?11:54
jamyou were mentioning you thought it might be your mongo version, so I'm guessing it was somewhere in local tests11:55
jamballoons: axw: I can confirm the same bad bot behavior for PR #805711:56
jamsomething seems very wedged with the bot.11:56
jamwpk: can you join: https://hangouts.google.com/hangouts/_/canonical.com/juju-doc?authuser=1 he had some FAN questions11:59
externalrealityjam: I can't be completely sure what it was11:59
jamexternalreality: right, I'm just trying to make sure that I'm exercising the same test that you saw failing12:00
jamI know you said it was blocked at one point, but I don't see what was actually failing.12:00
externalrealityAh, for example, initialization_test.go would fail attempting to build "txns.log" twice.12:01
externalrealityother tests would fail too, all suites that used stateSuite to establish connections to mongo12:02
jamexternalreality: I don't see an "initialization_test.go" file12:02
jamam I just missing it?12:02
externalrealityhmm12:03
jaminitialize_test.go ?12:03
externalrealityjam, correct. And a good example of a test that was failing is `TestDoubleInitializeConfig`12:05
jamexternalreality: so, that test doesn't have anything to do with your changes, and I don't think it could possibly fail because of your changes (AFAICT).12:33
jamsince its a state/state.go test12:33
jammight still be worth looking at, but otherwise its just a flaky test, and not related to your patch12:33
externalrealityYes, perhaps a flaky test or something related to the specific vm that I was running it on (some akin to a messed up clock or something).12:38
wpkjam: blah, missed it while lunching. Are you still there?12:39
jamwpk: no, we're done, but if you can respond to peter's questions around setting up VPC and the FAN would be useful.12:40
wpkkk12:48
jamballoons: just to note, the CI bot seems thoroughly wedged right now, not sure if there is something we could do to fix it. we should probably learn how, so that we can be landing code even when part of the world is asleep13:31
* jam heads away for EOD, though I'm likely to stop back again later.13:34
balloonsI'll look13:52
balloonsAnd I agree13:52
=== freyes__ is now known as freyes
balloonsjust fyi, I did nothing but it seems to have worked itself out14:51
balloonsI'm curious if someone can comment about what was wrong14:51
wpkjam: I realized that I've never created a VPC for Juju, always used existing ones15:27
wpk(and if we don't have a clear doc on how to do it that's bad...)15:27
jamballoons: we were submitting requests, and it was saying "going into the queue" but the queue itself was not updating.16:32
balloonsjam, are things still pending?16:33
jamballoons: I know axw had a PR, but also PR 805716:34
jamballoons: actually, still just as broken for us16:34
jamballoons: axw was trying to resubmit PR 805616:34
jamand that is the top of the queue, but didn't get retried, and nothing else got queued16:35
jamballoons: we also tried manually "rebuild" from the Jenkins UI, but didn't seem to do anything16:35
balloonshmm16:35
balloonsjam, ah-hah! the disk is full16:43
wpkballoons: ... and there's no nagios to tell anyone ;)17:19
balloonswpk, indeed. Jenkins monitors all the nodes; but not itself17:19
=== frankban is now known as frankban|afk
thedachml: fyi https://bugs.launchpad.net/juju/+bug/173223317:29
mupBug #1732233: Exiting from a debug-hook session puts hook in error state <juju:New> <https://launchpad.net/bugs/1732233>17:29
hmlthedac: was debug-hooks used because of an hook error?  if so was it resolved before exit?17:31
thedachml: I purposefully jumped into debug-hooks run them serially. Tried to exit clean but no matter what I do it goes into error state17:31
thedacall those log entries are me trying exit, exit 0 etc17:32
thedacI then have to do juju resolved --no-retry but this never actually passes relation data as juju thinks the hook has not "run"17:33
hmlthedac: well that’s not cool, i’m trying to remember if we changed debug-hooks recently…17:33
thedacShould be easily reproducible, not specific to openstack17:33
hmlthanks17:33
thedacno problem17:33
hmlballoons: are we back in business with jenkins?18:25
balloonshml, sorry, I missed your ping. I was following up on the pr's that seemed stuck20:48
balloonshml, yours failed to merge "FAILgithub.com/juju/juju/worker/firewaller1502.008s"20:49
balloonscan I get a reivew on https://github.com/juju/juju/pull/8072?20:49
balloonsjust bumping the version20:49
hmlballoons: ty for restarting my merge, that failure is really odd, esp with my change, retrying21:37
wallyworldbabbageclunk: how goes it with the ss stuff?22:12
babbageclunkwallyworld: got confused about it again yesterday afternoon. But going alright again now.22:13
wallyworldok, i'll review once it's ready22:13
babbageclunkwallyworld: have you got a moment for a quick hangout? want to check something with you.22:33
balloonsbabbageclunk, wallyworld, https://github.com/juju/juju/pull/8074. This does juju-versions.yaml now in the snap22:46
balloonswallyworld, babbageclunk, however, note the juju-versions.yaml file will be in /snap/bin/juju; aka, next to the binaries22:48
babbageclunkballoons: nice22:48
balloonstomorrow I'll get the patches included as well, and test it works for how we build / release22:49
balloonsthat will be a bit trickier. I may want to add a note about how to seed an agent yourself22:49
wallyworldballoons: yay, good progress23:20

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!