/srv/irclogs.ubuntu.com/2017/02/13/#juju-dev.txt

wallyworldaxw: ah, i think i understand what you might be saying - snap lxd daemon picks up the request from juju rather than deb lxd daemon and hence the wrong lxd get used. i thought setting LXD_DIR was supposed to cause the right lxd to get used (at least that's what nicholas thought).00:16
menn0wallyworld: I just realised that I hit the same thing with the conjure-up snap last week. it ships with its own version of juju and has LXD issues.00:22
wallyworldmenn0: yeah. adam and nick know about it. not sure exactly what their solution is. my system is still somewhat screwed even after removing lxd snap00:22
wallyworldneed to look at it again today00:23
menn0wallyworld: do you need to run sudo lxd init again?00:23
wallyworldthat's my last resort, yeah00:23
=== ben__ is now known as benk01
axwwallyworld: PR to update azure regions: https://github.com/juju/juju/pull/6969. seeing as we need to get the public-clouds.yaml updated anyway, figured we may as well try and get this one in too?01:25
wallyworldsure01:25
wallyworldaxw: sorry about delay. lgtm but there's a block on landing it due to current QA policy. there's a meeting tomorrow where that will be fixed01:45
axwwallyworld: thanks01:46
menn0wallyworld: were you running into this at bootstrap? 2017-02-13 03:05:33 ERROR cmd supercommand.go:458 new environ: Get https://10.0.8.1:8443/1.0: x509: certificate has expired or is not yet valid03:13
menn0anastasiamac: or is this what you saw? ^^03:13
wallyworldmenn0: similar, yeah, can't recall exact wording03:13
menn0its happening for me r03:14
menn0too03:14
menn0axw: could ^^^ be related to the lxd cert caching change?03:14
anastasiamacmenn0: i did not see this. wallyworld may have... just fixed my lxd to work over the weekend. my failures were related to lxc profile misconfiguration03:14
wallyworldmenn0: i've had to totally purge lxd, manually remove neworks and ip links, lxd init again etc. still not there though - juju can't talk to container03:15
axwmenn0: maybe. or the snap. are you using the snap?03:15
menn0no snaps involved03:15
axwmenn0: you haven't installed the juju or lxd snaps?03:16
menn0not on this machine03:16
axwmenn0: ok. seems most likely related to my changes then. it doesn't happen to me though. can you try and isolate it?03:17
menn0axw: sure. where's the cache file?03:17
axwmenn0: juju will pull certs out of either ~/.config/lxc or ~/.local/share/juju/lxd03:18
menn0axw: I don't have ~/.local/share/juju/lxd03:19
axwmenn0: either/or. if that one doesn't exist, it'll look in ~/.config/lxc03:19
axw(I don't have ~/.local/share/juju/lxd either)03:19
menn0axw: lxc is working fine by itself03:20
menn0axw: I just launched a container with "lxc launch"03:20
axwmenn0: lxc will use the unix socket locally03:21
axwmenn0: so HTTPS won't factor at all03:21
menn0axw: ok right03:21
menn0axw: moving the .config/lxc directory out of the way doesn't fix things03:31
jamthumper: menn0: coming by to say hi?03:31
thumperjam: yeah03:32
menn0jam: coming03:32
axwmenn0: can you try adding some logging to finalizeLocalCertificateCredential in provider/lxd/credentials.go? we should be generating a new cert, uploading it to lxd, and then using that03:35
axwwallyworld: can you pastebin the output of "lxc config trust list" for me?03:41
axwwallyworld menn0: one possibility is that the certs in ~/.config/lxc are expired. mine expire in 2026...03:43
menn0axw: but I moved them out of the way and the symptoms stayed the same?03:43
axwmenn0: ok, weird03:43
axwmenn0: even that doesn't repro for me03:45
axwmenn0: which version of ubuntu, lxd?03:45
menn0axw: well those certs *have* just expired03:47
menn0        Validity03:47
menn0            Not Before: Feb 10 02:21:23 2016 GMT03:47
menn0            Not After : Feb  9 02:21:23 2017 GMT03:47
menn0lxd 2.0.8, xenial03:47
axwmenn0: did you add a lxd cert credential to credentials.yaml? by autoload-credentials perhaps?03:49
menn0axw: no lxd creds in credentials.yaml03:50
axwmenn0: well if you moved the certs, I can't see how their expiry matters :/03:51
menn0axw: unless there's something inside the lxd daemon?03:53
axwmenn0: maybe the server cert is the thing that has expired?03:54
axwmenn0: the client credential changes could be a coincidence. seeing as the client certs you had have expired, possibly the server cert has too03:55
menn0axw: I think you're right. the problem happens with juju 2.0.3 too03:56
axwmenn0: ok, cool03:57
* axw wonders how to regen03:57
axwmenn0: looks like if you delete /var/lib/lxd/server.{crt,key}, lxd will recreate them on startup03:59
axwwallyworld: ^^04:00
menn0axw: they've expired too. that's got to be the problem.04:00
axwmenn0: terribly confusing coindidence :)04:00
wallyworldaxw: yeah, i did that earlier, full lxd reinstall and init was also needed for me04:10
wallyworlddue to wierd network issues04:10
=== frankban|afk is now known as frankban
jamwallyworld: ping if you're around12:29
wallyworldhey12:29
perrito666morning you two12:30
wallyworldevening here :-)12:30
perrito666wallyworld: any urgent bug that was left last night? otherwise ill just pick from the pile12:33
wallyworldperrito666: this one is important https://bugs.launchpad.net/juju/+bug/162321712:35
mupBug #1623217: juju bundles should be able to reference local resources <juju:Triaged> <https://launchpad.net/bugs/1623217>12:35
wallyworldnot sure if it can be done in time12:36
wallyworldie not sure of the scope of any change, haven't looked into it12:36
wallyworldjam: can i help with something?12:40
jamon bug #1577556 they just mentioned that they saw a 'statuses' doc that didn't have a txn-revno. I was a bit confused but it looks like statuseshistory is where we don't use TXNs but 'statuses' we *should* be using txns, right?12:41
mupBug #1577556: unit failing to get unit-get private-address in the install hook <intermittent-failure> <network> <uosci> <OpenStack Charm Test Infra:Confirmed> <juju:Triaged>12:41
mup<juju-core:Triaged> <juju-core 1.25:Triaged> <ubuntu-openstack-ci:Triaged> <mysql (Juju Charms Collection):Fix Released> <https://launchpad.net/bugs/1577556>12:41
jamwallyworld: sorry, wrong bug. actual is bug #148410512:43
mupBug #1484105: juju upgrade-charm returns ERROR state changing too quickly; try again soon <bug-squad> <canonical-is> <upgrade-charm> <upgrade-juju> <juju-core:Fix Released> <https://launchpad.net/bugs/1484105>12:43
perrito666wallyworld: tx12:44
wallyworldjam: yeah, status hisotry avoids txns. but the status doc itself in the status collection should use txn12:44
perrito666afaik the regular status collection does use txns12:45
wallyworldjam: all usages of statusDoc should be in the context of a txn.Op  slice12:46
jamperrito666: right, its supposed to, I was worried we had a case where sometimes we were and sometimes we weren't.12:47
jambut they are reliably (?) hitting a case where the statuses docs are missing the txn-revno, which is bad mojo for the TXN logic12:47
perrito666jam: odd, I wonder if someone confused status with status history12:49
perrito666jam: the latest status added was model status iirc12:49
wallyworldjam: i just did a (quick) code search and can't see anywhere where we are writing to the status collection not using txns12:49
jamperrito666: wallyworld: yeah, I grepped the code as well. It does mention an assert that "txn-revno == 0" which would indicate we tried to read it, got back 0 and then said "well, its gotta stay 0 for this updaet"12:50
wallyworldjam:  you talking about assert := bson.D{{"txn-revno", txnRevno}} in statusSetOps() I assume12:51
jamwallyworld: comment #16 on the bug12:51
jamthere is an enttry in txn queue that says: "a": { "txn-revno": NumberLong(0) }12:52
jammy guess is that it read the doc, saw there was no txn-queue so got the 'zero' value and then put that back into the assert.12:52
jamSo I don't think that is what *caused* the problem in the first place, just those txns all fall over because the data is bad.12:52
jamwe don't really know what caused it to be bad in the beginning.12:53
wallyworldjam: juju uses the txn-revno assert in a set status function (this pattern isused elsewhere too, eg updating ports). but i think that relies on a create being done first to insert the original doc and set the txnrev-no field. and we do have a createStatusOp.  i can't see how we would attempt to set a  status value on a doc with a given doc id without having done a create first12:56
wallyworldand that create should insert the txnrev-no field12:56
jamwallyworld: (a) race? (b) if we're creating the doc with an upsert, we're still using  the txn logic, and the mgo/txns is the thing that keeps txn-revno correct12:57
jamwe're just using an assert to say "if this doc is changed underneath us, ignore this change"12:57
wallyworldthat matches my understanding12:58
perrito666jam: just a wild idea, do you think this could be caused by mgopurge?12:58
wallyworldwe create the doc with an insert and a doc not exists assert12:59
perrito666wallyworld: https://bugs.launchpad.net/juju/+bug/1623217 seems like it will need some spec-ing and agreement from stakeholders12:59
mupBug #1623217: juju bundles should be able to reference local resources <juju:Triaged> <https://launchpad.net/bugs/1623217>12:59
wallyworldprobably, i just saw it and it looked important13:00
wallyworldshould move to 2.213:00
perrito666wallyworld: it has been like that since december, I dont think it is that urgent that we can skip proper procedure :)13:00
perrito666we should get someone started on that spec though13:01
wallyworldfair enough, i just skimmed it13:01
perrito666true, was just a comment not a rant13:01
rick_hperrito666: the goal there is just to allow local file paths like local charms13:01
rick_hperrito666: so hopefully a short spec13:01
perrito666rick_h: yes13:02
jamrick_h: perrito666: not sure how much of a spec it should have, vs you already have a syntax for 'use this local charm' ./path, we just support that for a resource blob as well13:09
rick_hjam: +113:11
jamperrito666: short enough spec for you ? :)13:11
perrito666jam: sure it is, if annyone comes asking ill post that line :p13:13
=== plars-away is now known as plars
=== perrito667 is now known as perrito666
jamperrito666: I didn't think that was actually going into 2.1, and doing it post rc feels a bit late, but if it isn't terribly hard, it is a nice quality-of-life for a bunch of people13:38
perrito666jam: I dont think ill be able to fit this into 2.1 most likely tonight ill move that to 2.213:48
=== balloons26 is now known as balloons
=== frankban is now known as frankban|afk
Dmitrii-Shhttps://github.com/cloud-green/juju-relation-mongodb/pull/6 cmars, mattyw - JFYI: sent out a couple of patches for the mongodb interface18:49
thumpermorning folks19:19
* perrito666 touches tip of hat19:25
mattywDmitrii-Sh, hey there, thanks very much, will take a look19:25
=== elmo_ is now known as elmo
tasdomascould there be a reason why juju 1.25.6 fails to bootstrap an aws model with authentication failure, while juju 2.0.0 works fine (with the same credentials)19:45
thumpertasdomas: no idea sorry19:48
mupBug #1664359 opened: Authentication fails for juju 1.25.6 on aws <juju-core:New> <https://launchpad.net/bugs/1664359>20:27
wallyworldperrito666: thumper: IMO the proposed test fix for that status history test is too lenient - it throws away the notion of the order of the expected status messges23:09
wallyworldthe fix should have been to be lenient with the expected count of messages, not the order23:10
perrito666wallyworld: I can very well send a follow up, since the test belo does test the order I thought it was not really that important23:15
wallyworldit does but the two tests are ostensibly similar - one uses a filter, the other doesn't. they should essentially test the same things the same way23:17
wallyworldjust with nd without  filter23:17
perrito666yes, but there is a race there, that infrastructure was built by william but he did not notice that by inserting the records with identical dates the ordering is non deterministic23:19
perrito666two items with the same date can be returned in any given order, and that is ok23:20
perrito666because in reality it will never happen that two items have the same date to the nanosecond23:20
wallyworldah, i see, that helps explain it a bit23:24

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!