/srv/irclogs.ubuntu.com/2017/01/17/#juju-dev.txt

anastasiamacthumper: babbageclunk: can we cap log collection in mongodb? bug 165643000:17
mupBug #1656430: juju logs should be a capped collection in mongodb <sts> <juju:New> <https://launchpad.net/bugs/1656430>00:17
babbageclunkanastasiamac, thumper: I don't think mongo allows deleting from a capped collection, so the pruner couldn't work. I'm not sure whether that would cause any knock-on problems.00:21
anastasiamaclove the error msg \o/ - "Something wicked happened" :D00:24
anastasiamacbabbageclunk: thnx!00:32
anastasiamacbabbageclunk: but interestingly, would we need prunner if log collection was capped anyway?..00:40
thumperanastasiamac: the problem with a capped collection is one noisy model can remove logs for another model00:55
thumperbut...00:55
thumperperhaps there are ways.00:55
anastasiamacthumper: yep. marked the bug as invalid... but thought it was interesting thought-flexing exercise :D00:56
redircollection per model01:01
thumperanastasiamac: I reopened that bug but with a different subject01:30
thumperMiguel_Ubuntu: what is the problem you are hitting?01:31
anastasiamacthumper: k01:31
redirbbl after dinner and stuff01:40
redirnight05:49
junaidaliperrito666: output of juju status -> http://paste.ubuntu.com/23814800/ after restore05:57
junaidaliI'm unable to remove machines with 'down' status and 'no-vote' as controller-member-status05:58
=== Spads_ is now known as Spads
=== akhavr1 is now known as akhavr
=== akhavr1 is now known as akhavr
jamwallyworld: ping. IIRC your team worked on "show-status-log". I'm trying to debug some provisioning errors, and I see the messages show up while the provisioner is trying10:48
jambut as soon as the provisioner gives up10:48
jamthe machine goes to "pending", ""10:49
jam(empty message)10:49
jamand "juju show-status-history" only shows the "pending", "" entry10:49
jamnone of the ones that have been giving me "failed because of X" messages.10:49
jamaxw: ^^ in case you know something about it, too10:52
perrito666jam: I can help you with that, just give me a couple of mins to discover where I left my glasses when I woke up11:28
perrito666jam: you mean the status history is being re-written?11:29
perrito666aaand I just realized you asked this like an hour ago11:30
jamperrito666: I mean that I'm testing what happens when provisioning fails. And if I watch "juju show-machine X" I can see the messages about failing and will retry11:30
jamperrito666: but as soon as the provisioner decides that its done trying11:31
jamperrito666: the message ends up as ""11:31
jamwhich is rather unhelpful11:31
jamlooking at the code in the Provisioner11:31
jamI see it wanting to call setErrorStatus which should set the machine into an Error state, instead of a Pending state11:31
jamand have a nice looking message there.11:31
jamwell, at least an informative one.11:31
perrito666jam: that is rather odd, and how is show-status-log involved here?11:32
jamperrito666: it also has only 1 entry "pending", ""11:32
jamwhich doesn't match the 5+ messages we just set about "could not do what you wanted"11:33
jamperrito666: and I was hoping  to, you know, *see* the reason why it had been failing.11:33
jamperrito666: mongodb has the same content as "juju status" and "juju show-machine" and "juju show-status-log" which means at least the reporting layer isn't lying11:33
perrito666jam: mmmm, interesting11:38
perrito666so I can do an educated guess, iirc, there used to be this rule: "You cant set error without the data field being populated" if still in place, the status set for error might be failing11:39
perrito666status history setting happens inside set status and is non guaranteed and non stopping so even though set status history fails, set status might succeed, so its not that breaking it, or should not be11:40
jamperrito666: hm. it seems to be setting the values on the agent's message11:44
jamperrito666: https://pastebin.canonical.com/176150/11:44
perrito666checking11:44
jamso while the values are there for something11:44
jamit isn't the machine object11:44
jamperrito666: is there a way to get the history for the "juju-status" portion of 'show-machine' ?11:46
jamperrito666: After debugging a bit more I do see a 'statuseshistory' entry for something11:46
jam"globalkey" : "m#1/lxd/6#instance", isn't interesting, but "globalkey" : "m#1/lxd/6" is11:47
jamperrito666: it looks like we added code so that if a machine isn't Stopped or Pending, then it overwrites the value of the juju-status field with "agent not communicating"11:50
jamhowever, it doesn't handle if the status is Error11:50
jamperrito666: juju show-status-log --type juju-machine XXX is what I wanted11:51
perrito666jam: sorry someone was at the door back to you11:59
perrito666jam: mm that is iirc, what cheril added and most likely I finished which is a proper "hardware" status12:00
jamperrito666: "what I wanted" meaning that's where the actual information *is* but it was quite confusing to find.12:00
perrito666we hold status for agent which is juju agent and "instance" which is the underlying status of the actual hardware12:00
jamand the fact that we were setting a field which defaults to being overridden12:01
perrito666jam: now, that was a bad design decision, I wonder we we did that12:04
jamperrito666: I think the idea is that you can't trust the status if the agent isn't communicating/you want to let the user know that the status is stale.12:04
jamperrito666: but I think it fundamentally is just "we should be setting InstanceStatus" during provisioning12:04
jamnot Status12:04
perrito666ahhh indeed, but that should not override error12:04
jamperrito666: it *doesn't* override Pending or Stopped12:05
jambut that is the only check12:05
jamI don't know whether Error was just not thought of12:05
perrito666jam: btw, instance status is set during provisioning iirc12:06
jamperrito666: *if* you call apiserver.provisioner...machine.SetInstanceStatus it will call that and machine.SetStatus12:08
jamperrito666: however, the provisioner code itself *only* calls SetStatus, *not* SetInstanceStatus12:08
jamperrito666: when there is a provisioning failure12:08
jamperrito666: unless the machine.SetStatus client-side is actually calling SetInstanceStatus12:08
jamperrito666: however, i'm not seeing any history in "juju show-status-log --type machine 1/lxd/3"12:09
perrito666jam: but there is an instancestatuspoller12:10
jamperrito666: this is container provisioning stuff that I'm specifically focused on.12:11
jamperrito666: but I'm pretty sure the maas status messages also end up in "juju-status" not "machine-status"12:11
jamI could be wrong there12:11
jamits been a while12:11
perrito666jam: mmm, odd, I wonder if the filter is ok... try getting all types and see if your global key shows (I dont recall the actuall syntax for this)12:12
perrito666I am pretty sure there is a thing called instancesomethingpoller that populates the instance status12:12
jamperrito666: so m#1/lxd/6/ is interesting m#1/lxd/6#instance is not12:13
jamfor the purposes of seeing "failed to create instance"12:13
jamsort of thing12:13
perrito666jam: I see, we need to polish that then12:14
jamperrito666: well, its what I'm working on *right now*, fortunately :)12:14
perrito666jam: would it be too much of a hassle to ask you to put up a bug with that info pointed in my direction?12:14
perrito666ahhh12:14
jamperrito666: bug #165025212:14
mupBug #1650252: juju add-machine lxd:N --constraints INVALID does not show provisioning error <lxd> <observability> <provisioning> <ui> <juju:Triaged> <https://launchpad.net/bugs/1650252>12:14
perrito666I thought you where workin on something else and got hit by this issue12:14
jamperrito666: I got hit by this issue when I refuse to start an LXD instance because of a misconfiguration, and no error is shown to the user.12:15
perrito666gotcha, I believe there is another ux pain point there where instance status is not getting the right info12:16
jamperrito666: so do you think that if juju-status is in Error it should not suppress the message when the agent is not alive?12:16
perrito666jam: I am unsure if that is the right place to show that error12:17
perrito666I mean, its not an error from the agent12:17
perrito666we are posting "there should be an agent here, but we could not give you one"12:17
perrito666why is it that memory leaks never come up when one needs them :p12:43
=== akhavr1 is now known as akhavr
=== akhavr1 is now known as akhavr
natefinchvoidspace: your mic is not working15:07
voidspacenatefinch: thanks15:20
=== akhavr1 is now known as akhavr
redirpong17:38
rediroops17:38
=== akhavr1 is now known as akhavr
=== akhavr1 is now known as akhavr
* thumper sighs22:57
thumpermore freaking intermittent failures22:57
* thumper picks one22:57
redir:|22:58
thumperfreaking peergrouper tests...22:59
thumperhttp://reports.vapour.ws/releases/issue/5617dbc6749a562f5cdd8efc22:59
* thumper dives on it22:59
* perrito666 tried to get mongo to accept 0.25G as a way of expressing 256M23:01
menn0babbageclunk: bug 1569632 is done right?23:02
mupBug #1569632: indicate "migrating" in show-model status output <juju:Triaged by 2-xtian> <https://launchpad.net/bugs/1569632>23:02
perrito666ghaaaaaaaaaaaaaa, this only became a float in 3.423:03
* perrito666 cries on the floor23:03
perrito666are we getting mongo 3.4 rsn?23:04
menn0perrito666: we probably should23:04
perrito666menn0: yup, especially because until that wired tiger cache is bound to take 1G as the minimum possible parameter23:05
menn0perrito666: really?23:06
perrito666menn0: well the command line param does not support floats until 3.423:06
perrito666so we can allow it to choose but that does half the ram minus 1g23:06
menn0perrito666: and it doesn't take a unit/23:06
menn0?23:06
perrito666nooope23:06
perrito666technically it does23:06
perrito666its /var/lib/juju/init/juju-db/juju-db.service23:06
perrito666--wiredTigerCacheSizeGB23:07
perrito666there23:07
perrito666so, it takes one unit :) GB23:07
menn0perrito666: well that just sucks23:07
perrito666after standup Ill glog my upload  by deploying a hughe bundle and see how this new setting bodes (even If I ask for 1G it will be better than allowing it to grow at will23:08
perrito666s/glog/clog23:08
menn0perrito666: regardless, it might be worth starting the ball rolling for moving to mongodb 3.423:11
perrito666yup, I just need to try and remember who was the packager23:11
menn0perrito666: was it mwhudson ?23:16
perrito666yes, tx23:16
perrito666sorry I am a bit distracted today23:16
perrito666mwhudson: hello, you might remember me from, lets upgrade to mongo 3.1 and lets upgrade to mongo 3.223:16
perrito666mwhudson: lets upgrade to mongo 3..423:16
menn0thumper: are you available for a quick hangout?23:20
thumpersure23:21
thumpermenn0: 1:1 hangout?23:21
menn0thumper: yep23:21
=== akhavr1 is now known as akhavr
wallyworldthumper: when you are free, PTAL at PR 6815, issues fixed23:39
wallyworldthanks for review23:39
perrito666so, hangouts or bluejeans?23:41
anastasiamacperrito666: ho23:45

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!