/srv/irclogs.ubuntu.com/2017/06/07/#juju-dev.txt

babbageclunkveebers: yes please!00:09
veebersbabbageclunk: cool, one snuck through, but failed. Let me check why00:13
babbageclunkaxw: eg looks really neat, thanks for the tip!03:25
axwbabbageclunk: cool :)03:25
axwbabbageclunk: FYI the PR I mentioned is https://github.com/juju/juju/pull/7446, the template I used is in the description03:25
wallyworldaxw: what do you think about adding the mongotop metrics to a prometheus collector? and other things like txn.logs size03:28
axwwallyworld: there is an existing prometheus exporter (https://github.com/dcu/mongodb_exporter) which I think we should use if possible. last time I tried to use it, it was a bit panicky03:30
axwwallyworld: not sure if that captures per-collection sizes. if it does not, adding txn.logs size sounds like a good idea to me03:30
wallyworldaxw: agree to use something existing if possible. top gets useful stats which IMO we'd want to graph over time and correlate with other measurements03:31
axwyup03:31
axwwallyworld: I think there might be one already, but if there's not we should look at snapping the mongodb prometheus exporter, to make it super easy to set up on the controller03:36
wallyworldaxw: that would be nice. as an aside, i had a brief look at the prometheus snap itself and didn't see an easy way to tell it to use a given config yaml, but i didn't look too hard03:37
axwwallyworld: there should be an existing config file, I forget where... search for prometheus.yml under /snap/prometheus03:38
axwwallyworld: also see https://awilkins.id.au/post/juju-2.1-prometheus/ if you haven't already, might be helpful03:38
wallyworldaxw: yeah there is one, but it sorta sucks to have to search for it and replace it and restart the process03:39
wallyworldaxw: i already have prometheus running against a local controller; not much to see as it's not busy03:40
axwwallyworld: maybe we should provide a tool to reconfigure a prometheus to add scrape targets for juju controllers?03:40
wallyworldnow that would be good03:41
wallyworldaxw: are you able to look at fixing the introspection worker to support cpu profiling as a quick win?03:45
axwwallyworld: it does support it, it's just the script that's broken03:45
axwwallyworld: I can look at fixing the script if it's really important03:46
wallyworldright, i meant the script. i'm not 100% sure what needs to change. replace GET with curl?03:46
axwwallyworld: I'm not sure either. I can look at it03:46
wallyworldwould be good to have it work out of the box for 2.203:46
wallyworldsince we are upgrading the customer to 2.2 controllers03:46
axwok03:46
=== thumper is now known as thumper-afk
wallyworldjam: on the surface of it, i can't see a way to intercept incoming http connections prior to the tls negotiation stage to reject logins at that point. there's some methods on the tls.Config that appear to be called for each request that we can override, but doing so results in an internal error in the std lib code. did you have any thoughts on how to implement?06:10
jamwallyworld: I didn't have any thoughts yet. my first instinct would be to have a custom Listener06:25
wallyworldjam: yeah, getting the right points to intercept before tls happens is the fun bit06:26
wallyworldbboab, school pick up06:26
jamwallyworld: tls.Config takes a net.Listener06:26
jamso if we wrap the passed in net.Listener with our own06:26
jamI think it could work06:26
jamline 226 of apiserver/apiserver.go06:27
mupBug #1696311 opened: layer-basic does not support centos7 <juju-core:New> <https://launchpad.net/bugs/1696311>06:27
mupBug #1696311 changed: layer-basic does not support centos7 <juju-core:New> <https://launchpad.net/bugs/1696311>06:30
wallyworldjam: yeah, just poking around. part of the issue it's only agent logins we want to throttle. and we only get to read the data off the rpc request to determine that once we've established the secure connection.06:42
mupBug #1696311 opened: layer-basic does not support centos7 <juju-core:New> <https://launchpad.net/bugs/1696311>06:42
jamwallyworld: right, if we just added a 1s sleep, or a load-based sleep, I think we could still get away with it, we could do a *bigger* sleep later06:52
jamor we could do it by IP address06:52
jam'local' addresses get a bigger delay as they are more likely to be agents vs client06:53
jamwe could just slow down all Connects when we're under load/based on number of active connections, etc.06:53
wallyworldthat might work initially06:53
jamand then slow down even further once we get to Login layer06:53
jamwallyworld: to slow down retries, I had initially investigated a sleep before returning the error06:54
jamwhich should still reduce total load06:54
jamits just nice to also reduce it before you get TLS handshake stuff06:54
wallyworldjam: right, i am adding an optional puase to the liniter06:54
wallyworldso Acquire() might not return immediately even if it can get a slot06:55
wallyworldactually, i am looking at pausing before polling the channel06:56
jamwallyworld: you mean pause-before-Accept?06:58
wallyworldin the Acquire() method of limiter06:58
wallyworldpause before attempting to acquire a login slot06:59
wallyworldjuju/utils/limiter.go06:59
wallyworldthat will throttle the agents. maybe not the best place to do it?07:00
wallyworldseems liked it was nice and transparent to the server07:00
wallyworldi guess login limit is only 1007:01
wallyworldso it may not help that much07:01
wallyworldbut it will delay any err retry07:02
wallyworldso that it limits the cost of the agents trying again and again07:02
jamwallyworld: so, I wouldn't do it universally in the generic code, but you could pass in an optional 'time.Duration' if we wanted07:02
jambut just doing it at line 92 of admin.go07:02
wallyworldright, that's what i'm doing07:02
jamknows that we're explicitly rate limiting *logins* right there07:02
wallyworldpassing in an optional duration to NewLimiter()07:03
jamwallyworld: sure, and that's also potentially testable, etc.07:03
wallyworldyep07:03
wallyworldand pausing before Acquire() means the agents are truely blocked07:04
wallyworldas no ErrRetry is issued07:04
wallyworldand s they can't just ping again07:04
wallyworldimmediately07:04
wallyworldor that's my theory anyway07:04
jamwallyworld: sure, before or after Acquire is fine07:05
jamjust before returning an error07:05
wallyworldyep07:06
wallyworldjam: here's a utils PR https://github.com/juju/utils/pull/28107:45
wallyworldbah, i broke APi, I will need to fix07:46
jamwallyworld: I feel like we need (min, max) instead of (0, max) thoughts?07:46
wallyworldyeah ok, can easily add07:47
jamor something like (avg, stddev) where we just pick some value for stddev based on avg07:47
wallyworldand i'll fix the api too07:47
wallyworldhmmm, do we really need that aside from min,max?07:47
jamwallyworld: so its the same effect, just thinking about what is useful to express07:48
jamwell, stddev means you would have a normal distribution instead of a flat one,07:48
jamnot sure that is useful07:48
mupBug #1696311 changed: layer-basic does not support centos7 <Charm Helpers:New> <https://launchpad.net/bugs/1696311>07:48
jamwallyworld: so even just 'max' is better than nothing07:49
jamit just means the 'average' time is going to be 'max/2'07:49
wallyworldjam: i'll add the min, easy enough07:52
wallyworldafter dinner though07:53
jamwallyworld: reviewed07:54
jamwallyworld: I do wonder if we could have a way to know "I've got a lot of load right now, lets slow down active connections a bit more", and provide backpressure08:07
wallyworldjam: i also think that we need to do more - this current change is just a small step08:18
MmikeHi, lads. Is there a way to configure juju to store less than 4GB of logs in mongodb?08:20
=== thumper-afk is now known as thumper
thumperhmm...09:56
thumpertrying to use the peer-xplod charm from the acceptance tests09:56
thumpergetting errors with lxd where it says '/usr/bin/env python' doesn't exist09:56
thumperroot@juju-61a95f-0:~# /usr/bin/env python09:57
thumper/usr/bin/env: ‘python’: No such file or directory09:57
thumperfrom the machine itself09:57
thumperseems like the current lxd xenial images only have python309:59
jamthumper: indeed, xenial doesn't come with python 210:05
thumper:-|10:05
jamthumper: I thought I had dealt with that once in the past, but maybe that was on my version of the charm and not the one they are using ?10:05
jamthumper: 'apt install python2' in 'install'10:05
thumperyep10:06
thumperdid that10:06
thumperalthough i used apt-get so it work on trusty too10:06
jamthumper: sure10:06
thumper:)10:06
jamI have 'apt install -y python' in mine10:06
jamthumper: is it a ~juju-qa charm ?10:08
thumperno, the one in acceptancetests dir in tree now10:08
jamthere are a couple small changes between the one in tree and lp:~jameinel/charms/trusty/peer-xplod10:10
jamnothing particularly major, just the 'apt-get install' and some small things about 'maximum=0' intending to be unlimited10:11
jamthumper: want me to put a PR that brings them in sync?10:11
thumperjam: sure, if you have the time10:11
jamthumper: https://github.com/juju/juju/pull/746310:16
wallyworldjam: here's a WIP which uses the login rate limiting plus a general connection throttle https://github.com/juju/juju/compare/2.2...wallyworld:throttle-controller-connections?expand=110:29
jamwallyworld: WIP, WIP it good :)10:29
wallyworlddoes it look reasonable? i plucked the numbers out of the air10:29
wallyworldfunny man10:30
jamwallyworld: so I'm wondering why we are sleeping longer for Conn than Login10:30
jamwallyworld: I would have thought 1s for conn, and 5s for login10:30
wallyworldi can do that10:30
wallyworldi thought login was limited to 10 at atome anyway10:31
wallyworldbut conns once logged in could grow more10:31
wallyworldprobably flawed thinking10:32
jamwallyworld: so conn affects users as well as agents, but you're right that the login rate limit only triggers once we're at 10 active10:32
jamah sorry, we always acquire so we would always hit that10:32
jambut only for agents10:32
wallyworldyeah, this latest wip does affect clients as well10:33
wallyworldbut if the system is really, really loaded, then even they should wait  abit?10:33
wallyworldthey will see a slow down anyway10:33
jamwallyworld: 1s is fine IMO10:33
wallyworld1s max10:33
jamthe question is whether that is *enough* generally, but adding an extra 5 for agents probably will be10:33
wallyworldand 5ms per conn?10:33
jamwallyworld: so a max 1s delay for Conn to return and a 5s extra delay for Agent Login to return 'go away'.10:34
jamneither is what I'd like in 'ideal world' which would be focused on scaling the numbers based on number of active connections10:35
jambut its probably a start10:35
wallyworldjam: so the 5s max for Accept() was really to attempt to throttle the thundering herd, and the pause time only grows by 5ms per conn10:35
wallyworldyeah, this is a quick win for 2.2rc210:35
jamah, I missed that throttling went up and down10:35
wallyworldon a normally loaded system there should be no noticable difference10:35
wallyworldyeah, it grows as we get more connections accepted10:36
jamwallyworld: so 5s on Conn isn't great. it affects 'juju status' when running on lxd10:36
jam'why is it taking 5s to get a result back with 2 machines'10:36
wallyworldthat's 5s max10:36
jamwallyworld: still avg 2.5s10:37
wallyworldonly if there are 1000 connections10:37
wallyworldthe max time grows10:37
wallyworldwell, that was the intent10:37
wallyworldstart at min 10ms or so, and then the max pause time grows with conn count10:37
jamwallyworld: ah sorry, I'v twisted it in my head,10:37
jamjust got coffe10:37
wallyworldnp, i'm tired so i could have messed up10:38
wallyworldso for accept, on a normally loaded system -> no dicernable difference10:38
wallyworldbut all connections are forced to wait  a bit as conn count grows10:38
jamwallyworld: so, all Accept() attempts have a 10ms floor that increases by 5ms for every active connection10:38
wallyworldyeah10:39
jamup to a max of 5ms from Accept until we do the SSL handshake10:39
wallyworldmax of 5s10:39
jamon Comcast world, that will, on average have 2500/3 = 800, say 1000 active agents10:39
jamevery 'juju status' will be slower by 5s10:39
wallyworldah right because the connections are long lived10:40
wallyworldi could do it based on rate of connection10:40
jamwallyworld: right, not for the *clients* which have to pay that on every connect10:40
jamwallyworld: but all the agents which have long-lived only pay it 1x10:40
jamwallyworld: something like 'number of connections in the last X seconds' would be good10:40
wallyworldyep, that would solve the thundering herd issue10:41
wallyworldi can tweak it10:41
jamwallyworld: (arguably we could do per-IP tracking or something, but again, that would be penalizing users that are actively engaging with the system)10:41
jamwe really just want the pushback on agents10:41
jamand we only know that at the Login time10:41
wallyworldagreed, but we don't concretely know what those ip addresses are at that point10:41
wallyworldwe can gues, but....10:41
jamwallyworld: yeah, I don't think we want to do IP based, cause then you have to track all of that10:41
jamI think just doing 'how many have connected in the last X' and slow it down up to 5s is ok10:42
wallyworldso i reckon 5ms per X rate of new connections10:42
wallyworldyep, up to 5s max10:42
jamwallyworld: I'd then also have Login that is going to *reject* an agent to come back later, wait another 5s10:42
jamwallyworld: which means all the people over the current 10 that we are going to reject, get delayed a little bit extra10:43
jamand I'm not apposed to something that delays before Acquire as well10:43
wallyworldjam: so add a pause when limiter.Acquire() returns false?10:44
wallyworldi think delay before is ok too10:44
jamwallyworld: those are the ones that will be reconnecting 3s later10:44
wallyworldok, i can add another apram to NewLimiter()10:44
wallyworldfixed time to pause if a reject happens10:45
jamwallyworld: its not hard to put it just before the "return ErrRetry"10:45
wallyworldyeah, ok10:45
wallyworldjam: so hopefully the net effect of this (pun half intended) is to allow things to come up more controlled without resorting ti IP tables10:46
jamwallyworld: yeah, we need to set up some testing of 'restart times' so we can tune some of these numbers10:46
wallyworldnext thing would be to throttle log connections10:46
wallyworldyeah, testing needed for sure10:47
jamwallyworld: I can probably set wpk on it today10:47
jamhe seemed interested10:47
wallyworldok, i'll finish this work10:47
jamwallyworld: I'm also curious what the net effect would be if you are running in HA10:47
jama given controller is going to push back, but will the others, etc10:47
wallyworldyeha10:47
wallyworldjam: i almost convinced myself those delay params should be configurable, not consts10:47
wallyworldso we can play with the numbers10:48
wallyworldmaybe via env vars10:48
jamwallyworld: well, I would hack them with ENV vars, etc to test it10:48
jamwallyworld: but it also is something that as soon as we know *we* want a knob10:48
jamsomebody else will ask for it10:48
wallyworldright, but we hide that knob10:48
wallyworldthose env vars are not publicised10:48
wallyworldbut we can ask CI to set up a system with lots of xplod charms, get it to steady state, see how it goes, and then kill the controller and see what happens then as well10:49
wallyworldand tweak the numbers10:49
axwwallyworld jam: https://github.com/juju/juju/pull/7465 has updates to support CPU profiling in the introspection CLI, as well as adding support for easily exposing as HTTP11:09
axwwallyworld jam: I started down the road of just modifying the bash code a little bit, but it was very fragile. so ended up with something a bit more comprehensive...11:10
jamaxw: is this a bit too much for a 2.2 at this point? I suppose we aren't changing the actual socket, nor are we changing the scripts that we used to support11:13
jamjust how they connect11:13
jamand possibly exposing a new thing people will ues11:13
jamits nice to not need to 'apt install socat' all the time11:13
jamsmall note 'juju-introspect' or 'jujud-introspect'... not sure11:14
jammyself11:14
jamI guess it is 'juju-run'11:14
jamthough honestly *that* one is mostly a source of confusion11:14
axwjam: the alternatives I can see are: (a) do nothing, (b) use curl, which makes the command more fragile (because of timing issues, starting socat and curl not necessarily having --retry, and other weirdness around socat)11:15
axwjam: IMO, this could wait for 2.2.1. it's possible to do all these thigns already with 2.2, just not in a neat command11:16
jamaxw: so the singlehostreverseproxy is to handle redirecting HTTP to a unix socket?11:17
jamwell abstract domain sockt11:17
axwjam: yep11:17
jamaxw: to check are we changing the raw content output then?11:20
jamyou made a comment about not having the headers11:20
jamwhich sounds good11:20
jambut does mean the actual output of "juju-goroutines > saved.txt" is going to be slightly different?11:20
jam(AFAICT, it actually means you don't have to munge the file before it is actually useful)11:20
axwjam: yes. it's the same except without the HTTP response header11:20
axwjam: right11:20
jamaxw: my concern is anyone whose scripted it may be removing it themselves and we're breaking that11:21
jamthats the sort of "shouldn't do in a .patch' release", I think11:21
jamaxw: I do believe it was a gotcha trying to use things like the heap profile11:22
jamso ultimately better11:22
jambut probably a risk for putting it into rc2, but also a big win for not breaking it in a .patch11:22
axwjam: I'm not aware of anyone interpreting them anyway - are you? not that that's proof or anything, but I am curious. they've always just been handed back to dev IME11:23
jamaxw: well, *I've* used them to run against go tool, and its always been a pain that you have to munge. Its certainly the sort of thing where I'd want us to be careful with compat11:25
jamaxw: and saying "<2.2.0 you need to trim the front, but we do that automatically in 2.2" sounds much better than11:25
jamin '2.2.1'11:25
axwjam: yep, fair point11:25
jamaxw: I'd *like* others to chime in on the "should it be 2.2.0rc2 or 2.2.1"11:25
jambut you have my vote11:26
axwjam: thanks. I will wait for wallyworld and thumper to chime in at least11:26
jama couple small things11:27
jamyou list the symlinks in one list over here, but individually multiple times over there11:27
jamand 'juju-introspection' vs 'jujud-introspection'.. I'm not sure there, either11:27
jamjuju- matches other things, but really we are introspecting a jujud11:27
axwjam: yep, thanks I'm fixing that list. I'm -0 on jujud-introspect because it has a different prefix to the introspection helpers (juju-goroutines, juju-heap-profile, etc.). they're all about jujud too, but I don't think it'd be helpful to users to have two different prefixes for the same class of commands11:31
jamfairy nuff11:31
axwjam: family's home, gtg. thanks for the review11:32
thumperaxw: shipit for 2.2-rc211:32
thumperaxw: I was just considering something like this myself11:32
thumperso yay11:32
axwthumper: okey dokey. I believe the bot is disabled, so how does one do that?11:32
thumperaxw: one asks one of the QA folk to poke the bot manually11:32
axwah I have to run, I'll check back later11:32
thumperaxw: probably need to get balloons to do it when he starts11:33
jamballoons: ^^ https://github.com/juju/juju/pull/746511:33
* thumper should go to bed11:33
jamwe would like to land that for 2.2rc211:33
thumperwell, go do dishes first11:33
thumpernight all11:33
jamthumper: go sleep :)11:33
marcoceppihow can I upgrade to 2.2-rc1 from a previous stable version?20:03
marcoceppi--agent-version=2.2-rc1 says "ERROR no matching binaries available"20:04
marcoceppiI got it upgrading, but how long should an implace upgrade take?20:31
wallyworldmarcoceppi: see the release  notes for rc1 - we split the logs into per model collections so for this upgrade, it can take a whiile20:43
wallyworldthe upgrade may need to split apart up to 4GB of logs20:43
marcoceppiwallyworld: thanks20:47
wallyworldmarcoceppi: i'm guessing it took maybe 5 or 10 minutes?20:48
wallyworldwe should surface a more complete message that just "upgrading" perhaps20:48
wallyworldthis was done to improve the model destroy performance for large numbers of models20:49
marcoceppiwallyworld: I think my upgrade might be stuck, but I have no way of telling21:07
marcoceppiit was started at 48 after the hour21:07
wallyworldwas it a big deploy?21:08
marcoceppidisk space consumption has not changed, and the logs are mostly filled with "login denied, upgrade in progress"21:08
marcoceppi6 machines21:08
marcoceppi1 model21:08
marcoceppibut it was a 2.0.4 -> 2.2-rc121:08
wallyworldshould work though21:09
wallyworldare you able to get a mongo shell and do a db.logs.size() and also a size on the new model logs collection to see if the records are still being copied?21:09
wallyworldthe new logs collection is something like logs.<modeluuid>21:10
marcoceppiwallyworld: how do I get a mongo shell?21:11
wallyworldssh to controller, and then mongo --ssl -u admin -p <oldpassword> localhost:37017/admin --sslAllowInvalidCertificates21:12
wallyworldwhere oldpassword is sudo grep oldpassword /var/lib/juju/agents/machine-0/agent.conf21:12
wallyworldthen once in shell, do a "use juju"21:12
wallyworldthat selects the juju database21:13
marcoceppilet me take a look21:16
=== cargonza_ is now known as cargonza
babbageclunkwallyworld: should I pick up a bug from the release blockers section?21:40
wallyworldbabbageclunk: in release call now, just discussing what needs to be done21:40
babbageclunkok21:40
marcoceppiwallyworld: I get login fialed with that command22:03
marcoceppibut the upgrade completed22:03
marcoceppiso I don't care anymore22:03
wallyworldmarcoceppi: sweet, ok. but we should report better22:05
wallyworldbabbageclunk: HO in standup?22:07
babbageclunkwallyworld: sure22:09
marcoceppiwallyworld: I do have another problem22:16
marcoceppisince the ugprade `juju models` hangs22:17
wallyworldmarcoceppi: ah bum, ok22:21
wallyworldwe haven't seen that22:21
babbageclunk:(22:21
wallyworldcan you turn on debug logging and see what it says?22:21
wallyworldraise a bug for sure with as much detail as possible22:21
marcoceppiwallyworld: it just says connected to ws22:29
wallyworldmarcoceppi: does show-model work?22:31
marcoceppiwallyworld: add and destroy model work22:31
wallyworldshow-model?22:32
marcoceppiwallyworld: nope22:32
marcoceppiwallyworld: http://paste.ubuntu.com/24803880/22:32
marcoceppiwallyworld: it says "connection established" then that's it22:32
thumperwell bollocks22:33
wallyworldmarcoceppi: can you turn on debug logging and provide a snippet from juju debug-log22:33
marcoceppiI think debug logging is on?22:33
wallyworldjuju model-config logging-config="<root>=DEBUG;"22:33
thumpermarcoceppi: juju debug-log -m controller22:34
marcoceppimodel config hangs22:34
thumperthis is a pretty serious regression22:34
wallyworldlook at current logging-config first so you can set it back later. juju model-config22:34
marcoceppimodel-config hangs all together22:34
wallyworldwtf22:34
marcoceppito be fair, two hours ago this was a 2.0-beta18 controller22:35
thumpermarcoceppi: wat?22:35
wallyworldcan you log onto the controller and look at the apiserver.log file22:35
marcoceppi2.0-beta18 -> 2.0.4 -> 2.2-rc122:35
thumpermarcoceppi: I'm not sure beta 18 was upgradable22:35
marcoceppithumper: well, 2.0.4 worked22:35
thumpermarcoceppi: we didn't say upgradable until 2.0-rc122:35
thumperhmm...22:35
thumperin theory, it should work22:36
thumpermarcoceppi: 'juju debug-log -m controller --replay | pastebinit'22:36
wallyworldonce we see server logs, we can deduce what's wrong hopefully22:36
marcoceppiwell now everything is hanging22:36
marcoceppilet me see what is happening onthe server22:36
marcoceppiload of 13, helllooo22:38
marcoceppiokya, model-config works, models doesn't22:39
marcoceppithumper: http://paste.ubuntu.com/24803956/22:44
marcoceppiwallyworld: ^22:45
thumpermachine-0: 18:38:52 DEBUG juju.utils setting GOMAXPROCS to 122:46
thumperhuh?22:46
marcoceppimy hope is I can just "model migrate" this to 2.2.0 and resolve a lot of whatever the hell I did22:46
thumperI wonder why we are seeing so much of this: machine-0: 18:38:54 DEBUG juju.mongo dialled mongodb server at "10.142.0.2:37017"22:47
marcoceppiyou all want ssh?22:49
wallyworldthumper: it appears the api worker can't start22:49
wallyworldmaybe22:49
marcoceppijujud is pegging this controller at 100%22:50
marcoceppibut it's been doing that since 2.0-beta1822:50
marcoceppihappy to give this vm more resources if that's what it takes22:50
thumpermarcoceppi: probably a broken setup...22:50
thumperit shouldn't be doing that22:50
marcoceppithat's what I wanted to go to 2.2, get them perf fixes22:50
thumperheh22:50
thumpermarcoceppi: need to do this "juju model-config -m controller logging-config=juju=debug"22:51
marcoceppiand CMR ,and like all the other good things22:51
thumperthen some debug log over the models call22:52
marcoceppiI've apparently exhausted memeory22:57
marcoceppihttp://paste.ubuntu.com/24804029/22:57
marcoceppiI'mve going to bump up the VM22:58
marcoceppirebooted, more cpu/ mem23:05
marcoceppinow I get this23:05
marcoceppimarco@T430:~$ juju models23:05
marcoceppiERROR cannot list models: upgrade in progress (upgrade in progress)23:05
marcoceppimarco@T430:~$ juju switch controller23:05
marcoceppisilph.io-prod1:admin/test -> silph.io-prod1:admin/controller23:05
marcoceppimarco@T430:~$ juju status23:05
marcoceppiModel       Controller      Cloud/Region     Version  Notes                               SLA23:05
marcoceppicontroller  silph.io-prod1  google/us-east1  2.2-rc1  upgraded on "2017-06-07T21:13:29Z"  unsupported23:05
marcoceppiApp  Version  Status  Scale  Charm  Store  Rev  OS  Notes23:05
marcoceppiUnit  Workload  Agent  Machine  Public address  Ports  Message23:05
marcoceppiMachine  State  DNS            Inst id        Series  AZ          Message23:05
marcoceppi0        down   35.185.85.250  juju-c9c599-0  xenial  us-east1-b  RUNNING23:05
marcoceppimarco@T430:~$ juju models23:05
marcoceppiERROR cannot list models: upgrade in progress (upgrade in progress)23:05
marcoceppicrap23:06
marcoceppihttp://paste.ubuntu.com/24804067/23:06
thumpermarcoceppi: it may well be migrating the logs23:23
thumpermarcoceppi: that will take some time23:23
thumpermarcoceppi: to move 4G of logs on my laptop with an SSD was over 7 minutes23:23
=== blahdeblah_ is now known as blahdeblah
axwveebers: hey, would you please land https://github.com/juju/juju/pull/7465 for 2.2? it has thumper's seal of approval23:52
thumperaxw: we asked veebers to stop making 2.2 special for now23:52
axwthumper: ah ok23:52
=== JoseeAntonioR is now known as jose
veebersthumper: ah yeah, I'll fix that up now, sorry23:53
thumperbut we'll keep an eye on who submist what23:53
thumperveebers: thanks23:53
axwokey dokey23:53
veebersthumper, axw: done it should just go through as per normal (once picked up)23:54
axwveebers: cheers23:54
veebersthumper, axw: any idea what else needs to land for rc2?23:55
axwveebers: azure auth stuff23:55
axwveebers: which has changed since I reviewed it, re-reviewing now23:55
thumperveebers: I'm adding some stuff around state export23:55
thumperveebers: wallyworld is working on a statushistory deletion bug23:56
thumperveebers: possibly wallyworld's connection backoff code23:56
thumperaxw: can I get you to look over that too?23:56
wallyworldbabbageclunk is working on the delete bug23:56
thumperwallyworld: ok, ta23:56
axwthumper: sure23:56
veebersthumper, axw: ack. If you can keep burton and myself in the loop so we know which CI runs to track (and baby) so we're ready to rock and/or roll when needed for release23:57
thumperhmm... dealing with a facade bump where we change the args and return values...23:57
thumperveebers: yep, sure23:58
thumperveebers, wallyworld: we also need to work out why the capped collection overflow didn't stop the agents23:58
thumperit *should* have caused all agents to stop immediately23:58
wallyworlddepends if CPU was overloaded etc23:59
wallyworldagents stop once channel selects are processed etc23:59

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!