/srv/irclogs.ubuntu.com/2017/02/22/#juju-dev.txt

=== thumper-dogwalk is now known as thumper
thumperblahdeblah: I'll let you know, thanks for filing it00:05
thumperexternalreality: Just making some toasted sandwiches, with you in about 15-20min00:05
externalrealitythumper: +1 for toasted sandwiches. Cya in 2000:06
thumperblahdeblah: your comment one seems to be missing the results of the actual calls00:23
blahdeblahthumper: That was intentional; mostly for anyone in the general public who runs into the problem but doesn't have access to the pastebins00:24
thumperblahdeblah: ah00:24
thumperok00:24
mupBug #1662272 changed: Agents stuck with "dependency not available" <juju-core:Won't Fix> <https://launchpad.net/bugs/1662272>00:27
wallyworldthumper: i have a question, got 2 minutes?01:29
thumpergimmie 5?01:29
thumperwallyworld: now is fine01:32
wallyworldsure, HO?01:32
stokachuanyone seen this yet: https://bugs.launchpad.net/juju/+bug/166672201:35
mupBug #1666722: juju 2.1 fails to deploy machines in localhost with lxd 2.9.2 <conjure> <lxd-provider> <juju:New> <https://launchpad.net/bugs/1666722>01:35
stokachunot sure why it was marked incomplete though01:35
anastasiamacstokachu: i've marked incomplete coz i cannot read the pastebin with log, plz attach log as a file :D01:37
stokachuuh01:37
stokachuwell i can read it just fine01:38
anastasiamac:D01:39
stokachuk added01:40
thumperwallyworld: hangouts dropped you01:43
thumpernow I get "can't start call due to an error"01:43
thumperhazaar01:43
wallyworldthumper: yeah, trying to rejoin01:43
wallyworldthumper: i'm bacl in join, waititng for people to join01:44
wallyworldwell i was01:44
wallyworlddropped me again01:44
thumperI'm still getting error01:44
thumperwallyworld: oh well...01:45
thumperlet's not bother then01:45
wallyworldok01:45
wallyworldyou answered my question, ty01:45
anastasiamacwallyworld: thumper: m looking at some logs from 2.1 today's tip and am seeing 2017-02-21 23:09:40 ERROR juju.state database.go:243 using unknown collection "remoteApplications"01:53
stokachuthink i figured it out01:53
anastasiamaci thought we have dealt with "remoteApplications"?01:53
thumperanastasiamac, wallyworld: all access should be behind a feature flag01:53
anastasiamacstokachu: m seeing01:54
anastasiamacStatus:"provisioning error", Info:"A root disk device must have the \"pool\" property set."01:54
anastasiamacstokachu: what did u figure? :D01:54
stokachuanastasiamac, yea lxd 2.9 requires a pool defined01:54
stokachu  root:01:54
stokachu    path: /01:54
stokachu    type: disk01:54
stokachuso that needs to be set in the profile01:54
stokachuwhich it does by default it looks like01:54
wallyworldit all was behind a flag as far as i knew. there was an issue in 2.001:54
wallyworldin the megawacther but that was fixed01:54
stokachuanyway anastasiamac you can close that issue01:54
anastasiamacthumper: m fairly certain feature flag was not enabled on this environment01:54
stokachujust requires "pool: default"01:54
anastasiamacstokachu: could u plz add a note as to what u did for posterity :D01:55
stokachuyea01:55
anastasiamacstokachu: and just to confirm, u did not have any fature flags enabled?01:55
wallyworldthe alest juju 2.1 should set up the profile01:55
wallyworldthere's nothing that conjure up should need to do01:55
wallyworldIIANM01:56
stokachunope no feature flags01:56
anastasiamacwallyworld: stokachu is on the latest (from today) 2.1 :D01:56
stokachuwallyworld, yea i need to update our spells now01:56
stokachuto account for the new way lxd storage works01:56
stokachuadded a note to the bug01:56
anastasiamacstokachu: thnx \o/01:57
anastasiamacwallyworld: wallyworld: I am also seeing failures related to trying to config mongo... http://pastebin.ubuntu.com/24043782/01:58
stokachunp01:58
wallyworldthat doesn't make sense, those kernel params do exist01:59
anastasiamacwallyworld: ¯\(ツ)_/¯01:59
wallyworldis this being done inside a container?01:59
anastasiamacstokachu: ^^02:00
stokachuanastasiamac, what is that from02:00
anastasiamacstokachu: from reading the logs u've attached to the bug... were u inside the container?02:01
stokachuyes02:01
wallyworldthose lone numbers don't match02:01
wallyworldline02:01
stokachuyea im not sure those error messages relate to what i was doing02:02
anastasiamacwallyworld: the full log is in bug 1666722 ... m calling as I see it :D02:02
mupBug #1666722: juju 2.1 fails to deploy machines in localhost with lxd 2.9.2 <conjure> <lxd-provider> <juju:Invalid> <https://launchpad.net/bugs/1666722>02:02
wallyworldbut it implies the latest 2.1 is not being used02:02
stokachuthis is from my snap which i rebuilt several hours ago02:03
wallyworldoh wait02:03
stokachuto make sure i pulled in the 2.9 fixes02:03
wallyworldsorry, i'm looking at thewrong branch02:03
wallyworldsigh, too many tabs02:03
stokachuwallyworld, :D02:03
wallyworldok, so just looked, those "errors" are poor debug messages02:04
wallyworldsigh02:04
wallyworldthey can be ignored02:05
stokachuwallyworld, all good :D02:05
anastasiamacwallyworld: what about "remoteApplications" one?02:14
wallyworldNFI. it's harmless. will need more context to track it down. what was the user doing, what commands were being run etc02:14
wallyworldeverything looks like it's behind the flag02:15
wallyworldsomething is leaking though it seems02:15
stokachuwallyworld, also keep in mind i bumped the logging way up02:15
wallyworldstokachu: yeah, but here juju was logging this as an error02:15
stokachuwallyworld, http://astokes.org/juju/2/api/debugging i do this everytime i need to investigate stuff02:15
wallyworldand it should not have been02:15
stokachuah ok02:15
anastasiamacwallyworld: i believe that stokachu was just bootstrapping at the time of that message02:16
stokachubootstrap actually worked02:16
stokachuit was the deploying of applications02:16
wallyworldyeah, that remote things is harmless02:16
wallyworldif it showed up after a deploy was run that gives some context02:16
anastasiamacstokachu: m not talking about what operation failed but what operation was run when that log message appeared.. it's in your log anyway02:17
stokachuok02:17
anastasiamacbut since wallyworld is happy there is no impact (and he would know)...02:17
wallyworldor so i think02:17
wallyworldit doesn't appear to from what i've seen02:18
wallyworldjust need to track it down and remove the noise02:18
anastasiamacwallyworld: m not clear why this mesage would appear if everything is behind a feature flag (regardless of whether it was bootstrap or deploy)02:18
wallyworldexcactly02:19
anastasiamacwe have been bitten before by similar things :D02:19
wallyworldthat's my point02:19
anastasiamack. i'll leave it with u, master02:19
wallyworldthere's ben a leak somewhere02:19
wallyworldit all should be behind a flag. i've looked through the code and it appears that's the case, but something somewhere is misconfigured02:20
jamperrito666: pong02:43
menn0thumper: tech board?03:02
axwjam: still doing standup? do you have anything to discuss?05:03
jamaxw: brt05:04
anastasiamacaxw: when u get a chance, PTAL https://github.com/juju/juju/pull/701705:12
blahdeblahanastasiamac: How is 1662272 considered non-critical, if restarting machine-0 agent, juju-db, and unit agent doesn't fix it?05:13
blahdeblahSeems like it's 1587644, only with more severe symptoms.05:13
axwanastasiamac: LGTM05:35
anastasiamacaxw: \o/05:36
jamaxw: anastasiamac: wrt persistent storage, what happens if you destroy the model or even the controller?06:14
jamaxw: do we intend to leave *persistent* storage as above Model or Controller scope?06:14
jamaxw: or at least have a way to say "this disk outlives us all"06:14
jamaxw: (came up in a discussion with anastasiamac about 'destroy-controller' and how it interacts with disks)06:15
jamspecifically, if the storage is going to outlive the machines we are killing, we probably need to *try* to do a clean shutdown, so the content on the disks can be in a consistent state.06:15
axwjam: eventually I want to give users a way of disowning storage, but for 2.2 it'll still be owned by the model06:24
jamaxw: what are your thoughts about fast-pathing tear-down. I believe right now we tell everything "I want you to die", and then wait for everything to tear themselves down and fire the Dead hooks06:25
axwjam: indeed that is what we do. what's the problem?06:26
jamaxw: it takes 10 minutes to "juju destroy-controller" when you're throwing everything away06:27
* axw nods06:27
jamaxw: what value is there in triggering "relation-departed" on a machine that is being terminated06:27
axwjam: so I think we want to at least wait for the units to be Dead, because they could interact with external things06:28
jamaxw: its different if you're just killing 1 machine06:28
jambut you also know that you're killing all of its peers06:28
axwjam: we also want to clean up manual machines, because their lifetime is not under our control06:28
axwjam: we could probably fast-path cloud machines, though there is some interaction with storage at least (destroying an instance with cloud storage attached can have negative consequences)06:29
axwIIRC on AWS, destroyin an instance with an EBS volume attached can make AWS sad06:30
jamaxw: how so? I thought terminating a machine is natural06:31
axwjam: I just recall instances getting wedged in a state where they wouldn't cleanly terminate, and having to force-terminate them. pretty sure it's documented too06:32
jamaxw: so you have to do "umount" inside the instance before terminate?06:34
axwjam: indeed. http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-detaching-volume.html06:35
axw"You can detach an Amazon EBS volume from an instance explicitly or by terminating the instance. However, if the instance is running, you must first unmount the volume from the instance."06:35
=== frankban|afk is now known as frankban
jamaxw: perrito666: https://github.com/juju/testing/pull/121 is a small tweak to the testing/mgo startup code. On my Mac laptop it only has mongo3.4 and that seems to have changed the content of the "waiting for connections" line.08:57
jamthe tweak is to be slightly less strict, which I'm hoping is still sane. It seems odd that we would have a different 'waiting for connections' that we *shouldn't* treat as mongo being ready08:58
axwjam: LGTM08:59
axwjam: I suspect it's to do with how the logging works, it's probably not skipping a frame like on other platforms. not entirely sure tho09:06
axwlooking at blame in mongo, seems the code that logs that hasn't changed in a way that would explain the difference09:07
axwanyway, it's a safe change09:07
jamaxw: I believe I responded to all of your review comments. Can you look at https://github.com/juju/juju/pull/6988 again?09:20
axwjam: sure09:20
perrito666It is too early to exist09:49
perrito666Jam you have too much faith in me if you expected me to be up and reviewing code at 5:57 am09:50
mupBug #1662272 opened: Agents stuck with "dependency not available" <canonical-is> <juju-core:New> <https://launchpad.net/bugs/1662272>09:52
perrito666jam: ping me when you are around please10:19
jamperrito666: good morning. It was more of a "I don't know who might end up being around, so I'll leave a note for someone to handle in the future"10:19
jamI'm around, just got done with the cloud town hall call10:19
perrito666ill wait for the recorded10:21
* perrito666 started the day with an interesting headache10:21
perrito666jam: I thought about the same you mail mentiones (re noproxy) last night but the code is confusing, when tracing where the NO_PROXY contents come from they seem to be ultimately lifted from the env, which makes no sense10:22
jamperrito666: there's another one in a couple of hours that is the US based timezones10:23
jamperrito666: so there are a couple of things about NO_PROXY, one is that the "proxyupdater" object actually sets the ENV variables10:24
jamso that things we spawn have them set10:24
jamand we've done some semi-bad things during 'init' times where we potentially race with the values that we're going to be setting in the future.10:25
jamhowever, the concrete "what is in no-proxy" has an answer that I posted in the bug10:25
jamwhere we had a bug about "API server addresses should be in no_proxy"10:25
jamwhich led us to just iterating over APIHostPorts (which can include 127.0.0.1 I'm *pretty* sure)10:25
jamwe can just always add "localhost" to it, and we can consider if we want to add the target machine's known IP addresses as well.10:26
perrito666it does include both localhost address and a third ip which I think is state serveer10:26
perrito666duh, I was right next to that code looking for a clearner implementation of no proxy :p10:30
jamperrito666: we have a loop of APIHostPorts which happens that all controller machines have a 127.0.0.1 address10:31
jamso it isn't really by *intent* that we add 127.0.0.1, its more by accident because we are adding "all known addresses for Controllers"10:31
perrito666yes, just landed there, I passed right through it expecting to find a place where we set default values for no proxy instead10:32
jamwe only set those values if you've set anything in no_proxy10:33
jamarguably we should be setting those values either10:33
jama) always10:33
jamb) when any of the *_proxy values are set10:33
jamotherwise you might set http_proxy, but never set no_proxy, and then we're back to leaving the Controllers as being accessed via a proxy10:33
jamto be fair, no_proxy as an env variable seems like a poorly thought out hack that has to be interpreted by every application we interact with10:34
jamall which might have small variations on how they use it10:34
jamand it seems *very* much focused on Domain names, and *not* on IPs10:34
jamand we're pretty heavily abusing it with IPs10:34
perrito666jam: I think no_proxy should be set as a default for config values10:34
perrito666well, I believe proxying is rather a big hack10:35
jamperrito666: we could. Its certainly arguable that what we're doing now is guessing that local traffic shouldn't be proxied, and thus we force our own addresses to not be proxied10:36
jamsome would say we should flag all IPs for all hosts in the model to not be proxed10:36
jamproxied10:36
perrito666jam: is there a limit to the size of no_proxy?10:36
jamperrito666: I've heard of people doing 'export no_proxy=10.0.0.{1..255}' which means it expands to 255 values*10 chars or so, some have asked to do 'export no_proxy=10.0.{1..255}.{1..255}'10:37
jambut I *think* the 65535 version wasn't working10:37
jamwe could certainly *inside juju* support10:38
jam10.0.0.0/8 sort of syntax10:38
jamthe problem there is that when we talk to 'wget' or 'curl' they don't do anything with it10:38
jam(I should test again)10:38
perrito666meh, what will they read?10:38
jamperrito666: let me confirm, but I'm pretty sure they just ignore that one10:39
jamperrito666: the default golang one says "either its a domain suffix, or its an exact IP match"10:39
jamperrito666: curl, at least, doesn't listen to "export no_proxy=192.168.0." or "192.168.0.*" or "192.168.0.0/24"10:42
jamI don't know if there *is* an IP based rule that it would respect10:42
jamit does respect DNS suffixes10:42
perrito666I am a bit worried that we are relying on something half the popular software ignore10:44
jamperrito666: well, we aren't, we only support explicit IP addresses,which appears to be supported everywhere10:57
jamperrito666: also, anastasia helpfully reminded me about https://bugs.launchpad.net/juju/+bug/1488139 https://bugs.launchpad.net/juju/+bug/1615719 https://bugs.launchpad.net/juju/+bug/142165010:58
mupBug #1488139: juju should add nodes IPs to no-proxy list <landscape> <network> <oil> <proxy> <juju:Triaged> <https://launchpad.net/bugs/1488139>10:58
mupBug #1615719: [juju-2.0-beta15] during the bootstrap stage the no-proxy config is ignored <openstack-provider> <juju:Triaged> <https://launchpad.net/bugs/1615719>10:58
mupBug #1421650: allow cidr notation for no-proxy <ci> <cloud-installer> <jujuqa> <proxy> <uosci> <juju:Triaged> <https://launchpad.net/bugs/1421650>10:58
jamwhich are things we should be aware of, but don't have to solve everything in one pass10:58
jamon #1488139 I mentioned I'm worried about the fact that no_proxy then becomes an O(N^2) bug for everyone11:01
mupBug #1488139: juju should add nodes IPs to no-proxy list <landscape> <network> <oil> <proxy> <juju:Triaged> <https://launchpad.net/bugs/1488139>11:01
jamI moved 1421650 to Won't Fix because I don't think we can deviate from interpreting it like the 'standard' that other tools do11:02
jamand I'm tempted to mark 1615719 as Wont fix because I think he's just using it wrong, based on how people assume 'no_proxy' works, but doesn't actually work that way11:02
jammaybe we should validate the value of no_proxy contains only domain suffixes and concrete IP addresses and fail if you pass a wildcard or CIDR notation?11:03
jamthat would probably be more helpful11:03
jamit might make people unhappy with *us* but at least we're preventing them from setting something that won't actually work11:03
=== frankban is now known as frankban|afk
jamperrito666: btw, if our fix is changing the model-default value, I think that's a 2.2 vs a 2.113:43
perrito666jam: I was going to target 2.2 actually :)13:45
perrito666is a migration required for that? it isnt right?13:46
jamperrito666: migration? probably not, upgrade-step ? Maybe13:51
jamactually likely, since at the least we would have been putting 127.0.0.1 into the field and wouldn't anymore.13:51
perrito666jam: that is what I meant, my django just crept into me :p13:53
perrito666the current default is ""13:53
=== frankban|afk is now known as frankban
redir_holidaymorning juju-dev16:35
redir_holidayme reboots for new kernel16:36
=== redir_holiday is now known as redir
jamevening perrito666, I hope everything is going well17:28
jamhi redir17:28
perrito666jam: yes, listening to CTH17:28
perrito666jam: and also running tests for no_proxy17:29
redirhowdy jam17:29
=== externalreality_ is now known as externalreality
=== frankban is now known as frankban|afk
redirphew email mostly filtered19:12
* redir puts on OCR hat19:13
babbageclunkwallyworld: hey, I'm updating the validation in the model description to know about remote applications.22:55
babbageclunkwallyworld: at the moment it checks that there is a correspondence between the application units and the endpoint units.22:56
babbageclunkwallyworld: There isn't really the same thing with the remote applications, is there? Are there unit settings for the remote units in the local db?22:57
babbageclunkwallyworld: I think I've forgotten most of the Barcelona braindump22:59
wallyworldbabbageclunk: sorry, was in release call, just finished, did you want to chat now?23:05
babbageclunkwallyworld: yes please!23:06
babbageclunkwallyworld: standup HO?23:07
wallyworldsure23:07
wallyworldanastasiamac: 4:30pm is likely to be school pickup so i'll be a few minutes late to meeting23:18
anastasiamacwallyworld: would 5pm b better?23:18
anastasiamacwallyworld: i know there is soccer...23:19
wallyworldyeah, i can squeeze in 523:19
anastasiamaci'd rather u ddi not squeeze ;) but i'llmove the meeting \o/23:22
redirOi, IDM rears its ugly head again23:59

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!