/srv/irclogs.ubuntu.com/2017/11/16/#juju.txt

R_P_Shey rick_h, just wanted to thank you for the help so far.00:08
bdxis there support for artful series on aws?00:15
bdxIm seeing juju has grabbed a machine http://paste.ubuntu.com/25970976/00:16
bdxbut just doesn't want to start00:16
bdxbeen sitting there for a while, making me think artful hasn't made the cut just yet00:17
bdxby a "while" I mean like 20 mins00:18
=== frankban|afk is now known as frankban
ChaoticMindis the jaas controller being super slow right now or is it just me?09:49
mhiltonChaoticMind, what cloud/region are you seeing problems in?09:50
ChaoticMindaws/eu-central-109:50
mhiltonChaoticMind, thanks I'll try it out and see if I see the same.09:52
ChaoticMindthanks09:52
mhiltonChaoticMind, is there any particular command that's taking it's time for you?09:57
ChaoticMindmhilton: just deploying the bundle took forever (like 3 minutes for a smallish bundle). Setting relationships took like 15 seconds each09:58
ChaoticMindUsually it's about 0.5 seconds09:58
ChaoticMindI made a new model and tried it again now, it seems ok now!09:59
mhiltonChaoticMind: I'll look into it, one of the controllers may be overloaded. Thanks for mentioning it.10:00
ChaoticMindno worries10:01
=== salmankhan1 is now known as salmankhan
=== zeus is now known as Guest84717
=== Guest84717 is now known as zeus
bdxyoyoyo - whats the deal with artful deploys? Will someone `juju add-machine --series artful` on aws and let me know if I'm crazy16:12
bdxoooh shoot, looks like adding a machine of artful worked actually16:14
bdxnm16:14
bdxjeeze16:14
bdxahh, `juju deploy ubuntu --series artful --force` is what is failing16:14
kwmonroebdx: how about juju add-machine --series artful; juju deploy ubuntu --to X --force?16:16
kwmonroejust a shot in the dark16:16
bdxkwmonroe: no, great shot, I actually just did that and it worked16:16
kwmonroesweet16:16
bdxand it looks like what was failing me last evening is now working too, with the `juju deploy ubuntu --series artful --force`16:17
bdxgd16:17
bdxI was experiencing some extreme jitter yesterday on JAAS I think16:18
bdxI was trying to get an artful deploy going for quite a while and it was just failing at machine "pending"16:18
bdxreally strange16:18
kwmonroeha!  yeah, "juju deploy ubuntu --series artful --force" just worked for me too on aws16:18
R_P_Shey, so as part of my ongoing evaluation of juju, I've just created an HA controller.  But how do I specify what subnets to create the ha-instances into?16:34
bdxjam:^^16:35
bdxR_P_S: that is possible with the `--to` directive, its just not documented yet16:36
bdxI think its something like `--to subnet=subnet-<id>`16:37
R_P_Sso $ juju enable-ha --to subnet=subnet-priv1b --to subnet=subnet-priv1c ?16:37
bdxR_P_S: let me see if I can get it, omp16:38
bdxR_P_S: `juju bootstrap aws/us-west-2 --to subnet=subnet-<id> --credential mycred`16:40
bdx^^ worked16:40
bdxI'll see about the HA omp16:40
R_P_Syeah, that worked for the first instance...16:40
bdxinstances are launching faster then I've ever experienced16:40
R_P_Sbut after creating the initial controller, enabling HA put them in random subnets as far as I can tell16:41
bdxI already have a bootstrapped controller16:41
bdxcrazy16:41
R_P_Sincluding moxing public and private subnets for controllers 1 and 216:41
bdxR_P_S: `juju enable-ha  --to subnet=subnet-<id>,subnet=subnet-<id>`16:43
bdxworked seamlessly16:43
bdxR_P_S: would you mind putting some heat on this please https://github.com/juju/docs/issues/212216:44
R_P_SI don't have a github account and I'm at work... I'll need to do that later... as an aside, that ticket doesn't appear to be about enable-ha16:47
R_P_Sso now I'm not sure how to remove the extra HA controllers16:52
R_P_Shttps://jujucharms.com/docs/2.2/controllers-ha16:52
R_P_Sjuju status doesn't have any mention of "has-vote" for the controller model... and "remove-machine" just fails with a message that "machine 2 is required by the model"16:53
rick_hR_P_S: so juju show-controller should mention HA status bits and has-vote I believe16:55
jamesbensonhi all, I'm trying to do a simple LXD conjure-up k8s with help, all-in-one.  But it fails from the get-go.16:55
rick_hR_P_S: you can always remove-machine --force but yea, best to know what's up there.16:55
stokachujamesbenson: whats the issue?16:56
rick_hjamesbenson: bummer, what's the issue? I'm sure folks can get you good to go here.16:56
jamesbensonthanks stokachu and rick_h16:56
jamesbensonI hope so16:56
jamesbensonSo there seems to be a few issues.  Sidenote: I'm doing this from a ubuntu server VM in openstack, xlarge.  Pretty sure that doesn't matter, but just in case16:57
stokachujamesbenson: whats the hw specs?16:57
stokachuram, cpus16:57
jamesbenson8 vCPU, 16GB RAM, 160 GB HD16:58
stokachuok should be fine16:58
jamesbensonubuntu 16. LTS16:59
jamesbensonThese are the commands I do from deploy: http://paste.openstack.org/show/626536/16:59
jamesbensonhttps://snag.gy/nT0LPv.jpg17:00
jamesbensonthat's the latest state...17:00
jamesbensonactually I've tried twice... here's the other: https://snag.gy/Z9cs2x.jpg17:01
jamesbensonthoughts stokachu?17:02
R_P_Sso I'm trying to rollover controllers by simply terminating "bad" ones in aws directly and re-running enable-ha17:15
R_P_Sbut I'm still unable to remove machines that don't show up with ha-status enabled in "show-controller"17:16
R_P_SI upped enable-ha to 7 to test the subnets... it looks like I need to specify the subnets with each enable-ha command :\17:17
R_P_Sbut show-controller lists machines 0,3,4,5,6,7,8 (1,2 were "demoted" accoding to enable-ha output)... but I still get "machine 1 is required by the model"17:18
R_P_Sand one thing I've found is that using --force for remove-machine leaves an orphaned security group17:19
kwmonroejamesbenson: just a guess, but can those units get to the outside world?  i know etcd and k8s charms snap install stuff, so i wonder if they're having trouble getting out.  can you pastebin a "juju debug-log --replay -i etcd/0"?17:22
R_P_Sjuju remove-machine 1 --force17:27
R_P_Sfails17:27
R_P_Sthis bug was opened almost a year https://bugs.launchpad.net/juju/+bug/165803317:31
mupBug #1658033: Juju HA - Unable to remove controller machines in 'down' state <4010> <cpe-onsite> <juju:Triaged> <https://launchpad.net/bugs/1658033>17:31
=== frankban is now known as frankban|afk
bdxR_P_S: downsizing the controller cluster isn't supported17:39
bdxyou have to dump the db and restore to a smaller cluster (I think)17:40
R_P_Scorrect, once an -n N has been specified, it can't be shrunk17:40
R_P_Sbut I'm trying to simulate failure17:40
R_P_Sso I terminated one instances and reran enable-ha to rebuild new ones17:40
R_P_Sbut the terminated onces are still in the list, unable to be removed17:40
R_P_SI'm up to 13 "machines" in the config, with 5/7 ha (currently rebuilding)17:41
R_P_S<controllername>*  controller  admin  superuser  aws/us-east-1       2        13  5/7  2.2.617:41
bdxkwmonroe: I'm hitting it again, I just deployed these and they stood up just fine, tore it down and redeployed and its the artful instances that have been in pending for > 20mins now -> http://paste.ubuntu.com/25975589/17:41
bdxkwmonroe: create a new model on the same controller, then deployed the same charm http://paste.ubuntu.com/25975644/17:52
bdxsee what I'm saying about the inconsistencies ?17:52
kwmonroebdx: i'm in us-east-1, and just verified "juju deploy ubuntu --series artful --force" worked again.  hard to say what's up with it being intermitten.  do a "juju ssh -m controller 0" and sudo grep around /var/log/juju for 'machine-X' to see if there's a provisioner issue.17:52
bdxright17:52
kwmonroeyeah bdx, frustrating for sure.. i'm hoping there's something in the controller log that will be more insightful about an artful provisioning issue.17:53
bdxkwmonroe: http://paste.ubuntu.com/25975668/ - oh man17:55
kwmonroebdx: i haven't seen "failed to start instance (failed to start instance in provided availability zone)" before, and no sign of it in my controllers. however, i'm on 2.3-beta1 so that could be new logging in beta3.18:00
kwmonroebdx: what kind of constraints do you have for redis-cache?  any wicked machine reqs there?18:01
bdxroot-disk, spaces18:01
bdxtesting w/o any constraints18:02
kwmonroebdx: i was hoping you had "instance-type=p3.xxlarge" and i could just say "us-west is simply out of those instance types", but that doesn't seem like the case.18:03
bdxkwmonroe: it was the constraints18:05
bdxI removed them, and wala18:05
kwmonroemust have been spaces, right?  surely not root-disk18:05
bdxI wonder if I'm hitting disk cap on aws18:05
bdxtesting that right now18:05
kwmonroeyeah, make sure you're asking for GB and not PB ;)18:06
kwmonroeothwise RIP your wallet18:06
bdxha, yeah, "G"18:09
bdxso I just logged into aws console and created 10 x 100G ebs volumes18:09
bdxno issues18:09
bdxkwmonroe: https://bugs.launchpad.net/juju/+bug/170646218:11
mupBug #1706462: juju tries to acquire machines in specific zones even when no zone placement directive is specified <cdo-qa> <foundations-engine> <juju:Triaged by ecjones> <MAAS:Invalid> <https://launchpad.net/bugs/1706462>18:11
bdxsee my comment at the bottom18:12
bdxkwmonroe: I'm about to suggest something crazy18:19
bdxhttp://paste.ubuntu.com/25975792/18:19
bdxtaking ^ into consideration18:19
bdxredis-space and ubuntu-space are both deployed with only a "spaces" constraint18:20
bdxthe ubuntu-space didn't have a --series constraint18:20
bdxor bah18:20
bdx--series argument18:20
bdxthe only failures I'm seeing here are when '--series' is specified alongside a spaces constraint18:21
bdxbecause we see from ^ that redis-disk worked, it had '--series artful' and '--constraints "root-disk=100G"18:21
bdxand ubuntu-space worked18:21
bdxwhich had no '--series' arg, but had the spaces constraint18:22
bdxbut the only thing failing consistently18:22
bdxare things deployed to a space that has the '--series'  arg18:23
bdxI'll prove it by specifying '--series' with another series other than artful18:23
bdxhow about zesty18:23
bdxsince we see from ^ that zesty worked w/o a spaces constraint18:24
kwmonroebdx: i don't know enough about juju's zone handling, but happened here with graylog / #38? http://paste.ubuntu.com/25968550/18:25
kwmonroedid graylog have constraints?18:25
bdxno18:26
bdxwell yes18:26
bdxbut that isn't happening because of that18:26
bdxthat happens with every single instance deployed with 2.3beta318:27
bdxit eventually gets past the "failed to start instance (failed to start instance in provided availability zone) " and finds one and eventually starts18:27
bdxI was just posting that to show that its not only maas thats having that issue18:27
bdxok, well I think this verifies my theory http://paste.ubuntu.com/25975824/18:28
bdxI jus deploy the ubuntu-zesty-space18:28
bdxit required the spaces constraint and --series18:28
bdxand it failed similar to the artful18:29
bdxjust stuck pending18:29
bdx#@(*$U(#@*$@#*&18:29
bdxidk18:30
bdxI may as well go back to sleep18:30
bdxsomehow I knew today would be a trying day18:30
kwmonroe:)18:37
kwmonroebdx: i would note that in bug 1706462, that spaces + series repro this easily on aws18:39
mupBug #1706462: juju tries to acquire machines in specific zones even when no zone placement directive is specified <cdo-qa> <foundations-engine> <juju:Triaged by ecjones> <MAAS:Invalid> <https://launchpad.net/bugs/1706462>18:39
bdxkwmonroe: series + spaces only with artful18:40
bdxAND @kwmonroe18:40
bdx^ bug is entirely different then what I'm seeing I think18:40
jamesbensonkwmonroe: sorry for the delay, turkey-luncheon thingy.... that command is giving me a TLS handshake timeout..18:40
jamesbensonkwmonroe: The instance can ping google...18:41
bdxkwmonroe: this verifies that it is only happening with artful http://paste.ubuntu.com/25975887/18:41
bdxkwmonroe: what I'm seeing is the instances stay in pending for only series + space + artful18:42
bdxkwmonroe: what I'm seeing is the instances stay in pending for only series + space + artful18:42
bdx1706462 - failed to start instance (failed to start instance in provided availability zone) within attempt 0, ret18:42
bdxrying in 10s with new availability zone18:42
kwmonroewait bdx, your previous paste shows machine 8 waiting for machine with series zesty: http://paste.ubuntu.com/25975824/18:43
bdxbut then on *a* next attempt, juju will find an instance, and start it, and go on its way18:43
bdxkwmonroe: ah, my bad, yea, that machine started18:43
bdxwhich made me realize, in all cases, its only artful that is the commonality here18:43
bdxwhen used with spaces + series18:44
bdxtry it18:44
bdxoooh, it may be only beta3, let me try this on jaas18:44
kwmonroejamesbenson: how about just "juju debug-log"?  does that give you a tls timeout too?18:48
bdxworks great on jaas http://paste.ubuntu.com/25975933/18:48
bdxkwmonroe: the juju agent never starts, so I don't get any log from those instances18:48
jamesbensonyes18:48
jamesbensonkwmonroe: ^18:48
bdxoooh jamesbenson18:49
bdxmy b18:49
bdxlol18:49
kwmonroe:)18:49
jamesbensonhttp://paste.openstack.org/show/626542/18:49
kwmonroejamesbenson: ooooohhh, i thought you meant the debug-log command wasn't showing any output.18:49
jamesbensonkwmonroe: No, seems to have issues with net/http: TLS handshake timeout...18:50
jamesbensonI'm not sure why that is, though, since easyrsa is able to get active ...18:51
jamesbensonso it must be able to reach out, correct?18:51
kwmonroejamesbenson: easyrsa doesn't snap install anything18:52
kwmonroeetcd and k8s charms do18:52
jamesbensonoh man..18:52
jamesbensonso something with the bridge then18:52
kwmonroeso jamesbenson, i'll bet you all the money in my pockets that if you do a "juju run --unit easyrsa/0 'sudo snap install etcd'", it'll fail18:52
jamesbensonkwmonroe: seems to be just sitting there...18:53
jamesbensonyep, same error18:54
jamesbensonjuju run --unit easyrsa/0 'sudo snap install etcd'18:54
jamesbensonerror: cannot perform the following tasks:18:54
jamesbenson- Download snap "core" (3440) from channel "stable" (Get https://068ed04f23.site.internapcdn.net/download-snap/99T7MUlRhtI3U0QFgl5mXXESAiSwt776_3440.snap?t=2017-11-16T20:00:00Z&h=30ced1b835617d49d8ff4221a62d789f7ca638aa: net/http: TLS handshake timeout)18:54
jamesbensonsorry about the paste there...18:54
kwmonroejamesbenson: to test the tls/http connectivity more generically, do this.. juju ssh etcd/0, then wget https://google.com from the etcd unit.18:54
kwmonroe(make sure it's https)18:55
bdxok, here it is https://bugs.launchpad.net/juju/+bug/173276418:55
mupBug #1732764: series + spaces + artful + juju2.3beta3 = fail <juju:New> <https://launchpad.net/bugs/1732764>18:55
jamesbensonkwmonroe: works.18:55
kwmonroeinteresting18:55
jamesbensonhttp://paste.openstack.org/show/626545/18:56
kwmonroejamesbenson: how about a "sudo snap install etcd" from that same etcd unit?18:56
jamesbensonkwmonroe : nope...http://paste.openstack.org/show/626546/18:58
jamesbensoninteresting...18:59
ryebotjamesbenson: if this is an egress-restricted environment and you're unable to hit the snap store, I can provide you with some steps for installing them manually.18:59
jamesbensonall ports are open....18:59
jamesbensonI'll double check though..18:59
kwmonroebdx: nice detail in 1732764.  interesting that's it's such a specific combo.  also, you may want to s/"spaces=myspace"/"spaces=facebook" in case a more recent social media platform helps.19:01
jamesbensonryebot kwmonroe: iptables are empty, and security group is open all ports in and out.19:01
jamesbensonhttp://paste.openstack.org/show/626547/19:01
jamesbensonThis is the only rule in my iptables -t nat: MASQUERADE  all  --  10.55.234.0/24      !10.55.234.0/24       /* managed by lxd-bridge */19:03
kwmonroejamesbenson: stick some quotes around that url... wget 'https://068ed04f23.site.internapcdn.net/download-snap/99T7MUlRhtI3U0QFgl5mXXESAiSwt776_3440.snap?t=2017-11-16T20:00:00Z&h=30ced1b835617d49d8ff4221a62d789f7ca638aa'19:04
jamesbensonhmm... still shows connected.  But once connected it sits.19:04
kwmonroejamesbenson: how about running "env | grep -i proxy" on that unit.  anything in there?19:05
jamesbensonNO_PROXY=10.55.234.245,127.0.0.1,::1,localhost19:07
jamesbensonno_proxy=10.55.234.245,127.0.0.1,::1,localhost19:07
kwmonroehmph jamesbenson, that seems legit19:09
jamesbenson....so confused....  not a good sign that everything seems legit from you too...19:10
jamesbensonkwmonroe: do you deploy on baremetal or in VM's?  Do you have any script or anything?19:11
kwmonroejamesbenson: by "legit", i meant the no_proxy stuff looks legit :)  if you can't do a "sudo snap install foo" from the unit, juju won't be able to either.19:22
kwmonroejamesbenson: there's a gremlin in there to be sure.  just need to figure out why those unit's can't snap install.19:22
kwmonroejamesbenson: i typically deploy to clouds or localhost (lxd).  not much experience with maas.19:23
jamesbensonwell this is in an openstack VM, so not with maas...19:24
kwmonroeah, right19:24
jamesbensonI know I can deploy using openstack magnum, but want to do it manually...19:25
kwmonroewell jamesbenson, from what i can tell, apt install works and wget works, so it's not like your units are totally locked down.  i'm not sure what's causing snap install to fail.19:25
jamesbensonerror: cannot install "foo": snap "core" has changes in progress19:25
kwmonroesilly rabbit, dont' actually stick 'foo' in there19:26
jamesbenson:-p19:26
jamesbensonhey, didn't know if it was a test option ;-)19:26
jamesbensonansible ping/pong test ...19:27
kwmonroe:)19:27
kwmonroejamesbenson: what does snap changes show?19:27
kwmonroe"snap changes"19:27
kwmonroei'm guessing it's stuck somewhere trying to download the core snap19:27
jamesbensonhttp://paste.openstack.org/show/626550/19:27
jamesbensonyou'll like that..19:28
kwmonroeheh, classy19:28
kwmonroejamesbenson: how about a "snap download etcd"?19:29
kwmonroewe should see the tls error.. just making sure.19:30
magicaltrout"Hello Kubernetes support desk, Kevin speaking, how may I help you today???"19:31
kwmonroephew!  backup arrives.  magicaltrout, meet jamesbenson.  he's having trouble snap installing k8s.19:31
jamesbensonyep, tls error19:31
magicaltrouti have many k8s installations19:31
magicaltrouttoo many19:31
kwmonroemagicaltrout: any on openstack?19:32
magicaltroutsorta19:32
magicaltroutits manual though not openstack cloud provider19:32
jamesbensonmagicaltrout: I've got a ubuntu 16 LTS, VM sitting in openstack.  Security group is completely open.  No iptables rules...19:33
jamesbenson8 vCPU, 16GB RAM, 160 GB HD;  deployed using these commands:  http://paste.openstack.org/show/626536/19:34
jamesbensoncan't seem to install though, giving me tls errors.19:37
magicaltroutokay, jamesbenson your cluster lives inside lxd on nodes on openstack?19:41
jamesbensonyes19:42
jamesbensonVM in openstack, lxd on that VM.19:42
magicaltrouthmm i've not tried that before19:42
magicaltroutif you snap install at vm level does it work?19:42
jamesbensonyeah, did that to install conjure-up19:42
jamesbensonand lxd19:42
magicaltrouthmm19:45
kwmonroejamesbenson: it feels like something about your lxd-bridge is interferring with fetching data from the snap store, but i can't fathom a reason why it would affect snap and not apt or wget.19:47
R_P_SI am having difficulties adding subnets to spaces to ensure instances are deployed in the correct VPC/AZ19:49
R_P_SI get an error "cannot add subnet: no subnets defined" while running19:49
R_P_Sjuju add-subnet 1.2.3.4/5 public subnet-1234567819:50
jamesbensonkwmonroe, magicaltrout: do you have general guidelines/rules/instructions on how you set up lxd, zfs, and the network?19:52
jamesbensonipv6 is disabled...19:53
jamesbensonbut I wasn't sure about the bridge19:53
magicaltrouti've only installed k8s with conjure up on lxd once19:53
magicaltrouti just did whatever it told me19:53
jamesbensonhow do you typically install it?19:53
magicaltrouti have 1 standard aws install and 3 openstack manual provider installs19:54
jamesbensonI'm doing lxd to do some dev with multiple "nodes" in an all in one...19:54
jamesbensonopenstack with magnum?19:55
magicaltroutnope19:55
jamesbensonmanual?19:55
magicaltroutyeah just spin up some nodes19:55
magicaltroutand deploy some stuff to them19:55
jamesbensonusing which method?19:55
magicaltrouthttps://jujucharms.com/docs/2.2/clouds-manual19:56
magicaltroutjust like a small 3 node cluster for k8s dev19:56
stokachujamesbenson: how's your lxd bridge configured19:57
jamesbensonstokachu: any command to detail it?19:57
stokachujamesbenson: lxc network show lxdbr019:58
stokachujamesbenson: easiset to do `lxc network show lxdbr0|pastebinit`19:59
jamesbensonhttp://paste.ubuntu.com/25976235/19:59
stokachudo you have another bridge defined?19:59
jamesbensonno idea about the pastebinit... awesome..19:59
stokachujamesbenson: whats `lxc network list|pastebinit` show20:00
jamesbensonhttp://paste.ubuntu.com/25976248/20:00
jamesbensonhttp://paste.ubuntu.com/25976252/20:01
stokachujamesbenson: yea youve got no bridge defined for lxd to use20:01
jamesbensonokay...20:02
stokachujamesbenson: how'd you create lxdbr0 before?20:02
jamesbensonsudo lxd init --auto20:02
kwmonroejamesbenson: fwiw, we have a generic lxd guide:  https://jujucharms.com/docs/stable/tut-lxd.  might be worth following that and bootstrap on a new node, then "juju deploy ubuntu", then "juju ssh ubuntu/0" and see if a "sudo snap install core" works.20:03
stokachukwmonroe: it's his bridge20:03
stokachuit isn't configured20:03
jamesbensonBRB20:03
jamesbensoncould you send me a few commands?20:04
stokachujamesbenson: `lxc profile show default|pastebinit`20:04
R_P_Sbdx / rick_h: any ideas why add-subnet is not working and complaining about subnets not being defined?20:05
kwmonroestokachu: if the bridge was borked, how did he get this far with kubeapi-load-balancer going active: https://snag.gy/nT0LPv.jpg20:05
kwmonroe(and easyrsa)20:06
stokachuwell for one, his lxd bridge is inet addr:10.55.234.120:06
stokachuand those ip's are different20:06
jamesbensonhttp://paste.ubuntu.com/25976291/20:06
stokachukwmonroe: also thats not output from conjure-up20:07
stokachuso i dont know what he did there20:07
stokachujamesbenson: basically your lxd network bridge is acting up20:08
stokachujamesbenson: i recommend tearing down that setup20:08
stokachujamesbenson: juju kill-controller localhost-localhost20:09
jamesbensonokay20:09
jamesbensonand how do I bring it back up?20:09
stokachuthen delete that lxdbr0 bridge20:09
stokachuone sec20:09
jamesbensonok20:09
jamesbensonthanks 🙏20:09
stokachujamesbenson: then do `sudo brctl delbr lxdbr0`20:09
stokachujamesbenson: let me know when you've done that, and give output of `ip addr|pastebinit`20:11
R_P_Sso I just discovered that by creating a new model, the subnets aren't populated...21:01
R_P_Sthe subnet info is available in the controller and default models, but I want to build a model for each environment21:02
R_P_Show do I populate... or copy the subnet info from one model to the other?21:02
R_P_Sjuju switch default && juju list-subnets -> full subnet output21:07
R_P_Sjuju switch dev-k8s && juju list-subnets -> No subnets to display21:07
kwmonroeR_P_S: does 'juju reload-spaces' while in the dev-k8s model do anything?21:20
kwmonroeR_P_S: on aws, all new models look populated with subnets for me.21:21
R_P_Sreload-spaces appears to not do anything21:24
R_P_Shold on21:25
R_P_Swould reload-spaces be dependent on a vpc-id being specified in the model-config?21:26
jamesbensonstokachu: I think it's easier to reset the VM and start from scratch, no?21:30
stokachujamesbenson: probably21:30
jamesbensonstokachu: So it's rebuilt.21:34
stokachuok so do this, `sudo apt-add-repository ppa:ubuntu-lxc/stable`21:34
stokachu`sudo apt update && sudo apt install lxd lxd-client`21:34
stokachuthen `lxd init --auto` (no sudo)21:35
stokachuthen `lxc network create lxdbr0 ipv4.address=auto ipv4.nat=true ipv6.address=none ipv6.nat=false`21:35
stokachuthen `snap install conjure-up --classic`21:35
stokachuand run conjure-up21:35
jamesbensonsnap needs sudi21:38
jamesbensonsudo21:38
jamesbensonrunning conjure-up21:39
R_P_Sok, turns out you can't just add a VPC to a model after the fact (got errors), as the VPC parameters need to be specified during model creation with --config21:40
jamesbensonstokachu:  oooo something different is happening... getting a good feeling ^_^21:41
stokachujamesbenson: \o/21:41
jamesbensonwhat's the watch command again?21:42
jamesbensongot it21:43
R_P_Sok, now I'm straight up running into this bug :(  https://bugs.launchpad.net/juju/+bug/170487621:50
mupBug #1704876: can't deploy to specific AWS subnets due to `juju add-subnet` fails <add-subnet> <aws> <conjure> <spaces> <subnet> <vpc> <juju:Triaged> <https://launchpad.net/bugs/1704876>21:50
R_P_Show do you delete a space in a model?21:59
hmlspaces can’t be deleted currently.  :-(21:59
R_P_S...21:59
R_P_Sare spaces completely broken? :(  can't delete, can't add subnets to a space... can't do anything with them?  yet they're core to defining where things will be deployed?22:00
hmli never use spaces personally - there are other ways to define how things are deployed22:01
hmldepends on the cloud you’ve bootstrapped22:01
R_P_SI'm following: https://insights.ubuntu.com/2017/02/08/automate-the-deployment-of-kubernetes-in-existing-aws-infrastructure/22:01
R_P_Show would I rewrite this command then to not use spaces?22:02
R_P_Sjuju deploy --constraints "instance-type=m3.medium spaces=private" cs:~containers/etcd-2322:02
hmlah…22:03
hmlyou can just make a space with a different name to use - suboptimal i know -22:03
R_P_Sthe only things I've done different so far are that I'm not using cloudformation (infrastructure preexisting) and creating a model22:04
R_P_SBut how do I use empty spaces?22:04
R_P_Ssince I can't add-subnet to a space?22:04
jamesbensonstokachu : etcd/0 Missing relation to certificate authority.22:04
jamesbensonhttps://snag.gy/5ED6sa.jpg22:05
jamesbensonah, my nginx just became active....22:05
R_P_Sso apparently you need to define your subnets when calling add-space...22:08
R_P_Sdo it once, don't screw it up... and if you ever accidentally re-assign a subnet to a different space, you're screwed?22:09
jamesbensonkwmonroe stokachu : thoughts?  seems like having a similiar issue like before.  the bridge is managed now though22:14
stokachuIf there is no error in juju status then give it time22:16
jamesbensonerror hook failed: "install" ?22:17
jamesbensonfor all etcd22:17
jamesbensonbut keeps on restarting...22:17
stokachuIs this a full VM you're running?22:17
jamesbensonfull?  yes, ubuntu server cloud image...22:17
kwmonroejamesbenson: juju debug-log --replay -i etcd/022:18
kwmonroelet's see if it's still a problem snap installing22:18
kwmonroejamesbenson: alternatively, juju ssh etcd/0 and try a "sudo snap install etcd"22:18
jamesbensonhttp://paste.ubuntu.com/25976912/22:18
kwmonroeinstant regret on the --replay ;)22:19
kwmonroeplease hold while your pastebin loads into ram22:19
jamesbenson:-(22:20
stokachuLooks like a timeout downloading the snaps22:20
jamesbensonyeah22:20
jamesbensonshould I try the `sudo snap download etcd`22:20
stokachusudo snap install etcd22:20
jamesbensontried inside of etcd and got the TLS handshake timeout error22:21
kwmonroeR_P_S: while i'm waiting on jamesbenson to crash my browser, what's your end game here?  i'm really not well versed with spaces/subnets, but i'm curious what people are up to when they have strict space reqs. i know bdx does these space constraints all the time -- i never knew why.22:21
stokachuYea, not a conjure-up or juju issue22:21
stokachuBut an issue nonetheless22:21
jamesbensonkwmonroe: sorry!22:22
stokachuAre you behind proxies?22:22
jamesbensonstokachu: no22:22
kwmonroeno worries jamesbenson -- it's ammo for getting a new rig for the holidays ;)22:22
stokachujamesbenson: may want to post on discuss.snapcraft.io22:23
jamesbensonooo, nice :-) đź‘Ť. I'm running MBP with touchbar...22:23
R_P_Skwmonroe: simple management of VPCs and subnets.  Without this, some of the anti-patterns that juju enables is mind boggling... things like 0.0.0.0/0 SSH ACLs on every instance22:24
kwmonroeyou happy with the touchbar jamesbenson?  i hear mixed reviews (kinda like battlefront 2), where by "mixed" i hear "i hate it" ;)22:24
kwmonroejamesbenson: i see we're back to the tls handshake timeout :/22:25
jamesbensonlol... it's okay... I do need to reset it though, really random hangs and freezes as of late... circle of death for like 10-20 seconds then free's up.22:25
jamesbensondoes snapcraft have an IRC?22:25
kwmonroeoof on the death spiral22:26
kwmonroejamesbenson: you'll get much better response from snapcraft.io, but there is a #snappy freenode channel22:26
R_P_Skwmonroe: at that point, the only way to secure them is to control your subnets with things like public and private... these are very basic security concepts when building AWS infrastructure.22:26
jamesbensonbut I hack the hell out of it, so I probably fugged something somewhere...22:26
jamesbensonback to point though22:27
kwmonroejamesbenson: i shouldn't say "much better", but i know those forums are monitored like crazy.  irc, i'm not sure.22:27
jamesbensonokay22:27
kwmonroejamesbenson: https://forum.snapcraft.io/ is the place22:28
kwmonroedont go there ^^ from your etcd/0 unit because you'll probably get a tls handshake error.22:28
R_P_Skwmonroe: the fact that conjure-up for a kubernetes cluster uses ec2-classic by default is, to be brutally honest, downright scary :(  ec2-classic was deprecated years ago, and should never be used again22:29
R_P_Sdamn, I gotta run to meetings... I'll likely be in meetings until EOD...22:30
R_P_Sonce again, thanks for all the help, I am making progress, but it is much slower than I'd hoped22:30
kwmonroeR_P_S: thx for the insights on your use of subnets/spaces!22:30
kwmonroei'll catch up with you later to dive in more22:31
stokachuLol ec2-classic?22:31
kwmonroeyeah - i dunno what ec2-classic is either, that was the diving part i alluded to ;)22:32
stokachuR_P_S: feel free to elaborate on ec2-classic22:33
jamesbensonkwmonroe: ha, thanks.  Not sure exactly what to post to them..  I suppose just snap is failing in openstack VM with that massive pastebin from earlier..22:36
kwmonroejamesbenson: like you said, back to the point, if you can't "sudo snap install etcd" from the deployed unit, juju won't be able to either.  so step 1 is to figure out why that's failing.  you're probably going to hear 1000 people ask "what are your proxy settings", don't be mad.  whatever's going on is probably a mix of openstack / lxd / snap.22:37
kwmonroejamesbenson: i would just create a new topic that says "snap install fails in a lxd container on an openstack VM"22:38
kwmonroejamesbenson: and the pastebin is good -- but since it has so much juju noise in it, i'd paste the failure that you see from "sudo snap install core" on the etcd unit.22:41
tvansteenburghstokachu: kwmonroe: ec2-classic == ec2 in the days before vpcs were introduced. if you have a sufficiently old aws account, the machines juju provisions will not be in a vpc, which can break things in unexpected ways. the way to get around this is to tell juju which vpc to use. you can do that using bootstrap or model config.22:47
jamesbensonkwmonroe : Posted... lets see what happens.22:55
kwmonroei predict nothing but good things jamesbenson ;)22:55
R_P_Sec2-classic is amazon ec2 before VPCs existed... IIRC, it's not even accessible on accounts that were created in the past few years23:15
R_P_Sec2-classic was the equivalent of one giant public VPC that contained every amazon customer all in one giant internal subnet (split per region)23:16
R_P_Sec2-classic didn't have as many features as "ec2-vpc" either.  examples include: the SG couldn't be changed, and could only have one SG attached to an instance.23:17
R_P_Sec2-classic SGs don't have egress ACLs.  They simply don't exist (non-configurable ALL ALL 0.0.0.0/0 egress)23:18
R_P_Sec2-classic without VPCs, you didn't make subnets either... I can't remember everything though, it's been years since I've done any significant amount of work in ec2-classic.23:19
R_P_Sec2-classic is inherently insecure compared to ec2-vpc23:19
tvansteenburghR_P_S: you can make conjure-up use a vpc, but you need to bootstrap the juju controller or create the model before using conjure-up, then tell conjure-up to use that controller or model23:21
tvansteenburghfor example: juju bootstrap --config "vpc-id=vpc-xxxxxxxx"23:21
tvansteenburghor: juju add-model --config "vpc-id=vpc-xxxxxxxx"23:22
R_P_Sat this point, I have my ha controllers in the VPC i want... kwmonroe was curious about ec2-classic though23:23
R_P_Sat the fact that the AWS account I'm using appears to be old enough to still support ec2-classic meant that a basic/barebones conjure-up created a kubernetes cluster in ec2-classic instead of inside a VPC23:26

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!