[16:02] oops well I had the date wrong in the topic for this... but here goes === blackboxsw changed the topic of #cloud-init to: Reviews: http://bit.ly/ci-reviews | Meeting minutes: https://goo.gl/mrHdaj | Next status meeting: Monday 4/30 16:00 UTC | cloud-init 18.2 released (03/28/2018) [16:04] #startmeeting Cloud-init bi-weekly status meeting [16:04] Meeting started Mon Apr 30 16:04:15 2018 UTC. The chair is blackboxsw. Information about MeetBot at http://wiki.ubuntu.com/meetingology. [16:04] Available commands: action commands idea info link nick [16:04] hi folks, sorry for the mis-representation of when our cloud-init status meeting date. It's time for another episode/update of the happenings in cloud-init. [16:05] Next meeting will be in two weeks: May 7th [16:05] at 16:00 UTC [16:06] The last couple weeks on the upstream side of the house has been a big push to get testing and stability into master for the Ubuntu Bionic release freeze [16:06] ... I'd better start with the topic [16:07] #topic Recent Changes [16:07] The last couple weeks on the upstream side of the house has been a big push to get testing and stability into master for the Ubuntu Bionic release freeze. [16:07] May 7th would be 1 week from today that should be May 14th [16:07] robjo: gah, I did it again. Thank you... glad someone's listening. Next cloud-init status meeting Monday May 14th 16:00 UTC [16:08] #topic #cloud-init Reviews: http://bit.ly/ci-reviews | Meeting minutes: https://goo.gl/mrHdaj | Next status meeting: Monday 5/14 16:00 UTC | cloud-init 18.2 released (03/28/2018) [16:08] ok topic agrees in channel now, so I don't botch it at the end of meeting [16:09] Along with a blitz for stability in Bionic the following changes have been shepherded into tip of master [16:09] - Add reporting events and log_time around early source of blocking time [16:09] [Ryan Harper] [16:09] - IBMCloud: recognize provisioning environment during debug boots. [16:09] (LP: #1767166) [16:09] - net: detect unstable network names and trigger a settle if needed [16:09] [Ryan Harper] (LP: #1766287) [16:09] - IBMCloud: improve documentation in datasource. [16:09] Launchpad bug 1767166 in cloud-init (Ubuntu) "IBMCloud datasource does not recognize provisioning in debug mode." [Medium,Confirmed] https://launchpad.net/bugs/1767166 [16:09] - sysconfig: dhcp6 subnet type should not imply dhcpv4 [Vitaly Kuznetsov] [16:09] - packages/debian/control.in: add missing dependency on iproute2. [16:09] Launchpad bug 1766287 in cloud-init (Ubuntu) "18.04 minimal images on GCE intermittently fail to set up networking " [Undecided,In progress] https://launchpad.net/bugs/1766287 [16:09] (LP: #1766711) [16:09] - DataSourceSmartOS: add locking of serial device. [16:09] [Mike Gerdts] (LP: #1746605) [16:09] - DataSourceSmartOS: sdc:hostname is ignored [Mike Gerdts] (LP: #1765085) [16:09] Launchpad bug 1766711 in cloud-init (Ubuntu Bionic) "cloud-init missing dependency on iproute2" [Medium,Fix committed] https://launchpad.net/bugs/1766711 [16:09] - DataSourceSmartOS: list() should always return a list [16:09] [Mike Gerdts] (LP: #1763480) [16:09] Launchpad bug 1746605 in cloud-init "DataSourceSmartOS needs locking" [Medium,Fix committed] https://launchpad.net/bugs/1746605 [16:09] - schema: in validation, raise ImportError if strict but no jsonschema. [16:09] - set_passwords: Add newline to end of sshd config, only restart if [16:09] updated. (LP: #1677205) [16:09] Launchpad bug 1765085 in cloud-init "DataSourceSmartOS ignores sdc:hostname" [Medium,Fix committed] https://launchpad.net/bugs/1765085 [16:09] - pylint: pay attention to unused variable warnings. [16:09] - doc: Add documentation for AliYun datasource. [Junjie Wang] [16:09] - Schema: do not warn on duplicate items in commands. (LP: #1764264) [16:09] Launchpad bug 1763480 in cloud-init "DataSourceSmartOS list() should always return a list" [Medium,Fix committed] https://launchpad.net/bugs/1763480 [16:09] Launchpad bug 1677205 in cloud-init "cloud-init eats final EOL of sshd_config" [Medium,Fix committed] https://launchpad.net/bugs/1677205 [16:09] Launchpad bug 1764264 in juju 2.3 "bionic cloud-init 18.2 WARNING Juju's 'runcmd' stanza" [High,Triaged] https://launchpad.net/bugs/1764264 [16:10] the general theme has been: new IBMCloud datasource support for cloud-init, SmartOS datasource work by mgerdts, and some json schema improvements [16:12] so background on IBM, is that their support used to be ConfigDrive based datasource only, but there is now some additional support for different IBM boot/provisioning stages, hence a new datasource that can support different boot modew [16:12] *boot modes [16:14] over the last two weeks we've landed an SRU into xenial and artful: 18.2-4-g05926e48-0ubuntu1~16.04.1 and bionic sits at 18.2-14-g6d48d265-0ubuntu1 [16:15] On the SmartOS side, my changes are driven by our adoption of bhyve (moving away from kvm/qemu). qemu provides a dhcp server VMs could fall back to if could-init was missing or misbehaving. bhyve doesn't have that, so I've been working on getting cloud-init to be more stable with the bhyve serial metadata service. [16:15] Also, to our continuous integration on jenkins we now have an additional test for proposed packages in ubuntu for the bionic release to make sure ubuntu doesn't break across pending upgrades [16:15] #link https://jenkins.ubuntu.com/server/job/cloud-init-integration-proposed-b/ [16:16] that integration tests hits the suite of platforms lxd, kvm and ec2 [16:16] excellent mgerdts, and thanks for the blitz on these branches [16:17] looks like there are a few still in our review queue that we'll be able to get through once the dust settles on the bionic release (which should be this week) [16:17] #link https://code.launchpad.net/~cloud-init-dev/cloud-init/+git/cloud-init/+ref/master/+activereviews [16:19] Is now the right time to discuss bug 1765801, or is that later? [16:19] bug 1765801 in cloud-init "network should be optionally reconfigured on every boot" [Undecided,Confirmed] https://launchpad.net/bugs/1765801 [16:20] I think over the last 2 weeks there have been a couple of requests in channel for how someone goes about getting newer cloud init into RHEL7, if anyone on the line today knows the contact point or process for that it'd be helpful. larsks doesn't seem to be around [16:20] mgerdts: probably in about 10 mins. thanks for brining it up [16:20] hopefully less. [16:20] ok I think that's it for recent changes, next topic (in-progress dev, then office hours (and bug discussion)) [16:21] #topic In-progresss Development [16:21] We'll make this one short: [16:22] for ubuntu : bionic just went feature freeze last week, our team has a couple of IBM-related cheanges that we are pulling together for a quick SRU into xenial/artful to handle upgrade path from configdrive -> IBMCloud that we are working on the beginning of this week [16:22] we are also trying to wrap up validation of a Bionic SRU per the following bug [16:22] #link https://bugs.launchpad.net/ubuntu/+source/cloud-init/+bug/1767412 [16:22] Launchpad bug 1767412 in cloud-init (Ubuntu Bionic) "SRU cloud-init 18.2-27-g6ef92c98-0ubuntu1" [Medium,Fix committed] [16:23] which grabs a number of the updates I listed in the last topic [16:23] since Ubuntu tends to sync all changes from tip into each release stream [16:23] Is there any chance the SmartOS changes can piggy back on that IBM SRU [16:24] asked too soon - I see they are mentioned in that bug. [16:25] mgerdts: no worries. good ask. probably not for this IBM SRU into xenial/artful which is going to be an exception to our update rule and only be a single cherry pick, but planning a folllowup SRU in about 2 weeks which will pull all changes from tip into artful/xenial/bionic/chunky releases [16:25] ok [16:25] the cherry pick is to fasttrack it for IBM into xenial with minimal risk. [16:25] and we want to pull in all your changes if we can (and perform additional validation) [16:25] so the next SRU is our target [16:26] Also inprogress is some more Azure work on pre-provisioning that should land shortly: [16:26] #link https://code.launchpad.net/~jocha/cloud-init/+git/cloud-init/+merge/344192 [16:27] as well as some builddeb fixes and network configuration printout fixes from smoser [16:29] smoser and rharper also worked out some issues on specific google regions where cloud-init was getting hit by a race condition. Cloud-init started up before the kernel/udev was able to rename network devices to stable names like ens4 etc, so cloud-init's network configuration written ended up breaking because it represented devices like eth0 etc. [16:29] there are a couple of branches in flight to fix this issue: [16:29] #link https://code.launchpad.net/~smoser/cloud-init/+git/cloud-init/+merge/344181 [16:30] #link https://code.launchpad.net/~raharper/cloud-init/+git/cloud-init/+merge/344198 [16:30] ok I think that's it for in-progress work. So we'll head to office hours so we can chat bugs, branches reviews etc [16:31] #topic Office Hours (next ~30 mins) [16:31] We'll be hanging out here for anyone who wants more eyes on a review, feature discussions or bug triage.... [16:32] well, some of us will be :) a couple of us are at a feature planning conference for the week. [16:33] In https://code.launchpad.net/~mgerdts/cloud-init/+git/cloud-init/+merge/343712 smoser said that he was concerned about how this would interact with eventual network hotplug [16:34] There doesn't seem to be a timeline for network hotplug and the lack of network autoreconfig on reboot is has popped up a couple times in the past week. This is just with a couple early adopters and internal users. [16:34] #link https://code.launchpad.net/~mgerdts/cloud-init/+git/cloud-init/+merge/343712 [16:35] just to track it in the meeting [16:35] So coming up with some mechanism to make this work soon is pretty important to us. [16:35] gotcha, will be sure to do that in the future. [16:35] * blackboxsw reads up on that link [16:35] no worries, I'm pedantic :) [16:36] That's how you got chosen to run the meeting, I suppose. :) [16:36] yeah network hotplug will have a long tail as far as feature develpment (agreed). I believe it's on our charter for this next quarter. but that's what is being discussed this week [16:37] heh on meeting comment ;) too true [16:39] so mgerdts your branch allows metadata to set maintain_network to allow cloud-init to control network configuration each reboot with a True value [16:39] ? [16:39] yes [16:39] if it's not set to true in our metadata, the traditional behavior stays. [16:40] That is, in the default path, any customization that someone does in the guest will not get whacked. [16:41] cloud-netconfig handles hotplug https://github.com/SUSE/Enceladus/tree/master/cloud-netconfig contributions for other distros welcome [16:42] nice reference robjo [16:42] #link https://github.com/SUSE/Enceladus/tree/master/cloud-netconfig [16:43] We currently have no GCE specific information but that is easy enough to add. The GCE guest environment handles this and we use the GCE guest environment code in our images in GCE [16:43] mgerdts: so can a user turn off that feature on an instance once they've already deployed, or is it create-time only [16:43] It can be flipped at any time, in the current implementation. [16:44] current implementation is only in a development branch [16:47] mgerdts: the only things I can see being an issue with the maintain network in cloud-init is that we are adding the cost of another function call && metdata dict parse to look for a signal about maintaining the network. I agree that cloud-init having granularity between is_new_instance vs just re-do network, is something that cloud-init should have. [16:48] we probably need to discuss this too with rharper about what short-term vision we can get to while we await our network hotplug support in cloud-init proper [16:49] I'd tend to agree that waiting on fully baked hotplug solution is probably too long in this case [16:49] as that runway will be at least 2 months I'd think [16:50] ok, I'll take an action item to resolve this if we can by next meeting [16:50] Not only that, but support for it will likely require changes in the host as well. We tend not to do host updates very often, so it could be a year or more after the feature is available in images before it will be useful. [16:51] #action blackboxsw to have discussions w/ team on datasource maintaining network on each reboot per https://code.launchpad.net/~mgerdts/cloud-init/+git/cloud-init/+merge/343712 [16:51] ACTION: blackboxsw to have discussions w/ team on datasource maintaining network on each reboot per https://code.launchpad.net/~mgerdts/cloud-init/+git/cloud-init/+merge/343712 [16:51] thanks [16:51] good topic. [16:52] Is there another place that is good to catch up with larsks or other people that can offer guidance on for redhat/centos? [16:52] let's see, anything else folks want to chat about? stagnant reviews, bugs of interest etc? [16:53] * blackboxsw looks at the last cloud-init community summit attendees list to see if rhel folks have another contacts that was supposed to replace larsks [16:53] Chad, Is it possible that someone from cloud-init team can take a look at https://bugs.launchpad.net/ubuntu/+source/cloud-init/+bug/1766538 [16:53] Launchpad bug 1766538 in cloud-init (Ubuntu) "network customization with cloud-init does not work on Ubuntu18.04 Beta2 Server" [Undecided,Confirmed] [16:55] mgerts, ryan mccabe is a potential contact too, looks like he's not here either today. [16:56] ok, thanks [16:56] hrm, yeah not certain what mechanism is used to get cloud-init updated into RedHat mgerdts. Maybe filing a redhat bug about the request [16:57] mgerdts: https://bugzilla.redhat.com/ maybe [16:58] stanguturi: yes we can, we are trying to sort and understand any bugs against Bionic that we can [16:58] ok, I can try that. [16:58] #link https://bugs.launchpad.net/ubuntu/+source/cloud-init/+bug/1766538 [16:58] Launchpad bug 1766538 in cloud-init (Ubuntu) "network customization with cloud-init does not work on Ubuntu18.04 Beta2 Server" [Undecided,Confirmed] [16:58] blackboxsw: Thanks [17:00] stanguturi: ok, so this is netplan + cloud-init related right? [17:00] blackboxsw: Yes. [17:01] what does network hotplug mean in cloud-init context? [17:02] * blackboxsw tries to remember what vmware datasource does, (like writing files direct to network /etc/network/interfaces.d) [17:02] akik: https://hackmd.io/M1Tae41PQBC7a9qMsurTJw?both is a shared document for comment on hotplug in cloud-init [17:02] #link https://hackmd.io/M1Tae41PQBC7a9qMsurTJw?both [17:03] * blackboxsw looks to see if there was a better doc hrm [17:03] blackboxsw: Oh. But in the case of netplan, why does cloud-init remembers? [17:04] blackboxsw: does it mean that cloud-init stays running, waiting for new network interfaces to appear? [17:04] akik: right, it would mean that you wouldn't have to reboot cloud-init if devices get added at a later time (post-boot) [17:05] cloud-init would listen to some sort of event channel and react, re-write, and apply network config to add new devices [17:06] would it do the same thing as you could do with ansible or puppet? sorry i'm trying to understand why you would do it with cloud-init [17:08] akik: you would try to do it with cloud-init if you didn't want to rely on additional configuration management solutions if the only thing you needed was network config to reflect reality (not full system configuration and system automation) [17:09] * blackboxsw has more puppet/chef background than ansible. [17:09] cloud-init does currently detect and write network configuration based on what the user/cloud-metadata tell us is the proper config for the instance [17:10] i only thought of cloud-init to do the initial configuration [17:10] so it would follow that if the metadata could dynamically tell the instance that network config has changed, cloud-init should probably try to react to that to fix the config to match the updated network configuration [17:11] akik: correct. cloud-init current only handle initial boot config and leaves the rest up whatever mechanism someone uses to update detailed config after that boot [17:11] ok thanks [17:12] akik: and we'd make that feature configurable (handle hotplug:True/False) so if users have other services handling hotplug cloud-init wouldn't collide [17:13] ok I think we're hitting the end of office hours. please feel free to continue discussion, we all poke around here throughout the day as our primary means of communication [17:13] thanks robjo akik stanguturi and mgerdts for the lively discussion [17:13] stanguturi: I'll dig up more info on that bug today [17:13] thank you [17:13] as always notes will be here [17:14] #link https://cloud-init.github.io/ [17:14] #endmeeting [17:14] Meeting ended Mon Apr 30 17:14:19 2018 UTC. [17:14] Minutes: http://ubottu.com/meetingology/logs/cloud-init/2018/cloud-init.2018-04-30-16.04.moin.txt [17:33] Thanks blackboxsw [17:41] blackboxsw: after quite too long... just updated [17:41] https://code.launchpad.net/~smoser/cloud-init/+git/cloud-init/+merge/344784 [17:43] there is one behavioral change there. === shardy is now known as shardy_afk === robjo is now known as rschweikert_phon [18:03] smoser: checking now [18:04] just rebaed and fixed the pylint herror === rschweikert_phon is now known as robjo [18:09] blackboxsw: could you mark https://code.launchpad.net/~smoser/cloud-init/+git/cloud-init/+merge/342334 as approved ? or just make sure you'lre ok with that ? and then i'll queue it into ubuntu/devel [18:13] smoser: ok so you drop the is_ibm_cloud == not_found detection from nocloud in ds-identify now right? [18:20] minor unittest change and +1 on it. I'm testing the updated deb in ibm w/ a nocloud setup to be sure. [18:21] correct. i dropped that (previously) [18:21] the new change is to drop the ibm_cloud detection for the config-drive seed case. [18:21] ok I must've missed that earlier, but I wanted a specifically exercise tha path (as I didn't look at it formerly) [18:24] do we want to queue your merge 344784 too? [18:30] https://code.launchpad.net/~smoser/cloud-init/+git/cloud-init/+merge/342334 is approved, but doesn't it need a rebase now to fix versioning in debian/chanelog [18:32] blackboxsw: yeah... so thats kind of an interesting topic i talked iwth rharper some last week. [18:33] last time i proposed two MPs for merge int ubuntu/* branches (the addition of the isc-dhcp-client and iproute2) [18:33] i did not upate the debian/changelog [18:33] largely because of that. [18:33] but then... debian/changelog didn't get updated... [18:33] i caught those later before the SRU upload. [18:33] i'm not really sure how to handle this on the commits to ubuntu/ branches [18:42] so a commit to debian/changelog with an out of order version won't matter will if we are following up with something >= during our next 'C' package release right? [19:03] blackboxsw: you are corect. we do not want to merge that out of order like that. [19:03] Just validated nocloud and configdrive behavior on IBM: https://code.launchpad.net/~smoser/cloud-init/+git/cloud-init/+merge/344784 can land with minor unittest tweak [19:08] blackboxsw: the line go shorter then original [19:08] ie.. at first i neded the paren [19:08] i'll drop . thanks. [19:08] guessed as much [19:09] pushed update [19:22] smoser: landed in tip [19:27] as far as the dropping of new-upstream-snapshot. it has current merge conflicts with ubuntu/devel, so I'm not sure how to resolve it other than rebasing and rebuilding debian/changelog with a new entry, or manually resolving the conflict and injecting this debian/changelog item where you think best in the most recent changelog section. [19:28] I guess it's easiest in the future if we don't let changes against ubuntu/devel age, and try to resolve them before we pull in any other new-upstream-snapshots against that branch. [19:32] blackboxsw: cat a minute ? [19:32] chat even [19:36] chat with a cat on my way [20:07] https://hackmd.io/jlq3C4qbSgurZ_DZ5GTiuw [20:36] did you guys see #1768118? fwiw, i tried such an upgrade and it worked just fine [20:39] bug #1768118 [20:39] bug 1768118 in cloud-init (Ubuntu) "/etc/netplan/50-cloud-init.yaml fails silently with no dhcp address assigned" [Undecided,New] https://launchpad.net/bugs/1768118 [20:40] not sure why it was filed against cloud-init, it's lacking many details [20:40] thanks ahasenack looking now. [20:40] smoser's on it ahasenack thanks [20:44] https://bugs.launchpad.net/ubuntu/+source/cloud-init/+bug/1766538 here's another network related concern to peek at [20:44] Launchpad bug 1766538 in cloud-init (Ubuntu) "network customization with cloud-init does not work on Ubuntu18.04 Beta2 Server" [Undecided,Confirmed] [20:51] blackboxsw: I don't suspect that's a cloud-init issue, 18.04 desktop nor server are going to have ifupdown installed; I don't think we're getting the whoe picture there [20:52] rharper: yeah we are talking about this now. we're gonna ask for cloud-init collect-logs [20:52] rharper: yeah. [20:52] well, logs won't matter [20:53] and go to bed ;) [20:53] 18.04 has never had ifupdown in it [20:53] so someone's not telling the whole story [20:53] it's also vmware [20:53] yeah agreed [20:53] it absolutely rendered witih netplan [20:53] which has quite a few odd bugs filed w.r.t cloud-init and no vmware datasource, etc [20:53] the logs are clear on that. [20:53] so I suspect testing issue [20:53] hehe [20:53] and multiple boots in that log listed too [20:54] might have started as xenial or something who knows [20:54] yes [20:54] so, a dist-upgrade to 18.04 would make sense [20:54] but the scenario is far from clear in the bug [20:57] mgerdts: https://pythonhosted.org/smartdc/ [20:57] is that something you would recommend ? have experience with ? [20:59] I don't have experience with it, but I can ask around internally to see what people say. There's also #smartos that is pretty active. [20:59] version history that ends about 5 years ago makes me worry. [21:00] rharper: marked incomplete [21:00] mgerdts: fair. i'm basically looking for something to launch an instance with python. none of our tools are node , so i dont want that at the moment. [21:06] smoser: for xenial https://code.launchpad.net/~chad.smith/cloud-init/+git/cloud-init/+merge/344866 [21:09] blackboxsw: i think you cherry-ipcked the wrong thing [21:09] wow typo, pulling again sorry [21:09] 11172924a48a47a7231d19d9cefe628dfddda8bf [21:11] smoser: one engineer reported having success a couple years back, but others remarked that the examples seemed to use obsolete arguments. It doesn't seem as though there's a better maintained alternative. :( [21:12] mgerdts: thanks. [21:12] In particular: "the createmachine example there uses "dataset" for the image argument... using URNs instead of UUIDs." [21:13] blackboxsw: i'll go ahead and grab that [21:13] but it loks like there are some differences between yours and mine [21:14] http://paste.ubuntu.com/p/5NBPqmpBCS/ [21:14] i think that is due to [21:14] a.) ~/.quiltrc http://paste.ubuntu.com/p/M3v6cJDfHg/ [21:14] b.) i probably have newer git and it is picking 8 hash by default [21:14] I repushed, but go4it smoser [21:15] checking [21:15] probably some changes needed to debian/cherry-pick to specify both of those things [21:16] probably right, I did this from a xenial env [21:16] if I did it from my bionic env it probably would have been equal [21:17] well, the 'ab' style comes from ~/.quiltrc [21:17] maybe i can look at making it do the rigiht thing all the time tomorrow [21:22] blackboxsw: uploading. i did take my version, but with your name. [21:22] as soon as build completes it will upload. oh. fudge. failed build. [21:22] fudge. [21:22] namedtuple usage [21:23] http://paste.ubuntu.com/p/99RW7NrSTQ/ [21:24] that commit isit stand alone. [21:24] it needs 6ef92c98c3d2b127b05d6708337efc8a81e00071 [21:25] blackboxsw: i'll be b ack in in 3 hours or so. sorry [21:26] gah, ok [21:27] sounds good [21:32] strange as tox completed for me locally [21:33] ahh can reproduce w/ make deb [21:46] blackboxsw: i think maybe just cherry pick the one you had and the other [21:46] they're both ibm related [21:46] wow 3 hours was fast [21:46] and for ibm internal test, they might want the debug thing. [21:46] sorry pushing that [21:46] yeah. [21:46] i just thought that [21:46] and i have to go afk now. [21:46] that make sense thouigh? 6ef92c98c3d2b127b05d6708337efc8a81e00071 [21:46] is what you need to make the test ru [21:46] run [21:46] i think