/srv/irclogs.ubuntu.com/2018/01/24/#cloud-init.txt

blackboxswhrm... ok seeing we get through cloud-init modules:config stages which means the datasource succeeded00:00
blackboxswhrm have to step away a bit. sorry for the moment dojordan_ will check it out00:01
blackboxswI'll have something on this later00:01
blackboxswsorry I should have tested this again this morning00:01
dojordan_no worries00:01
dojordan_one thing would be great if you could do, would be to change logging level so we can see more info on the serial port00:02
* blackboxsw clicks enable boot diagnostics logging in the UI and clicks reboot on this instance00:07
blackboxswand bailing for dinner00:19
dojordan_same problem on artful00:37
blackboxswsmoser: just pushed a merge proposal into bionic for today. I need a bit more time to triage what gives on Azure :/01:20
blackboxswhttps://code.launchpad.net/~chad.smith/cloud-init/+git/cloud-init/+merge/33651301:20
blackboxswok I'm out for the night. gotta do the bedtime routine with the kiddos. more on azure first thing in my morning01:20
dojordan_thanks for all the help, sounds good01:21
smoserblackboxsw: fudge01:47
smoserhttps://jenkins.ubuntu.com/server/job/cloud-init-ci/725/console01:47
smoser:-(01:47
* smoser fail01:47
smoseri'm fixing and pushing. http://paste.ubuntu.com/26448094/01:58
smoserrharper, powersj blackboxsw if you're still around, to disagree or +1 that.01:58
rharperlooking01:58
smoserrunning tox + centos build && git push upstream HEAD01:58
rharperlooks sane01:59
smoserthakns01:59
rharperbtw, it's a royal pain to launch multi-subnet/ip instances via the console;  also it would be of great help if the ec2 docs would tell you what the format of the instance data is, for example local-ipv4s is a list of some sort of ipv4 addresses;  but is it comma separated, newline, space?  I can't find any examples with my google-fu so trying to get an instance up to check02:17
rharper*finally*02:46
rharpervpc is timeconsuming02:46
blackboxswhab03:00
blackboxswbah03:00
blackboxswresubmitting  the merge proposal03:04
blackboxswwith a new snapshot from master03:07
rharperok, have crude ec2 network metadata to v1 config03:09
* rharper calls it a night03:09
blackboxswnice03:14
blackboxswok new MP against bionic up. thanks for the fix smoser03:18
blackboxswhttps://code.launchpad.net/~chad.smith/cloud-init/+git/cloud-init/+merge/33651403:19
=== shardy is now known as shardy_afk
=== shardy_afk is now known as shardy
=== Guest28399 is now known as mgagne
smoserblackboxsw: on azure...15:33
smoseryou there?15:33
smoserdojordan_: west coasters.15:34
smoser(they stay up  late.. https://finance.yahoo.com/news/exclusive-fitbits-6-billion-nights-sleep-data-reveals-us-110058417.html )15:36
blackboxswhere16:03
blackboxsw:)16:03
blackboxswok azure triage time16:19
=== hrybacki is now known as hrybacki_mtg
dojordan_here now @blackboxsw17:05
blackboxswgood to know you come to work at a reasonable time like the rest of us :) I'm walking through failure path again, as smoser surmised it's likely the cdrom disappearing on us before I cloud-init clean --reboot.... so I'm adding logs etc now and going through that to confirm17:06
* blackboxsw wasn't sure yet why this seemed to work with tip of master though too. 17:07
dojordan_hmm, interesting idea. doesn't waagent copy the ovf_env.xml off of the cd ?17:07
dojordan_FWIW we remove the CD as soon as we get a provisioning message17:09
dojordan_question, shouldn't we be keeping around the ovf-file before rebooting?17:44
dojordan_i think clean --reboot deletes it17:44
dojordan_err, nvm, it lives in /var/lib/waagent/ovf-env.xml17:45
dojordan_@blackboxsw, same problem on xenial17:58
smoserdojordan_: i had asked blackboxsw to edit /etc/cloud/cloud.cfg.d/05_logging.cfg and change the console logging from WARN to DEBUG18:00
smoserand then collect console log18:01
smoserare you able to easily do that too ?18:01
dojordan_yeah18:01
smoserdojordan_: for boot diagnostics18:02
smoserwhich storage account type do you need ?18:02
* blackboxsw is already mid reboot/test on my bionic instance with debug console logs enabled, an azure storage account created and18:03
blackboxswboot logs enabled18:03
blackboxswubuntu@40.70.46.8818:04
smoserssh-import-id smoser ?18:04
blackboxswchecking boot logs now to make sure cloud-init reported correctly on this last clean boot18:04
blackboxswalready done for dojordan and smoser18:04
blackboxswi'm in byobu term18:04
smoserpermission denied18:05
smoserin18:05
blackboxswadded agin18:05
blackboxswmust've typod18:05
dojordan_denied18:06
smosertry again18:06
dojordan_cool18:07
blackboxswok let's see here..... cehcking azure cli now to make sure I could see boot logs18:08
blackboxswbefore rebooting18:08
dojordan_worst case i can always get them :)18:08
blackboxsw az vm boot-diagnostics  get-boot-log --ids /subscriptions/12aad61c-6de4-4e53-a6c6-5aff52a83777/resourceGroups/SRUGRP10/providers/Microsoft.Compute/virtualMachines/my-b118:09
blackboxsw'ascii' codec can't decode byte 0xe2 in position 40610: ordinal not in range(128)18:09
blackboxswhrm oops az cli18:09
blackboxswchecking UI18:09
blackboxswserial log in UI is working for me18:09
blackboxswok18:09
blackboxswold log18:10
blackboxswhttp:pastebin.ubuntu.com/2645316018:10
blackboxswhttp://pastebin.ubuntu.com/2645316018:10
smoserblackboxsw: 'ordinal not in range'18:11
smoser?18:11
blackboxswyeah az cli cloudn't decode the boot logs on the machinie18:11
smoseris that because az is trying to .decode() the console log ?18:11
blackboxswyeah18:11
smoser:-(18:11
blackboxswso something to file against azure cli when I dig into it :/18:11
blackboxswbut UI works18:11
dojordan_ugh, ill make a bug report18:11
blackboxswthanks dojordan_18:12
blackboxswlemme get az cli version18:12
dojordan_can you pastebin the ui logs?18:12
blackboxswhttp://paste.ubuntu.com/26453179/18:12
smoserdojordan_: you're in good company. this week, we've hit.18:12
smoser https://github.com/lxc/pylxd/issues/26818:12
smoserand18:12
smoser https://github.com/boto/botocore/issues/135118:12
blackboxswdojordan_: ui logs is http://pastebin.ubuntu.com/2645316018:12
dojordan_second boot?18:13
smoserblackboxsw: yeah, go for it.18:13
blackboxswdojordan_: smoser 2nd rebooting now18:13
blackboxswok18:13
=== hrybacki_mtg is now known as hrybacki
blackboxswhrm any way to show in cli what power state is on node18:16
dojordan_let me see18:16
blackboxswdojordan_: ok it's looping18:18
blackboxswjust got logs smoser dojordan_18:18
blackboxswlooping on reprovidsiondata18:18
blackboxswcopying now18:18
blackboxswnew boot log http://pastebin.ubuntu.com/2645322518:19
blackboxswI'm looking now18:20
blackboxswyeah it's looping on 404 from reprovisioning18:20
=== shardy is now known as shardy_afk
blackboxswso, something triggered that poll which shouldn't have18:21
dojordan_DataSourceAzure.py[INFO]: Creating a marker file to poll imds18:21
dojordan_yup18:21
smoserwell, that part seems like it is functioning as designed.18:21
blackboxswhahah18:21
blackboxsw:)18:21
smoserdojordan_: logging seems extremely verbose if you're expecting this to sit up for 24 hours before use18:21
dojordan_but we won't log debug by default right?18:22
smoserdebug does go to log file, but not to console18:22
smoserlooks like < 1k/second. but that'd add up.18:22
smoserbut thats not the issue.18:23
smoserwhy did we get into the imds18:23
blackboxswso cfg.PreprovisionedVm == True18:23
blackboxswsomething in _extract_preprovisioned_vm_setting returns True18:24
blackboxswwe need to look over that ovf file again18:24
blackboxswI think18:24
dojordan_my guess is the refactoring broke something. the weird thing is it should have been covered by ut18:24
blackboxswyeah I thought so too18:24
dojordan_im re reading my code now and no idea...18:25
blackboxswI'm starting up a 2nd vm now and will run _extract_... on the doc18:26
dojordan_smart18:27
dojordan_we didnt see any of those debugs though...18:28
smoserblackboxsw: smoser@52.151.23.9118:29
smoserif you want18:29
smosertake it18:29
blackboxsw40.79.65.17118:30
blackboxswas well18:30
blackboxsw40.79.65.161 rather18:30
blackboxsw<ns1:PreprovisionedVm>false</ns1:PreprovisionedVm>18:31
blackboxswok,,,, so that should've been interpreted as false18:31
dojordan_oh no18:32
dojordan_bool("false") is true18:32
blackboxswahhahhha18:32
blackboxswohhh right18:32
blackboxswdidn't translate from string type18:32
dojordan_ill push a fix18:33
dojordan_thanks for all the help18:33
blackboxswcheers. gotta go pickup a kiddo from school18:33
blackboxswsee ya in a bit18:33
dojordan_@smoser, @blackboxsw, I pushed a fix, and added another UT that would have caught it. Testing now in azure.19:37
dojordan_my thoughts on removing the verbose logging: maybe just log a byte every request of something. Also, do we have a log level that goes to the console by default?19:47
blackboxswdojordan_: warning level is configured to the console by default.19:58
* blackboxsw wonders about us adding a param in a subsequent branch to url_helper.readurl(quiet=(False|True) then callers handling retries outside of that could turn down the volume of logs20:04
blackboxsw testing your latest branch now too20:10
dojordan_same here, *fingers crossed*20:18
dojordan_i got permission denied using password auth but at least the ECSDA host key changed on me20:21
smoserssh auth shouldnt be affected. if you get there, it really should let you in20:23
smosernothing woudl have deleted your keys20:23
dojordan_password would have been redacted in the ovf-env.xml, not sure what that changes20:23
blackboxswI know it's a nit, but changing the log message Start polling IMDS from debug -> warning feels like it really shouldn't be a warning level log20:24
blackboxswmaybe I'm wrong (I know you are probably just trying to get it to show up in default console log configuration)20:25
dojordan_right, im open to other options, but it would be nice to get to the console20:26
smoseri can  understand wanting to see somethign on the console (for azure platofrm perspectivee)20:27
smoserbut itkind of stinks from the users' perspective.20:27
smoserthey have a right to expect WARN in the logs to mean "something went wrong"20:27
smoserbut here nothing in their control actually went wrong.20:28
dojordan_true...20:28
dojordan_im fine reverting it now that we found this bug20:28
smoseryeah. i think that is best for now.20:28
blackboxswdojordan_: one more thing while you are in there.20:28
smoseri have said many times i think python logging lacks level granularity20:28
smoserand cloudd-init usage of what *is* there is bad.20:28
blackboxswthere's a util.translate_bool that might be of use in checking that truthy value from ovf file20:29
smoserit seems to me that this should qualify as INFO level20:29
smoserand at some point maybe a concerted effort couldg et INFO to the console20:29
blackboxswbtw smoser and dojordan_ success ubuntu@40.79.65.16120:33
dojordan_sweet!20:33
dojordan_what distro?20:33
blackboxswdojordan_: bionic, running through xenial now20:34
dojordan_Y20:34
dojordan_cool, I got back in on xenial too20:37
dojordan_just pushed those two changes (correct log level, and util.translate_bool)20:38
blackboxswgreat, xenial looks good for me too.20:47
smoser\o/20:48
blackboxswok, I'll land this when ci completes it's vote dojordan_20:50
dojordan_thanks!20:51
blackboxswthanks for "dotting the i's and crossing the t's"20:57
robjosmoser: As touched on in previous discussion platform.linux_distribution() is deprecated in upstream Python and as of version 3.7 is expected to go away, in 3.6 on SUSE it returns an empty tuple, thus useless20:59
robjoI take it in Ubuntu you guys patched Python20:59
robjoanyway, I think we shoud make a decisison if we expand the dependencies to python-distro or if cloud-init gets it's own function to determine the distribution21:00
robjothoughts?21:00
robjohttps://github.com/nir0s/distro#distro---a-linux-os-platform-information-api21:01
smoserhm.21:01
smoseri think i'd just want to build my own.21:02
smosers/my/own/21:02
smosers/my/our/21:02
smoseri dont want an external dependency for something as seemingly simple as "figure out if you are on ubuntu, suse, redhat, ...".21:03
smoser:-(21:03
robjofair enough, something like this?21:03
robjoif os.path.exists('/etc/os-release'):21:04
robjo  use it and determine the distro21:04
robjoelse:21:04
robjo  try:21:04
robjo    platform.linux_distribution()21:05
robjoexcept:21:05
robjo......21:05
robjoSound reasonable?21:05
smoseryeah i guess. id 'also like to olet the packager easily just set it21:05
robjowell that's the other option, just punt and make the person running setup set the distro then we save the code all together21:08
smoserwell, i thin i'd l ike it to do the right thing, but if the logic that is there doesnt "do the right thing", then let the packager set it.21:12
smoseri want trunk to "just work" though21:13
robjoOK, I'll see what I can come up with21:13
robjohttps://bugs.launchpad.net/cloud-init/+bug/174523521:24
ubot5Ubuntu bug 1745235 in cloud-init "distribution detection" [Undecided,New]21:24
dojordan_@smoser and @blackboxsw, thank you for all your help landing this PR. When will the nightly azure images contain these changes?23:15
blackboxswheh, was going to ping you that I just landed it :)23:15
blackboxswshould be in bionic tomorrow23:15
blackboxswI'm thinking we will probably SRU in February.... so xenial, artful would have it our next SRU23:16
dojordan_bionic will work for me :)23:21
blackboxswdojordan_: oopsie, sorry I need to propose for merging into bionic23:22
blackboxswI'll put up another merge proposal tonight. we can probably land that tomorrow and it'll be published friday23:22
blackboxswjust landing robjo's btrfs branch too23:22
dojordan_gotcha. Is the bionic branch just a delayed mirror of master?23:22
blackboxswdojordan_: yeah the way we structure bionic publishing is just to mirror all content from master tip23:24
blackboxswfor SRUs into xenial, zesty artful releases we take  a snapshot of tip as well and if some significant behavior change requires attention to retain backward compatibility we carry a small patch to retain behavior in xenial.23:26
blackboxswsince bionic is not officially in feature freeze until March 2018, any change in behavior of cloud-init is given the go-ahead, so snapshots are easy https://wiki.ubuntu.com/BionicBeaver/ReleaseSchedule23:28
blackboxswSRUs into ubuntu series that are 'stable/released' require a bit more work on our end with testing/verification23:29
blackboxswhttps://wiki.ubuntu.com/CloudinitUpdates for our SRU process (TMI I know)23:29
dojordan_got it, this explains a lot. (not TMI :) )23:30

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!