/srv/irclogs.ubuntu.com/2018/02/08/#maas.txt

mupBug #1748051 opened: [2.4, devel]  <MAAS:New> <https://launchpad.net/bugs/1748051>00:03
mupBug #1748052 opened: [2.4, devel] ] Unable to write to plugin cache /usr/lib/python3/dist-packages/twisted/plugins/dropin.cache: error number 13 <MAAS:Triaged> <https://launchpad.net/bugs/1748052>00:18
mupBug #1748055 opened: [2.4, devel] While commissioning/testing <MAAS:Triaged> <https://launchpad.net/bugs/1748055>00:34
taskermy maas-proxy cache is consuming some disk space. is there an official way to clear out this cache or is "rm -r /var/spool/maas-proxy/*" an acceptable solution?01:36
=== frankban|afk is now known as frankban
tosarajaIs maas itself providing the cloud-init content when deploying a host? I'm trying to deploy a 17.10 host here and I'm getting "404 Not found http://old-releases.ubuntu.com/ubuntu artful-security Release". Well, artful isn't an old-release, so why is it trying to pick it from there?08:17
mupBug #1748187 opened: Only 16.04 Xenial available for commissioning <MAAS:New> <https://launchpad.net/bugs/1748187>13:15
mupBug #1748187 changed: Only 16.04 Xenial available for commissioning <MAAS:Won't Fix> <https://launchpad.net/bugs/1748187>13:54
mupBug #1589140 changed: No WOL option in latest MAAS version for 16.04 <MAAS:Won't Fix> <https://launchpad.net/bugs/1589140>14:24
ejathi .. any update on bug 167372415:38
ejatbug #167372415:39
roaksoaxejat: seems fixed to me15:56
kikoejat, are you seeing it on a modern version of MAAS?16:08
=== frankban is now known as frankban|afk
niedbalskiroaksoax, https://bugs.launchpad.net/maas/+bug/1743144 is affecting HPe machines , not sure why that repository is added by default.19:35
niedbalskiroaksoax, https://pastebin.ubuntu.com/26542719/ fyi19:38
andrew-iiI need to deploy Debian 9, but it seems like `ifenslave` is not included by default. Is there a time in the curtin setup that I can make it install the package before it configures the NICs?19:48
andrew-ii(That is, Debian 9's default ISO doesn't include it. Perhaps I need to make a new Debian cloud installer image from scratch?)19:48
roaksoaxniedbalski: you can disable the config to install third party drivers on the settings21:32
=== TJ- is now known as Guest27109
roaksoaxniedbalski: or you can remove that from the config in /etc/maas/drivers.yaml21:32
niedbalskiroaksoax, yeah, but shouldn't be this disabled per series? as this isn't really available for xenial21:33
roaksoaxniedbalski: not relaly not. We have no way of knowing it is on the repository or not21:33
roaksoaxniedbalski: but since there's options to disable/enable this or remove the use of the driver altogether21:34
roaksoaxi think there's ways to not be affected21:34
=== TJ_Remix is now known as TJ-
roaksoaxniedbalski: in fact, you could even change the repository where you get the drivers from21:34
niedbalskiroaksoax, well, a mention in the documentation is worth then, it took me a few to discover it as by default third party drivers are enabled.21:34
roaksoaxniedbalski: uhmm seems this section as removed from the docs21:36
niedbalskiroaksoax, looks like21:36
niedbalskiroaksoax, now I am hitting 1730456 :-)21:37
roaksoaxniedbalski: what's your rackd.conf and your regiond.conf ?21:39
niedbalskiroaksoax, https://pastebin.ubuntu.com/26543275/21:41
roaksoaxniedbalski: i bet that 10.10.1.7 is not the ip the machines can reach MAAS at21:42
niedbalskiroaksoax, https://pastebin.ubuntu.com/26543312/ .. yes, that's not the address the machines are reaching (192.168.100.0/24)21:46
roaksoaxniedbalski: so you have to options, leave rackd.conf as localhost and update regiond.conf correctly21:47
roaksoaxniedbalski: or mofidy rackd.conf21:47
niedbalskiroaksoax, do you see any evident correlation with the error that I just posted? I wonder if you know something I don't :-)21:48
roaksoaxniedbalski: that one could be a clockskew thing21:49
niedbalskiroaksoax, clock is aligned in both maas/deployed node21:49
niedbalskianyways, I am ntp syncing and modifying regiond accordingly.21:49
roaksoaxyeah seems cloud-init is doing the right thing by fixing the clock skew21:50
roaksoaxniedbalski: did you fix rackd.conf, restarted it and retried ?21:50
niedbalskiroaksoax, probando doctor21:50
niedbalskiroaksoax, https://pastebin.ubuntu.com/26543384/21:59
roaksoaxniedbalski: release the machine, or abort it and try ?22:00
niedbalskiroaksoax, just did it.22:00
roaksoaxniedbalski: it could be due to the clock skew that it cannot authorize22:00
roaksoaxniedbalski: hwat about other mcahines ?22:00
niedbalskiroaksoax, yes, is there something like a cache for tokens or similar?22:00
roaksoaxno, we dont cache tokes22:01
roaksoaxno, we dont cache tokens22:01
roaksoaxeach tim e a new token gets re-generated22:01
roaksoaxniedbalski: that said, is this a commissioning or enlistment ?22:01
roaksoaxniedbalski: err or deployment ?22:01
niedbalskiroaksoax, thats ok, other machines (non hp) works ok, and this started to happen after I disabled the third party drivers22:01
niedbalskiroaksoax, commissioning22:01
roaksoaxniedbalski: the other one could be that its ready user data from the disk instead of the pxe process22:02
niedbalskiroaksoax, the mac address remains the same, i deleted/created the machine with another name/uuid just in case.22:04
roaksoaxniedbalski: yeah lets try that and see what happens22:05
roaksoaxthat's a strange error though22:05
niedbalskiroaksoax, should I wipe out the disks before?22:06
roaksoaxniedbalski: if you could that'd be good22:06
niedbalskiroaksoax, might be the old userdata is being read from the disk22:06
roaksoaxindeed22:06
niedbalskiroaksoax, have you seen something like that?22:06
roaksoaxniedbalski: nope, i personally havent, althjough we did fix a bug long time ago that required a new cloud-init so that it wouldn't read form disk22:07
roaksoaxniedbalski: are your images the latest ?22:07
niedbalskiroaksoax, I think yes, they are in sync with images.maas.io22:07
roaksoaxi wonder if cloud-init could have regressed and no longer listens to this option we send it kernel params22:08
niedbalskiroaksoax, let me see, I will wipeout the disk arrays22:08
niedbalskiroaksoax, which option?22:08
roaksoaxwe send an option on the kernel command line to alwys use the maas datasource22:09
niedbalskiroaksoax, commission with any non-available ppa, (commission fails), remove the ppa and retry commissioning22:11
niedbalskisimplest reproducer.22:11
xygnalroaksoax this memory bug is killing our performance.  our clients are u happy with the delays. how do we step up the level of toubleshooting on 174476522:17
roaksoaxxygnal: at this point, the only thing i can't think of is that's related to you running on top of vmware22:33
roaksoaxxygnal: we have confirmed we have larger maas' (e.g. way many more machines in a single maas), running on hardware that dont exhibit these issues22:33
roaksoaxniedbalski: aha! you weren't giving me enough info. So it is clear now. cloud-init fails to configure the archive, it tels that to maas and the machine gets marked as faled commissioning, the oauth keys expire and you see that in the logs :)22:34
roaksoaxxygnal: on the same version that is22:36
roaksoaxxygnal: so you could run a test by dumping your db, and importing it in a cleanly installed maas of the same version22:36
roaksoaxxygnal: on different underlying hardware22:36
niedbalskiroaksoax, https://pastebin.ubuntu.com/26543565/22:37
niedbalskiroaksoax, ok, so the first failure was ok, then I removed the archive (disabled third party drivers completely) and re-commissioned22:38
roaksoaxniedbalski: yeah that makes sense22:39
niedbalskiroaksoax, ok, failed the commissioning with the same error22:41
niedbalskii wiped out the disk in case it was using disk source22:41
roaksoaxniedbalski: can you show all the cloud-init-output log ?22:45
roaksoaxniedbalski: and all cloud-init.log too22:45
niedbalskiroaksoax, oops :-)22:45
niedbalskiroaksoax, q: does the oauth tokens gets expired in case of failure during any of the cloud-init stages?22:46
roaksoaxniedbalski: yes, so if cloud-init sayas "hey maas i failed to configure this thing you told me to" maas sees the failure message from cloud-init, and marks it failed commissioning22:47
roaksoaxniedbalski: and then you would see those errors that you cant access the metadata22:47
niedbalskiroaksoax, gotcha, let me retry and upload the full cloud-init logs for you to look.22:48
niedbalskiroaksoax, ok, now all went through .. i removed the maas squid cache, the problem was due to a hash mismatch while running the apt update phase on cloud-init, not sure how the proxy ended up that way.22:51
niedbalskiroaksoax, deploying now, lets see :-)22:52
roaksoaxcool22:53
xygnalroaksoax how much memory on these bare metal region controllers you test on?22:57

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!