/srv/irclogs.ubuntu.com/2014/09/30/#maas.txt

rick_h_so are preseeds a way I could suggestion someone use ansible with maas w/o juju?01:11
rick_h_basically a way to run some custom cloud-init when the node is brought up01:12
thetravtrying to learn MAAS. Is there a command to re-image a node?01:35
thetraveverything seems to work properly to get one up and running, but then I do my configuration, break it horrible, and want to start again01:36
thetravs/horrible/horribly01:36
thetravmaybe I can commission it with a different release, then switch it back01:39
thetravseems a bit long winded though01:39
thetravtrying to get MAAS to re-image a machine... so far it seems to just be re-booting it with the existing OS in place.  Anyone know how to get it to start from scratch again?03:42
jtvthetrav: if you just release it, it will be reinstalled on next boot.03:42
thetravhow do I release it?03:43
thetravthere's no button for that in the web ui :P03:43
thetravmaas -h also doesn't lead me to any answers03:43
jtvThere should be, if it's currently allocated to you.03:43
jtvOh, you want to re-commission?03:43
thetravthere is a "stop" button03:43
jtvI fell into the middle of what you were saying earlier, so I'm missing some context.03:43
thetravI think what I was saying earlier is the same03:44
jtvAh yes, older version than what we're working with.  Sorry.  That "stop" button released the node.03:44
thetravor it was meant to be03:44
thetravI'm working with whatever is in the ubuntu package repo03:44
thetravI figured that would be "stable"03:44
jtvWe're developing the next version, and so we're more actively familiar with it.03:44
thetravright03:45
thetravmakes sense03:45
thetravis there a way to do it in the old version?03:45
jtvIf you hit Stop, the node should then go into the Ready state.03:45
thetravyeah, it does do that03:45
jtvGood.  Now, if you re-allocate and restart it again (the Start button in your version), it should reinstall.03:45
thetravre-allocate?03:46
thetravif I click the start button it just powers up the machine.  Still has the existing file system and ubuntu install03:46
jtvYeah.  There's two steps to deploying a machine, which are more clearly distinct in the next version: first you allocate the machine, then you fire it up.03:46
jtvIf it went through Ready and is now in Allocated state, it should _not_ have the same install...03:46
jtvIf this doesn't boot you back into the installer, I suspect it's just not netbooting.03:48
thetravright03:49
jtvQuestion is, if it's not netbooting, how did it install before?03:49
thetravso... one of the things I noticed03:49
thetravin the preseed there's a line that says "turn off PXE netboot"03:50
jtvYeah, that happens at the end when the node's installed and deploying.03:50
thetravand when I go look at the bios in CIMC, PXE is not in the Actual Boot Order03:50
thetravis MAAS supposed to be modifying the boot order?03:51
thetravwhen it shuts down?03:51
jtvWhen the node gets released, it should set it to netboot again.03:51
thetravright03:51
thetravso I think that's not happening right03:51
jtvSounds like.03:51
thetravwhat mechanism is it using to do that?03:51
jtvIIRC it's a parameter to the power command: "come up, and when you do, netboot."03:52
thetravright, so in this case I've configured it to IPMI v203:52
thetravI do recall seeing something about a bug with cisco integrated management controller and MAAS power management03:53
* thetrav searches03:53
thetravso it may be that I just have to manually set it to netboot whenever i restart03:55
jtvThat'd be annoying.03:56
thetravyep03:56
thetravsomething like this: https://bugs.launchpad.net/maas/+bug/130047603:57
ubot5Ubuntu bug 1300476 in maas (Ubuntu) "Unable to setup BMC/UCS user on Cisco B200 M3" [Critical,Fix released]03:57
jtv(You _can_ restart a node that you currently own without MAAS' involvement, of course, and that won't need the change.  But still.)03:57
thetravalthough this is not a B200, it's a C240-M3S03:57
thetravyeah, so I told it to netboot using KVM and it's re-imaging now03:58
thetravwould be nice if I didn't need KVM though03:58
jtvI don't suppose this hardware supports the UCS power method?03:59
thetravI believe UCS requires expensive hardware that we haven't purchased04:00
thetravI don't really know though04:00
thetravI"m a software guy more than a hardware guy04:00
thetravthis space is all pretty new to me04:00
thetravwhen you say UCS do you mean the cisco unified computing system dealy?04:01
thetravor is there another meaning for that achronym?04:01
jtvI thinkthat's the one... Unified Computing and Servers?04:04
thetravyeah, so everything I've read and been told (admittedly by cisco sales guys) about that is that I can't use it without a 25k fabric interconnect04:05
thetravor maybe it's the fabric extender04:06
thetravpoint is, it's some seriously expensive fabric04:06
thetravfor my budget of $004:06
thetravthat bug is not the issue btw04:07
thetravI checked and it has created a maas user for the IPMI stuff04:07
thetravit's just not adjusting the boot settings04:07
thetravat least on server shut down04:07
jtvYeah your problem is a different one from that bug.04:08
jtvI'm not sure it should be adjusting on shutdown — I think it does that on power up.04:08
=== CyberJacob|Away is now known as CyberJacob
=== CyberJacob is now known as CyberJacob|Away
=== MasterPiece is now known as Qarekhani
=== Qarekhani is now known as MasterPiece
=== MasterPiece is now known as Qarekhani
=== Qarekhani is now known as MasterPiece
ram_how to validate if maas is installed correctly on the server? - I do not get the URL <ServerIP>/maas working in my setup11:00
=== jefferai_ is now known as jefferai
=== jfarschman is now known as MilesDenver
=== jfarschman is now known as MilesDenver
rick_h_any maas folks around to help me get through this ssh problem. We're trying to qa updates to quickstart to enable maas support and having some fun15:18
roaksoaxrick_h_: how can we help you?16:00
rick_h_roaksoax: I've got maas running on 3 nucs, we thought we had everything good but keep having issues with ssh and juju from the maas controller to the two nucs it's controlling16:00
rick_h_roaksoax: I'm confused about how the amt/maas control stuff is meant to work so maybe things look setup right but aren't16:01
roaksoaxrick_h_: what are the issues?16:01
rick_h_roaksoax: juju is unable to ssh https://bugs.launchpad.net/juju-core/+bug/1314682 it looks a lot like that bug16:01
ubot5Ubuntu bug 1314682 in juju-core "Bootstrap fails because of virt-manager config" [High,Triaged]16:01
rick_h_roaksoax: so I think my amt control isn't 100% correct16:01
rick_h_roaksoax: so if I can fire a couple of questions maybe it'll lead to something16:01
rick_h_roaksoax: on the amt node, I started out with it setup dhcp, but chnged it to static ip in an effort to make sure the node is always in the same place16:02
rick_h_roaksoax: I entered that into the maas power settings16:02
roaksoaxrick_h_: did you add AMT credentials for each of the nodes and confirmed MAAS power's them on on a juju bootstrap?16:02
rick_h_roaksoax: and when the machine is commissioned, it gets a different ip, is that ok? amt has 10.0.0.101 and comissioned one gets 10.0.0.250?16:02
rick_h_roaksoax: well that's the thing. in maas it's 'start/stop' but that doesn't seem to really control power on or power off?16:03
roaksoaxrick_h_: what version of MAAS are you using?16:03
rick_h_roaksoax: I moved to the daily ppa last night trying to work around a different issue16:03
rick_h_1.6.1+bzr2550+2551+295~ppa0~ubuntu14.04.116:04
rick_h_roaksoax: it's setup at maas.jujugui.org and happy to help give access if it helps in debugging16:04
roaksoaxrick_h_: can you test ppa:maas-maintainers/experimental ?16:04
rick_h_roaksoax: sure thing16:05
roaksoaxrick_h_: who is giving IP Address ot AMT? I'd suggest you configure the IP manually for each AMT host and not on a range that MAAS manages16:05
rick_h_roaksoax: right, that's what I've done. I've hard coded the amt ip now on both nucs16:06
rick_h_roaksoax: and then maas gives the machine a dynamic space ip when it comissions16:06
roaksoaxrick_h_:yes that's fine. The static IP allocation means that MAAS pics an IP and assigns it to the node on *start*16:06
rick_h_roaksoax: ok16:06
rick_h_roaksoax: 1.7 installing now16:07
rick_h_roaksoax: hmm, change to maas_local_settings.py in upgrade there removing all rabbitmq?16:08
rick_h_roaksoax: sent diff in pm if that's expected?16:09
roaksoaxrick_h_: Y16:11
roaksoaxrick_h_: that's expeceted16:11
rick_h_roaksoax: ok cool thanks for sanity check16:11
rick_h_smaller one on pserv.yaml accepted as well16:12
roaksoaxrick_h_: i'll try to handle that automatically16:12
rick_h_ok, new ui loaded :)16:13
rick_h_"Boot image import process not started. Nodes will not be able to provision without boot images. Start the boot images import process to resolve this issue." warning16:13
roaksoaxrick_h_: ah yeah, og to the Images tab16:14
roaksoaxand import images again :)16:14
roaksoaxrick_h_: but the images should be there16:14
roaksoaxrick_h_: we just haven't migrated16:14
roaksoaxyet16:14
rick_h_ah cool yea got it16:14
rick_h_ok, see some more options on the nodes as well16:14
rick_h_roaksoax: so stop == shutdown?16:14
roaksoaxrick_h_: yes! please files bugs if you thinkg that should be changed :)16:15
rick_h_roaksoax: so I could not start/stop the node with the error that the config didn't allow it16:18
rick_h_roaksoax: so I went in to edit the node and in the 'power type' I have a select list with no options16:18
rick_h_roaksoax: so it seems I lost my power info on the node in the upgrade and have no valid types to choose from now16:19
roaksoaxrick_h_: so what if you do: sudo service apache2 restart && sudo service maas-cluster-register restart && sudo service maas-cluster restart16:20
roaksoaxrick_h_: wait a bit16:20
roaksoaxrick_h_: and check under the 'Clusters' tab to see if the cluster is connected16:20
rick_h_roaksoax: running now16:20
roaksoaxrick_h_: once connected, you should get that back16:20
rick_h_roaksoax: rgr16:20
roaksoaxrick_h_: did it work this time?16:26
rick_h_roaksoax: so it grabbed some ips on the wrong network so editing the pserv.yaml and maas_cluster.conf to update those ips and restarting16:26
roaksoaxrick_h_: sudo dpkg-reconfigure maas-cluster-controller :)16:27
rick_h_roaksoax: good to know16:27
rick_h_roaksoax: ok, restarted and connected cluster now16:27
rick_h_roaksoax: ok, so my power config is back and set16:28
roaksoaxrick_h_: ok great, let's try to test this time16:28
rick_h_roaksoax: is the mac addr diff or the same from amt to the 'machine'? (I have them as the same but curious if I should be looking for a diff one)16:28
roaksoaxrick_h_: i think it might be the same16:28
rick_h_machine is off, trying to start it brings up same error "The action "Start selected nodes" could not be performed on 1 node because its state does not allow that action.16:28
roaksoaxrick_h_: i cna't remember.. don't have a NUC in hand now unfortunately16:28
rick_h_roaksoax: all good16:29
roaksoaxrick_h_: Allocate machine first16:29
roaksoaxrick_h_: and then you can start it16:29
rick_h_allocate == commission?16:29
rick_h_it's showing as status of ready atm16:29
rick_h_ah, more details in the new edit details page16:29
rick_h_Failed to query node's BMC — Node could not be queried node-ee9f70b4-48aa-11e4-8a8c-eca86bffcfed (nuc1) amt failed with return code 2: Missing amttool (amtterm package)16:30
roaksoaxrick_h_: commissioning is the stage where MAAS lears about the machine16:30
roaksoaxrick_h_: there you go, sudo apt-get install amtterm :)16:30
roaksoaxrick_h_: good thing that MAAS shows what's going on nowadays :)16:30
rick_h_no kidding16:30
rick_h_never would have realized it didn't already come with that stuff ootb16:30
roaksoaxrick_h_: that's why mark was happy in nuremberg :)16:31
rick_h_oh yay color icon shows off now16:32
rick_h_roaksoax: ok, I've got a power status and the button on the edit to 'check power status' shows it's off correctly16:35
rick_h_roaksoax: but when I go to 'start' I get 'The action "Start selected nodes" could not be performed on 1 node because its state does not allow that action.16:35
rick_h_'16:35
rick_h_roaksoax: from a 'ready' state currently16:35
rick_h_roaksoax: no new error in the edit view, just the same 6min old 'amtterm' missing16:35
roaksoaxrick_h_: yeah you need to *own* the machine first16:39
roaksoaxrick_h_: so *allocate* the machine first and then *start*16:39
rick_h_roaksoax: allocate == commission or acquire or ?16:40
* rick_h_ isn't seeing a allocate button and is feeling a bit like a dumb user16:40
rick_h_roaksoax: ok yea so the 'acquire' gave me the status of 'allocated' so I'd give user feedback of making the terms consistent16:42
roaksoaxrick_h_: agreed, please do files bugs16:42
rick_h_ok, and now I can start and the machine is coming up woot!16:42
rick_h_roaksoax: will do ty16:42
rick_h_roaksoax: ok, more fun. So I managed to acquire and then start. Now I can't shut down or do anything else. It shows green started, and I'ge for tftp request node events17:07
roaksoaxrick_h_: what's the status of the node? Deploying? Deployed?17:07
rick_h_roaksoax: I've tried to "Abort operatoin" and "stop node" and I get The action "Stop selected nodes" could not be performed on 1 node because its state does not allow that action.17:08
rick_h_deploying17:08
roaksoaxrick_h_: ok, I think that's xpected since it is in the process of being deployed17:08
rick_h_but that failed, the machine came up, got a tftp timeout, and now is sitting at the 'reboot or select proper boot device"17:08
roaksoaxrick_h_: humm so it never really started?17:08
rick_h_roaksoax: correct17:08
rick_h_it 'turned on' but never got rolling due to the tftp timeout17:08
roaksoaxrick_h_: did you add an SSH key? can you show output of /var/log/maas/maas.log and /var/log/maas/pserv.log17:09
rick_h_roaksoax: https://pastebin.canonical.com/117948/ and https://pastebin.canonical.com/117948/ respectively17:11
rick_h_roaksoax: yes, ssh key is added to maas17:11
rick_h_roaksoax: tftp error was "PXE-E32: TFTP open timeout17:12
roaksoaxrick_h_: yeah I don't see a PXE boot here: https://pastebin.canonical.com/117949/17:13
roaksoaxrick_h_: is the cluster controller correctly configured to do DHCP/DNS?17:13
rick_h_roaksoax: as far as I'm aware. it's connected, managed interfaces: 1, nodes 1, images synced, with "Manage" set to DHCP & DNS17:14
rick_h_roaksoax: ok, I can release the node and now there's an error where amt disagreed that the machine was going to power off, yet it did17:15
rick_h_https://pastebin.canonical.com/117950/ but it did occur so at least it's not tied up on the broken deploying now17:16
roaksoaxrick_h_: weird... maybe AMT issues?17:16
rick_h_roaksoax: maybe, my first experience with it.17:16
roaksoaxrick_h_: the PXE boot issue i think it has been seen before but that's something we are investigating17:16
rick_h_ok, had to release it twice but now it's actually released, back in a powered off 'black' state17:17
rick_h_roaksoax: ok, cool. Well, this is farther than I was with better debug info. I'll add the second node and see if I can get it to work at all with juju now.17:18
rick_h_roaksoax: thanks for the help and pointer to the experimental stuff.17:18
roaksoaxrick_h_: ok, great17:18
rick_h_I'll file a few bugs on things based on the experience as feedback17:18
roaksoaxrick_h_: awesome! thanks17:19
=== roadmr is now known as roadmr_afk
rick_h_roaksoax: can't add a my second node, does this look like something you've seen? https://pastebin.canonical.com/117958/18:33
roaksoaxrick_h_: interesting... it seems like it would be trying to pxe boot a node but it doesn't fine it18:34
roaksoaxrick_h_: can youplease file a bug and point it to me?18:34
rick_h_roaksoax: will do18:34
rick_h_roaksoax: that's in the add node UI after entering the info to add a new one18:34
roaksoaxrick_h_: ah yes, we spotted a bug when adding via webui18:40
roaksoaxrick_h_: it is in the process of being fixed18:40
rick_h_roaksoax: ok cool18:40
=== CyberJacob|Away is now known as CyberJacob
=== jfarschman is now known as MilesDenver
=== roadmr_afk is now known as roadmr
=== CyberJacob is now known as CyberJacob|Away
=== mjs0 is now known as menn0
=== jfarschman is now known as MilesDenver

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!