/srv/irclogs.ubuntu.com/2014/12/05/#maas.txt

=== CyberJacob is now known as CyberJacob|Away
designatein the maas 1.7 changelog it says maas no longer uses squid-deb-proxy but for some reason it's still getting installed...00:25
designatedoes anyone even monitor this channel?01:18
roaksoaxdesignate: sudo apt-get dist-upgrade should remove it01:51
roaksoaxdesignate: if you did sudo apt-get update it might not have01:51
designateroaksoax: thank you, I will try that.01:54
roaksoaxdesignate: np!01:56
=== CyberJacob|Away is now known as CyberJacob
=== CyberJacob is now known as CyberJacob|Away
thebozzGuys, I need some help. We're trying to comission a Dell R710 server into our newly installed MAAS cluster. However, it fails after turning on and off a couple of times, the GUI only says "Failed to power on node — Timeout after 7 tries ". What could be wrong, where should we look to start fixing this issue?13:22
thebozzThe rest of our cluster are only R720 servers. My boss suspects there could be an issue because the R720s have iDRAC 7, while the 710 has iDRAC 6.13:23
thebozzAnyone in here?14:54
jhobbsthebozz: what version of MAAS are you using?14:55
jhobbssounds like 1.714:55
jhobbsthere is a button on the node page where you can check the power state of the noe14:55
jhobbs*node14:55
jhobbsthebozz: can you try that button and see if it works for the 71014:55
thebozzIt ran successfully, and detected the node as off.14:56
jhobbsso MAAS can reach the node and has good credentials for it14:57
jhobbswhen you say  it fails after turning on14:57
jhobbs                 and off a couple of times,14:57
jhobbshow are you turning it on and off there?14:57
thebozzClicking on "comission node".14:58
thebozzLet me get the relevant logs, maybe there's some useful info in there.14:58
jhobbsis there anything else talking to the BMCs? nagios or something like that14:59
jhobbsor serial over lan15:00
thebozz:/ actually, I have no idea what I'm looking at. I haven't been involved in the deployment other than helping here and there. Is there anything I should look at in the logs to help me debug this?15:01
jhobbswell /var/log/maas/maas.log and /var/log/maas/maas-django.log might be useful, if you can post them15:01
jhobbsif the node is powered off right now, what state is it in in MAAS? Ready? Failed Commissioning?15:02
thebozzFailed Commissioning. Let me do some filtering on those files, I'll try to grab anything that seems relevant.15:03
jhobbscan you try commissioning again, since the power check is working?15:04
jhobbsif that doesn't work, you should try powering on the node manually via IPMI using MAAS's credentials15:05
thebozzMy boss insists he thinks it has to do with every other node having iDRAC 7 while this one has iDRAC 6. Is that relevant at all?15:10
jhobbsit could be15:12
thebozzHere are the logs: http://pastebin.com/aTf05ajh => maas.log ; http://pastebin.com/aQEAn3MR => pserv.log ; maas-django.log didn't have any references to the relevant MAC address. Is there anything else I can use to filter?15:13
thebozzAbout the iDRAC thing... how is it relevant? I don't really understand that.15:13
jhobbsthere was a bug at one point that affected r71015:14
jhobbshttps://bugs.launchpad.net/ubuntu/+source/maas/+bug/128796415:14
ubot5`Launchpad bug 1287964 in MAAS "MAAS incorrectly detects / sets-up BMC information on Dell PowerEdge servers" [High,Fix released]15:14
jhobbsIPMI is quirky - different versions of it react different ways to the same commands sometimes15:14
jhobbsjust minor differences in how the protocol is implemented15:14
jhobbsr710 doesn't look to be ubuntu certified for 14.0415:15
jhobbsr720 is though15:15
thebozzHuh. Then it's worth a shot to do it manually. Will MAAS be able to turn it on and off at will after comissioning?15:15
jhobbswell that depends on why it's not working - if it's not working now, and nothing changes, i wouldn't expect it to change after commissioning15:16
jhobbsoh, r710 is certified too, so it should be working15:16
thebozzThat only makes this even weirder :/15:16
jhobbshttp://www.ubuntu.com/certification/hardware/201404-14939/15:16
jhobbssince it works sometimes and sometimes it doesn't, i would suspect either something else is talking to it via IPMI and using its sessions up, or maybe something at the network layer is bad - duplicate IP addresses maybe?15:18
jhobbsor maybe the firmware on the bmc is out of date?15:18
thebozzThe cluster is on its own subnetwork, pretty much isolated ATM. So yeah, my guess would be network layer issues or firmware.15:20
=== jfarschman is now known as MilesDenver
=== roadmr is now known as roadmr_afk
=== jfarschman is now known as MilesDenver
=== jfarschman is now known as MilesDenver
=== roadmr_afk is now known as roadmr
designateI am trying to bootstrap an environment using maas/juju (latest stable versions of both) but I'm getting the following error: "401 OK (Authorization Error: 'Expired timestamp: given 1417774874 and now 1417800053 has a greater difference than threshold 300')" despite the fact that I have configured an NTP server in MAAS that is reachable by all servers.17:48
=== CyberJacob|Away is now known as CyberJacob
=== roadmr is now known as roadmr_afk
=== roadmr_afk is now known as roadmr
designatesince images are now stored in the maas database, can anyone point me in the direction of modifying the ephemeral image?  I need to add an NTP server because of clock differences causing oauth errors.22:41
=== CyberJacob is now known as CyberJacob|Away

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!