/srv/irclogs.ubuntu.com/2014/06/05/#maas.txt

smoserdesignated, your error is probably true.02:14
smoserit couldn't get a lock because the process running (that never completeded) had it locke02:14
smoseryou could strace that process and probably get some more info02:14
=== CyberJacob|Away is now known as CyberJacob
smosermaybe you have outbound internet access blocked ?02:15
designatedsmoser, it's not blocked, maas is not recognizing the domain as being managed even though it's configured correctly and forwarding the dns request to the configured dns forwarder02:19
smoserthe sudo error is not relevant.02:19
smoserit happens any time dns resolution of 'hostname' fails.02:20
smoserwhich is fine.02:20
designatedsmoser, correct but that's what I'm trying to explain.02:20
smoserthats fine though.02:20
smoserif you're apt-get update hung, then thats the problem.02:20
designatedapt-get update is hanging because of a dns issue and never completing enlistment.02:20
smoserno. i don't think so.02:20
smoserdns resolution of `hostname` is fialing02:20
designatedit is because if i kill the process it finishes enlistment immediately02:21
smosercan you verify it works for anything else ?02:21
smoserand resolution would *fail* not hang.02:21
smoseryou're on the system now?02:21
designatedyes02:21
smoserdo 'ping archive.ubuntu.com'02:21
smoseri think you'll get dns resolution02:21
smoseror if you dont, then yes, dns is the issue.02:21
designatedit will resolve that02:22
designatedapt-get update doesn't succeed because it cannot resolve it's own hostname02:22
smoserdoesn't care.02:22
smoserthats not why its hanging. i'm certain of that.02:22
smoseryou can replicate that anywhere like this:02:23
designatedthis process is running: root      1384  0.0  0.0  31064  2380 ?        S    20:13   0:00 /usr/bin/apt-get --option=Dpkg::Options::=--force-confold --option=Dpkg::options::=--force-unsafe-io --assume-yes --quiet update02:23
designatedif i kill it, enlistment will finish02:23
smoserstrace it02:23
smoserwhat is it doing.02:23
smoseror even just tail /var/log/cloud-init-output.log02:23
smosersudo strace -p 238002:23
designated$ sudo strace -p 138402:24
designatedsudo: unable to resolve host maas-enlisting-node02:24
designatedProcess 1384 attached02:24
designatedselect(8, [6 7], [], NULL, {0, 23729})  = 0 (Timeout)02:24
designatedselect(8, [6 7], [], NULL, {0, 500000}) = 0 (Timeout)02:24
designatedselect(8, [6 7], [], NULL, {0, 500000}) = 0 (Timeout)02:24
designatedselect(8, [6 7], [], NULL, {0, 500000}) = 0 (Timeout)02:24
designatedIgn http://archive.ubuntu.com trusty-updates InRelease02:25
designatedErr http://security.ubuntu.com trusty-security Release.gpg02:25
designated  Connection failed02:25
designatedsmoser, mackrel is working with me on this issue02:27
designatedjust letting you know so he can ask questions about the same issue02:27
designatedsmoser, could it be an issue with the proxy server on maas?02:28
smoserdesignated, see, its timing out on a network connection.02:28
smoseryou set a proxy in maas ?02:29
designatedsmoser, no but i thought by default the apt-get requests got proxied through maas02:29
=== CyberJacob is now known as CyberJacob|Away
smoseryou can see if that was correctly written into /etc/apt (grep -r Proxy /etc/apt)02:29
designatedunder maas gui there is an option to configure proxy server, it says if you leave it blank "This will also be passed onto provisioned nodes instead of the default proxy (the region controller proxy)."02:30
smoserit could be an issue then on the squid proxy on the maas region controller02:30
smosertry:02:30
designated/etc/apt/apt.conf.d/95cloud-init-proxy:Acquire::HTTP::Proxy "http://192.168.168.7:8000/";02:30
smoser http_proxy=http://your.maas.ip.addr:3128 wget http://security.ubuntu.com02:31
smoseri suspect that will hang similarly.02:31
smoserer... s/3128/8000/02:31
designatedResolving your.maas.ip.addr (your.maas.ip.addr)... failed: Name or service not known.02:33
designatedwget: unable to resolve host address âyour.maas.ip.addrâ02:33
designatedshit...just a sec02:33
smoser:)02:33
mackrelsmoser, designated, yes it is stalling.02:33
smoseryeah, thats not going to work :)02:33
designatednoobing it up tonight02:33
designatedjust hangs02:34
designatedConnecting to 192.168.168.7:8000... connected.02:34
designatedProxy request sent, awaiting response...02:34
designatedi wonder how squid proxy got jacked up02:34
mackrelsudo service squid-deb-proxy status squid-deb-proxy start/running, process 100502:38
designatedi show squid-deb-proxy as well as squid3 installed.  are both of these needed?02:39
designatedsquid3 is listening on TCP/800002:40
designatedproxy     1005  0.0  0.1 127620 30736 ?        Ssl  18:30   0:01 /usr/sbin/squid3 -N -f /etc/squid-deb-proxy/squid-deb-proxy.conf02:40
designatedproxy     1233  0.0  0.1 113348 20260 ?        Ss   18:32   0:00 /usr/sbin/squid3 -N -YC -f /etc/squid3/squid.conf02:40
smoseris squid proxy simply blocked on outbound network connections ?02:43
smoserie, from maas sytem can you hit archive.ubuntu.com ?02:43
designatedsmoser, it's not blocked02:43
designatedsmoser, yes i can02:43
smosercan you try the wget above on the maas region controller ?02:43
smosermaybe just try restarting squid. that doesn't give you warm fuzzies, but id ont know.02:44
designatedsmoser, the wget from the controller succeeds but not when i proxy the request through itself.02:45
designatedmackrel, didn't we have a similar problem in the lab?02:47
smoserright.02:47
smoserso yeah, squid is messed up. did you try restarting it ?02:47
designatedi did02:47
designatedno difference02:47
smoseri know thats a hack.02:47
smoserbut i dont know why it would be hung.02:47
smoseryou can look at its logs for some info.02:47
mackrelyes.  we experienced this issue before but this was working and has deteriated to this02:49
designatedmackrel, right, no changes were made to squid.  everything was working, then it stopped.  i rebuilt maas from scratch today and we're still seeing this issue.02:52
smoserso see if there is anything in squid error or access logs that gives you any hiints02:53
smoserthe only guess i have a this point is that squid's dns resolution is borked. suspecting something to do with maas taking over dns on that system.02:54
smoserbut i dont have a lot of faith in that theory02:54
designated1401936515.205   4723 192.168.168.7 TCP_MISS_ABORTED/000 0 GET http://security.ubuntu.com/ - HIER_DIRECT/2001:67c:1562::15 -02:54
designatedi don't understand why there are two squid processes running02:54
designatedproxy     8372  0.0  0.1 115640 22480 ?        Ss   20:45   0:00 /usr/sbin/squid3 -N -f /etc/squid-deb-proxy/squid-deb-proxy.conf02:54
designatedproxy     8464  0.0  0.1 113216 19956 ?        Ss   20:48   0:00 /usr/sbin/squid3 -N -YC -f /etc/squid3/squid.conf02:54
smoserwell, yeah, that is kind of silly. :)02:55
smoserbut one of them is squid deb proxy and one is just squid.02:55
designatedmaas uses squid-deb-proxy...no?02:55
smosersquid deb proxy actually i think probably runs on 312802:55
smosererr.02:55
smoseri might be rwong02:55
smoseri am wrong02:55
smoser8000 is squid deb proxy02:55
mackrelsudo netstat -tanpo | grep 312802:55
mackreltcp6       0      0 :::3128                 :::*                    LISTEN      8464/squid3      off (0.00/0/0)02:56
designatedtcp6 not 402:56
designatedboth of them say tcp602:56
mackrel sudo netstat -tanpo | grep 800002:56
mackreltcp6       0      0 :::8000                 :::*                    LISTEN      8372/squid3      off (0.00/0/0)02:56
mackrelone process is listening 3128 and another on 800002:57
smoseryou're saying squid just isn't listening on ipv4 ?02:59
smosertelnet localhost 8000 ?02:59
designatedsmoser, it connects03:00
designateddoes squid get installed as a depend of maas-dns?03:00
mackrellocalhost resolves to ::1 in /etc/hosts, so I imagine when squid starts it binds itself to localhost and resolve ipv6.  we can probably comment that out, restart squid and it would list on ipv403:01
smoserprobably not maas-dns. prbobaly maas region controller03:01
designatedmackrel, any ideas?03:04
mackreldesignated, not really.  dns resolution was first step but now it is proxy... seems pretty weird we got three dozen nodes to register before and now kaput03:08
designatedsmoser, thanks for your help.  we're going to have to figure out wth squid is doing03:18
smoserdesignated, sorry couldn't get you past that issue.03:38
=== plars is now known as plars-away
=== vladk|offline is now known as vladk
=== CyberJacob|Away is now known as CyberJacob
=== vladk is now known as vladk|offline
=== vladk|offline is now known as vladk
rvbagmb: I'm marking your fix-commissioning-page-distro-list-bug-1312844 branch "needs fixing".  The problem I describe could have gone unnoticed because the testing coverage is not complete in this area.  Happy to help you with this when you're back from your mini sprint.08:57
gmbrvba: Merci. Yeah, you’re right… I kind of knew I was on a bit of a wing and a prayer tests-wise :)08:57
gmbanyway08:58
* gmb -> sprinting08:58
rvbabigjools: why do you think it's best to set it in start_nodes()?09:02
bigjoolsrvba: lol09:02
bigjoolsrvba: because I figured chaining the jobs would be better, but as you just pointed out we need host entries for other types too09:03
bigjoolsthe other job being the power_on09:03
bigjoolshmmm09:03
bigjoolsI'll add it tomorrow.09:04
bigjoolsin claim_static_ip() I mean09:04
rvbaSounds good.09:04
bigjoolswe have to hope celery gets to it before the power_on :)09:05
rvbaThis is a bit of a gamble.09:06
bigjoolsquite09:06
bigjoolshence my question09:07
bigjoolsit's not so cut and dry09:07
rvbabigjools: we can't reasonably take that risk.09:11
bigjoolsrvba: yeah, and I think in terms of abstraction it makes sense to leave it out09:12
rvbaI would have preferred to have the change in the DB and the setting of the host entry in one place.  Because they are the two sides of the same coin (one part is internal the other is external).09:13
=== vladk is now known as vladk|offline
=== vladk|offline is now known as vladk
=== vladk is now known as vladk|offline
=== vladk|offline is now known as vladk
=== vladk is now known as vladk|offline
=== vladk|offline is now known as vladk
JayJI need help debugging MaaS. The node (VM) fails to Commission12:43
=== vladk is now known as vladk|offline
Jay_I need help to debug MaaS. Anyone?12:49
Jay_I posted the question in askubuntu: http://askubuntu.com/questions/477028/maas-fails-to-commission-nodes12:50
rvbaHi Jay_; thanks for the question.  I'll have a look at the logs you provided in a short while.  I'll get back to you when I do (probably on askubuntu.com).13:11
Jay_rvba: Thank you very much13:14
=== vladk|offline is now known as vladk
Jay_rvba: Please let me know if you need any more logs from the box.13:42
rvbaJay_: I suggest you have a look at the machine's logs (syslog & co) when it fails to get its IP address.  Maybe you'll find a hint as to why it failed commissioning.13:46
Jay_rvba: Do these logs make any sense?13:52
Jay_Jun  4 20:03:56 maas dhcpd: DHCPNAK on 50.50.50.13 to 08:00:27:b9:e6:16 via eth013:52
Jay_Jun  4 20:04:12 maas dhcpd: DHCPDISCOVER from 08:00:27:b9:e6:16 via eth013:52
Jay_Jun  4 20:04:12 maas dhcpd: DHCPOFFER on 50.50.50.58 to 08:00:27:b9:e6:16 via eth013:52
Jay_Jun  4 20:04:12 maas dhcpd: DHCPREQUEST for 50.50.50.13 (50.50.50.3) from 08:00:27:b9:e6:16 via eth0: lease 50.50.50.13 unavailable.13:52
rvbaJay_: the "lease <ip> unavailable" doesn't look too good.13:53
Jay_rvba: That's where I think MaaS messing up something. Don't know how to proceed as I know little about MaaS13:54
rvbaJay_: can you share the content of the lease file? /var/lib/maas/dhcp/dhcpd.leases13:56
rvbaleases* even13:56
rvbaJay_: The DHCPNAK message also seems to indicate something is wrong with the network config/the DHCP config.13:59
Jay_rvba: Copied here: https://www.dropbox.com/s/fwu0db3orphz2jk/dhcp-leases13:59
Jay_rvba: Appears that MaaS assigns an IP and binds MAC during enlistment. Then it is getting confused during Commissioning!14:00
rvbaJay_: yes, the assignment is made the first time MAAS sees a node.  It should be used throughout a node's lifecycle so that the IP doesn't change.14:04
=== vladk is now known as vladk|offline
Jay_makes sense. I thought so too. However, the VM PXE boots again during Commissioning, whcih is when it is getting confised14:05
=== vladk|offline is now known as vladk
=== CyberJacob is now known as CyberJacob|Away
magicrobotmonkeywhich tool do you use to change the volume quotas?14:57
magicrobotmonkeycinder!14:59
=== vladk is now known as vladk|offline
designatedblake_r, do you know of a quick way to restart all maas services?16:34
blake_rdesignated: i just normally restart them one at a time16:36
designatedblake_r, so just restart everything in /etc/init/maas-* one at a time?16:51
blake_ryes16:52
blake_rand apache216:52
blake_rand tgt16:52
=== vladk|offline is now known as vladk
designatedblake_r, thank you.17:28
designatedblake_r, do you know of a way to disable maas forcing the nodes to use the maas controller as a proxy?  during enlistment, squid-deb-proxy doesn't seem to be functioning correctly, I've been troubleshooting it for a couple of days now with no success.  I keep getting:18:20
designatedErr http://security.ubuntu.com trusty-security Release.gpg18:20
designated  Connection failed18:20
designatedall of my nodes have direct internet access.18:21
designatedsmoser, who is responsible for working on the squid-deb-proxy portion of maas and can assist in troubleshooting this issue?18:53
smoserwell squid-deb-proxy is just an ubuntu package. maas depends on it. you can file a bug against squid-deb-proxy using 'ubuntu-bug squid-deb-proxy'.18:56
=== roadmr is now known as roadmr_afk
smoserand you can probably turn up debug info in squid18:58
smoserhttp://www.squid-cache.org/Doc/config/debug_options/18:58
designatedsmoser, do you know a way to prevent maas from forcing the nodes to proxy the apt requests?18:59
smoserdesignated, grep through /etc/maas -r for http_proxy or just proxy and see if you see anything19:08
smoseri think it should show up there.19:09
smoserand i think you should be even able to set the proxy in the maas web ui19:09
designatedsmoser, i don't want a proxy19:10
smoserright. i suspec tyou'll see it set to some value19:10
smoserand you can unset it19:10
=== CyberJacob|Away is now known as CyberJacob
designatedsmoser, I'll try that.  thank you19:19
=== roadmr_afk is now known as roadmr
designatedsmoser, the file /etc/maas/preseeds/commissioning only contains {{preseed_data}}.  Does that get pulled in from 'generic' and 'preseed_master'?20:11
smoserprobably rendered in maas internal.20:12
smoseri'd hvae to look at it.20:12
smoseri dont really know., anyone know how to globally disable the squid proxy ?20:13
designatedi successfully disabled the proxy server during enlistment and it enlisted perfectly.  now trying to go the same for commissioning20:16
designatedsmoser, I think I found it here: /etc/maas/templates/commissioning-user-data/user_data_config.template20:23
=== vladk is now known as vladk|offline
=== CyberJacob is now known as CyberJacob|Away

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!