/srv/irclogs.ubuntu.com/2014/01/14/#maas.txt

jtvWhy isn't the packaging building bin/maas-probe-dhcp!?03:29
jtvbigjools: any ideas?  ^03:30
bigjoolsjtv: where should it live do you reckon? cluster?04:47
bigjoolsmaas-dhcp?04:47
jtvI was thinking maas-dhcp.04:47
bigjoolsk04:47
jtvI already have a branch installing it as part of that package.  The problem is getting it built.04:47
jtvBut feel free to write up your own branch; it's not exactly a lot of diff.04:47
bigjoolsjtv: hmmm it needs to go in cluster controller04:52
bigjoolswell doesn't *have* to but makes more sense to me04:53
bigjoolsah sod it maas-dhcp04:53
jtvWell I had been wondering, but you weren't around: did we have a particular purpose in mind for it?04:53
jtvWere we going to run it during install?  Or before starting our DHCP server..?04:54
bigjoolsjtv: show me your diff04:58
jtvI added to debian/maas-dhcp.install:05:02
jtvdebian/extras/maas-probe-dhcp /usr/sbin05:02
jtv(Or various other things I tried).05:03
jtvI also had a lintian-overrides, but I reverted that already.05:03
bigjoolswell you need a bit more than that05:03
bigjoolsMP coming05:03
bigjoolsjtv: https://code.launchpad.net/~julian-edwards/maas/packaging-add-maas-probe-dhcp/+merge/20153805:12
jtvThanks.05:12
* jtv shall review forthwith05:12
bigjoolsbuildout scripts are a PITA05:12
bigjoolsjtv: it's untested!05:12
bigjoolsI figured since you were desperate ...05:12
bigjoolsI can do it if you want05:13
jtvI see that you managed to include bash.  :)05:13
bigjoolssadly05:13
bigjoolscargo culted existing ones05:13
jtvWhat exactly do you mean by untested — "not covered by test suite" or "I haven't tried this yet"?05:13
bigjoolsI haven't tried it05:14
jtvAh.  Well, it'll sort of come naturally with my manual testing, so don't worry about it.05:14
jtvAs long as the package builds...05:14
bigjoolsok05:14
bigjoolswell let's do a test build first05:14
bigjoolsI'll do it on cstack05:15
jtvOK05:17
* jtv reminds self to write wrapper script for uvt-kvm05:17
jtvbigjools: it's surprising to me that you can't just use the maas-prove-dhcp tool that buildout builds for us.05:23
bigjoolsjtv: buildout doesn't run when packaging05:23
jtvAhhh.05:24
bigjoolsit's a development convenience05:24
jtv...Because package-building prefers setup.py over make.05:24
jtvIf we didn't have setup.py, it'd have run "make," and then buildout would have run.05:24
jtvSilly me.05:25
bigjoolsjtv: nearly done testing, so far so good05:44
bigjoolsjtv: and it works05:48
bigjoolsjtv: branch merged06:01
jtvThanks bigjools06:20
rawanghi anyone knows why I deploy service, every time I got "agent-state-info: '(error: cannot run instances: gomaasapi: got error back from  server: 409 CONFLICT)",   by looking at maas.log, I gotOAuthUnauthorized exception07:17
jtvrawang: the two sound like they're different problems...  MAAS returns 409: Conflict in particular if no nodes are available that match your request.07:20
jtvUnfortunately older versions of juju don't return the full error message.07:20
rawangjtv, there are 3 available server in maas's pool07:21
jtvAnd you specified no constraints?07:21
rawangjtv, and i have use tag to bootstrap on one of them07:21
jtvAnd the servers are all in Ready state, I guess...07:21
rawangjtv, well for juju bootstrap, i have constais07:22
rawangjtv, oh , you remind me maybe constraints is still there when boostrap the juju ...07:22
jtvYou could also try simulating juju's request directly against the MAAS API (e.g. using maas-cli) and seeing what the actual error message is.07:23
rawangjtv, oh really, could you please give me a example? :)07:24
rawangjtv, after I remove the constraints, it works, thanks :)07:25
jtvPhew.  :)07:42
jtvbigjools: any chance you could re-test my branch lp:~jtv/maas-test/second-dhcp-check against your hardware?07:47
jtvI'd be particularly interested in these scenarios:07:47
jtv1. A regular, successful run (to see that nothing is broken).07:48
jtv2. There's a dhcpd running, but on the same interface you're running maas-test against.07:48
jtvI'm trying it against scenarios where the network interface has no IP address.07:49
jtvbigjools: scenario 2 needs your updated package inside the VM.  Easiest way to do that I suppose is to edit maastest/maasfixture.py and replace the apt-get with some other way of installing the package.08:08
=== CyberJacob|Away is now known as CyberJacob
rvbajtv: you can test scenario 1 in the lab yourself, that's precisely why the manual job is made for.08:37
jtvrvba: what is "the manual job"?08:47
rvbajtv: I was referring to the email I sent yesterday "Manual maas-test Jenkins job in the lab"08:48
jtvAh, I'll have a look at that.08:48
rvbajtv: with that you can get the lab to test a maas-test branch for you.08:49
jtvGreat.08:49
rvbaThat will test your scenario 1.08:49
bjorneI have a problem, i have try 12.04 and 13.10 and get same error09:34
bjorne[21/Dec/2013:00:49:57 +0100] "GET /MAAS/metadata//2012-03-01/user-data HTTP/1.1" 40409:34
=== mwhudson is now known as zz_mwhudson
jtvrvba: looks like that lab run failed...  Do we get the details that the tests attached anywhere?10:22
rvbabigjools: when I comment out the broker's details (in the config), I still get a segfault… so maybe the rabbit connection is not part of the problem after all.10:31
bigjoolsrvba: interesting10:31
bigjoolsrvba: can you run the celeryd itself up with minimal args?10:31
rvbabigjools: that's precisely what I'm doing.10:32
bigjoolscool10:32
bigjoolsso this smacks of a build/dependency problem10:32
bigjoolsrvba: can I ssh into your machine?10:32
rvbaBut it cannot run without some kind of config…10:32
rvbabigjools: sure, hang on.10:32
rvbabigjools: ssh ubuntu@10.55.60.16710:33
bigjoolsrvba: did you add my key?10:34
rvbaYes10:34
bigjoolsit won't let me in10:34
rvbabigjools: ah, I'm logged in as root, just one sec.10:35
rvbaMy bad10:35
rvbabigjools: please try again10:35
bigjoolsstill can't get in10:35
rvbahum…10:36
rvbabigjools: can you try one more time (I'm watching the logs)10:37
rvba?10:37
bigjoolsthere10:37
rvbaNothing10:37
bigjoolsyou gave me the right IP address?10:38
rvbassh ubuntu@10.55.60.16710:38
rvbaYes10:38
bigjoolsPEBKAC10:38
rvbabigjools: when celeryd isn't configured to read MAAS' tasks, it doesn't blow up (!).10:43
bigjoolsrvba: If I run "celeryd" on its own it crashes10:43
bigjoolsno args10:43
bigjoolsrvba: gah remind me how to remove core dump restriction10:44
jtvbigjools: ulimit10:46
bigjoolsjtv: tried it already10:46
bigjools-bash: ulimit: core file size: cannot modify limit: Operation not permitted10:46
jtvHnyug10:46
rvbabigjools: what core dump restriction10:47
jtvStrange thing is, I can do that.10:47
rvba?10:47
bigjoolsrvba: size10:47
bigjoolsyou don't get 'em by default10:47
jtvDid you make it too big perhaps?  Unit of size is 512KB blocks.10:47
bigjoolsjtv: "1" doesn't even work10:48
jtvwha10:48
bigjoolsI used "unlimited", doesn't work10:48
jtvAre you sudo'ing it perhaps?  I guess that might refuse.10:48
bigjoolsulimit -S -c unlimited works10:49
bigjoolsneed the -S10:49
bigjoolsgah no core file still10:49
bigjoolswtf10:49
bigjoolsgrah it's sending it to apport10:51
jtv /o\10:51
rvbabigjools: I was wrong, it's just that sometimes it takes a long time to crash.10:51
bigjoolsrvba: writing the core file no doubt10:51
rvbabigjools: I noticed that python-celery (3.1.6-1ubuntu1) is now released in Trusty's cloud archive.  I started a new canonistack instance and installed python-celery.  When I run celeryd there is doesn't crash.  Maybe the package from our ppa is faulty?10:57
rvbas/is doesn't/it doesn't/10:57
bigjoolsrvba: quite likely, as I said, I suspect build or dependency problems10:58
rvbaTrying to install MAAS with python-celery from the cloud archive.10:59
rvba(https://launchpad.net/ubuntu/trusty/+source/celery/3.1.6-1ubuntu1 says it was published 1 hour ago)10:59
bigjoolsrvba: can I upgrade your other instance?10:59
rvbabigjools: sure10:59
rvbaBut I tweaked the configs.11:00
rvbaSo it's better to start from a fresh instance.11:00
=== CyberJacob is now known as CyberJacob|Away
rvbabigjools: arg, same crash with the python-celery from the cloud archive :/11:05
gmbWhy oh why oh why does Canonistack have the shits today?11:06
rvbabigjools: it's really weird, look: http://paste.ubuntu.com/6749871/ vs http://paste.ubuntu.com/6749868/11:07
bigjoolsrvba: did it work with the one in the main archive?11:07
bigjoolsI still see a core11:07
rvbabigjools: two similar Trusty machines, one one celeryd crahsed, on the other it does not.11:08
bigjools!11:08
bigjoolsthis is python crashing11:08
* rvba installs maas on the machine where python-celery does not crash…11:09
rvbagmb: what do you mean?11:10
gmbrvba: Instances keep dying on me. lcy01 must be out of RAM. Current one is staying up though, so far...11:11
bigjoolsI gave up with 0111:11
rvbagmb: hum, might be related to the problem we're seeing then…11:12
rvbabigjools: these are all 01 machines11:12
gmbrvba: Hmm, possibly. That said, if your instances are staying up, probably not. Nova is pretty proactive about pausing instances when it runs out of resources.11:13
gmbSo you'd know if it was affecting you.11:14
rvba ls11:14
rvbaSegmentation fault11:14
rvbaI think it's affecting me :)11:14
bigjoolsrvba: I have a clue11:15
bigjoolsit's falling over in librabbitmq11:15
bigjools#0  0x00007fb4eef716ab in amqp_pool_alloc ()11:15
bigjools   from /usr/lib/x86_64-linux-gnu/librabbitmq.so.111:15
bigjoolsso it's not celery's or python's fault at all11:15
rvbabigjools: how did you find out?  In the code dump?11:19
bigjoolsyep11:19
bigjoolsit's rather annoying that someone bumped celery a whole major version11:23
bigjoolsand didn't choose a new package name11:24
rvbabigjools: did you see that running celeryd with strace?11:26
bigjoolsrvba: no I analysed the core11:26
rvbaAh okay, I see it now.11:27
rvbabigjools: so, technically it blew up in librabbitmq1 but this doesn't tell us why exactly does it?  It might be a wrong interaction with another library.11:30
bigjoolsrvba: amqp_pool_alloc11:30
bigjoolsI suspect some esoteric part of rabbit changed11:30
rvbabigjools: the most recent successful run we had in the lab is this: http://d-jenkins.ubuntu-ci:8080/view/MAAS/job/trusty-adt-maas-manual/4/console11:32
rvbaThe console log gives us all the versions of all the packages used.11:32
bigjoolsrvba: old celery?11:34
rvbaYes11:35
rvbaAn no trace of librabbitmq.11:35
rvbabigjools: celery's changelog says (for the upgrade to 3.1.6-1 from 2.5.3-4ubuntu1): 'Drop depenceny on python-amqplib, it has been replaced by python-amqp/python-librabbitmq in python-kombu.'  And of course python-librabbitmq depends on librabbitmq1.11:41
bigjoolsright11:41
bigjoolsit needs a build with the -dbg library and then we can get which line of code blew up11:43
bigjoolsmy frantic googling is turning up nothing11:43
rvbaSame here.11:43
bigjoolsI gotta sleep man11:44
bigjoolsgood luck and good night11:44
rvbaYeah, I'll see if Andres can help us.  Maybe by creating debugging packages like you said.11:44
bigjoolsyou'll need andres11:45
rvbabigjools: ^11:45
bigjoolsjust install -dbg ones11:45
bigjoolslibrabbitmq-dbg11:45
bigjoolsand run again11:45
rvbaOh, they exist already?11:45
bigjoolsthen when you get the core file:11:45
bigjoolsgdb /usr/bin/python core11:46
bigjools> bt11:46
rvbaYeah, I know that part.11:46
bigjoolsok11:46
rvbaI'll try that.11:46
bigjoolsok I am going to bed11:46
bigjoolssee you11:46
rvbanight11:47
=== CyberJacob|Away is now known as CyberJacob
=== zz_mwhudson is now known as mwhudson
=== CyberJacob is now known as CyberJacob|Away
=== CyberJacob|Away is now known as CyberJacob
=== mwhudson is now known as zz_mwhudson

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!