/srv/irclogs.ubuntu.com/2014/04/08/#maas.txt

Kupo24zhey all, trying 14.04 MAAS and getting this error when trying a pxe boot; Exception: {'__all__': [u'Node group interface with this Nodegroup and Interface already exists.']}01:08
Kupo24zany idea?01:08
bigjoolsKupo24z: which log?01:09
Kupo24zmaas.log01:09
Kupo24zlooks like boot images are not importing either, similar error01:09
bigjoolscan you paste the log please01:10
bigjoolsand the pserv.log01:10
bigjoolsand the cluster.log01:10
bigjoolsare you on the latest revision 2202?01:10
Kupo24zhttp://paste.ubuntu.com/7219582/01:11
Kupo24zbeta 201:12
bigjoolswhat is the package version?01:16
bigjoolsmaas package01:16
bigjoolsthe time stamps do not match up on the logs for the operations you say they are doing01:19
bigjoolsI think you have a cluster that is not accepted yet01:19
bigjoolswhat did you do leading up to this error?01:20
bigjoolsupgrade? fresh install?01:20
Kupo24zfresh install from beta 2 iso, updating apt right now01:21
bigjoolsI tihnk the iso had a problem01:21
bigjoolspackaged install from apt is OK01:21
Kupo24zon reboot after update http://paste.ubuntu.com/7219609/01:25
Kupo24zVersion: 1.5+bzr2227-0ubuntu101:25
Kupo24zthis is the output on import boot images: http://paste.ubuntu.com/7219621/01:27
Kupo24zim assuming 'raised unexpected: AssertionError(u'MAAS_URL is not set.' is the cause01:31
Kupo24zhowever in /etc/maas/maas_cluster.conf i can see it there01:31
bigjoolscan you remove maas entirely and re-install from scratch from the archive please01:33
bigjoolsapt-get purge '*maas*'01:33
bigjoolsI don't trust the iso01:33
Kupo24zI get E: Couldn't find any package by regex '*maas*' with that, syntax different?01:33
bigjoolsargh01:34
bigjoolsapt-get purge maas and then I think apt-get autoremove should get rid of the rest01:34
Kupo24zdoes it require a maas user? django.db.utils.OperationalError: FATAL:  password authentication failed for user "maas"01:38
Kupo24zhttp://paste.ubuntu.com/7219650/01:40
bigjoolsurrrr01:44
bigjoolsis this while purging?01:44
bigjoolsroaksoax_: on the offchance you didn't actually go yet, can you look at this? --^01:45
bigjoolsKupo24z: is this while purging?01:45
diadistiso/01:51
diadistisIs it possible to use maas for servers that are not in the same subnet?01:53
bigjoolsdiadistis: yes, you need to install a cluster controller on the subnet01:54
bigjoolseach subnet, I mean01:54
diadistisThe problem I'm facing right now is that we have about 20 dedicated servers in 12 subnets. I tried to use ipxe to boot them but no luck...01:55
bigjoolsdiadistis: http://maas.ubuntu.com/docs/cluster-configuration.html01:57
=== jhobbs_ is now known as jhobbs
=== CyberJacob|Away is now known as CyberJacob
bigjoolsrvba: have you seen bug 130407806:45
ubot5bug 1304078 in MAAS "Endpoint /MAAS/api/1.0/files/?op=list returns HTTP 500 with juju-core." [Undecided,New] https://launchpad.net/bugs/130407806:45
rvbabigjools: just saw it yeah.  I'll have a look now that https://code.launchpad.net/~rvb/maas/migr-bug-1302156/+merge/214667 is up for review.06:45
jtvSpeaking of reviews: I need a few!  One looks huge but is really just moving code.07:01
bigjoolsbtw folks, we don't need 14.10 milestones that have a 1.5 bugtask on 14.0407:07
bigjoolsif it's fixed in 1.5 then it's fixed in trunk07:07
jtvAssuming we backport from trunk to 1.5?  Because there was talk of doing it the other way around.07:10
bigjoolsalways backport07:14
bigjoolsif we chop and change things will get *very* confusing very quickly07:14
jtvOK07:25
jtvAnybody free to review https://code.launchpad.net/~jtv/maas/split-boot_resources/+merge/214670 ?  It's really just moving code, no other changes.07:26
jtvAlthough one obscure function moved "to nowhere."07:26
rvbabigjools: couldn't reproduce the problem from https://bugs.launchpad.net/maas/+bug/1304078.  Will follow up when I get more details.07:28
ubot5Ubuntu bug 1304078 in MAAS "Endpoint /MAAS/api/1.0/files/?op=list returns HTTP 500 with juju-core." [Undecided,Incomplete]07:28
bigjoolsrvba: he's using an old version07:28
rvbabigjools: no, just the cloud archive.07:28
rvbaOn precise.07:29
bigjoolsyes, an old version :)07:29
rvbaWell, precise is still supported.07:29
bigjoolsand old!07:29
bigjools:)07:29
rvbaokay :)07:29
bigjoolsrvba: don't we have CI for that version in the lab?07:29
* rvba checks07:30
* bigjools stops to eat, back in a bit07:30
rvbabigjools: no, what we test in the lab is whatever is in the daily PPA (i.e. the package built from 1.2)07:31
mwhudsonbigjools: btw i can finally try to break maas-on-arm tomorrow i think07:49
bigjoolsrvba: urgh, I guess we should fix that08:00
bigjoolsmwhudson: tip top!08:00
rvbabigjools: we probably should.  I'll ask Diogo to do it.08:01
bigjoolsrvba: thanks08:03
jtvAny reviewers available for https://code.launchpad.net/~jtv/maas/split-boot_resources/+merge/214670 ?  It's a large diff, but it's all moving code, not changing it.09:13
strikovjtv: Hi Jeroen. What's the idea behind make_image_spec() in tests? What's the problem with let say hardcoded arch and release? Thanks!09:22
jtvstrikov: it's a slightly controversial issue.  On the one hand, it's nice to have concrete human-readable strings in the test, and to have them look realistic.  On the other hand, tests should not pass "by accident."09:23
jtvFor example, if a test says "arch=i386" somewhere, it could be that it's hitting something that's broken for every architecture except i386, because that happens to be default somewhere else.09:23
jtvGenerally, with the factory style in our tests, we try to show that the behaviour we want has no implicit dependencies on other setup, configuration, defaults, etc.09:24
jtvAny two items that a test creates are different and unrelated, unless there needs to be some specific connection — and then the test makes it explicitly.09:25
strikovjtv: we're using release-XXXXXX and arch-XXXXXX generators right now, maybe it's better to stick with some more realistic value. But I got your point in general. Thanks!09:25
jtvThose names are a compromise, really.09:25
jtvYou get random, but you also get recognisable.09:26
strikovjtv: True09:26
rvbagmb: jtv: The maas-test failure in the lab needs investigation http://paste.ubuntu.com/7220923/10:04
jtv"Bad Request" *after* destroying the VM?10:04
rvbajtv: the ordering of the messages is wrong.10:05
rvbagmb: shouldn't you backport the HWE doc onto 1.5?10:05
rvba(Looks like it's only in trunk.)10:06
=== CyberJacob is now known as CyberJacob|Away
jtvrvba: I wonder if this means that we broke the interaction between the API client and the API somehow.10:08
jtvBecause that looks like the first API request that maas-test makes.10:09
jtvOr, of course, we're just setting a value that is no longer in the config...10:09
jtvMaybe we're setting an unsupported series?10:11
rvbaYeah, that would be my guess too.10:11
jtvMight be nice to dump the response body at that point in maas-test...10:12
rvbajtv: I /think/ the change I just landed will fix the problem…10:15
jtvAh good.10:15
jtvI'll pick up one of those other maas-test bugs then.10:15
rvbajtv: maas-test CI is still failing :/10:25
rvbai.e. my change didn't fix the pb.10:25
jtvrvba: what was the fix that didn't work?10:30
rvbajtv: it's not the it didn't work.  I just didn't fix the CI problem (which I haven't diagnosed properly yet) https://code.launchpad.net/~rvb/maas-test/only-trusty/+merge/21431210:31
jtvOh, it was something you had already and hoped might _also_ fix the problem?10:31
jtvAFAICT there is no real validation of the config value in that set_config call, is there?10:32
rvbajtv: yes, I hoped it would fix the pb but it didn't.  The only validation is that it's a valid series.10:33
rvbaIIRC10:33
jtvWe validate that?  In the set_config call?10:33
jtvI didn't think we did...10:33
jtvI do remember making a change: I moved the commissioning series from the "networking" section of the config to the "Ubuntu" section.10:34
jtvBut that was only in the inline dict in get_default_config, I think — in which case it shouldn't matter, right?10:34
rvbaNo, it shouldn't.10:35
jtvI don't see any kind of validation of the value.10:36
jtvThe traceback also shows us that urllib2 will raise an exception when it gets an error code from the API...  the maas-test code only checks for a non-OK return value.10:38
jtvAren't we adding the server-side logs to the test details though?10:38
rvbaI don't see the logs anywhere.10:39
jtvrvba: maybe that's because the error happens during setUp, and we don't gather the logs yet at that stage.  :/10:40
rvbajtv: I'm debugging it the problem in the lab manually…10:50
rvbas/it//10:50
jtvrvba: meanwhile I'll put up a branch that makes the fixture dump logs if it fails at this point.  It doesn't look very invasive.10:51
jtvrvba: I do wonder if the RPC connection time is an issue for maas-test...10:52
jtvrvba: https://code.launchpad.net/~jtv/maas-test/maasfixture-log-earlier/+merge/21471810:59
jtv ← should help debug that problem10:59
rvbajtv: yep, looks good.11:09
rvbajtv: not sure it will help with our immediate pb though.11:09
jtvIt will activate the logging of server-side information before the fateful API request.11:10
jtvSo as long as the API logs the failure, we'll get it.11:10
jtvBut I do wonder: does maas-test wait for the cluster and region to hook up their RPC?11:11
rvbaI don't think the API will log a failure, it's a validation error.11:11
jtvWe don't know that.11:12
rvbaTrue, let's see.11:12
jtvYeah.11:12
jtvMeanwhile, I have to call it a day.11:12
strikovjtv: I want to come up with a test that generates pretty complex metadata (multiple versions inside the product, each with a specific label and set of subarches). I started to do it in a 'random fields' fashion (as you did TestMain) but feel that it's too much (i had to create a bunch of code to just generate this fields the right way). Any ideas which fields should be indeed random and which one I can hardcode?11:19
jtvstrikov: put the complexity into the unit tests, where it's still controllable.  Otherwise updating the test later becomes a nightmare.11:23
jtvFactory methods can help a lot: "create an X for me with all the values randomised."11:24
jtvThe overall test will show that the parts fit together; the unit tests can put real stress on each of the parts.11:25
jtvThe trick for the tests is to hate whoever writes the code, and try to prove them Wrong in every way possible.  Even when that person is actually you.11:26
jtvIn other words, schizophrenia is one of the most valuable traits in software development.11:26
jtvIf you try to put that sort of thing in the big, end-to-end tests, it inevitably becomes a little arbitrary which corner cases the test does or doesn't exercise.11:27
jtvWith unit tests, it's easier to throw the real nightmares at the code.11:28
strikovjtv: Just to make sure that I got you correctly. What do you mean by unit tests -- something which resides in src/provisioningserver/import_image/tests/?11:31
jtvWell yes, but that's not the whole story.  :)11:31
jtvI mean tests that take one small part of the software ("unit"!) and test it in detail, by calling it directly.11:31
* jtv has to go now11:49
jtvstrikov: for examples, have a look at the existing tests in src/provisioningserver/import_image/tests/, but specifically the tests for the lower-level functions, not the test for main().11:51
jtvGood night!11:51
rvbagmb: allenap: time for a tiny review? https://code.launchpad.net/~rvb/maas-test/maas-test-use-trusty/+merge/21475113:39
allenaprvba: otp13:40
gmbrvba: Sure13:40
gmbThat worked out well then :)13:40
allenaprvba, gmb: roaksoax just reminded me that we need to ensure that maas-test’s changelog is up to date, and that each change since the last upload has a bug number attached. Do you know if that’s the case?13:51
gmbEr, nope.13:52
rvbaIt's probably not the case.13:52
rvbagmb: I wonder if you missed my message from earlier… shouldn't you backport the HWE documentation to 1.5?13:57
gmbrvba: Yes, I think I missed that; had some connection problems with IRC…13:58
gmbrvba: Good point. I’ll do that now.13:58
rvbaCool.13:58
allenaprvba, gmb: Do either of you fancy doing it? :)14:16
rvbaallenap: I created a bug for the change I just landed.14:17
rvbagmb: is it normal that the HWE doc isn't linked from the main index?14:19
gmbrvba: Nope, that’s an oversight. I’ll fix it.14:37
rvbagmb: okay.  While you're at it, maybe add a note similar to the one we have in docs/networks.rst to state that this feature is new.14:43
gmbYep14:43
bladernr_so... now that things have changed again (since the last time I updated my trust maas server) what do I edit to ONLY download trusty boot images (PXE and Ephemeral)14:55
bladernr_is /etc/maas/import_pxe_files still valid (and /etc/maas/import_ephemerals) or is boot_resources.yaml the file now?14:56
bladernr_err... bootresources.yaml14:56
roaksoaxbladernr_: bootresources.yaml from now on15:50
bladernr_roaksoax: yeah, figured that out.  the old files shouldn't have been left if they're no longer honored, IMO... or at least should have been renamed to something like import_pxe_files.unused15:52
bladernr_and just to be sure, it's safe now to delete /var/lib/maas/ephemeral? that would free up 20GB of disk space...15:53
bladernr_I'm guessing yes but wanted to confirm to avoid hosing my server15:53
bladernr_Under this new boot-resources scheme, how long are snapshot dirs kept?16:07
bladernr_if I update daily, and pull in, say, 4GB per day of new images, if there's no garbage collection or whatever, I could quickly run out of disk space, depending on my maas server's setup16:08
allenapbladernr_: You can delete snapshots as you like, just leave the one that’s pointed to by the current symlink. The snapshots are created by hard-linking to files in the cache directory, so, even if you delete all snapshots, a sync should not need to download much, if anything.16:23
bladernr_It does if I'm pulling in x86 and amd64 dailies for development... it's at least 12 images a day if trusty images are spun daily16:24
bladernr_3.5 - 4GB estimated per day, so if I have a cron set to update the images each morning, and no automated garbage collection, and forget cron is running, I could eat up 100GB in 25 days.16:25
bladernr_I must admit though, I REALLY REALLY appreciate that the cluster now tells me exactly what images it has available16:26
bladernr_there are some really great UI changes in the recent updates :D16:26
bladernr_and that leads to another question about how these dirs are handled...  "current" points to the latest snapshot dir.  So lets say I'm pulling down Precise x86 and amd64 images, and trusty daily x86 and amd64 images.  Precise images won't be new each day, so the snapshot dir for today could have only trusty images in it.  Will the new method aggregate those dirs to allow me access to all of the available16:30
bladernr_images?16:30
bladernr_ok, curious...16:52
bladernr_I added precise x86 and amd64 to bootresources.yaml and re-ran the import-pxe-files command.  I have a second snapshot that is 6.8GB vs 3.5 for trusty only.16:53
bladernr_So, question is: did this copy my existing trusty stuff over, or re-download?  That would answer the above question about how existing stuff is handled.16:53
=== roadmr is now known as roadmr_afk
allenapbladernr_: It should have hard-linked to the pre-existing images, so 3.5GB of the 6.8GB in the second snapshot should be the same on-disk data as the first.17:02
allenapbladernr_: But if you discover otherwise, please file a bug.17:02
allenapbladernr_: Btw, glad to hear you like some of the new stuff :) Unfortunately, for now, you’ll have to arrange garbage collection yourself.17:03
smoserallenap, are you sure?17:09
smoserwhat did you mean.17:09
smoseri think it should clean up after itself.17:09
allenapsmoser: I looked at the code in MAAS and I didn’t see any clean-up. Is it in simplestreams?17:09
smoserbladernr_, "current" is current for everything. they get munged into that dir.17:09
smoseri believe the sync is done with max=117:10
smoserer... wait. thats not relevant.17:10
smoseri thought that it only kept 2 thigns.17:10
smoseri was pretty sure that oleg did that.17:10
smoserif not then its a critical bug.17:10
allenapsmoser: There’s only one snapshot in the Garage MAAS, so maybe it is cleaning up after all.17:10
allenapOleg doesn’t seem to be around today to ask.17:11
allenapOr he’s EODed.17:11
smoserallenap, and du should not be tricked by hard links.17:13
smoserit should count correctly.17:13
allenapsmoser: Aye, but `du snapshot-1` and `du snapshot-2` run separately will sum up to more than `du snapshot-[12]`.17:17
smoserand in both cases its counting is correct :)17:17
roaksoaxbladernr_: should be. Please do file bugs for all that stuff you are finding17:38
=== vladk is now known as vladk|offline
=== roadmr_afk is now known as roadmr
bladernr_smoser, allenap thanks, just double checked and it is indeed hardlinking to the older items... (the hard links threw me, I'm so used to everyone using symlinks for that type of work)18:23
=== CyberJacob|Away is now known as CyberJacob
=== vladk|offline is now known as vladk
tych0hi allenap, so r.e. calling the celery job19:42
tych0i am getting a 403, http://paste.ubuntu.com/7223235/19:43
tych0looking at the code, the nodegroup is the only one allowed to call that celery task?19:43
allenaptych0: That sounds about right. Do you have access to those credentials?20:07
=== vladk is now known as vladk|offline
bladernr_Is there any documentation, beyond a couple VERY brief blog posts I've managed to find, that explains in good detail the various options and ways to modify the fast-path install via curtin_userdata?22:26
bladernr_I have a working d-i preseed in maas that I now want to translate into cloud-init-isms and have found bits related to ec2 cloud-config that I'm not sure work in MAAS...22:27
bladernr_good example I haven't found yet, I can add a PPA (thanks to smoser's blog), but not sure how to do things like update apt cache, then install individual packages after passing things to debconf-set-selections22:34
=== CyberJacob is now known as CyberJacob|Away
bladernr_Oh.. so this seems to be working a lot easier than I was thinking it was going to.  Not sure what I was doing wrong before, but adding in things line by line and re-doing the install to see the progress seems to be working well thus far.22:43
bigjoolsbladernr_: if you collect useful info can you please let me know and I will add it to the maas docs23:54

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!