/srv/irclogs.ubuntu.com/2013/11/08/#maas.txt

freeflyinghttp://paste.ubuntu.com/6379554/01:29
freeflyingmaas stop to update dns, a lot of error like this can be found from maas.log01:30
bigjoolswhat api call is causing that?01:31
freeflyingvery likely update_dns01:34
freeflyingwhen celery-region try to update dns maybe01:35
bigjoolscan you show me the celery-region log please01:35
bigjoolsalso the celery-cluster log01:35
freeflyinghttp://paste.ubuntu.com/6379577/01:35
bigjoolsrndc is failing - it was previously working.  What changed inbetween>01:37
bigjools?01:37
bigjoolsI suspect the cluster is not authorised as well, it does periodic lease updates on the api (but I can't tell for sure that's what's failing as you chopped the timestamp on the maasserver log)01:38
freeflyingi found it fails to reolves some node's name, then checked, found ip has changed, but dns not updated, so restart named, it can't be stopped01:39
freeflyingthen did kill it, and restart it again01:39
bigjoolstry this:01:39
bigjoolsmaas set_up_dns01:39
bigjoolsit'll re-write everything DNS-related01:40
bigjoolsalso show me your cluster celery log please01:41
freeflyingNo handlers could be found for log "metadataserver"01:41
freeflyinghttp://paste.ubuntu.com/6379619/01:43
bigjoolsok so there's failures  earlier on for import_boot_images and restart_dhcp_server,  are those ok now?01:45
bigjoolsalso are you still seeing those api failures?  If so can you look at the apache log file and for the corresponding timestamp of a failure look up what the request was01:46
freeflyingbigjools, they're fine now01:47
freeflyingbigjools, but no idea maas.log grows that bigger01:47
bigjoolssorry that doesn't make any sense, are you saying maas.log is growing?01:48
bigjoolsI need to know if you are still seeing those authorization errors for an api call01:48
freeflyingbigjools, I mean the import_boot image seems fine now, we did it 3 day ago01:48
freeflyingbigjools, there is no error from maas.log since 10:3601:50
freeflyingnow is 10:50 on the server01:50
bigjoolsand is everything working?01:50
freeflyingdon't have new node deployed, so not sure, is it normal that if no new ip in lease file, then celery-region won'r have anything to do01:52
freeflyingbigjools, from celery-region.log, there is no action for a while01:52
bigjoolscorrect, the celery-region just writes out changes to DNS as required01:53
freeflyingbigjools, ic, thanks01:54
bigjoolsas I said above you can force a re-write with the "maas set_up_dns" command.01:54
freeflyingyea01:54
=== freeflying is now known as freeflying_away
=== freeflying_away is now known as freeflying
Azendalebigjools: here's my /var/log/maas/pserv.log file (with just the stuff from today, otherwise it was 1.5 million lines!) http://paste.ubuntu.com/6379827/03:04
=== freeflying is now known as freeflying_away
=== freeflying_away is now known as freeflying
=== freeflying is now known as freeflying_away
bigjoolsAzendale: your pserv log looks fine.  Can you snip the appropriate bits for dhcpd out of syslog?06:05
Azendalebigjools: sure06:10
Azendalebigjools: This time, the VM's booted, but the login prompt has something like "192-168-178-68" as the the host name06:13
Azendalebigjools: I can ssh to the individual node, but of course have no idea of the credentials to login06:14
bigjoolsAzendale: how did you allocate and start it?06:14
Azendalebigjools: juju deploy06:15
bigjoolsthen juju puts your ssh key on it, assuming it installed ok06:15
Azendalebigjools: log as requested http://paste.ubuntu.com/6380436/ (syslog filtered through grep "dhcpd")06:19
bigjoolsAzendale: there's nothing that leaps out to explain why you had problems.06:20
bigjoolsare things very reliable when booting one at a time?06:21
Azendalebigjools: it seems to work just fine when I bring them up staggered a bit06:21
Azendalebigjools: I do see in the log "Dynamic and static leases present for 192.168.178.86"06:22
bigjoolsthat's normal06:22
bigjoolsand it's definitely the dhcp stage that's timing out, not tftp?06:22
Azendalebigjools: I've tried to get this to work so many times, it's hard to be sure for 100% of the time06:23
Azendalebigjools: But I know for sure I have seen cases where the PXE boot never gets an address. That didn't seem to happen in this case because I can ssh to the node06:24
bigjoolsI am going to guess that you saturated your network with udp packets06:24
Azendalebigjools: I'm not sure what would cause the machine to come up with no host name (but still with an IP)?06:25
bigjoolsit's pulled its host name from DNS I expect, which means cloud-init failed06:26
Azendalebigjools: what all does cloud init depend on to succeed?06:26
bigjoolsthere's a ton of stuff that can go wrong06:27
bigjoolshard to see without looking at its log06:27
bigjoolsand if you can't ssh in then  you can't get it06:27
bigjoolswe're working soon on making that easier, but for now you have to use a special ephemeral that hard-codes a backdoor user + password06:28
Azendalebigjools: is the log stored on the harddisk? I might be able to power off the machine and then retrieve it, it's a .qcow2 disk image06:28
bigjoolsyes /var/log/cloud-init/06:28
Azendalebigjools: ok, I'll see if I can get a copy06:29
bigjoolsok06:29
bigjoolsI'm leaving soon so I'll catch you Monday, it's Friday here :)06:30
Azendalebigjools: ok. If you think it would help, I could try running through a few cycles of boot various number of machines, and collect data if that would help. If so, just give me an idea of what information it would be helpful for me to gather06:32
bigjoolsnot sure tbh, if it's a networking problem there's not much I can do06:33
bigjoolsare you on 100M, GigE?06:33
Azendalebigjools: it's virtio tap interfaces to a linux bridge on the host06:36
Azendalebigjools: http://paste.ubuntu.com/6380463/ cloud init log06:37
bigjoolsah ok06:41
bigjoolswell that log looks like it booted without maas asking it to boot06:42
Azendalebigjools: hm, weird. I have MaaS set up with the virsh option for power management. The machine would have been triggered to turn on (via virsh) by maas when I ran juju deploy. Wouldn't MaaS expect the machine to reboot automatically after installing?06:45
bigjoolshmm ok06:46
bigjoolsyes it reboots and then installs juju06:46
Azendalebigjools: so, if I'm keeping you, just let me know06:49
bigjoolsI am going now actually06:49
Azendalebigjools: ok, I guess I'll talk to you Monday06:50
bigjoolsyep - let's continue this06:51
bigjoolsbye06:51
jtvrbasak, do let me know if you want help adding the ssh-key-file option to uvtool!06:57
=== CyberJacob|Away is now known as CyberJacob
rbasakjtv: it's in and build in the PPA07:58
rbasakbuilt07:58
rbasakjtv, rvba: though --ssh-public-key might more more consistent than --ssh-public-key-file.07:59
* rbasak doesn't know07:59
=== freeflying_away is now known as freeflying
jtvrbasak: great news, thanks!  The "-file" does make it nicely clear that this is not a key ID.08:43
gnuoyHi, I want to configure bonding on a server under maas control the port I'll be pxe booting via is part of the bond. Any advice on how to go about this ? I can add a script to setup the bonding with the pressed config but it'll be to late then I think.09:00
gnuoy...as the switch will be configured to bond the interfaces right from the start09:00
gnuoyDo I need to doctor the initrd image ? or is there a whole world of pain waiting around the corner ?09:01
=== freeflying is now known as freeflying_away
jtvGrrr why can't I do self.useFixture(TempDir()) in KVMFixture!?10:40
jtvAttributeError: 'KVMFixture' object has no attribute '_cleanups'10:40
jtvallenap: it sounds like the sort of thing you'd know all about...  ^10:40
jtv(And __init__ and setUp do their upcalls, so it's not that)10:41
jtvArrrrgh I have it.  The fixture is never set up.10:48
allenapjtv: Was the call in __init__?10:51
allenapThat dosnae work; needs to be in setUp().10:51
jtvYeah, it was just a test that skipped setUp().  Sorry!10:58
rbasakallenap: got time at some point to talk about the node subarch field thing?11:02
allenaprbasak: Sure. I'm also reading your release combos email right now, so we could talk about that. I need to refresh my mind about the subarch thing. 1145 okay?11:03
rbasakallenap: Sure, thanks!11:04
allenaprbasak: Cool.11:04
gnuoyI've have a Global Kernel Parameter set but one of my nodes its not showing it on the display screen and it doesn't seem to be being passed when the node boots. Any ideas how I can fix that ?12:11
gnuoyall nodes are in the "ready" state12:18
gnuoyI've tried removing the kernel option and adding it back in again12:18
gnuoymaas-cli seems to returns the correct value12:21
gnuoy$ maas-cli maas maas get-config name=kernel_opts12:21
gnuoy"console=ttyS1"12:21
rbasakgnuoy: I think (but am not sure) that the kernel parameters there are for enlistment, commissioning and firing off the installer. For the deployment itself, I think you need to preseed. I'm not sure about what you need to do for fast-path, though.12:23
gnuoyrbasak, preseed is fine, but its very odd that the gui isn't showing the kernel params for just one node12:24
rbasakgnuoy: ah. I didn't realise you meant that it was missing from the gui. I'm not sure about that, sorry.12:25
gnuoyrbasak, ok, thanks for the preseed suggestion.12:26
rbasakgnuoy: no problem. Note that I might be inaccurate. My knowledge is a little dated.12:28
gnuoyrbasak, ok, noted. :-)12:29
rvbagmb: could you please have a look at https://code.launchpad.net/~rvb/maas-test/vm-network4/+merge/194522 if you have time today?  I'd like to get that landed before my long weekend if possible so that you guys will have it on Monday.14:33
gmbrvba: Sure, I'll take a look now.14:33
rvbagmb: well, okay, I also want Jeroen to deal with all the conflicts this will create with his branch ;)14:33
gmblol14:34
rvbaTa.14:34
gmbrvba: Approved with a couple of comments.15:11
rvbagmb: Thanks!15:34
gmbrvba, allenap: I'm almost done for the day; I thought I'd get to jtv's branch but didn't; it's here should either of you want to round out your friday nicely... https://code.launchpad.net/~jtv/maas-test/ssh-key/+merge/19448815:42
allenapgmb: Okeydoke. Have a good weekend dude.16:03
gnuoyIf anyone was following along at home with regards to my earlier problem with installing with bonded nics, in the end initial registration and commissioning worked fine, with the nics on the switch in an active channel-group but the install went dark almost immediately after the initrd was loaded. I got around this by shutting down the second nics interface on the switch.17:22
mthaddongnuoy: I'd like to make sure that's documented somewhere - allenap, how would we get something like that into the maas docs?17:27
gnuoymthaddon, ok, I'll follow that up17:31
mthaddonthx17:32
mthaddonsave someone else your pain :)17:32
allenapmthaddon, gnuoy: The docs at http://maas.ubuntu.com/docs are generated from rst files in lp:maas/docs. Add something there - any of us will be more than happy to help - then propose a merge.17:36
allenap(That's the docs/ directory in lp:maas.)17:36
gnuoyallenap, lovely, thanks17:36
mthaddonthx17:37
allenapgnuoy: Adding docs to MAAS entitles you to a bit wet kiss. Unfortunately, as the Red Squad representative in Norfolk, I would be the one authorised to deliver your reward.17:38
gnuoyallenap, I'll settle for a big wet pint17:39
allenapgnuoy: Deal.17:39
allenapgnuoy: Or you can wait for gmb to visit the county? Take your pick.17:40
gnuoyI'll wait !17:40
allenapThink of all those bristles...17:41
=== Spideyman is now known as Spideyman_afk
=== Spideyman_afk is now known as Spideyman
allenapgnuoy: I've just replied to your weird kernel options bug.18:08
allenapgnuoy: Have a good weekend!18:09
=== Spideyman is now known as Spideyman_afk
=== Spideyman_afk is now known as Spideyman
=== freeflying_away is now known as freeflying
=== freeflying is now known as freeflying_away
=== CyberJacob is now known as CyberJacob|Away

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!