/srv/irclogs.ubuntu.com/2009/02/12/#ubuntu-kernel.txt

praveen_i want to know abt trace flag and i need some pointers\04:08
=== praveen_ is now known as shadow_fax
lesshastehi11:21
lesshasteI am trying to produce some debugging info for kernel panics I get the whole time11:22
lesshasteis this the right way to run kexec?11:22
lesshastesudo kexec -p /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23 ro11:22
lesshasteno one in?11:32
lesshasteit seems the root= option is wrong for some reason11:33
lesshasteI get11:33
lesshasteCannot open `--append=root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23': No such file or directory11:33
mnemoyou got an unmatched " in there11:40
lesshastemnemo, yes sorry.. sudo kexec -p /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23"11:41
lesshastegives11:41
lesshasteCannot open `--append=root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23': No such file or directory11:41
lesshastemnemo, so I must just have it completely wrong11:42
lesshastehi Wellark 11:44
mnemoso you have one kernel that gives you panic and one that works?11:44
lesshastemnemo, no.. I have one that panics every day or so11:44
lesshastemnemo, I don't have one that obviously doesn't panic11:44
mnemousually you do "kexec -l kern_img"11:44
lesshastemnemo, right.. did you see my attempt at the command line above?#11:44
mnemoyeah thats the thing, you dont have any -l  ??11:45
lesshastemnemo, no I am using -p11:45
lesshastemnemo, but check out11:45
lesshastehttp://www.mjmwired.net/kernel/Documentation/kdump/kdump.txt11:45
lesshastethe instructions are to do11:45
lesshaste  kexec -p <dump-capture-kernel-bzImage> --initrd=<initrd-for-dump-capture-kernel> --append="root=<root-dev> <arch-specific-options>"11:46
lesshasteso my line is sudo kexec -p /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23"11:46
lesshasteclearly I have the root= part completely wrong11:46
lesshastebut I don't understand as that is what I have in menu.lst for grub11:46
lesshasteand cat /proc/cmdline gives me  cat /proc/cmdline 11:47
lesshasteroot=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23 ro quiet splash crashkernel=64M@16M11:47
mnemoI think the --append part is fine... I think you forgot some other part so that its assuming that "--appendBLAH" is the filename11:47
mnemobut check out "man kexec"11:48
mnemoit says "kexec -l img" but for -p it doesnt say "kexec -p img" it just says "kexec -p"11:48
lesshastemnemo, sudo kexec -l /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23"11:48
lesshasteCannot open `--append=root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23': No such file or directory11:48
mnemoi dunno, I have never used of kexec before so im really the wrong person to ask :) :)11:48
lesshasteit doesn't make any differnce11:48
lesshasteI have the syntax wrong11:48
mnemolesshaste: jaunty will have automatic ops crash reporting at least11:50
mnemobut I guess that's a long time to wait just to report it ;o11:50
lesshastemnemo, you see in the man page it says " Passing  the  exact  contents  of11:50
lesshaste       /proc/cmdline into command-line-options is the  safest  way  to  ensure11:50
lesshaste       that correct values are passed to the rebooting kernel.11:50
lesshaste"11:50
lesshasteso I have11:51
lesshaste sudo kexec -l /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23 ro quiet splash crashkernel=64M@16M"11:51
lesshasteand that is wrong apparently!11:51
lesshastegrrr!11:51
mnemotry --command-line=BLAH instead of --append=BLAH then?11:51
IntuitiveNippleAre you missing "--args-linux" before --append ?11:54
loolTheMuso: Hey, I checked the build failures of linux-ports on various arches; it seems we want to enable makedumpfile on more arches: kexec-tools is needed first on armel lp #323634, I don't know about hppa, powerpc and amd64 needs the bdep to be updated from makedumpfile [i386] to makedumpfile [!armel]11:56
looland I don't know about ia6411:56
lesshasteIntuitiveNipple, it's not in the compressed kernel version at http://www.mjmwired.net/kernel/Documentation/kdump/kdump.txt11:56
TheMusolool: makedumpfile is only available for i386 powerpc ia64 and lpia, ports does not build any armel kernels.11:57
TheMusolool: ia64 makedumpfile fails, same with powerpc11:57
TheMusolool: haven't investigated why, just want them to build for now.11:57
lesshasteIntuitiveNipple, ok this is just weird11:57
lesshastesudo kexec -l /boot/vmlinuz-2.6.27-11-generic --initrd /boot/initrd.img-2.6.27-11-generic --args-linux --append="root=UUID=7d9f52a6-3632-41cc-aa41-67534f3cdd23 ro quiet splash crashkernel=64M@16M"11:57
lesshasteCannot open `--args-linux': No such file or directory11:57
lesshastehi Lure 11:58
mnemolesshaste: you should use "--initrd=BLAH" instead of "--initrd BLAH"11:59
lesshastegaaaaaah!11:59
TheMusolool: again we don't build amd64 kernels with ports11:59
lesshastethanks :))11:59
mnemonp :)11:59
loolTheMuso: Ok11:59
lesshasteblimey that took a long time11:59
loolTheMuso: I had in mind that you could probably keep a very similar bdep as the main kernel which should reflect arches where makedumpfile should theoritically be available and only disable it in the rules, but it's fine like that; thanks12:01
loolTheMuso: Also, I was curious whether you plan to move the tree to ubuntu/ like the other in-archive trees?  or is this reserved to a special team?12:04
TheMusolool: ubuntu/ is only for the kernel team afaik, I don't have access to it. In addition, the kernel team don't want very much to do with ports, so I don't know what will be happening about where the official prts tree will end up.12:05
TheMusoSo I am keeping it under my own namespace currently.12:05
loolTheMuso: Is there a way to share access to this tree to others (I think NCommander also contributed to the git tree?!?)12:06
TheMusolool: unless everyone has ubuntu/ access, the easiest way is to create one's own tree, and ask me to merge/pull changes I guess. I know NCommander has done ports work before, but that was for intrepid/early jaunty, and ports is now maintained somewhat differently (how the lpia kernel was maintained before it was included into mainline jaunty).12:07
loolTheMuso: Understood12:07
TheMusolool: If things work out with me maintaining ports in my own time, and if others want to help work on ports, there may be grounds to making a team that has access to something like ubuntu-ports or something similar.12:09
lesshastehow do ubuntu kernel people normally try to diagnose kernel panics?12:23
lesshastehi Lure 12:44
lesshastehi abogani 13:17
rtgamitk: where are you on the config file hierarchy changes and the ARM config file updates?13:23
rtgI thought I'd take a stab at getting LPIA into the Jaunty archive today.13:24
amitkrtg: Almost done13:31
amitkrtg: 10 mins to finish build testing13:31
rtgamitk: no rush, I've plenty of other minutia to keep me busy13:33
amitkrtg: it is an abi bumper13:36
amitki guess it will be for lpia in any case13:36
rtgamitk: I had planned on bumping ABI anyway 'cause I want to tuurn on CRDA13:38
IntuitiveNipplertg: possible regression in Intrepid (ath5k); you might want to cast an eye over it: bug #32723713:38
ubot3Malone bug 327237 in linux "Kernel 2.6.27-11 in 8.10 has no WiFi support" [Undecided,New] https://launchpad.net/bugs/32723713:38
amitkrtg: changes compiled successsfully and pushed out to git13:39
rtgIntuitiveNipple: contact smb_tp as he is responsible for maintenance and regressions.13:39
rtgamitk: thanks13:39
IntuitiveNippleokay13:39
smb_tpIntuitiveNipple, I look at it13:40
IntuitiveNipplesmb_tp: thanks.13:40
maxbHmm... madwifi and ath5k both loaded in that intrepid bug - is that badness?13:44
rtgyep - madwifi borks the hardware. there is a pending bug fix which includes blacklisting madwifi13:45
smb_tprtg, So was madwifi not there before?13:46
rtgit ought to have been, but its kind of racy.13:46
maxbThe wifi%d line looks a bit odd too - %d not being interpolated?13:47
IntuitiveNipplemaxb: yeah, and that is generated by ath9k :p13:48
IntuitiveNipple(drivers/net/wireless/ath9k/core.c:1127)13:48
maxbThe pending fix leaves things jockey-able, right? I have some hardware which works with madwifi but not ath5k13:48
amitkrtg: are you looking at https://bugs.edge.launchpad.net/ubuntu/+source/ixp4xx-microcode/+bug/328188 13:58
ubot3Malone bug 328188 in ixp4xx-microcode "please include firmware from ixp4xx-microcode in the linux-firmware package for armel" [Undecided,New] 13:58
rtgamitk: not yet. you wanna do it?13:59
amitkrtg: not sure what needs doing...13:59
rtgamitk: ok, I'll get to it as I think I'm assigned the bug14:00
amitkjust add the firmware to the jaunty/firmware directory?14:00
amitkrtg: ^14:00
rtgand make sure the license/copyright stuff gets added to the package14:00
* amitk looks at how the firmware udeb is created14:01
smb_tpIntuitiveNipple, Actually the message comes from lrm:ubuntu-restricted/madwifi/ath/if_ath.c:44614:04
IntuitiveNipplesmb_tp: really? oh, duplicates then! I thought it seemed strange every module trying to grab it :)14:05
smb_tpThey are quite similar. :)14:05
IntuitiveNippleMy brain's mush... didn't go to bed last night so only just functioning right now 14:06
rtgsmb_tp: uh, the bug I was thinking about is https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/208137, but its the same solution for madwifi v.s. ath5k.14:06
ubot3Malone bug 208137 in module-init-tools "Add vt8623fb to blacklisted framebuffer drivers" [Medium,In progress] 14:06
rtgIn fact, shouldn't jockey handle this?14:07
smb_tprtg, So, the solution would be to blacklist madwifi and jockey should handle/load it if required?14:09
rtgsmb_tp: actually, if they just ran jockey it ought to do the right thing.14:10
tjaaltonany idea why the qla2xxx driver takes one minute to initialize?14:31
tjaaltonon jaunty14:31
tjaaltonthe default rootdelay is 30 seconds, so unless I change it the boot fails14:32
tjaaltonthis is a blade server with a dual-HBA adapter14:33
tjaalton-adapter14:33
tjaalton[   62.940070] qla2xxx 0000:13:00.0: Firmware image unavailable.14:34
tjaaltonmaybe that has something to do with it?14:34
rtgtjaalton: but it eventually works?14:36
tjaaltonyes14:37
rtgtjaalton: well, start a bug report and attach dmesg. maybe we can eyeball it from the driver source14:37
tjaaltonrtg: yeah, started already14:38
rtgtjaalton: I've got a server that is totally wedging, it looks like igb. It started yesterday after a BIOS update and perhaps an update to the latest Jaunty -7 kernel.14:39
rtgI should try nomsi14:40
tjaaltonrtg: this has happened since I started testing multipath-booting (late december)14:40
rtgtjaalton: ah, nothing really new then. have you bugged upstream?14:41
tjaaltonrtg: haven't had time yet, fixed multipath-tools first so it actually boots up without manual intervention14:42
rtgbiab, have to get close to my server in another room14:42
tjaaltonit could be that the slow initialization is normal in this case, since there are so many paths to the device (8)14:43
tjaaltonrtg: filed bug 32855014:49
ubot3Malone bug 328550 in linux "qla2xxx takes ~one minute to initialize per device" [Undecided,New] https://launchpad.net/bugs/32855014:49
tjaaltonwtf? apt-getting the source for linux-image-2.6.28-7-generic fetches the meta-package14:49
mnemoi told you yesterday timo :)14:50
tjaaltonmnemo: yes, maybe our mirror was the one that was outdated :)14:50
tjaaltonsince I couldn't reproduce it then14:50
mnemohehe yea14:51
mnemoanyway I worked around it with:14:51
mnemoapt-get source linux=2.6.28-7.2014:51
tjaaltonthanks14:51
tjaaltonrtg: do you know the contact point for qla2xxx upstream?14:53
rtgtjaalton: doesn't appear to be in the MAINTAINERS file. You might try sending an email to James Bottomley14:57
tjaaltonrtg: actually, Andrew Vasquez looks like a better match :)14:58
tjaaltonrtg: ok, got a reply already. they'd like it not to complain about the firmware :)15:20
tjaaltonbut I'm not sure if the initramfs has it or not15:20
tjaaltonupdate-initramfs complains as well15:24
smb_tptjaalton, Actually I would suspect that getting linux-meta as the source for linux-image-2.6.28-7-generic is sort of the right thing15:31
tjaaltonsmb_tp: but that only happens as of a few days ago15:33
tjaaltonor yesterday15:33
tjaaltonthe docs still suggest to do that when trying out patches15:34
tjaaltonanyway, the qla2xxx still complains even if I copy the firmware in /l/f/2.6.28-7-server15:35
smb_tpWhich is rather strange. That linux-image-2.6.28-7-generic is a meta-package itself and the source of that is linux-meta15:35
tjaaltonno it isn't :)15:36
tjaaltonbut maybe it's the new apt that does this15:36
tjaaltonthe source of the image is 'linux', and the source for it is 'linux-meta'15:37
tjaaltonso the resolver is different now15:37
smb_tptjaalton, Hrm, yeah. You are so right. It just sounded like the linux-restricted-modules trap I fall into so often15:38
tjaaltonhehe15:38
tjaaltonsigh, the qla2xxx driver still complains even if I copy the fw to /l/f15:49
tjaaltonbbl15:50
Kanortg: http://kanotix.com/files/kernel/unused-patches/2.6.28-ubuntu-qc-usb-compile-fix.patch16:44
Kanodont forget this one16:44
Kanoa x-fi driver would be interesting too16:46
Keybukok17:42
Keybuksomething is definitely, absolutely, officially wrong with I/O in 2.6.2817:43
Kanowhats your problem?17:59
KeybukKano: every single bootchart shows almost continuous IO, with one core continually in I/O wait, and no process claiming it18:06
Keybuknot limited to any particular machine either18:06
Kanoah18:06
Kanoand 2.6.27 or .29 rc is different?18:06
Keybukthe intrepid kernel does not show it18:06
rtgKeybuk: can you try the server kernel? Its got a different I/O sched setting, e.g., deadline 18:57
Keybuksure18:58
elmoor just boot with elevator=deadline19:04
tjaaltonrtg: any idea why the qla2xxx driver still complains about the missing firmware even if I copy it to /lib/firmware (and repack the initramfs)?19:04
rtgelmo: well, there are some other differences as well, like the time slice quanta19:05
elmortg: ah, ok19:05
rtgtjaalton: not off the top of my head. are you sure the name is right?19:05
tjaaltonrtg: checking19:06
tjaaltonrtg: yep, correct name19:08
rtgtjaalton: uh, gimme a bit. I'm in the middle of horking in 2.6.28.519:11
tjaaltonrtg: sure19:11
Keybukrtg: mad I/O still shows up with -server19:14
rtghmm, that sucks.19:14
rtgq:q!19:15
rohanhttps://wiki.ubuntu.com/KernelTeam/SuspendResumeTesting --> the script provided here for stress testing suspend/resume is meant to be run on which distribution?19:39
rohanor is it distro agnostic?19:39
rtgapw:  ^^19:44
apwrohan, it mostly relies on pm-suspend existing.  it has one gnome specific test which uses dbus to initiate a suspend for a more complete test19:46
apwbut that test not working is not fatal, and occurs on kde19:46
rohanapw: ok, so i can use it on kubuntu 8.04?19:46
apwthe failure reporting to launchpad only occurs on the latest jaunty installs and is tied into apport19:46
apwi have run it on 8.04, it will not report back to launchpad automatically on failure there, but still is expected to work19:47
apwif it does not contact me and i'll see what i can do to make it so do19:47
rohanok, thanks19:47
apwnp19:47
rtgtjaalton: can you build a kernel? strip the DEBUG2 macro from line 2711 of drivers/scsi/qla2xxx/qla_os.c19:48
tjaaltonrtg: sure, and it shouldn't take long since the beast has eight cores20:05
rtgtjaalton: it looks like that _has_ to be the failure point.20:05
rtgtjaalton: you could also modprobe it with ql2xextended_error_logging=120:06
rtguse break=top to catch it in initramfs20:06
tjaaltonrtg: if it was set to use the serial port for console, I could. I'll try building the kernel first20:08
rtgk20:08
tjaaltonhmm, so remove those two lines?20:09
rtgtjaalton: no, unwrap the DEBUG2 macro so that it prints something20:10
tjaaltonoh, heh20:10
tjaaltonlet's see20:10
rtgthis is in the qla2x00_request_firmware() function, just to make sure we're looking at the same sources.20:11
tjaaltonyep20:11
tjaaltonrtg: [   63.010090] scsi(0): Failed to load firmware image (ql2400_fw.bin).20:45
tjaaltonrtg: note that the image is not normally on the initrd at all20:46
tjaalton..and actually I didn't remember to copy it manually either20:46
rtgtjaalton: doh! that might have something to do with it. I wonder if there is an option to build it in. lemme check20:47
tjaaltonupdate-initramfs complains about the firmwares20:47
tjaaltonW: Possible missing firmware /lib/firmware/2.6.28-7-server/ql2500_fw.bin for module qla2xxx20:49
tjaaltonetc20:49
rtgtjaalton: thats cause it should be /lib/firmware/ql2500_fw.bin20:50
tjaaltonrtg: that's where they actually are20:51
tjaaltonI'll repack the initrd with the firmware in /l/f20:51
tjaaltonand according to dmesg it really is waiting one minute for the firmware21:00
tjaaltonrtg: same error message with the firmware in /lib/firmware21:05
rtgtjaalton: ok, I'll dig deeper, but its gonna have to wait until tomorrow.21:07
tjaaltonrtg: take your time :)21:08
tjaaltonrtg: qlogic guy says that redhat has a similar problem related to udev, and he'll get back to me later22:21
=== Guest65438 is now known as mxab
=== mxab is now known as maxb
tjaaltonrtg: got a reply; "This is an issue with the initrd (initramfs) infrastructure not supporting the request_firmware() interface."22:47
tjaalton"Basically, the 60 second lag time is the request_firmware() call timing out due to udev being unable to satisfy the call to load firmware"22:48
rtgtjaalton: seems like we ought to be able to get initramfs to do the right thing.23:39

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!