/srv/irclogs.ubuntu.com/2008/10/16/#ubuntu-kernel.txt

pgranerdavidm: pong, sorry eating00:08
davidmpgraner, my laptop is now able to reproduce the bug very well 8 out of 10 boots.00:09
pgranerdavidm: ok, did the workaround solve it?00:09
davidmDo you want it?  I can fedex it to you or tim since it now is in text mode boot00:09
davidmworkaround does solve it.00:09
pgranerdavidm: its definitely a race, its just a matter of narrowing down what with the 3945 driver00:10
davidmIt also alwayst gets past USB boot sequence so you can use serial console if you need it.00:10
pgranerrtg: ping... see above00:10
davidmI have about 20 minutes to drop off time here.00:10
pgranerdavidm: does it have a real serial port?00:11
davidmnope, just USB00:11
davidmBut USB is getting up before the hang from what I see.00:11
davidmAnd as I say I'm in text mode so you can printk debug with it.00:11
davidmusplash is hard off00:12
pgranerdavidm: is it hard hung, i.e. does the caps lock toggle on and off?00:12
davidmhard hung00:12
davidmno caps lock, no alt/printscreen t or 9 all locked solid00:12
pgranerdavidm: can you live without it for a few days? If so send it to rtg this way he has one that repros it00:13
davidmI'm off on Friday so I have no issue giving you unit until Monday00:13
pgranerdavidm: ok send it to rtg, you have the addr?00:13
davidmI need address to do so00:13
pgranerdavidm: via email00:14
davidmOK00:14
davidmI'll fedex for early delevery00:14
pgranerdavidm: you have mail00:15
davidmOK00:16
davidmOK have address will try to get FedEx to take it.  I'll be off line use phone to get me if need be00:21
emmawhat type of error would this be: FATAL error inserting battery /lib/modules/..../acpi/batery.ko): no such device.01:04
NCommanderemgent, offhand, I'd say a bad one. It sounds like your kernel modules went and ran away01:17
NCommanderer, emma 01:20
emmaThat's trouble.01:20
TheMusoTo me it sounds like the module in question can't find the hardware its designed to work with.01:21
emmaI got it from aptitude while installing xserver-xorg in the cli only mini.iso on a computer that's quite old.01:21
NCommanderemma, is it a laptop?01:21
* NCommander notes that should have been the obvious first question01:22
emmaThe odd thing is, besides that fatal error, xserver-xorg runs.01:22
NCommanderWell, the fatal error sounds like its being caused by modprobe and not xorg01:22
emmaNope not a laptop. This is an old dino-puter, I got it for fun and experimenting with linux. :) It's a Dell Optiplex GX1, Pentium III.01:22
emmaAt install it said it had to force aspci01:23
emmaacpi that is.01:24
=== emma is now known as joe-the-plumber
=== joe-the-plumber is now known as emma
khaeruHello?04:28
mdzanyone online with an iwl3945 to debug?09:31
NCommanderSorry, iwlagn here :-/09:52
NCommanderI might be able to dig one up mdz 09:52
NCommandermdz, speaking of things to debug, do you know what is needed to get a kernel.u.c account?09:53
amitkNCommander: it helps to be a regular contributor first before requesting an account.09:56
amitkNCommander: what are you planning to work on?09:56
NCommanderlinux-ports kernel09:56
NCommanderI want to start work on rebasing to 2.6.27 in a side tree so once jaunty is available, and a kernel team member can sign off on it, it can become the jaunty kernel. I'm sorta irked at the age of the -ports kernel09:57
amitkNCommander: why can't that be done by just pulling the current ports tree and doing it locally? Once you have something to show, there won't be a reason _not_ to give you an account :)09:58
NCommanderAt the moment, the 2.6.25 tree is someone fracked 09:58
NCommander*somewhat09:58
NCommanderI dunno what happened to it, but git explodes during a rebasing attempt09:58
* NCommander did have one of the kernel gurus also try it to rule out user error ;-)09:58
NCommanderso its more grab linus's tree, and start it again09:59
amitkNCommander: 2.6.25 -> 26.27 is enough of a big jump that some attempts at rebase might fail because some patches become obsolete and cause conflicts. So we are better of discarding those patches. Trying 'git merge' to 2.6.27 final might get you there quicker. Then we can figure out the actual 'diff' and recreate a new tree.10:01
NCommanderWell, normally I'd agree10:01
NCommanderBut its failed on trival patches as well10:01
NCommanderITs like git can't see that this patch applies cleanly10:02
NCommander(on the rebasing)10:02
NCommanderI personally felt it might be easier to just take the clean 2.6.27 drop, drop debian on it, then pop off each patch individually (there are maybe 15-20 tops)10:02
NCommander(the debian folder that is)10:02
amitkNCommander: i agree. 'git format-patch -o /tmp <commit id>' will help you export all ubuntu/ports patches on top of Linus' tree10:03
* NCommander nods10:04
NCommanderPretty much what I was thinking10:04
ograamitk, http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=497200 is there a way that we get that driver in before final ? 10:06
amitkNCommander: if you came back with such a tree, someone would have a easier time recommending you for a kernel.u.c account. :) But we will be glad to help while you do this...10:06
* NCommander wishes bzr-git existed10:07
NCommanderOh well10:07
ograNCommander, not only you :)10:07
NCommanderI just wanted a place where I could upload my tree more or less10:07
NCommanderI don't have a machine where I canpush a git repo to10:08
ograNCommander, easy to solve btw, you just need to convine linus to switch upstream :)10:08
NCommanderOuch10:08
NCommanderI personally wish he switched to monotone10:09
* NCommander likes mtn10:09
amitkogra: one thing at a time... mobile kernel first :)10:13
ograamitk, indeed, that would only be for post freeze, no hurry :)10:13
amitkogra: when is freeze?10:14
NCommanderamitk, well, I'm fairly new to kernel development, but the -ports kernel has seen no love pre-hardy when it was forked from the normal one, and I think if we're going to have usable ports, this needs to change :-/10:14
ograamitk, slangasek said something like "morning-ish UTC" 10:15
amitkNCommander: agreed. we discussed this in kernel team meeting on tuesday10:15
amitkogra: today?10:15
ograyes10:15
NCommanderamitk, well, I'm a -ports user, REVU is on a sparc box. Its really pathetic that all the ports die because no one can maintain the kernel :-/10:16
ograamitk, it obviously didnt happen yet10:16
ograamitk, whoops it happened right now when i was typing the above 10:16
ograheh10:17
* amitk groans10:17
ogra(see -devel)10:17
LimCorehi, in 2.6.27 (ubuntu 8.10, amd64) - compared to 8.04 on same box (acer laptop single core), CPU usage is awesome (98% in C3 when idle, was 1% on 8.04 bue to probably kernel problem)10:26
LimCorethis rocks10:26
LimCorehowever, now one thing causes wakeups (70 per second)10:27
LimCorei915@pci:0000:00:02.010:27
LimCoreI didnt see that one on 8.04... any ideas?10:27
LimCoreoh ok.  this is only when desktop effects are active.  I guess this is because it uses OpenGL mode...10:28
mdzNCommander: if you have a 3945 system, debugging for bug 263059 would be greatly appreciated10:55
NCommanderI'll see if I can dig one up10:55
=== asac_ is now known as asac
=== mdz_ is now known as mdz
rtgNg: did you catch this little factoid re e1000e corruption? http://kernel.ubuntu.com/git?p=ubuntu/ubuntu-intrepid.git;a=commit;h=67d9b90a1c844bf1c6daaffd2c60561fc8c445f714:52
Ngrtg: yeah, I was gonna ask if we're going to take that :)14:52
NgI know basically nothing about ftrace, but it makes me wonder if it's possible it's causing other subtle weirdnesses14:53
rtgNg: test building now.14:53
rtgNg: its possible. I'm getting David M's laptop this morning 'cause it exhibits Jane's i3945 related crash. Its really weird and racy.14:54
amitkrtg: 5 reboots without a hand on the 3945 now15:52
amitks/hand/hang15:52
rtgmjg59: what happens if you access an I/O mapped register after pci_disable_device() ? I imagine its somewhat HW dependent.15:53
Ngrtg: is your test build going to yield a handy .deb? If so I could do more reboot testing of my weird boot crash/corruption tonight15:53
mjg59rtg: In principle, I don't think the device should decode it15:53
Ngbut that's purely me hoping and wishing that it will magically go away with ftrace ;)15:53
rtgNg: actually, I think its worthy of an upload.15:53
Ngfair enough15:53
amitkNg: I'll have a .deb soon if you are on amd64-generic15:53
Ngnewp, i38615:55
NgI like my proprietary media to work ;)15:55
jdstrandamitk: fwiw, I've got a similar issue with ipw2200, and I got 9 successful boots with the 10th a hang15:56
amitkjdstrand: bug #?15:56
rtgamitk: can you instrument your i3945 driver such that you can tell when the rf-kill handler is called?15:56
jdstrandbug #28440615:57
rtgmjg59: so you think it would wedge?15:57
rtgi.e., stop the PCI bus.15:57
mjg59No, it should just cause an aborted write15:59
mjg59/shouldn't/ wedge it15:59
jdstrandrtg: I don't seem to have a way to enable rf_kill. Fn+F5 will toggle it to 1, but it goes to 0 on reboot. I can disable it in the bios, but then ipw2200 isn't detected/loaded during boot16:01
pgranerBenC: ping16:01
amitkrtg: mdz already sprinkled some printks around rfkill_init() this morning. The .ko is attached to the bug.16:01
* amitk back in 15 mins (dinner)16:01
rtgamitk: I don't have hardware yet. It should get delivered soon.16:01
amitkrtg: I have a kernel compile going with FTRACE disabled. I'll tell after that.16:02
rtgamitk: pull from the repo. I've already committed taht stuff16:02
mdzamitk: so you can reproduce the bug now?16:05
mdzjdstrand: I have a T42 I can test, any hints on reproducing bug 284406?  upgrading it to intrepid at the moment isn't practical, but I can test a CD16:11
jdstrandmdz: not really-- just patience. I've gone as many as 9 consecutive successful reboots16:12
jdstrands/reboots/boots/16:12
jdstrandmdz: I did have associate=0 in modprobe.d/ due to another bug that was fixed in 7.11, but removing that file makes no difference16:13
amitkrtg: yes, I already pulled and building the kernel now. Almost done16:15
amitkmdz: yes, I can reproduce, but very infrequently. Once in 10 tries or so...16:16
amitkrtg: I'm not having too much luck reproducing after adding debug to the kernel cmdline16:34
mdzjdstrand: at what point did this regression get introduced?16:35
mdzasac,all: is there any clarity on whether bug 259157 needs to be fixed in userland or kernel?16:41
jdstrandmdz: I am trying to ascertain that. I *definitely* know I never saw this in hardy. I thought I saw it in 2.6.26-5.17 once this morning, but am trying to reproduce that currently16:42
jdstrandmdz: unfortunately, this is not a machine I actively use, so I may have well been hitting good boots all along in the cycle16:43
jdstrand(though I seem to remember at least one time before the latest kernel that I had a hang, but didn't have time to troubleshoot)16:43
mdzBenC: what is the resolution to be for 246269 (uvesafb)?16:45
amitk*sigh* 20 reboots now without a hang.16:47
mdzrtg: did the hardware arrive?16:47
rtgmdz: yep - cloning /home16:48
mdzrtg: great16:48
mdzrtg: did it hang when you booted it up?16:48
mdzI'm wondering if maybe it's tickled by what sort of APs happen to be nearby, if any16:48
rtgmdz: on conf call, I'll be done in a bit.16:51
asacmdz: there is no userland solution for atheros ... the previous we had used the wpasupplicant madwifi module16:51
asacwhich doesnt work anymore i think16:51
asacmdz: (thats for ath) ... for orinoco the solution could be user land, but the code changed considerably in NM and i havent received a single complained about it (and therefore no testers)16:52
asacmdz: so to answer your question: atheros -> fix in kernel (though at least some appear to work with latest drivers); orinoco -> maybe userland16:53
* jdstrand finally got smart and created /etc/rc2.d/S19rebootme so he doesn't have to hand hold his laptop through endless reboots16:57
BenCmdz: we uploaded a new kernel with vesafb as default17:08
mdzBenC: we reverted from uvesafb to vesafb?17:08
BenCmdz: right17:10
mdzBenC,rtg,amitk,pgraner,cjwatson,slangasek: targeted kernel bug review on its way to your mailbox17:18
rtgmdz: if the rf-kill switch is enabled, then the AP should have no influence17:18
rtgohh ah, locked up.17:19
mdzrtg: good point, several of us tried that17:22
rtgmdz: on another note, I upload Intrepid LBM about 4 hours ago. need to get Steve to release it.17:23
rtgmdz: hmm, fixing this bug it gonna be difficult if I can't _ever_ get David laptop to finish booting. 5 for 5 on boot hangs17:24
mdzrtg: he's managed to get it very reliable17:28
mdzrtg: his boot process is modified from stock (he's not waiting for udevtrigger to finish before continuing)17:28
mdzinit=/bin/sh should get you in of course17:29
rtgmdz: how does that effect rf-kill processing? I assume its udev tha does that as a result of ACPI events.17:29
mdzrtg: the normal boot process does essentially "udevtrigger; udevsettle"  davidm's just runs udevtrigger and keeps going.  so it would mean that there are other init scripts running in the background while udev is still loading modules17:31
mdzit probably gets to runlevel 2 or so before it's done loading modules17:32
=== paran_ is now known as paran
davidm8 out of 10 will lock17:32
davidmmdz i think you put udevsettle back17:33
rtgdavidm: I think I'm 10 for 10. init=/bin/sh never gives a prompt, I had to use break=top17:33
mdzdavidm: I did, but then you took it out again last I checked17:34
davidmyou may want to boot a cd17:34
davidmnope i never touched it 17:34
mdzrtg: so it's possible that it's one of the startup scripts in rcS.d after S10udev which tickles the bug17:34
davidmthat made the lock happen earlier17:35
paranwhere have the linux-image-debug-* packages gone in intrepid?17:38
amitkrtg: will wireless backports happen before release? if so, I'll skip bug #17:41
amitkrtg: will wireless backports happen before release? if so, I'll skip bug #284354 for now.17:42
rtgamitk: I'm not sure yet. I've been to busy with other stuff to get back top it.17:42
rtgI have committed the current compat-wireless, but it didn't seem to work, e.g., would not connect.17:43
jdstrandmdz: fyi-- 31 consecutive successful boot with 2.6.26-5.17-generic. moving on to 2.6.27-2.3...17:51
mdzjdstrand: don't tell me, put it in the bug :-)17:56
jdstrandmdz: I did :)17:56
mdzparan: bug 25390417:56
amitkrtg: since you are having more luck than I at reproducing the 3945, could you add the following debugging lines: https://pastebin.canonical.com/10277/18:16
rtgamitk: what's the dump_stack() gonna show you?18:20
paranmdz: thanks, so they are on ddebs? why? you really need kernel debug symbols to use things like oprofile or systemtap that are both in the normal archive :(18:21
amitkrtg: the call stack, I got a freeze (only twice yet) right before that. I was hoping to get something out before the freeze18:23
rtgamitk: gimme a bit. I rebuilt the module without rf-kill. its altered the problem enough that david's laptop appears to boot more reliably. no hnags after 4 tries.18:25
amitkrtg: sure, take your time. I am about to put this on an automatic reboot loop and call it a day.18:28
rtgamitk: later...18:28
Kanohi, did you see that 2.6.27.1 hotfix?18:59
laga_Kano: nobody in here follows upstream development18:59
laga_you should have learned that by now19:00
Kanowell but that disables the root cause of the e1000 problem19:00
Kanoi would say it is very important19:00
laga_i think an intel developer has submitted that patch to the kernel-team list19:01
Kanodisable CONFIG_DYNAMIC_FTRACE due to possible memory corruption on module unload19:01
Kanoyou do not explicit need that patch,but at least disable it19:02
Kanothe patch would not hurt however19:03
rtgalreay committed to the Intrepid repo19:04
rtg*already19:04
Kanois config changed?19:04
rtgthe Kconfig change is the patch19:04
Kanook19:04
Kanowill compile new snapshot then19:05
crimsunugh, we really need something like hal-info for HDA quirks.19:45
mjg59Being able to set the mappings from userspace would help a great deal19:49
mjg59As would being able to parse the .inf files19:49
solarionis there any hope of getting the elantech touchpad driver into intrepid?19:58
solarionIt always goes quiet when I say that.  :/20:07
amitksolarion: the ditro was frozen today and we release in 2 weeks. So nobody wants to add another driver at this late stage.20:10
amitk*distro20:10
solarionamitk: the request's been in since Hardy, and likely before20:10
solarionhttps://bugs.launchpad.net/ubuntu/+source/linux/+bug/12377520:11
solarionyes, this bug is clearly over a year and a quarter old20:13
amitksolarion: but clearly at that time it was an out of tree driver with no future. So it made it to the -mm tree later. What happened next? Do you know the story?20:14
solarionI've no idea20:14
solarionI just want to disable tap-to-click and enable scroll areas.  :)20:15
amitksolarion: I suggest you right to the author here:  http://arjan.opmeer.net/elantech/ and ask him why the driver is still outside the tree after 3 major kernel versions.20:20
amitks/right/write20:20
solarionamitk: good point20:21
solarionthere.  Sendified.  :)20:25
amitklet us know what you hear back20:29
jdstrandwell, it took 36 tries (!) to get the boot hang (bug #284406) with 2.6.27-3.4, but ony 6 with 2.6.27-2.3... gotta love race conditions :(20:29
rtgjdstrand: https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/263059/comments/14820:36
rtgits likely they are the same hang.20:36
jdstrandrtg: excellent-- is it worthwhile to retest 2.6.26, or do you have enough to go on at this point?20:37
rtgjdstrand:  I'm gnarling into that function, and I've solicited some advice from the wireless guys (no response yet)20:37
jdstrandrtg: well, I've got this thing on a loop so I can let it churn away...20:38
rtgjdstrand: instrument the ipw2200 driver so that it prints some locators, preferably just before and just after the call to ieee80211_register_hw()20:39
jdstrandrtg: can you paste what you used for iwl3945?20:42
rtgjdstrand: just do 'printk(KERN_INFO DRV_NAME " %d\n",__LINE__);' where DRV_NAME is something like 'ipw2200'20:43
jdstrandrtg: ok, easy enough20:43
amitkrtg: i've been looking at the link-order angle of this bug.20:51
amitkbut since 3945 is a module, that doesn't apply20:51
amitkbut is the order in which userspace triggers these module loads fixed? and if so, where is it?20:52
rtgamitk: oh, defintely. David was able to get it more often by messing with udev settling20:53
amitkwhere is keybuk when we need him?20:54
rtgamitk: see /etc/init.d/udev. comment out the 'udevadm settle' clause in the restart option.20:54
solarionamitk: looks like it might be in 2.6.2820:56
solariontoo late for intrepid, unfortunately20:56
amitksolarion: first thing in jaunty though :) Did he mention why it took this long?20:56
solarionamitk: no, but I didn't really ask that; I asked what was keeping it from going in-tree20:57
amitkok20:57
* solarion digs for the linux-kernel archive posting20:58
solarionhttp://www.uwsg.iu.edu/hypermail/linux/kernel/0810.2/0404.html20:58
amitknice21:02
rtgamitk: I've chased it as far as the rtnl_lock() in ieee80211_register_hw()21:03
amitkrtg: the two times I did get a freeze, I saw all messages from ieee80211_init_rate_ctrl_alg()21:11
amitkrtg: interestingly, https://bugs.edge.launchpad.net/ubuntu/+bug/263059/comments/128 finished pci_probe on every hang21:12
amitk^ using mdz's instrumented modules21:12
rtgamitk: i just had it lock up _after_ the i3945 module init completed.21:12
amitkok. that makes more sense.21:14
amitkor that is what I am seeing the bug reports21:14
amitkrtg: comment out udevadm settle in restart or start option?21:26
rtgamitk: mdz commented out the clause in restart (on David's machine).21:40
amitkhuh21:45
amitkrtg: do you have then X60?21:53
rtgamitk: T6022:20
=== TheMuso_ is now known as TheMuso
=== laga_ is now known as laga

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!