/srv/irclogs.ubuntu.com/2008/03/28/#ubuntu-kernel.txt

=== \sh_away is now known as \sh
aboganicking: Are you around?07:36
ckingabogani:  Hi there..07:36
aboganicking: About Bug #17789507:38
ubotuLaunchpad bug 177895 in linux "Kernel 2.6.24-2 causing ~1000 wakeups by "Rescheduling Interrupts"" [Medium,In progress] https://launchpad.net/bugs/17789507:38
aboganicking: Interesting commit is 62fb185130e4d420f71a30ff59d8b16b74ef5d2b in mainline.07:38
ckingYes, I am very familiar with this one :-)07:38
=== doko_ is now known as doko
ckingYep - apparently it removes two earlier commits 07:39
aboganiSo I let thi Bug to you :-)07:39
cking..well I had a look at it and I was not sure if these commits are actually in our kernel07:39
aboganiIt isn't a trivial merge.07:39
aboganicking: Do you have hw that incurr evidently in this bug?07:40
aboganicking: What is your TZ?07:41
ckingabogani: I have a Centrino duo which I see 200-300+ Rescheduling Interrupts. But not 5000+ as some are seeing07:41
ckingcking: My TZ is UTC. I start early :-)07:41
aboganicking:so  Good Morning! :-)07:42
aboganicking: Mainline give the same result on my Centrino duo!07:42
ckingabogani: Back to commit 62fb185130e4d420f71a30ff59d8b16b74ef5d2b... I believe it reverts 58e2d4ca581167c2a079f4ee02be2f0bc52e8729 and 6b2d7700266b9402e12824e11e0099ae6a4a6a7907:43
ckingabogani: however, I was unsure if these commits were actually made to our current kernel tree07:44
ckingabogani: have you investigated this to any depth yourself?07:44
aboganicking: I'm already cherry-picked and my laptop works perfectly. 07:44
ckingabogani: do you have a patch that I can pick from so that I can give it a run through?07:45
aboganicking: But i don't have hw that expose evidently thus bug. I'm not sure that this fix... :-(07:45
aboganicking: Do you prefer git or email?07:45
ckingabogani: yes this is the same kind of problem I am facing too. And the patch is a bit "radical" as it does touch a lot of the important parts of the  scheduler07:46
aboganiAgreed07:46
ckingabogani: git - if that's OK, but email if that's easier for you.07:46
ckingabogani: I have some tests with powertop and so forth that I can apply to see what's going on.07:47
aboganicking: Ok i'll push it in my git tree. Please Let me some minutes...07:47
ckingabogani: The main thing is to see if the rescheduling interrupts are legitimate - a lot of them really are due to correct load balancing07:48
ckingabogani: so it will take me 3-4 hours to shove in some diagnostics and see if the fix really is OK07:48
ckingabogani: OK - much appreciated - this saves me a lot of work. :-)07:49
=== asac_ is now known as asac
aboganicking: Sorry I don't know why but my git don't work anymore! :-(09:16
ckingabogani: Ah. Problem :-(09:17
aboganicking: Mhhhh .gitignore conflict09:18
aboganiNo way to understand why git stop work... 09:27
ckingabogani: perhaps I should try the patch on my git tree and toy with the fix that way09:34
aboganicking: Are you registered user on Freenode?09:34
ckingabogani: I thought so - why?09:35
aboganicking: I'm trying to send you a file09:35
ckingabogani: perhaps try Emailing it to colin.king "at" ubuntu.com 09:36
ckingabogani: meanwhile I check my Freenode and IRC settings 09:37
aboganicking: Mailed. I send you my adaptation of the cherry-pick fix 09:38
ckingabogani: many thanks!09:38
aboganiSorry but my git is completely break ... :-(09:39
ckingabogani: ..I know the feeling, when it breaks, it *really* breaks - and usually at the worst time.09:39
abogani:-)09:40
aboganiMurphy docet ;-)09:41
amitkabogani: what seems to be the problem with git?09:46
=== setmora is now known as eradicus
aboganiamitk: Hi Amit! I cherry-pick a commit from a remote tracked repo, use git-mergetool and git-status. And all is ok. I execute git-commit and all things disappear. git-log and git-status don't show nothing! Work seems completely lost!09:49
aboganiSeems to me a index problem...09:49
aboganis/index/git index/09:49
amitkabogani: interesting. And you weren't working on a branch?09:52
aboganiamitk: No. Just created with 'git-checkout -b'09:54
amitkabogani: that is a branch09:55
ckingabogani: Hi again..10:09
aboganicking: Hi10:09
ckingcan you gzip the patch and re-send it to me as a gzip attachment. My mail client is silently putting in white spaces that cause git-apply to fail :-(10:10
=== rikai__ is now known as rikai
cking..by the way I can reproduce the 10,000+ Rescheduling Interrupts quite predictably now - so it will be straight forward to test the patch.10:11
aboganicking: The fault is my webmail!10:14
aboganicking: Mailed.10:15
ckingabogani: no worries - thanks again for the speedy response!10:15
aboganicking: How can you reproduce the bug? Is it necessary something special actions?10:16
ckingabogani: It's a case of getting the right suite of apps to generate loads of timer actions and get enough free CPU cycles so that both cores are incorrectly balanced... 10:17
cking...then one can see the scheduler working hard to try to rebalance the load and cause zillions of IPI events.10:17
Ngcking: is that going to be easily fixable for hardy? :)10:17
aboganiNot easily...10:18
cking..I hope so.. it will take a few hours of analysing the IPI events under different loads before I can say. I think it's not easily fixable..10:19
cking...I've been looking at an answer on and off for a few weeks on this one - and tinkering with the scheduler is risky for Beta10:19
Ngyeah10:20
Ngjust curious because my laptop seems to do several hundred wakeups a second for that10:20
ckingAfter a lot of consideration I believe a lot of the rescheduling interrupts are legitimate.10:20
ckingA lot of apps do very frequent timer events .. they wake up and the scheduler tries to spread the load across cores10:21
ckingFor laptops with Centrino Duo cores it is better to spread the load across cores so that they are busy rather than have one core running at full speed 10:23
amitkcking: what happens if IRQ balancing is disabled? the interrupts only go to the core on which the app is running?10:23
ckingamitk: Not sure if this is just a IRQ balancing. My understanding is that the scheduler is rebalancing the load across cores in probably too agressive a manner 10:25
cking..OR...10:25
cking..that we can now see the IPI events because the Rescheduling Interrupts are more visible 10:25
ckingamitk: Any wise input on this?10:27
amitkcking: not really. Though I wonder if the rebalancing behaviour will change if the scheduler is changed.10:28
ckingamitk: looking at all the scheduler tweaking in 2.6.25+ I am very concerned that this type of issue has a lot of life in it..10:29
ckingamitk: my big worry is that the problem may be resolvable for Hardy, but need a lot more work for later kernels.10:31
cking..and also the scheduler is not a trivial piece of code to tinker with - if one goofs up, one goofs up big time.10:32
amitkcking: how so?10:32
ckingamitk: sorry.. I misunderstand: how so what?10:32
amitkcking: why would it be harder to fix for later kernels? It should be fixed upstream, right?>10:34
ckingamitk: true10:36
ckingI'm just concerned with a fix now in Hardy LTS which makes the scheduler diverge from upstream in a (perhaps significant and) unmaintainable way10:38
amitkcking: i see you point now. But with LTS, it is unlikely that we will upgrade to a newer kernel.10:40
amitk*your10:40
ckingamitk: indeed.. but I'm always keen to reduce risk.. especially wrt key kernel components.10:42
amitkcking: agreed10:43
ckingHi BenC11:01
BenChey11:02
ckinganother day, another bug :-)11:02
BenChehe11:04
BenCcking, amitk: If either of you wants to follow up on my work on cpu1 losing cpufreq capa, I found some interesting bits after some debug last night11:09
BenCafter suspend, when bringing up cpu1, there are some acpi errors about unknown op codes11:10
ckingBenC: Sounds like something meaty that I could get my teeth into..11:10
BenCbringing cpu1 down/up before suspend/resume works fine (the cpufreq symlink returns in sysfs)11:10
BenCdoing so after suspend resume repeats the acpi errors though11:11
* amitk gives up the bug to cking reluctantly - not :)11:11
BenCmjg59: Does the acpi subsystem reread the acpi tables after suspend/resume, or rely on it's internal copy?11:12
ckingBenC: Sounds like https://bugs.launchpad.net/ubuntu/+source/linux-source-2.6.22/+bug/18303311:12
ubotuLaunchpad bug 183033 in linux-source-2.6.22 "Intel Core 2 Duo - Resume from suspend, CPU Frequency Scaling is gone on CPU1" [Undecided,Confirmed] 11:12
BenCcking: That's the one11:12
mjg59BenC: They're never re-read11:12
mjg59Hm. In principle, SSDTs can be loaded and unloaded at runtime.11:12
ckingmjg59: Not sure what happens in reality though. I will look into it.11:13
mjg59BenC: Curious. acpidump and iasl -d should let you figure out where that object is meant to live11:13
BenCmjg59: Then my first guess is that the internal copy is getting corrupt somehow11:15
BenCmaybe acpidump before and after suspend will confirm that11:16
mjg59BenC: acpidump will dump direct from the hardware, not the kernel's representation11:16
mjg59/proc/dsdt /might/ give you the kernel version of the dsdt11:16
mjg59/proc/acpi/dsdt, that is11:17
mjg59BenC: What hardware are you seeing this on?11:17
mjg59A d630 has literally just turned up at my front door, so I can poke it there11:17
ckingmjg59: This bug also occurs on my Lenovo 3000N200 Centrino Duo 11:17
BenCmjg59: Lots of systems11:17
ckingBenC: so at least I can dig into the bug on my hardware straight away11:18
mjg59Ok. I'm not seeing it on my HP.11:19
BenCI reproduced it on two of my dell laptops11:19
Ngfwiw, not seeing it on my thinkpad11:19
BenChoping this isn't a BIOS bug11:20
BenCor if it is, hopefully we can work around it somehow11:20
BenCmjg59: Should acpidump and /proc/acpi/dsdt match?11:22
mjg59BenC: The dsdt hunk of acpidump should I believe, yes11:22
mjg59Certainly after passing them through iasl -d11:22
BenCmjg59: Are the CPU related bits we are concerned about in the dsdt?11:23
mjg59BenC: Not sure. If you disassemble the dsdt and find the _PCT methods, then yes :)11:23
BenCmjg59: Thanks :)11:24
BenCcking: Ok, sounds like you've got plenty to go on...I expect a patch in upstream kernel and a full report when I return in 3 hours :)11:24
mjg59Otherwise, probably in an SSDT11:24
ckingBenC: Of course ;-)11:24
BenCSee you guys in a bit11:25
adincif i download the packaged ubuntu kernel sources, where does it install them?11:45
aboganiadinc: apt-get source linux-image-2.6.24-12-generic will install source in current dir12:09
adincabogani: no it did install to AAAAAAAAAAAAAAAAAAAAAAAA/AAAAAABBBBBBBBBBB12:19
adincpardon to /usr/src12:19
adinccan i disable CGROUPS on a running kernel, or do i have to disable it in the kernel configuration before compiling?13:53
rtgadinc: its a compilation config macro, so I don't think you can change the way the scheduler works at runtime.13:55
adincis this essential in ubuntu kernel? or can i safely disable it13:55
rtgadinc: you can set it however you like. the current value is targeted to desktop single user environments.13:58
adincwhere can i get the complete config file for the hardy kernel packages kernel?13:58
adincthe config file which is in /boot is unfortunately not complete13:59
rtgadinc: install a headers package, then look for a .config in /usr/src/linux-headers*13:59
adincwhich one is complete linux-headers-2.6.24-12 or linux-headers-2.6.24-12-generic14:00
adincgeneric14:00
adinc...14:00
rtgadinc: alternatively you could 'debian/rules prepare-generic', then look in debian/build/build-generic for a .config for that specific flavour.14:01
adincrtg: no this can't be the complete .config file since the .config in linux-headers is IWL3945 module missing, i mean it is not set14:01
mjg59adinc: iwl3945 comes from linux-ubuntu-modules14:02
mjg59Not linux-image14:02
rtgadinc: thats because iwlwifi is built in LUM, not the kernel.14:02
adincso how do i get a complete config file14:02
mjg59adinc: That is a complete config file14:02
adinci don't understand14:03
rtgadinc: its the complete kernel config.14:03
mjg59adinc: The Ubuntu kernel does not include iwl3945. It's in a separate package.14:03
adincbut the kernel needs to be prepared in order to accept this module, doesnt it14:03
mjg59adinc: No14:03
rtgadinc: have you read https://wiki.ubuntu.com/KernelMaintenance?14:04
adinci've read several wiki pages, but let me see if i know this aswell14:04
adinci'm thankfull for any information14:04
adincthis would mean that i could compile this particular module only with the kernel headers, but without the source. only this particular modul, is this right?14:07
adincrtg: thank you for the link,  here it tells that the config files are concatenated into one file14:08
=== \sh is now known as \sh_away
tseliotBenC: did you review my patch?16:50
lamontBenC: which version of things was that vmmon, I wonder... no networking here.. FTL17:32
infinityBenC: Is the patch from #201591 queued for the next kernel upload, by any chance?18:26
infinityBenC: I'd love to be able to see myself type in consoles. :)18:26
rtginfinity: http://kernel.ubuntu.com/git?p=ubuntu/ubuntu-hardy.git;a=commit;h=4cbe826672e05ace08a4848e53135d652772feae will appear with the next upload.18:36
infinityrtg: Danke.18:38
infinityrtg: (For the record, despite the bug title, it's a regression for all *fb drivers, not just atyfb... I have the same bug on vesafb)18:38
rtginfinity: you are correct, the commit title is disingenuous, but the code is correct.18:41
infinityrtg: Yeah, I know the code is correct, the heads up was just for the Debian changelog to be accurate. :)18:42
rtginfinity: I'll try to remember when I do the upload.18:43
infinityrtg: Is there a timeframe on that?  There's nothing scarier for me than running a devel release without consoles. :)18:46
rtginfinity: by next Friday for sure, perhaps sooner. 18:49
cradekrtg: thanks, that patch is the one that fixed my system too19:00
BenCrtg: I'm doing an upload this evening19:04
BenCor, soon after on the weekend19:04
tseliotBenC: did you review my patch (or read my email)?19:15
rtgBenC: good. I was gonna ask Steve if there was any reason I shouldn't do an upload today.19:17
debAm i connected21:30
debI have downloaded 8.04 and found it is not a viable distro for me, and many others.21:31
debThe reason is the lack of internet apps21:31
alex_joniyou do know that on the CD there's only a tiny amount of installed packages21:31
alex_jonithere are about 18-19000 other apps in repositories, which you can easily install21:32
debPardon?  There are more apps on the CD21:32
deb?21:32
debOh, that. That  is my point.  As presently setup, I cannot get to the repositories with 8.0421:33
debReason is the internet access apps are so skimpy.21:33
infinityDefine "internet access apps".21:33
infinityAlso, please define it in another channel (#ubuntu, perhaps?), this is A) not a support channel, and B) dedicated to kernel discussion.21:34
debI have two locations and two different ISP's that I connect through - as setup I cannot connect from either with 8.0421:34
debOne location has a wireless network and the main computer on it has a D-Link DWA-552 wireless card, which 8.04 does not recognize (There is a GPL driver for it beause Sabayon uses it)21:35
debThere is also no "ndiswrapper."21:35
debSo I can use the Windows driver21:35
debThe other location uses PPPOE -complete with user name and password - for each log on, there are no apps to allow me to do that (they are available as PCLinuxOS uses thenm)(21:36
alex_jonithen use PCLinuxOS21:37
alex_jonias infinity pointed out, this is a place to discuss kernel development, which you obviously aren't interested in21:37
debA less than "sensitive" anwer21:37
infinityWe ship ndiswrapper and ppoeconf, both installed by default no less.21:38
debNo I am not interested in kernel development, but your folks on the desktop sent me here 21:38
debWhere are they, I looked but did not find them.21:39
infinitydeb: pppoeconf is in /usr/sbin, ndiswrapper modules are provided by the default kernel setup, though you might need ndiswrapper-utils-1.9, if you need the userspace apps.21:41
sorenndisgtk is.. Oh, he buggered off.21:43
soren*shrug*21:43
infinityIt's always unforunate when we lose a "IS THIS THE KERNEL CHANNEL, GIVE ME FREE SUPPORT NOW, UR DISTRO SUCKS, U ALL SUXORS!!" type.21:44
alex_joniinfinity: you forgot "I HAVE NO CLUE WHAT I'M DOING, BUT THERE IS SOMETHING WRONG WITH WHAT YOU DID"21:46
infinityalex_joni: I'm fine with people lacking clue but, yes, I agree that it's irksome when they blame their lack of clue on us clearly having broken their computer from afar.21:46
alex_joniand sometimes even meaning to do that (breaking their computers)21:47
zulif everyone based their stuff off of pclinuxos then a world will be a happier place21:54
alex_jonihow do you guys debug machine crashes.. if there's nothing in syslog?22:02
desrthai!  i can has kernel hacker?23:32
* desrt requires patch application23:41

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!