/srv/irclogs.ubuntu.com/2012/02/27/#ubuntu-kernel.txt

jMCgI'd argue one of the basic operations of an OS is to boot. If it can't do that, then something's going wrong.01:01
jMCgWonderful, after installing the kernel from oneiric-proposed, it still doesn't boot.01:33
ppisatimoin07:55
=== smb` is now known as smb
smbmorning08:20
* apw waves to smb08:24
* smb waves back08:25
ppisati*reboot after dist-upgrade08:40
jMCghttps://bugs.launchpad.net/ubuntu-release-notes/+bug/818177/ (#128)09:05
ubot2`Launchpad bug 818177 in udev "boot failures as /dev is not transferred to /root (because 'udevadm exit' times out waiting for a deadlocked worker)" [High,Fix released]09:06
jMCgSo, after six hours it's still dead-locked, yes.09:07
smbA lockup that long does not sound like the initial problem09:48
apwhenrix, morning10:18
apwjMCg, what are you seeing exactly, and are you using lvm10:19
henrixapw: hey, morning10:25
ckingsmb, http://awaseconfigurations.wordpress.com/2011/11/21/automated-ubuntu-release-upgrade/ 10:29
smbcking, Ah thank, guess I have to fold that into doing it through do-release-update. At least that has a frontend option as well iirc. It just did not explain much. And the dpkg env variable theoreticall should pass it as well...10:36
* smb will reset the vm and try10:36
* cking wishes smb luck10:37
* smb needs to re-adjust coffee levels first, though10:37
jMCgapw: the kvm's installed on an LVM partition. It doesn't use lvm by itself (I even uninstalled the lvm2 and mdadm packages from the VM images)10:46
jMCgWhat I pasted is what I see in the end, of course I can paste the entire boot process.10:46
smbjMCg, And the problem is the kvm guest that does not boot? 10:47
jMCgsmb: it doesn't finish booting. It starts, which is great progress!10:48
smbAt least that could make it less likely related to the initial problem. Is that guest running with the graphical splash screen or a server install coming up in text mode? 10:53
smbNot sure whether one can send kvm magic sysrq-keys... That might be one way to find out what is blocking...10:54
jMCgsmb: I'm on the serial console thingy, I don't think it's got any splash stuff installed. (It's based on my ISP's minimal image, I don't think that has any splash stuff)10:55
smbjMCg, Hm, ok so a blind stab would be to try press 's' when it hangs. 10:56
smbThere had been cases in the past when mountall was waiting for something but did not print a message when there was no plymouth10:56
jMCgOh... plymouth is installed, wonder why10:56
jMCgmountall depends on plymouth. that explains it, I guess.10:57
jMCgii  plymouth-theme-ubuntu-text       0.8.2-2ubuntu28                         graphical boot animation and logger - ubuntu-logo theme10:58
jMCgOkay, waiting for the 61 timeout to finish.10:59
jMCgsmb: pressing s does nothing.11:02
jMCgJust as a reminder: I'm booting the kvm, I'm connected to it via serial console11:03
smbjMCg, Ok, but I guess you have something like console= in the kernel arguments, otherwise you would not see anything in that serial console...11:05
jMCgsmb: yeah.11:06
jMCghttp://sprunge.us/VFch11:06
jMCgnomodeset, cute.11:06
jMCgWell, it was late and I was desperate.11:07
apwjMCg, ok the fixes that went in on the bug you mention fixed the issue triggering that bug, therefore you must have a different if similar issue, so it'd be good to have your own bug, as noone is going to look at that closed one11:07
jMCgI didn't realize it was closed.11:08
apwyeah i had a system i could actually reproduce that specific issue and we found root cause and fixed that one11:08
apwi believe at least11:09
jMCgapw: okay. So, what do I call this bug, and how do I provide more info?11:09
* apw is thinking11:10
apwi would title it something like 'boot hanging at "<whatever the last line of console output>" to start with11:10
apwand i would like to see the whole log at least on that bug11:10
smbQuestion would be to find out where it is hanging/waiting. So like apw said and having the dmesg from the serial console11:11
jMCgvirsh start ; virsh console > log should be easy to give you that.11:11
smbprobably add a debug to the kernel arguments11:11
smbI mean "debug" as an option11:12
jMCgs/nomodeset/debug/11:12
smband what was that other one for startup11:12
smbkeep the nomodeset11:12
apwyep11:12
apwand add --debug as well11:13
smbat least prevents the kernel from trying fancy graphics where you don't really need11:13
apwwhich gets to upstart in case we are getting that far11:13
jMCgI don't have a graphics card configured :)11:13
smbAh, ok. Probably won't make a difference but neither does hurt then11:14
apwjMCg, yeah i _think_ we are making it into the main root11:14
smbapw, initcall_debug?11:14
apwwill make his buffer be very big, so initially i think i'd leave that one off11:14
jMCgWell, there's some ugly formatting.11:15
apwso i think its getting to real root, as we have run and completed 'init-bottom' which kills udev off11:16
apwand then we have udevd starting some time later, over 1s, which i believe should be only occuring in real root11:16
apwso you definatly have a different error, as the error on that bug, we get stuck in init-bottom11:16
jMCgHeisenbug!11:17
jMCgSweet. When I add debug --debug, it doesn't get post-start.11:17
apw'post-start' ?11:17
jMCghttp://sprunge.us/igSh11:17
apwbah i hate serial ports which lose charactes11:18
apw[    3.8 console-setup main process (347)   3.893235] init: console-state changed from spawnost-start11:18
jMCgThe best part is that this is an emulated serial port ;)11:18
apwok so that mess has enough in it to tell us we got as far as upstart11:18
jMCgI'm gonna send it a shutdown command and see what that does.11:21
jMCghttp://sprunge.us/SIQC not much.11:22
apwso it responded, and shutdown as far as i can see11:23
apwit ran rc scripting at least to try and kill stuff11:23
jMCgSo from what I gather, we're past mounting / in the right place.11:26
apwlong past that yes11:26
apwas soon as all the init: stuff comes out in the --debug mode, we are running that in the real root11:27
apw[    1.7 linear personality registered for level [    1.74271path personality registered for leve[    1.827 (vda): mounted filesystem with ordea mode. Opts: (null)11:27
apwLoading onfiguration from /etc/init.conf11:27
apwthat is the disk coming online and us flipping over11:27
jMCgSo, what's the next point to get stuck?11:28
apwjMCg, heh there are no sensible points to get stuck11:29
jMCgI'd argue that getting stuck at all in a boot isn't very sensible, but then I've passed the point of sensibility a long time ago.11:29
smbjMCg, Just to be sure this is not something there. If you can extract the /etc/fstab from the lv you use as root11:30
apwjMCg, well what i am saying is there are no know points to get stuck11:30
jMCgThe job of an OS is to manage resources. If I spend 4 days trying to get it to boot, that's a terrible waste of my resources :-/11:30
apwjMCg, yep and we're trying to help11:31
apwyou asked where is the place i expect a hang, i told nowhere11:31
jMCgapw: thanks11:31
jMCgSo, /etc/fstab from the kvm?11:31
apwjMCg, ok so advice is to boot with --verbose instead of --debug and add --no-log in case11:31
apwthat might get us less output so we can read it11:31
jMCghttp://sprunge.us/fNUY11:32
jMCgNew command line for the kernel:11:33
jMCg    <cmdline>root=/dev/vda ro serial=tty0 console=ttyS0,38400 nomodeset debug --verbose --no-log</cmdline>11:33
jMCgsweet! That looks readable!11:34
apwyeah that looks fine11:34
apwthats something at least11:34
jMCg http://sprunge.us/BiSi11:34
smbHm11:37
smbjMCg, Have you sshd running11:37
smbOr other thing, is there a /etc/init/ttyS0.conf to actually bring up a login session on the serial console?11:37
jMCgsmb: last time I checjked, there was one. But I'll check again.11:40
* ppisati -> out for lunch11:41
smbjMCg, No, I guess then it is still there11:41
smbJust going through my memory of things that had been failing and looking similar11:42
jMCggah.11:42
smbThe virtual fs stage seems to be waiting11:42
apwjMCg, ok ... can you try creating /etc/init/procps.override in the image containing the word 'manual'11:43
apwand try booting it again11:43
jMCgapw: first, I'll re-dd ttyS0.conf11:43
jMCgWhich magickally got lost since last week.11:43
jMCgSo, right now, I feel like punching myself in the face.12:01
jMCgapw: thank you.12:01
apwso was that a missing ttyS0 config ?12:02
apwjMCg, ^12:02
jMCgSo, between the real issue, which hung up the domains, and my ghost issue, I (accidentally) axed ttyS0.conf and have been wondering why it doesn't boot ever since.12:02
apwsmb, you win a cookie12:02
* smb munches12:02
jMCgThe real issue was fixed with the udev/and or kernel upgrade (in the Host), this here should be fixed by head-desking ;)12:03
jMCgfreaky. This thing actually works. O_o12:03
jMCgI appear to have done a pretty decent job creating my vm image. Except for that ttyS0.conf thing :D12:04
apwheh we hope most of the time12:04
smbUnfortunately no login tty and hanging rather look similar from a console. ;)12:04
jMCgRFE: put a ttyS0.conf in by default12:05
apwthats kinda image specific, as i don't have a S0 most of the time12:05
jMCgNeither do I, except with KVMs and servers.12:06
apwsadly even in virtual ones it differes depending if its kvm or xen12:06
jMCg:-/12:06
smbYeah, at least differently named12:06
jMCgThere needs to be a unified way of managing servers (real or virtual ones) if there's no ethernet connection, or the OS is otherwise not up and running.12:07
jMCgOh, and it should be cheap and secure.12:07
apwa nice utopia indeed12:08
jMCgYeah.12:09
smbAt least that way makes people earn their money by knowing where to hit with the hammer... ;)12:09
* henrix will be back in ~20m12:35
hertonppisati, do you know someone who can verify bug 927526 against latest ti-omap4 proposed?13:01
ubot2`Launchpad bug 927526 in linux-ti-omap4 "missing support for some LIRC devices" [Undecided,Fix committed] https://launchpad.net/bugs/92752613:01
ppisatiherton: community only i guess13:37
ppisatiherton: personally i don't have any IR receiver/transmitter (maybe my tv remote?!?!?! :) )13:37
hertonppisati, ok, will try pinging reporter on that bug13:38
=== bladernr_afk is now known as bladernr_
ppisatithe sticking edge on the other screen is just annoying14:17
=== yofel_ is now known as yofel
apwppisati, on the other screen ?14:46
* ogasawara back in 2014:48
ppisatiapw: yes, in a dual screen setup, on the left i've email + terminal (stacked) and on the right i've chrome14:54
ppisatiapw: and when i move to the right one, i find it to be a *royal* pain to go back to the left one14:55
ppisatiapw: since i've to rush the mouse pointer else it gets stuck over there14:55
ppisatiit's just innatural14:55
apwppisati, yep i hate it too, the gluey ness is configurable, in ccsm14:57
smbppisati, Maybe you need to think of it as like you have to break through the monitors edges to get to the other screen... :)14:57
ppisatiapw: i just tried U3d, i'll probably go back to 2d: less resources, and it doesn't have this feature  :)14:57
ppisatismb: or probably i should stop going over there altogether14:58
apwheh, well its good someone tests 2d, as it is supported14:58
manjoherton, https://bugs.launchpad.net/linux/+bug/925552 verification done 15:23
ubot2`Launchpad bug 925552 in linux "[12.04] Broadcom Bluetooth device (Vendor=0a5c ProdID=21f3) not supported" [High,In progress]15:23
hertonmanjo, yep, everything is ok now. I released oneiric for testing this morning15:24
manjooh cool thanks a ton 15:24
* cking grabs a coffee15:56
* henrix follows cking15:57
pgranercking, ping16:01
ckingpgraner, pong16:01
pgranercking, hey on your sandybridge laptop are you seeing high fan useage recently? 16:02
ckingpgraner, nope, it's running relatively quietly16:02
pgranercking, powertop is estimating its using 9.4w to run it 16:02
pgranercking, that seems like a lot of juice16:03
ckingI am running with UEFI as the default16:03
ckingpgraner, ah, well, powertop results may be a bit iffy if you run them for too short a period. use my powerstat tool instead and see what it reports16:03
pgranercking, ok, powertop has been running for over an hour (box is on battery)16:04
ckinglet me look up my latest power usage stats16:04
ckingI was getting ~8.8W on a default clean install with the latest Precise kernel on Saturday16:05
ckingw/o RC6 it was consuming  ~17W16:06
ckingpgraner, you could try: https://wiki.ubuntu.com/Kernel/PowerManagement/IdentifyingIssues16:07
cking(best to try power-usage-report, about 1/2 down the wiki page)16:08
ckingthe X220i that I've got is an i3-2350M with BT disabled too16:10
amitkcking: some scary (as in wrecking the scheduler -scary) bed-time reading for you in case you haven't seen this yet: http://lwn.net/Articles/482344/ , https://wiki.linaro.org/WorkingGroups/PowerManagement/Doc/HMPscheduling16:11
ckingamitk, eek, more data ;-)16:12
amitkcking: lots more data :) , but if you have an input, I'd love to hear it16:13
ckingamitk, no "fix the broken apps" items on the action points then?16:13
amitkcking: -EMAKEPROGRESS16:14
ckingamitk, how about "why do so many apps do such low poll() timeouts and causes dozens of unnecessary wakeups? ;-)" 16:14
pgranercking, ok powerstat show avg power use at 10.3w 16:15
pgranercking, powertop lies, lies, lies16:15
ckingpgraner, well, powerstat applies some sneaky sliding window averages, so we are comparing apples vs pears so to speak16:16
amitkcking: that was solved by powertop, but I've been thinking about a new cgroup type to catch these sorts of apps in automated testing - multi-stage cgroup that only delivers a certain percentage of wakeups to the apps at each level. At L1 you get 100% of your wakeups, if you wake up too many times, you get demoted to L2 where you only get 75% of your wakeups, so on and so forth...16:17
* amitk waves to pgraner 16:18
ckingamitk I was thinking more like the OOM style killer for badly behaving apps that do poll() with a zero timeout ;-)16:18
* pgraner waves back at amitk16:19
ckingamitk, sometimes wakeups are legitimate, eg. perhaps audio/video requires it on AV sync. however, sometimes it is just badly written apps16:20
amitkcking: which is why OOM would be a big hammer. I'd like to run then entire application stack in this new cgroup type and see how many end up in L2, L3, go correlate with what the ongoing activity was, and if necessary go fix them.16:22
amitks/run then/run the/16:22
ckingamitk, yep, I was being joking about OOM ;-)16:22
amitkcking: heh :)16:22
Amozhi cking :) about the RC6 testing, intel_reg_dumper doesn't return any rc6 register, is this normal?16:24
Amozlooks like it's the same for all of us..16:24
ckingAmoz, yep, it is consistently "none"16:25
Amozand that's why you check it, I suppose...16:25
Amozcking, have you checked the results recently? if so, do you think rc6 will be enabled for precise? 16:26
ckingAmoz, it is our intention to16:27
ckingogasawara, I see you added to the RC6 power testing the request for intel_reg_dumper data - everyone is getting "None" for that 16:28
Amozcking, could other i915 kernelparams interfere with the results?16:30
AmozI ran my test with fbc=1, lvds_downclock=116:30
ckingAmoz, potentially, so best to test w/o the other i915 options so we can compare like for like in this table. 16:30
Amozcking, I'll rerun my test then16:31
Amozwithout other params16:31
ckingbut it is also valid to test with fbc=1 etc as these may save a little bit more power, but we are not going to use these by default for precise because of the associated risk that they may cause other bugs 16:31
* tgardner reboots to test AA patches16:35
ckingAmoz, thanks for testing!16:37
ogasawaracking: hrm, lemme double check.  I'd gotten that from eugeni.16:38
Amozcking, no thank you for making my computer perform better!16:38
* cking likes to point out this one wasn't his work at all ;-)16:41
ckingamitk, that's a lot of info to digest, I may get back to you later on those links16:42
ogasawaracking: RC6 state output via intel_reg_dumper is only available in intel-gpu-tools v1.2, and it looks like we've only got v1.1 in Precise.  I think I'll just remove it from the wiki for now and let eugeni know.16:58
ckingogasawara, ah, that explains it, I couldn't see what I was doing wrong on Saturday when I ran this and got no output.16:58
amitkcking: sure thing, it was meant for you bed time light reading anyway ;)17:00
ckingamitk, the synthetic simulation of smart phone apps is going to be hard to do - that requires a lot of data gathering and some deep analysis to understand different use-cases correctly.17:05
amitkcking: we want to start with simple models first which I suspec will be similar to a desktop usecase model (except for the input sources) e.g. web browsing, mp3 playback, video playback, etc.17:07
ckingamitk, yep, chose some misbehaving apps to ;-)17:08
Amozcking, updated, almost the same results17:27
ckingAmoz, thanks, that was to be expected, we just wanted to see if it fixed any existing issues or threw up any new ones - thanks for testing - it is good to see that your data does not vary much at all between test - that's a good sign!17:29
Amozcking, yeah I know. especially the low std deviation is good I assume :)17:32
ckingyes, that makes me very confident ;-)17:33
Amozcking, is there anything else one can do to exercise the rc6 code?17:36
Amozand while I'm at it, how can one start contributing to the kernel team?17:38
ckingAmoz, do you normal work on the desktop, watch videos, etc and see if you can spot any issues.17:38
cking(for rc6)17:38
ckingAmoz,  there is always a load of bug triaging and stuff - perhaps talk to jsalisbury about this17:40
jsalisburyAmoz, We can *always* use help triaging :-)17:41
AmozI guess everyone need to start out with the triaging stuff before doing development :)17:41
Amozjsalisbury, I figured17:41
jsalisburyAmoz, there are some good documents available at:17:42
jsalisburyAmoz, https://wiki.ubuntu.com/Kernel/BugTriage17:42
AmozI've read some17:42
Amozstill doesn't make me a good triager17:42
Amoz:(17:43
Amozbut I suppose one need to know everything there17:43
jsalisburyAmoz, Not everything.  Basically just start of with one bug at a time17:43
jsalisburyAmoz, Testing is also another area that is of great help.  Especially if you can reproduce a bug easily.17:44
Amozjsalisbury, I guess stacktrace dumps are very helpful for crashes etc?17:46
jsalisburyAmoz, sure, when we can get them.17:46
Amozjsalisbury, so what bugs should be triaged? undecided - new, unknowns - new ?17:49
Amozjsalisbury, if something's in the wiki, just direct me there17:49
jsalisburyAmoz, we have a bug bot that checks for all required information and then changes to "Confirmed".  Those are the bugs that need to be triaged.17:51
jsalisburyAmoz, this is a good link for the various bug states:17:52
jsalisburyAmoz, https://wiki.ubuntu.com/Kernel/BugTriage/BugStates17:52
jsalisburyAmoz, the #ubuntu-bugs channel on Freenode is a good place to go to.  There is also some good info at:17:54
jsalisburyhttps://wiki.ubuntu.com/BugSquad/KnowledgeBase17:54
jsalisburyAmoz, The previous wiki link I mentioned has lots of info, but is focused on all Ubuntu packages, not just linux package.17:56
Amozjsalisbury, thanks, already aware of them17:56
jsalisburyAmoz, cool17:56
ogasawarajsalisbury: was just reading your status... bug 914161, have you had them test a more recent Precise kernel?  We disabled CONFIG_INTEL_IOMMU_DEFAULT_ON a while ago due to issues it was causing.17:57
ubot2`Launchpad bug 914161 in linux "Linux 3.2 freezes system on FUJITSU ESPRIMO P7935" [Medium,Confirmed] https://launchpad.net/bugs/91416117:57
jsalisburyogasawara, I will confirm he is running 3.2.0-17.2717:59
ogasawarajsalisbury: ah nm.  just saw their recent comment.17:59
ogasawarajsalisbury: based on their comment, you can close it Fix Released.17:59
jsalisburyogasawara, I'll still confirm what he means by "Latest" kernel17:59
ogasawarajsalisbury: he gave uname -a output in comment #68, shows 3.2.0-1717:59
jsalisburyogasawara, ahh, cool.18:00
ogasawarajsalisbury: we intend to keep iommu disabled by default18:00
jsalisburyogasawara, great, I'll mark as fix released and update the bug.18:00
Amozare there any sources for the bot scripts you're using?18:02
jsalisburyogasawara, thanks18:02
jsalisburyAmoz, The following wiki describes them:18:04
jsalisburyhttps://wiki.ubuntu.com/Kernel/kteam-tools18:04
brycehAmoz, https://code.launchpad.net/bugbot18:04
Amozjsalisbury, bryceh, thanks18:05
=== kentb-out is now known as kentb
=== jjohansen is now known as creckets
=== creckets is now known as crickets
=== crickets is now known as jjohansen
htorquejsalisbury: hi! bug 808384 - i'm not seeing it with the current mainline kernel - do you think it's worth it testing again with a vanilla kernel 3.2 with ubuntu's config?18:46
ubot2`Launchpad bug 808384 in linux "[drm:drm_mode_getfb] *ERROR* invalid framebuffer id (FujitsuSiemens Amilo Pi 2530 Hotkeys)" [Medium,Triaged] https://launchpad.net/bugs/80838418:46
=== tgardner is now known as tgardner-lunch
jsalisburyhtorque, Or possibly the latest precise kernel.  I'll post a link in the bug.18:58
jsalisburyhtorque, ahh, wait, you already tested 3.2.0-17-generic, correct?18:58
htorqueyes18:58
htorquejsalisbury:  i'm building the 3.2.7 vanilla with 3.2.0-17-generic's config now18:59
jsalisburyhtorque, ok, can you post your results to the bug.  19:00
htorquesure, will do19:00
apwhtorque, the mainline kernels are built ubuntu's configs ?19:18
htorqueapw: oops, i bookmarked http://kernel.ubuntu.com/~kernel-ppa/mainline/daily/ - never knew there are older kernels available too !:-)19:22
htorquethanks, that should save some time ;-)19:22
* cking -> EOD19:23
* apw gets something to work as intended instead of exploding ... sounds like time to quit to me19:25
jjohansenapw: nice, run don't walk don't walk, if you look back it might break19:32
htorquejsalisbury: also got the warning/error message with the 3.2.7 mainline kernel, so it sounds like this has been fixed upstream, right?19:54
jsalisburyhtorque, sounds like it.  Are you able to test the latest v3.3-rc5 kernel?  It is available from:20:02
jsalisburyhttp://kernel.ubuntu.com/~kernel-ppa/mainline/v3.3-rc5-precise/20:02
htorquejsalisbury: can test it, though it's older than the daily one i tested20:04
jsalisburyhtorque, if the bug exists there, we will want to open a bug upstream.20:05
htorqueok, on it20:06
=== tgardner-lunch is now known as tgardner
jsalisburyhtorque, great, thanks.20:06
htorquejsalisbury: doesn't seem to be in 3.3-rc5 either :-)20:11
jsalisburyhtorque, but it does happen in 3.2.7 mainline?  That would indicate a bug in 3.2 stable.20:12
jsalisburyhtorque, that was fixed in 3.320:12
htorqueright, it's there in 3.2.7, it's gone in 3.3-rc520:13
jsalisburyhtorque, thanks for testing.  I'll do some upstream searches and see if the fix in 3.3 has been queued up for 3.2 stable.20:16
htorquejsalisbury: np! verified the results on a second system and also the new 3.2.8 seems affected.20:28
jsalisburyhtorque, great, thanks. 20:29
jsalisburyhtorque, I posted some additional comments to the bug report.21:19
htorquejsalisbury: the ftrace commit is also in 3.2.821:20
htorqueso that's not it21:20
htorqueit seems fixed with 3.3-rc1 but that's a lot of commits...21:21
jsalisburyhtorque, do you see any additional messages in syslog when you reproduce the error?  21:21
htorqueno, it's just "[drm:drm_mode_getfb] *ERROR* invalid framebuffer id"21:21
jsalisburyhtorque, ok, I'll do some more investigation21:22
jsalisburyhtorque, is anything written to your /var/log/Xorg.0.log log file?21:39
htorquejsalisbury: yes! http://paste.ubuntu.com/859764/21:41
jsalisburyhtorque, thanks, that give some more to search on.21:43

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!