/srv/irclogs.ubuntu.com/2012/10/11/#ubuntu-kernel.txt

=== BenC__ is now known as BenC
=== chrisccoulson_ is now known as chrisccoulson
=== smb` is now known as smb
jibelsmb, I'm still affected by bug 1021471 with kernel 3.5.0-17.28 and lxc07:31
ubot2Launchpad bug 1021471 in linux "clone() hang when creating new network namespace (dmesg show unregister_netdevice: waiting for lo to become free. Usage count = 2)" [High,Confirmed] https://launchpad.net/bugs/102147107:31
smbjibel, And you are not using wl?07:31
jibelsmb, just by creating a new container and shutting it down with poweroff from inside the container07:32
jibelsmb, no, r816907:32
smbjibel, and the count is 2? I did try to do the same you did in my tests and those worked07:33
jibelsmb, count is 107:33
smbjibel, Then I think it sounds like a similar thing that Clint still sees07:34
jibelsmb, thne this is the log I get when I try to start a container http://paste.ubuntu.com/1272737/07:35
jibelsmb, do you want another report ?07:36
smbjibel, I would suggest you and him get in touch. Not sure he already filed the new bug report or not. And you could get the debugging kernel from my people page.07:36
smbWell the hung task is because of the net namespace not being cleaned in some way. That seems to be the same07:37
smbjibel, At least it seems to be related with certain drivers or net hw. As Clint said he would have no problems when not using wl (not sure what he was using instead from my head)07:38
jibelsmb, I talked to him yesterday evening and he had not fully tested the latest kernel and didn't filed a new bug report. 07:38
jibelsmb, he's using b4307:38
smbjibel, So then feel free to go ahead and create one and let him know.07:38
jibelsmb, ack, thanks. 07:38
smbjibel, debug kernel would be the smb2 at people:~smb/clonetst07:39
smbIt is not the latest but close enough07:39
jibelok, I'll give it a try07:40
smbjibel, I expect it to break in the same way but be more verbose on it. So a syslog from that boot is telling a bit more07:41
smbjibel, One thing would be of interest. Are you using ipv6 in some way (beyond the stuff that basically always is there)07:42
smbcd07:44
jibelsmb, I don't use IPv607:45
smbjibel, ok, thanks07:45
apwsmb, do we still carry that poweroff change for containers, or is that gone now07:56
smbapw, I would hope that it is upstream by now... but nothing knowing for sure07:56
apwand do i surmise that if you take networking down locally it goes away?07:57
smband if me remembers correctly that was just changing the signal to its init process in some way07:57
apwyeah i think it was07:57
smbSo teardown of network would be outside the container07:58
* smb scratches head07:58
smbwell at least the other namespace07:58
smbBut maybe I understand what you try getting at. If the interface would not be shut down correctly inside and still have some refs. The only thing is it seemed to work ok on my own tests08:00
apwjibel, is this is a lab system we can get access to?h08:02
apwas it is reproing the issue and we cannot08:02
apwjibel, or indeed is it a vm08:03
jibelapw, it's on my machine at home, I don't do it in the lab otherwise, I cannot shutdown the server remotely08:04
apwhmmm.  can you file a bug with the reproduction steps if there isn't one, as this needs debugging 08:05
smbjibel, When you report the bug, can you please be verbose on the steps you take to set up lxc, so I will be doing exactly the same08:05
smberr what apw said08:06
smbapw, stop saying what I am thinking08:06
apwtime for more coffee08:06
apw:)08:06
jibelapw, sure, I trying on another freshly installed machine to find a reproducible test case.08:06
smb:)08:06
=== henrix_ is now known as henrix
jibelok, bug reproduced on another machine, the steps are not exactly clear as I had to start/shutdown the container twice, and it seems to require network activity from inside the container before shutdown. I'll try again to narrow it down.08:15
smbCould be the one difference to what I did08:16
=== henrix is now known as henrix_
=== henrix_ is now known as henrix
jibelsmb, So ... I have a minimal test case with your kernel, I can file a bug :)09:06
smbjibel, Then you shall do so. ;)09:06
cjwatsonAnyone know if we ever got anywhere about escalating bug 1040557 to Samsung BIOS folks?09:11
ubot2Launchpad bug 1040557 in ubuntu-cdimage "UEFI boot live-usb bricks SAMSUNG 530U3C,np700z5c laptop" [Critical,Confirmed] https://launchpad.net/bugs/104055709:11
* smb does not09:13
apwcjwatson, not heard anything myself09:17
cjwatsonDid we even escalate it?09:17
apwcjwatson, i don't think i know that even09:18
apwogasawara, do you know if the above samsung issue was escalated?09:18
cjwatsonI sent mail about it to canonical-uefi and got not much09:20
apwcjwatson, no indeed.  and i am sure this will be getting more common too09:23
apwcking was saying he did two in in a day, just by setting efi variables repeatedly09:24
apwand a different vendor too09:24
apwsmb, there is an interesting networking fix posted sans-explanation to u-k-t09:25
smbapw, Saw it, ignored it for a bit...09:25
smbapw, Hm, the bug report explains it better09:27
apwyeah i don't think there is any doubt that the fix is sane09:29
cjwatsonapw: Sure, I just don't want people to be able to say that laptops are getting bricked because we ignored the reports09:29
apwcjwatson, i wonder if we can blacklist them09:31
smbapw, Yeah, so far it did not seem to be in things Dave Miller has sent to stable. And there is usually little to predict what he will send. Plus the meaning of this is upstream is relative...09:34
jibelsmb, bug 106543409:34
ubot2Launchpad bug 1065434 in linux ""unregister_netdevice: waiting for lo to become free. Usage count = 1" after LXC container shutdown" [Undecided,Confirmed] https://launchpad.net/bugs/106543409:34
smbjibel, Ok, thanks. I will assign it to me09:35
smbjibel, So simple, just transferring ~300MB before interrupting it... Now do I really want to look at *that* syslog...?09:38
=== doko_ is now known as doko
jibelsmb, ok, maybe less, want me to try with ping instead ?09:39
smbjibel, If it is possible to see with causing less network traffic it would help to make out some trees in the wall of forest that the additional debugging produces.09:41
jibelsmb, ack, I'll chop some trees09:42
smbjibel, Muchly appreciated, thanks09:43
jibelsmb, 50MB is enough :)09:49
jibelsmb, back in ~20min, I'll attach a smaller syslog with the debug kernel09:49
smbjibel, Argh, that would be even more than last time09:49
smbOh you mean 50M transfer... 09:50
jibelyeah, I transfered 50MB then stopped09:50
jibelI didn't reproduce with 10 and 20MB09:50
smb*sigh* Ok, but *sigh*09:51
jibelbut I don't know if duration is involved or amount of data09:51
smbI guess it has to be some form of net usage. Sure, could be the time a connection is up or how often things are transferred. We will maybe see...09:53
=== amitk is now known as amitk_
=== amitk_ is now known as amitk__
jibelsmb, it's the amount of data10:26
jibelI did the following tests: 1) download 1MB at 5kbps and 2) 50MB at 100Mbps10:27
jibelthe second test failed not the first10:27
apwthose wo don't take the saem time do they ?10:28
jibelno they don't, the second was much faster10:29
apwok10:29
* henrix -> lunch11:29
MCR1Why is Kernel 3.5.6-quantal only available for i386 ?11:35
ogasawaracjwatson, apw: re: 1040557 and escalating to Samsung BIOS folks...I know cking asked vanhoof to put a request out to some of his guys for contacts.  I unfortunately don't know what the outcome was there.12:44
apwogasawara, ok we should follow up with the hoof then today and check12:44
ogasawarartg: care to take a quick peek at git://kernel.ubuntu.com/ubuntu/ubuntu-quantal-lbm.git13:03
ogasawarartg: I realized last night we never threw lbm together for quantal13:03
rtgogasawara, ack, looking13:05
rtgogasawara, I'm wondering if we need an LBM for Quantal. We've kind of fallen off doing the backport packages since we've started doing whole kernels.13:06
ogasawarartg: it's a good question.  I haven't heard anyone screaming for a 3.6 compat-wireless stack, which is all that's really provided in the quantal lbm that I rolled.13:07
ogasawarartg: I was curious what hurdles we'd face trying to upload post release in the event we ever did want lbm for quantal.13:08
rtgogasawara, I'd be inclined to just add cw-3.6 to precise and call it good.13:08
rtgogasawara, I don't think we did cw for maverick/natty/oneiric did we ? mostly just lucid.13:10
rtgor if we did, perhaps we should change that trend.13:10
ogasawarartg: I think we did have lbm for those older releases, /me double checks13:11
rtgogasawara, even if we did (which I suspect you're right about), I think its time to change. 13:12
ogasawarartg: works for me13:12
rtgat least for the short term releases13:12
ogasawarartg: I'm just gonna remove that ubuntu-quantal-lbm repo on zinc then, just to avoid any confusion13:13
rtgogasawara, I was just gonna say that :)13:13
rtgogasawara, you could just about cherry-pick your work in the Precise LBM13:14
rtginto*13:14
ogasawarartg: yep, will do13:15
cjwatsonThese days I suspect doing it post-release wouldn't be a particular headache if you did need to13:18
ogasawaracjwatson: thanks, good to know13:23
=== amitk is now known as amitk-afk
* ogasawara back in 2014:56
=== kengyu_ is now known as lexical
ppisatibrb15:17
rtgapw, I'm looking at jk's last comment re: "PATCH 5/5] efivarfs: efivarfs_fill_super() ensure we clean up correctly on error". Would it be sufficient to call efivarfs_kill_sb() just before 'fail:' ?15:38
alexblighWhat would be the recommended way to install the current quantal kernel on precice? (wget the .deb and install produced an unbootable system - not looked into why yet)15:50
rtgalexbligh, thats not a packaging problem. doing just what you did does work.15:52
=== amitk-afk is now known as amitk
apwrtg, i don't thnk it is necessary because we are already ripping down the sb on the way out, will reply16:00
rtgapw, I think efivarfs_kill_sb() gets called indirectly anyways if mount_single() gets an error return from fill_super(), so I think its OK, i.e., no resources are orphaned.16:07
apwrtg, yeah essentially efivars_kill_sb() should be called as part of the tear down ...16:07
apwyeah we concur16:07
rtgapw, so, I've got these applied to Quantal branch. shall I post 'em or are you already ahead of me ?16:08
apwrtg there was a small bit of porting work in the original patches16:09
apwto do with nameidata going away after quantal16:09
rtgI've haven't built yet, but they were all clean cherry-picks16:10
apwright and actuall its not clean when you compile the is a warning16:10
apwrtg, i'll push these as i have test this combination16:12
rtgapw, OK16:12
apwrtg ok pushed16:17
rtgapw, I'll let ogasawara know that I reviewed them in case she's feeling cranky about stuff appearing in her tree post Beta2 :)16:19
ogasawara:)16:19
apwrtg, heh perhaps i should get you to put acks on them :)16:22
rtgapw, will do.16:22
apwogasawara, i think we are expecting to put that in the first sru currently16:29
ogasawaraapw: ack16:29
rtgapw, repushed. I've done my officious rubber stamping for the day.16:30
apwrtg, did you add jk's extra patch?17:01
rtgapw, which one was that ?17:02
rtgI mostly looked at your cleanup patches17:02
apwefivarfs: Implement exclusive access for {get,set}_variab...17:02
rtgapw, that wasn't one of the patches in your push17:03
apwno i just noticed it on our kernel-team@ list17:04
apwwill review and test17:04
rtgapw, I must have deleted it already17:04
rtgapw, are you sure it was on the kteam list ? I don't see anything from jk for Sept or Oct17:06
apwoh no its not, i missread, i have it cause i am on CC:17:11
rtgapw, LKML then ?17:11
apwyeah, thats what confused me, it looked enought like kernel-team to confuse me17:12
alexblighrtg, belated thanks17:12
apwrtg, i'll pull it and test it17:19
rtgapw, I noticed a pull request for the signed modules patch set17:22
apwrtg cool17:25
* rtg -> lunch17:34
slangasekmjg59: hi, any chance you've gotten a look at that shim patch of mine?17:40
mjg59slangasek: Sorry, not yet17:44
mjg59Give me 20 minutes or so?17:44
slangasekmjg59: that works fine, thanks17:44
apwrtg, well i can still read variables with that locking patch applied18:00
=== yofel_ is now known as yofel
* henrix -> EOD18:04
=== henrix is now known as henrix_
penalvchHello everyone. I am trying to bisect bug 980279 and have gotten stuck. My progress is documented at http://pastebin.com/Heqd1JVN . What would be the next step?18:41
ubot2Launchpad bug 980279 in linux "BUG: soft lockup - CPU#5 stuck for 22s! [xfce4-sensors-p:1873]; EIP is at generic_exec_single+0x66/0x80" [Medium,Triaged] https://launchpad.net/bugs/98027918:41
rtgpenalvch, instead of bisecting between non-linear tags, you might be better off trying to figure out which stable update caused your regression.18:50
rtgthe various stable releases are pre-built at http://kernel.ubuntu.com/~kernel-ppa/mainline/18:51
penalvchrtg thank you for responding. This is my first bisect. So your suggestions, while seemingly helpful, have gotten me lost already. :( What I have found was that the last good kernel was 3.2.0-14-generic, and the first bad kernel was 3.2.0-15-generic.18:56
penalvchrtg, are you suggesting to map the the regressions to the mainline kernel releases via http://kernel.ubuntu.com/~kernel-ppa/info/kernel-version-map.html and git bisect the mainline kernel?18:58
rtgpenalvch, no, I'm suggesting that you first narrow down which stable update (if any) caused your problem. if you examine the changelog you'll notice that the kernel was rebased 3 times against stable releases 3.2.3 through 3.2.519:00
penalvchrtg, when I run git log --oneline Ubuntu-3.2.0-14.23..Ubuntu-3.2.0-15.24 I see rebase once between the two: 4d41bd7 UBUNTU: [Config] Rebase to v3.2.5 . What are the other two rebases?19:06
rtgpenalvch, I was looking in debian.master/changelog which indicates the rebases that occurred between 3.2.0-14.23 and 3.2.0-15.2419:07
rtglooks like I got one too mant19:08
rtgmany*19:08
penalvchrtg, ok glad you found that changelog. This was my original problem. I could not find the debian.master/changelog in the ubuntu-precise folder generated by executing git clone git://kernel.ubuntu.com/ubuntu/ubuntu-precise.git . I looked for it via find and locate, no luck. What is the location of this changelog?19:09
rtgoh, maybe not. looks like we skipped one of the stable update cycles and went from v3.2.3 to v3.2.519:09
rtgpenalvch, you can't find it? debian.master/changelog is the exact path.19:10
penalvchrtg, unfortunately I do not find this path.19:11
rtgpenalvch, I don't what to tell you. If you've correctly cloned this repository, then it can't _not_ be there.19:12
penalvchrtg, ok. Looks like I have a corrupt?! git repo clone as per http://kernel.ubuntu.com/git?p=ubuntu/ubuntu-precise.git;a=tree;h=44aedcec26f76c9a8d4ef7db73cf83047ad988b9;hb=907e57fa9bdc74415eaa69d8b1229ca20d3876b2 . I'll try recloning it. Thanks for your help!19:16
mjg59slangasek: I'd kind of prefer the shifted down code to be in the if statement rather than adding another goto19:21
mjg59slangasek: But other than that, looks fine19:21
slangasekmjg59: ok, thanks for the review19:27
slangasekmjg59: would you like me to clean that up and resubmit, or do you want to amend it on your side?19:28
mjg59slangasek: I'm hacking other bits at the moment, so I'll do it19:29
mjg59But feel free to ship that version19:29
slangasekyeppers19:29
* ogasawara lunch19:47
hallynsmb: do you have any thoughts on bug 1065589 ?  (whether the kernel should send a uevent when a netdev is moved to a new netns)19:51
ubot2Launchpad bug 1065589 in lxc ""initctl list" shows 11974 instances of network-interface-security after two days of uptime" [Medium,Triaged] https://launchpad.net/bugs/106558919:51
hallynprobably i need to go ask Eric Biederman what he thinks19:52
develtechhi20:10
develtechi have question regarding Crypto API in linux kernel20:10
penalvchrtg, quick follow up. What I found is that my git repo was not corrupted. Instead, when I execute the following, debian and debian.master are wiped out: git bisect start Ubuntu-3.2.0-15.24 Ubuntu-3.2.0-14.2320:22
penalvchrtg, I would cd ubuntu-precise immediately prior to git bisect start...20:23
rtgpenalvch, thats because there is no linearity between those 2 tags. the first bisect commit is _before_ Ubuntu-3.2.0-14.23 was introduced. In fact, its before any Debian packaging was applied.20:25
penalvchrtg, ok. How would one workaround that issue in this case?20:26
rtgpenalvch, you can restore by using 'git fetch origin;git fetch origin master;git reset --hard FETCH_HEAD'20:26
penalvchrtg, I executed verbatim: 'git fetch origin;git fetch origin master;git reset --hard FETCH_HEAD' and now the missing files are back. What would be the next step?20:28
rtgpenalvch, then, like I said, figure out which stable update caused the regression by installing one of the packages found at http://kernel.ubuntu.com/~kernel-ppa/mainline/, e.g., http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.2.4-precise/linux-image-3.2.4-030204-generic_3.2.4-030204.201202031635_amd64.deb20:28
rtgpenalvch, I'm outta here for the day. perhaps ogasawara or bjf can give you some advice. otherwise, contact jsalisbury who can do the bisect for you.20:29
komputesHi Ubuntu Kernel Folks! What is the recommended way to boot 12.04 on an EFI system?20:29
penalvchrtg, ok. Thanks for your help.20:30
bjfkomputes, the same way you boot on a non-efi system20:30
komputesbjf: I know of an EFI system which will not boot after installation.20:31
bjfkomputes, would you like to elaborate on that a little?20:32
komputesbjf: Lenovo D3020:33
komputesbjf: Tried reinstalling ubuntu with both EFI Boot and Bios-grub partition.20:33
komputesIf sw RAID (fakeraid) is set 0+1 it does not boot20:34
komputesIf sw RAID (fakeraid) is set 1 it boot occasionally20:34
slangasekso why do you believe this is an EFI problem, instead of a fakeraid problem?20:34
komputesIf sw RAID (fakeraid) is set 10 installation is successful, but ubuntu will not boot.20:35
xnoxkomputes: can you please spell out what raid you have? do you mean mdadm or something like Intel Matrix Storage / dmraid?20:35
komputesslangasek: because I'm not too familiar with booting from EFIBoot or biosgrub partitions20:35
xnox(possibly other "technologies")20:35
slangasekwhat do you mean by "biosgrub partition"?20:36
komputesxnox: Serial Attached SCSI controller [0107]: Intel Corporation Patsburg Dual 4-Port SATA/SAS Storage Control Unit [8086:1d68] (rev 06) 20:36
komputesslangasek: that's exactly what I said20:36
xnoxkomputes: thanks.20:36
slangasekkomputes: I can read what you said, but I have no idea what you mean20:36
komputesslangasek: I'm quite confused as well20:37
slangasekbooting with GRUB under BIOS doesn't involve any special partitions20:37
komputesindeed, but this motherboard uses EFI20:37
slangasekare you booting it in EFI mode, or in BIOS compat mode?20:37
komputesEFI mode I believe20:38
komputesfstab seems to show overlayfs being use as a source for root (/)20:38
slangasekkomputes: you can verify this by booting the installer and looking for the presence of /sys/firmware/efi20:38
komputesthat confused me too20:38
komputesslangasek: will do20:39
slangasekif you have that directory, you're booted EFI; if not, you're booted BIOS compat.  And if you're booted EFI, you might want to try booting in BIOS compat again to cross-check that this is actually an EFI problem20:39
slangasek(booting and installing in)20:39
komputesslangasek: cool, thanks for the advice20:40
xnoxdid the kernel auto-removal / cleanup blueprint happened?21:36
hallynsmb`: fwiw looking at the source it looks like the net-device-{removed,added} uevents SHOULD be getting sent, I'm still digging.21:56
hallynbut upstart certainly doesnt seem to get it22:11
hallynok, gotta run.  hopefully will figure this out tomorrow22:11
slangasekxnox: nope, infinity sat on it the whole cycle :)22:29
=== lifeless_ is now known as lifeless

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!