/srv/irclogs.ubuntu.com/2011/04/29/#ubuntu-kernel.txt

=== yofel_ is now known as yofel
=== solarion is now known as Solarion
=== michaelh1 is now known as michaelh1|away
=== smb` is now known as smb
loolapw: Hey there; for LP #732046 I had requested inclusion of nls_cp437 in the virtual kernel, but unfortunately it isn't enough and iso8859-1 is needed as well; I've confirmed that it is the only missing bit though; would you mind adding fs/nls/nls_iso8859-1.ko to debian.master/control.d/virtual.inclusion-list?10:13
ubot2Launchpad bug 732046 in linux "Missing filesystem modules in -virtual package" [Undecided,Fix committed] https://launchpad.net/bugs/73204610:13
smblool, You won't be very successful in reaching apw today. They got a royal wedding going on over there and got the day off10:14
smbI will have a look10:14
loolah right10:15
loolHow could I forget about the royal wedding10:15
smbStill better than to forget you own one (or any anniversary of it) :)10:15
loolhaha10:15
loolsmb: The above likely affects oneiric, natty and maverick   :-/10:16
loolI'm testing natty at the moment10:16
smblool, Ok, let me know any results, I'd start with Maverick patches10:17
jjohansensmb: at this rate we should just get rid of the inclusion list10:18
smbjjohansen, Well, I think it is still better than to include everything10:19
jjohansensmb: well until we slowly have every driver asked for anyways10:19
jjohansensmb: I almost think its should be an exclusion list10:19
smbI at least believe there was still enough of missing ones.10:20
smbThough its true that at least filesystems might be on a "could be needed" list in general10:21
jjohansensmb: yeah, I think a hybrid approach some parts inclusion, some parts exclusion would work better10:23
smbjjohansen, Do we have some place to externalize/save that though for UDS-O?10:24
smbthought I meant10:24
jjohansenhrmm, we could drop it in misc blueprint10:24
smbjjohansen, I really should look by now, would we have a session between us and the server-team discussing any feature / idea / needs they got for us?10:26
jjohansensmb: I don't think we do yet, though we should10:27
jjohansensmb: updated the misc blueprint10:27
smbOk. Thanks. Hm, yeah. I check to open that server-kernel blueprint. Though before I try to find out whether something possibly exists.10:29
jjohansensmb: yeah lets ping the server team and see what they have first10:31
loolsmb: Right, same issue on natty10:31
smblool, ok, I check whether the same patch applies to both or whether I need slightly different ones10:32
hallynI'd like to see commit 332ab16f830f59e7621ae8eb2c353dc135a316f6 (add refcounting for lower inodes for ecryptfs) go into natty-updates.  Should I open a bug?14:20
hallynor just beg and promise a beer at dublin sprint?14:21
smbhallyn, A bug is better (though nobody will turn down any beers)14:21
hallynheh, maybe i'll do both14:22
hallynok i'll submit, thanks14:22
smbhallyn, If you got the report, a quick mail about patch and report to team-ml increases the chances of not being overlooked14:22
hallynsmb: thanks, will do14:23
=== dimmortal1 is now known as dimmortal
smbjjohansen, Ok, so I just opened up https://blueprints.launchpad.net/ubuntu/+spec/other-kernel-o-server-requirements 15:06
jjohansensmb: thanks15:07
smbhggdh, smoser Daviey ^ Idea would be to add any loose/open ends to the whiteboard.15:11
=== bjf[afk] is now known as bjf
hggdhsmb: roj15:12
ppisatiericm|ubuntu: ping15:15
ericm|ubuntuppisati, pong15:15
ppisatiericm|ubuntu: hi Eric15:15
ppisatiericm|ubuntu: in the past, didn't you use jtag on the dove board?15:15
ericm|ubuntuppisati, yep15:16
ericm|ubuntuppisati, which you need their xdb thing15:16
ppisatiericm|ubuntu: cool, which hw/sw did you use?15:16
ericm|ubuntuppisati, it's called XDB or Extreme blah blah blah, cannot remember exactly15:16
ericm|ubuntuppisati, you have to ask Marvell for the software and a usable license15:16
ericm|ubuntuppisati, it's commercial :-/15:16
ppisatiericm|ubuntu: i've a flyswatter with ARM TI 14 and 20 pins interface15:17
ppisatiericm|ubuntu: uhm, i hoped someone tried openocd on it15:17
ericm|ubuntuppisati, that possibly won't work - a DIY-ed wiggler?15:17
ericm|ubuntuppisati, I had15:17
ericm|ubuntuppisati, didn't work very well15:17
ppisatiericm|ubuntu: which profile did you test?15:17
ericm|ubuntuppisati, couldn't stop the cpu15:18
ericm|ubuntuppisati, I cannot remember15:18
ppisatiericm|ubuntu: ok, if suddenly you remeber it, please let me know15:18
ericm|ubuntuppisati, there were 2 or 3 - all tried but failed15:18
ppisatiericm|ubuntu: ah ok15:18
ericm|ubuntuppisati, the only way is to use their XDB, and even so - it didn't work very well15:18
ppisatiericm|ubuntu: no, it's ok15:19
ppisatiericm|ubuntu: i;m tracking down a bug on ARM v715:19
ericm|ubuntuppisati, ok15:19
ericm|ubuntuppisati, and it's dove related?15:19
ericm|ubuntuspecific?15:19
ppisatiericm|ubuntu: and i can reproduce it on every kernel and hw platform we have (ti-omap3/4)15:19
ppisatiericm|ubuntu: in my opinion it's a stack corruption so i wanted to test how that code path worked on another arm platform15:19
ppisatiericm|ubuntu: to double check my findigs15:20
ericm|ubuntuppisati, ah ok15:20
ericm|ubuntuppisati, cool - good luck then :-)15:20
ppisatiericm|ubuntu: unfortuntely it's an asm routine that is first relocated before the new kernel is kexeced15:20
ppisatiericm|ubuntu: so plain printk won't help here15:20
ppisatithat's it15:20
ppisatitrying to narrow the window :)15:21
ericm|ubuntuppisati, so it's kexec related?15:21
ppisatiericm|ubuntu: kexec15:21
ppisatiyep15:21
ericm|ubuntuppisati, I'm having trouble with kexec on this imx53 board as well15:21
ppisatiarm v7?15:21
ericm|ubuntuppisati, yet do not have time to look into that further15:21
ericm|ubuntuppisati, it would be good if you could find something15:21
ppisatiwait15:21
ericm|ubuntuppisati, so kexec doesn't work on TI omap3/4?15:21
ppisatiyep15:21
ppisatibroken in every release since...15:22
ericm|ubuntuppisati, there were some issues but I believe they were addressed15:22
ppisatilucid? but lucid was a tech showcase15:22
ericm|ubuntuppisati, kexec should be ok on marvell dove15:22
ppisatiericm|ubuntu: yep, works well there15:22
ericm|ubuntuppisati, we had it fixed years ago15:22
ppisatiericm|ubuntu: but IIRC it's v6 over there15:22
ppisatiericm|ubuntu: yeah yeah, that's why i'm tryiong to debug the code there15:22
ericm|ubuntuppisati, not really sure if v7 was enabled15:23
ericm|ubuntuppisati, there was an issue that L2 wasn't flushed before disabled15:23
ericm|ubuntuppisati, so the memory content is corrupted for the 2nd kernel15:23
ppisatiericm|ubuntu: anyway, read here for omap 3/4:15:23
ppisatihttps://bugs.launchpad.net/ubuntu/+source/linux-ti-omap4/+bug/76824915:23
ubot2Launchpad bug 768249 in linux-ti-omap4 "kexec panic: external abort on non-linefetch (0x1008) at 0x03510000" [Undecided,New]15:23
ppisatiactually there are at least 2 bugs here15:24
ericm|ubuntuppisati, I'm having exactly the same bug on this i.mx53 board15:24
ppisatifirst, when we are to juno to the relocation code, the memory location we end up is wrong15:24
ppisatisecond, if i force a jump to the correct location, it panics later15:24
ppisatii dind't dig into the second one, since first i would like to know what's wrong the machine_kexec()15:25
ericm|ubuntuppisati, let me look into that bug15:25
ericm|ubuntuppisati, interestingly enough - though I propose to flush L2 before kexec, the existing code doesn't seem to address this issue yet15:25
ppisatianother thing that i didn't add to that bug report is that if i interleave a flush_cache_all() in the second part of machine kexec() (after 121 and before 124)15:26
ppisatithe memory location where we jump is the correct one15:26
ppisatiso, IMO there's something rotten in:15:26
ericm|ubuntuppisati, so on my i.mx53 board, kexec was actually able to jump to the 0x70008000 which is the place where the 2nd kernel starts15:26
ericm|ubuntuand panic somewhere at 0x7000800c or so15:26
ppisatiuhm15:26
ppisatiand the relocation code is there, right?15:27
ericm|ubuntuppisati, be aware that flush_cache_all() doesn't necessarily mean flush L215:27
ericm|ubuntuppisati, what do you mean by relocation code?15:27
ericm|ubuntuppisati, that small block of code who moves the kernel to the memory start?15:27
ericm|ubuntubefore the real jump?15:27
ppisatiin machine kexec() first we copy the routine arch/arm/kernel/relocate_kernel.S::relocate_new_kernel() to reboot_code_buffer15:28
ericm|ubuntuppisati, that's one thing I have concern, not really sure if the relocation code did it right, I just need a debugger to examine the memory content15:28
ppisatithen we jumop to that routine that it actually load/move the new kernel15:29
ppisatiadd a printk15:29
ppisatiah no15:29
ericm|ubuntuppisati, that won't work15:29
ppisatiyou can't check the last part...15:29
ppisatiright15:29
ppisatiyes, you need a jtag too15:29
ericm|ubuntuppisati, the only reliable way to check that is to use jtag15:29
ericm|ubuntuppisati, exactly15:29
ppisatiyep15:29
ericm|ubuntuppisati, but I believe the issue has the same root cause as the omap one15:29
ppisatiif it's arm v7, i think so15:30
ppisatinow i just took a break from this bug beacseu i was wasting too much time on it15:30
ppisatibut i'll be back shortly15:30
ericm|ubuntuppisati, yeah - go outside and have some fresh air15:30
ppisatiand having another arm target working (jtag-wise) would be a huge help15:30
ericm|ubuntuppisati, does your omap come with a debugger/software?15:31
ppisatiericm|ubuntu: i'm applying CVEs, is it the same as a walk outside? :)15:31
ppisatiericm|ubuntu: nope, i have my own jtag stuff15:31
ppisatiwait15:31
ppisatihttp://www.tincantools.com/product.php?productid=16134&cat=251&page=115:32
ppisati50 bucks15:32
ppisatiand then you can get cables/adapters for ARM and MIPS target15:32
ppisatiand it's natively supported by openocd15:32
ppisatiit's really nice15:32
ppisatiif you compare it to a Lauterbach15:32
ppisatithat costs thousands15:32
ericm|ubuntuppisati, nice15:33
=== jjohansen is now known as jj-afk
hertontgardner: something strange, there is a notification on bug 764758 that it got fixed on oneiric, but the patch is relevant for natty only16:01
ubot2Launchpad bug 764758 in linux "Revert now uneeded "x86, hibernate: Initialize mmu_cr4_features during boot" on Natty" [Undecided,Fix committed] https://launchpad.net/bugs/76475816:01
hertonah now I see, patch was wrongly applied on oneiric16:03
hertonsorry, didn't noticed this earlier16:04
hertonthe revert is only for stable (natty)16:04
tgardnerherton, it shold have both the patch and the revert, right?16:04
defiantredpillDoes anyone know where i can find the list of patches in the ubuntu kernel?16:04
tgardnerherton, nm, not the revert.16:04
hertontgardner: yep, the revert is because the troublesome commit was reverted on stable, but kept on 2.6.39-rc + the fix16:05
tgardnerdefiantredpill, git is your friend. You can start here: http://kernel.ubuntu.com/git16:06
hertonso 2.6.39 needs the fix, while the fix is not needed anymore on 2.6.38.x series16:06
tgardnerherton, so I think I hear you saying Oneiric is OK ?16:06
defiantredpillthx16:06
hertontgardner: yes, we shouldn't apply the revert on it16:06
hertonjust explaining better: a fix for another long standing bug was made and applied on 2.6.39, with cc stable, and was applied to 2.6.38.x too16:08
hertonbut this fix introduce the hibernate regression16:09
herton*introduced16:09
hertona fix was made to 2.6.39, but stable guys decided to revert the commit on 2.6.38.x16:09
hertonso 2.6.38.x doesn't need the fix anymore (so the revert I posted)16:10
hertonwhile we don't need to do anything for 2.6.3916:10
tgardnerherton, right, so I think we're OK.16:10
hertonyes, but we shouldn't revert the fix on oneiric, so we need to revert the revert on it :)16:12
tgardnerherton, how about if I just make it disappear?16:13
* smb 's mind boggles16:13
smbI know it is a rebase tree anyway16:13
tgardnerThis one, right: UBUNTU: SAUCE: Revert "x86, hibernate: Initialize mmu_cr4_features during boot"16:13
tgardnersmb, shit disappears from master-next dev trees all the time16:14
smbtgardner, well as I understand herton it got pushed out in a oneiric upload but meh16:16
herton_tgardner: yep, this one (in case the reply didn't went, connection dropped here)16:16
tgardnerherton: its as if it never existed.16:17
tgardneryou gotta love rebase :)16:17
herton_tgardner: yeah, saw here now on a new fetch it's ok, thanks16:18
=== herton_ is now known as herton
smbtgardner, bjf Updated the cve-tracker with a mass rename update from jdstrand. Before changing things, be sure to pull16:38
bjfsmb, wonder if this will bust apw's report16:40
smbbjf, Hm, dunno. I probably write a note to him (and probably myself to remind him of the note on Tuesday)16:41
bjfsmb, well know as soon as the report cron runs again, top of the hour i think16:42
bjfs/well/we'll/16:42
smbOh, wait maybe I get confused. Was that a report _for_ apw or one generated _by_ him?16:43
smbAnd what report exactly anyway... *sigh*16:44
* smb looks longing towards the nice cool can of cider in the fridge...16:45
bjfjcastro, there is a uds-o conflict on monday at 10 with "Improve Suspend and Hibernate Kernel Debug" and the "Kernel Monday Roundtable" one is "hardware" the other is "other"16:47
bjfjcastro, is there a "simple" way the summit system can figure out the conflict?16:48
jcastrobjf: yeah manually scheduled roundtables don't get resolved, you can just click edit and drag it out of that slot16:49
jcastrobjf: I check those a bunch so if it starts to do that I move them, but if you run into that you can just fix it16:49
bjfjcastro, i don't want to change the roundtable, i want to have the other session autoscheduled to another slot16:49
jcastroright16:50
jcastroI just moved it16:50
jcastroyou're good16:50
bjfjcastro, thanks16:50
bjfjcastro, if there were a way to say that we were "required" to be at the roundtable, would that help?16:51
bjfjcastro, is there somewhere else besides this channel that you'd like me to bring these to your attention ?16:55
jcastrobjf: yeah but the thing is the required thing needs a blueprint, so all I do is kick out sessions for tracks that are running roundtables16:56
jcastroit ends up being less work16:56
jcastrobjf: how you're doing it is fine. 16:56
bjfjcastro, ok, same issue with Tuesday 9:00 with the "Ubuntu support for Intel UEFI firmware"16:57
JFo<-lunch, bbiab17:03
jcastrobjf: fixed17:09
bjfjcastro, thanks17:09
=== jj-afk is now known as jjohansen
=== JanC_ is now known as JanC
Kanohi jjohansen , didnt you want to fix the 32 bit daily builds?17:49
jjohansenKano: ah yes, sorry I haven't gotten to looking at those yet17:50
Kanothe last 32 bit kernel is rc4...17:50
Kanoalso did somebody notice that dvb-c has a much worse signal with .39 compared to .38?17:51
jjohansenKano: unlikely .39 hasn't been the focus for the last few weeks17:51
Kanowell all my scripts work with .39 kernel. every nv/fglrx17:52
=== eagles0513875 is now known as zz_eagles0513875
=== allison_ is now known as wendar
keesbjf: I'm confused about CVE-2011-1182. it's marked as "released" (without versions) and "ignored" (without a reason). what's up with that CVE?18:45
ubot2kees: ** RESERVED ** This candidate has been reserved by an organization or individual that will use it when announcing a new security problem.  When the candidate has been publicized, the details for this candidate will be provided. (http://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2011-1182)18:45
bjfkees, for Lucid, Maverick and Natty the patches have already been applied via stable upstream releases18:47
keesbjf: ah-ha, okay. I'll have to find out which -security version they appear in.18:47
bjfkees. for Hardy, let me go back and look18:48
tgardnerkees, apw has been working on a script to match stable updates and CVE commits. 18:48
keesbjf: for hardy, if the code isn't vulnerable, "not-affected" is fine. "ignored" implies it's vulnerable, but for some reason we're not patching it.18:48
keestgardner: excellent!18:48
bjfkees, bug 772543 shows when the patches came in18:48
ubot2Launchpad bug 772543 in linux-mvl-dove "CVE-2011-1182" [Undecided,In progress] https://launchpad.net/bugs/77254318:48
bjfkees, that may have been the case that i tagged it incorrectly, but i need to go back and look again18:49
keesbjf: okay, I see the upstream/stable version notes. I'll have to figure out where those are with respect to our published -security versions, update the old USNs, etc.18:50
bjfkees, some of those are probably in -proposed right now18:50
bjfkees, so "released" may have too strong a term18:50
keesah! in that case, it should be "pending" (since they're not released)18:50
keesyeah :)18:50
keesthis way I can examine the "pending" CVEs when writing up the USN (in the case of the CVE not showing up in the changelog, etc)18:51
keesanyway, this is part of the grey area to discuss at UDS :) thanks!18:51
keesoh! also, I've just EOL'd karmic, so you might want to pull in our u-c-t tree to see those changes so you can ignore karmic from now on. :)18:52
bjfkees, ok, those signals didn't exist in hardy, so that should have been "not-affected" i'll fix that up18:52
keesbjf: perfecto, thanks!18:52
* bjf -> lunch / errand18:58
=== bjf is now known as bjf[afk]
* tgardner --> lunch18:59
=== jjohansen is now known as jj-afk
* jj-afk -> lunch20:01
loolsmb: Just a note that I tested a rebuilt natty kernel with the iso8859-1 module added under a natty ec2 instance and it solved the problem20:34
serge_two xfs snafus on natty in one day.  I'm getting suspicious20:49
=== bjf[afk] is now known as bjf
cwilson_So I have a long running copy of a file that is just over 4TB from one internal drive to a firewire drive mounted on the same system, transfer speeds start out in the 20-30 MB/s range but then after 24 hours it drops down to 5-8MB/s and then after another day or so the transfer drops down again to around 50KB/s.  The big difference I see process wise after the first drop is the kernel flush process for the target drive being in D state all the time a21:11
cwilson_rsync process that I'm using for the copy being in 'S+' state.21:11
ohsixwhat filesystems are involved?21:12
cwilson_At the begining of the process when the speeds are high I am seeing both the flush process in D state but also the rsync process in D state, could this be related to how the kernel is flushing the file out?21:12
cwilson_I am reading from an XFS filesystem and writing to an XFS filesystem21:12
ohsixah, i don't have anything to add; i've just seen something like that but highly exaggerated with ntfs-3g as a destination21:13
cwilson_cool, thanks21:13

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!