/srv/irclogs.ubuntu.com/2010/06/25/#ubuntu-kernel.txt

erichammondtgardner: (Took a meeting break)  Yes, I should be able to take care of what is needed, though RightScale keeps changing the best way to set up their integration software with Ubuntu on EC2.00:00
keesogasawara: 9a7e849 boots ok00:26
ogasawarakees: cool, I expected it to pass looking at the commit00:27
keesI guess it's 9908ff736adf261e749b4887486a32ffa209304c causing the problem then?00:27
ogasawarakees: yah, that's what the bisect is pointing to00:28
ogasawarakees: k, I'm going to try revert just that commit and build a final test kernel00:29
keesogasawara: sweet00:29
tgardner-afkccheney, http://kernel.ubuntu.com/~rtg/mainline/4f570f995f68ef77aae7e5a441222f59232f2d0e-maverick/linux-image-2.6.32-020632rc3g4f570f9-generic_2.6.32-020632rc3g4f570f9_amd64.deb00:44
=== kamal-away is now known as kamal
ogasawarakees: this is the latest 2.6.35-6.7 Maverick kernel with the 9908ff7 reverted - http://people.canonical.com/~ogasawara/lp597075/2.6.35-6.7-9908ff7/00:59
ccheneytgardner-afk, thanks00:59
ogasawarakees: I'm still building the mainline kernel with the commit reverted00:59
ccheneytgardner-afk, looks good01:05
ccheneybe back in about 30m01:05
ogasawarakees: http://people.canonical.com/~ogasawara/lp597075/2.6.35-rc3-9908ff7/ - that's the latest 2.6.35-rc3 mainline kernel with 9908ff7 reverted01:12
keesogasawara: lp597075/2.6.35-6.7-9908ff7 did not boot :(01:16
keestrying the mainline next...01:16
ogasawarakees: ugh, was afraid of that01:16
keesogasawara: :( lp597075/2.6.35-rc3-9908ff7 did not boot either.01:23
ogasawaraboo01:23
keesogasawara: this is really odd.  are there maybe multiple problems?01:23
ogasawarakees: I'm thinking it's a commit outside of i915 that might be triggering the i915 issue01:23
keesoh, the bisection was targetting only changes in i915, but there were other commits between it?01:24
ogasawarakees: yep01:24
keesgotcha.  craps.01:24
keeswell, if you can show me the commnds you're running, I can take this over maybe.  I have access to the builders.01:25
* ccheney back01:25
ogasawarakees: ooh, have access to tyler?01:25
ccheneytgardner-afk, ok yea your 4f57f9 kernel was good01:25
ogasawarakees: that's what I was using as the build box01:25
keesogasawara: yup, I'm on tyler01:27
ogasawarakees: git clone git://kernel.ubuntu.com/ubuntu/kteam-tools.git01:28
keesdone01:28
ogasawarakees: cp /home/ogasawara/kteam-tools/kteam-tools/mainline-build/mainline-build-one to /home/kees/kteam-tools/kteam-tools/mainline-build/mainline-build-one01:29
ogasawarakees: It's just a tweaked version of the script to use the tyler chroots01:30
tgardner-afkogasawara, did you swipe the one I've been using on emerald?01:30
keesogasawara: okay (removed extra /kteam-tools from path, but good now)01:31
ogasawaratgardner-afk: nope, saw you'd added a README to the repo and then tweaked the script to make it work for me on tyler01:31
tgardner-afkI'm gonna spend tomorrow making them work on any machine, not just zinc01:31
ogasawarakees: oops, copy and paste error01:31
ogasawaratgardner-afk: that'd be awesome, I found it really useful doing these mainline bisects for kees01:32
tgardner-afkogasawara, yep, I've been doing 'em for 2 days now01:32
ogasawarakees: so I did a mkdir lp59707501:33
=== tgardner-afk is now known as tgardner
ogasawarakees: just to keep things separate01:33
erichammondtgardner: The currently published Maverick kernel on EC2 is 2.6.32-305.9.  Do you know if the "scheduler fixes" you mentioned are available in that version or would we need to wait for the next release?01:33
keesogasawara: okay01:33
ogasawarakees: then in lp597075 I cloned the mainline git tree, git clone --reference /usr3/ubuntu/linux-2.6.git/ /usr3/ubuntu/linux-2.6.git/01:33
keesokay, done01:34
ogasawarakees: cd linux-2.601:34
ogasawarakees: then run the following to build `/home/kees/kteam-tools/mainline-build/mainline-build-one <sha1> maverick`01:35
keeswhich sha1 do I want to start with?01:35
tgardnererichammond, hmm, I have not backported the EC2 kernel from Maverick. All I have are the standard kernels. I may have mislead you thinking that a standard kernel would work, but thats only for UEC, not Amazon, right?01:35
ogasawarakees: ah, so in a separate linux-2.6 tree I kept track of the bisect01:36
ogasawarakees: git bisect start v2.6.35-rc2 v2.6.35-rc101:36
erichammondtgardner: We are only interested in Amazon EC2 kernels as this is where the application servers run.01:37
keesseparate tree? ooooh.  on tree doing the bisect, the other tree doing the builds?01:37
ogasawarakees: right01:37
ogasawarakees: I just liked being able to read the bisect output uninterrupted01:37
keesogasawara: and now I just get to do a full-blown bisect without limits01:37
tgardnererichammond, right. I don't think jjohansen has a working Maverick kernel for EC2 yet, correct?01:37
ogasawarakees: I think I could replay it, but am too lazy to look it up01:37
tgardnerogasawara, you know it keeps a log? .git/BISECT_LOG 01:38
ogasawarakees: yep :(  looked like around ~1500 commits between rc1 and rc201:38
ogasawaratgardner: there is?01:38
jjohansentgardner, erichammond: yes I am trying to get the EC2 kernel for Maverick up01:38
keesogasawara: so the sha1 I give is the one git bisect spits out?01:38
keesi.e.:01:39
kees6$ git bisect start v2.6.35-rc2 v2.6.35-rc101:39
keesBisecting: 373 revisions left to test after this (roughly 9 steps)01:39
ogasawarakees: yep01:39
kees[b1413357d924792e2e332dcb6b712a7fb2a5fb25] fbdev: fix frame buffer devices menu01:39
kees^^^ that?01:39
erichammondtgardner, jjohansen: I saw a note about that in the latest server-team meeting summary, but I was able to get the Maverick alpha-1 AMI running on EC2.01:39
ogasawarakees: yah, so use b1413..01:39
ogasawaratgardner: well, I'll be damed.  /me makes a note01:39
jjohansenerichammond: right01:39
keesogasawara: does mainline-build-one correctly maximize CPU parallelization in the build01:39
kees?01:39
tgardnererichammond, my understanding from jjohansen is that it still fails in some zones01:39
ogasawarakees: not sure about that, but it'd crank em out pretty fast01:39
tgardnerogasawara, kees: it does01:40
jjohansentgardner, erichammond: specifically the pv-ops stuff fails in some zones, the full Xen has other bugs, and problems01:40
keesfatal: 'maverick' does not appear to be a git repository01:40
keesogasawara: I seem to be missing something01:40
ogasawarakees: shoot, forgot to have you add the remote01:41
keesah-ha!01:41
erichammondtgardner, jjohansen: For this Asterisk test, we only need the 64-bit kernel in us-east-1, but it sounds like it will need to wait anyway.01:41
keesogasawara:  git remote add maverick /usr3/ubuntu/ubuntu-maverick.git/ ?01:41
ogasawarakees: git remote add maverick git://kernel.ubuntu.com/ubuntu/ubuntu-maverick.git01:41
jjohansenerichammond: try me again tomorrow, I have another kernel I will be testing soon01:41
tgardnerkees, for i in dapper hardy jaunty karmic lucid maverick; do git remote add $i git://kernel.ubuntu.com/ubuntu/ubuntu-$i.git; done01:42
keesoh, not the local one?01:42
ogasawarakees: you could probably use the local one too01:42
ogasawarakees: it's sync'd regularly01:42
tgardnerkees, it doesn't make much difference after the first pull01:42
keesogasawara, tgardner: cool, now it's rockin' :)01:43
ogasawarakees: sweet01:43
tgardnerkees, tyler is a wuss, you oughta try emerald.01:44
tgardneror tangerine01:44
keestgardner: I was doing my Yama builds on tangerine.  woof.01:44
tgardnertangerine and emerald are basically the same machine01:44
keesI wonder how much ccache would help when doing a bisect01:45
tgardnerkees, I found on emerald and tangerine it makes about a 10 second diff over a full 3 arch pass on Lucid01:46
tgardnerso, not much01:46
keesholy cow, that's not so great.  :P01:46
ogasawarakees: ok, I gotta bail for a bit.  keep me posted how the bisect goes.01:48
keesogasawara: cool; thanks!01:51
ccheneytgardner, another build failure showed up01:51
tgardnerccheney, copying now01:52
ccheneyok01:52
ccheney150e6c6 ?01:52
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/150e6c67f4bf6ab51e62defc41bd19a2eefe5709-maverick/linux-image-2.6.32-020632rc3g150e6c6-generic_2.6.32-020632rc3g150e6c6_amd64.deb01:53
ccheneytgardner, good01:58
ccheneytgardner, er it works i mean :)02:00
tgardnerccheney, Bisecting: 2 revisions left to test after this02:00
ccheneygreat :)02:01
jjohansenback on later02:01
=== jjohansen is now known as jjohansen-afk
keesogasawara, tgardner: should I expect this thing to work on tangerine?  it only seems to build on tyler.02:08
keesoh, er02:08
kees/home/kees/kteam-tools/mainline-build/mainline-build-one: line 128: dch: command not found02:08
keesheh02:08
tgardnersee  mine stuff in emerald:~rtg/kteam-tools/mainline-build02:09
tgardnerkees, see my*02:09
tgardnerkees, its in a transitional phase. I'm gonna fix it tomorrow so that it works everywhere02:10
keestgardner: okay, cool02:10
* kees runs away for dinner02:10
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/e00ef7997195e4f8e10593727a6286e2e2802159-maverick/linux-image-2.6.32-020632rc5ge00ef79-generic_2.6.32-020632rc5ge00ef79_amd64.deb02:20
ccheneytesting02:20
ccheneytgardner, works02:27
tgardnerccheney, ack02:27
ccheneylooks like it must be cc56f7de7f00d188c7c4da1e9861581853b9e92f, the other commit is supposedly ps3 specific02:28
tgardnerccheney, probably, but lets play it through02:29
ccheneyyea02:30
tgardnerccheney, this one has a good vibe though.02:30
ccheneytgardner, there is a v2 of the patch, if i understand what is on lkml that was posted on 5/2902:36
ccheneymay be related to the bug we are seeing, if it is indeed that patch causing the problem02:37
tgardnerccheney, I'll have it in 5 minutes02:37
ccheneyok02:37
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/cc56f7de7f00d188c7c4da1e9861581853b9e92f-maverick/linux-image-2.6.32-020632rc5gcc56f7d-generic_2.6.32-020632rc5gcc56f7d_amd64.deb02:43
ccheneytesting02:44
benjamim Anyone can tell me if the current maverick kernel present on the PPA (2.6.35-5) already has vga-switchero compiled as default ?02:47
RAOFYes.02:47
RAOFIt deos.02:47
benjamimthanks RAOF02:48
benjamimi'm trying to switch to my RADEON graphics with "echo DDIS > /sys/kernel/debug/vgaswitcheroo/switch"02:49
benjamimbut i receive a blackscreen02:49
benjamimis the command right ?02:49
RAOFI forget.02:50
RAOFFrom what I understand, that's highly unlikely to work properly if X is running.02:51
ccheneytgardner, failed test, i am going to verify e00ef79 is actually good and can run instances to be sure its the cc56f7d02:51
benjamimyep, after type that command i restart X02:52
benjamimbut the screen just go black02:52
benjamimOnly the intel graphics are working02:52
RAOFbenjamim: I'd probably grep the git log for the switcheroo commit to see what to do.02:53
tgardnerccheney, this next kernel is the absolute proof since that last commit is now skipped (if it doesn't fail)02:53
ccheneytgardner, ok02:54
benjamimRAOF, could you tell what is the file that the switchero log is stored ?02:55
RAOFbenjamim: It's not - it's in the VCS history.  You'd need to clone the kernel's git tree.  Alternatively, go to git.kernel.org and use the search box in Linus' tree.02:56
benjamimoh, i did not know that, I'll take a look.02:58
benjamimthanks for help RAOF02:58
ccheneytgardner, yea e00ef79 ran an instance so its good, i saw a different type of failure but we think its a different unrelated bug02:59
ccheneythe other bug is probably due to eucalyptus not doing error checking03:00
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/f21121cde6e617b90cd03ce083652ca543004dc2-maverick/linux-image-2.6.32-020632rc5gf21121c-generic_2.6.32-020632rc5gf21121c_amd64.deb03:08
ccheneytesting03:08
ccheneytgardner, looks good so far03:12
tgardnerccheney, play it all the way, just like the others.03:13
ccheneytgardner, its still running, just giving early feedback :)03:15
ccheneytakes around 15m to run the test 10 times03:16
* ccheney has to go to the grocery store, bbia 1hr03:16
ccheneyso far passed the test 3/3 with 7 left to try03:17
tgardnerccheney, k, see you in a bit03:17
tgardnerccheney, back in the AM04:27
ccheneyi got swallowed by walmart05:16
ccheneysorry for being gone so long :-\05:16
* ccheney needs to remember to look at the list his wife writes up before leaving to make sure it is detailed enough to buy05:17
jk--1) "items"05:25
=== kamal is now known as kamal-away
smbMorning08:14
ckingGood morning smb! Welcome to another wonder filled day08:16
smbcking, Too true08:16
smbcking, Just experienced the wonders of democratic bit reading. :-P08:16
LLStarksapw or ogasawara: any chance that enable-multiple-ring-buffer.patch can get backported for maverick?08:16
* cking wonders what "democratic bit reading" is08:17
smbLLStarks, Not apw and ogasawara will not be around till later08:17
LLStarksokay08:17
LLStarksany other kernel managers that i should speak to?08:17
smbcking, You read 5 times and let the majority decide whether its 0 or 108:17
ckingsmb, not sure that's entirely a good idea. a dictatorship suits me better. as long as I'm in control ;-)08:18
smbLLStarks, Depends on the urgency and the patch. And as I still wait for my mail to sync I don't even know whether apw has written something about this patch08:19
smbcking, In my case democracy seems the better choice as that allows the EDID to be ok even with some monitor switch in between08:20
LLStarksit's for vaapi support on intel08:20
ckingsmb, I get your drift now08:20
LLStarkskinda stupid to not have it since  libva is now in ubuntu archives08:20
LLStarkshttp://intellinuxgraphics.org/h264.html08:21
smbLLStarks, Have you sent the patch to kernel-team@lists.ubuntu.com with some short explanation?08:21
LLStarksi'm not on the mailing list08:21
RAOFThat ended up missing 2.6.35, did it?  It's been on the intel-gfx lists for some time - I kinda thought it made it into Linus' tree.08:22
LLStarksmade 2.6.35 from what i've heard08:22
smbLLStarks, You could still send to it. Though it then take a bit of time for it to get moderated to the list08:22
LLStarksnot sure though08:22
RAOFThen rejoice, for Maverick will have 2.6.3508:22
LLStarkslike i said, i'm not sure if it made the cut08:23
LLStarksah. it did. http://permalink.gmane.org/gmane.comp.video.dri.devel/4685508:24
RAOFgit log says that its in 2.6.35-rc3, so it's in Maverick now.08:24
LLStarkshopefully everything else required falls into place.08:24
RAOFThe easiest form of backporting!08:24
smbRAOF, Very true08:25
smbRAOF, Btw, just out of interest. There have been a few bugs about invalid EDIDs. Do you know whether they are all over adapter space or mainly intel?08:25
RAOFThe EDID processing code is in the shared drm code.  EDID processing recently got more strict about the checksum being accurate (ie: actually checking the checksum, rather than completely ignoring it).08:27
smbRAOF, Right the EDID code itself is. But the implementation of the bit banging functions are driver specific. In my case intel_i2c.c08:28
smbRAOF, And I seem to experience flakyness when the monitor cable is not directly connected to the laptop08:28
smbRAOF, But as of this mornings "democratic" approach it seems to work. With mostly a 4:1 vote for the bit being 008:29
RAOFUrgh :)08:29
smbOh yeah. :)08:29
smbEspecially as the i915 code has so many bit values defined that I am completely lost about their intention08:30
RAOFI'm not sure how I'd tell if a bad EDID on a bug was due to the monitor or the i2c interface.  It's not something I've regularly asked reporters about.08:30
RAOFAnd there are plenty of monitors with bad EDIDs to go around.08:31
smbYes, that might be specific to my case. The interesting part would be the number of messages. If the remainder is always the same, I guess the EDID /monitor is at fault08:32
smbIf, like in my case, the remainder always changes, there is flakyness08:32
RAOFYeah.  I *have* noticed some changes in that sort of area for i915 but not radeon or nouveau.08:33
smbThat would sort of match my experience that all radeon (I think I have not tried nvidia) laptops when connected to that same monitor switch would get the right EDID but not that i915 based netbook08:37
smbAnd as it drove me nuts (because I mainly work on that netbook and it would only allow the high mode after much cursing and tricking around) I played around with that yesterday and today08:38
RAOFI think it might be at least in part because Intel chips are entirely integrated, so they rely on third parties to actually wire stuff up.  ATi and nVidia both either (a) are discrete cards, wired up by people whose job it is to wire up discrete cards, or (b) on motherboards wired up by AMD or nVidia.08:39
smbMaybe also because compared to radeon i2c code which only seems to set unset a bit in the registers, the intel implementation has dir bits and a mask and val bits and a mask and I think I really need to talk to somebody that can actually make sense of that. :) I'll try Eric, maybe he can tell me what all of this means08:42
ckingsmb, http://kernel.ubuntu.com/git?p=ubuntu/ubuntu-hardy.git;a=commit;h=13f8acafe67361d7afda05c23a0408820e0cb46811:12
=== ghostcube_ is now known as ghostcube
=== ayan-afk is now known as ayan
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/3b2e8e02ca5a31b6f8db8de05becb613d819622a-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb13:35
ccheneytgardner, thanks, testing13:48
ccheneytgardner, seems to be working, doing full test and going to run instance to be sure13:58
tgardnerccheney, ok, I've been looking at the patch. most of the changes are just pointer guards, but there is one change that could affect the behavior.13:58
ccheneyok13:59
ccheneytgardner, the pad block corrupted message doesn't appear but it seems to not run the instance, i am going to try it again and see if it will work14:28
ccheneytgardner, its possible there is more than one bug, i am going to look into finding other ways to verify if the image was registered properly14:29
=== sconklin-gone is now known as sconklin
tgardnerccheney, a different behavior then the previous successful runs?14:34
ccheneywith this kernel it appeared to register the image properly but then when i tried to launch an instance using the image it did not work, both worked before, i may need to retest your last kernel from last night to make sure it was working for running instance, but i am pretty sure it was14:35
ccheneyi slept since then so i have forgotten a bit :-\14:35
tgardnerccheney, retry 2.6.35-5.6 to verify it still produces the PAD error, then I'll build a kernel based on that version but with the offending commit reverted.14:37
ccheneyok will do14:37
ccheneytgardner, it ran this time, i think its probably an issue with eucalyptus that caused it to not work the first attempt14:40
tgardnerccheney, does it pass the 10 loop test?14:40
ccheneyi'll try a few more times to make sure it consistently is working, but i wouldn't be surprised if it was euca for that part14:40
ccheneytgardner, yea the 10 loop test passed, that just tests the registration for the pad bug14:41
ccheneythis is still on 2.6.35-614:41
ccheneythe step i do after testing the pad bug is gone is to run the instance which seemed to hang the first attempt but worked on the second14:41
ccheneywill get back to you soon about it, talking to the guys about it14:41
tgardnerccheney, just to be clear, you're running the kernel with cc56f7de7f00d188c7c4da1e9861581853b9e92f reverted?14:42
ccheneyi am running http://kernel.ubuntu.com/~rtg/mainline/3b2e8e02ca5a31b6f8db8de05becb613d819622a-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb14:42
tgardnerccheney, k, I'll add that to the bug report14:43
ccheneytgardner, ok14:43
ccheneytgardner, yea it seems to fix it, it only failed the first time of running instance and i don't have a great track record of instances running on my box anyway due to other weird eucalyptus bugs14:48
ccheneyso i would consider the test to be successful14:48
ttxtgardner: what are our options wrt A2 ? would you consider using that "current minus problematic patch" kernel for A2 ?14:48
ttxs/patch/commit/14:49
tgardnerttx, um, I don't completely understand _why_ that patch causes a failure. As far as I can tell it merely implements more rigorous error checking, which the Java machine may be tripping over. I'm still of the opinion that you should release with a Lucid kernel 'cause I'm not yet willing to advise that we revert that commit for the master kernel.14:53
ttxtgardner: hm...14:54
tgardnerttx, Linus has been gone for 2 weeks, so there may well be fixes for this already in the pipeline. I'll bug the authors about it.14:56
ttxtgardner: I thought there would be more value in testing a 2.6.35 kernel in the A2 milestone than to revert to a Lucid one.14:56
ttxtgardner: there have been some patches proposed by the same author that seem to "fix" things, ccehney found them14:56
tgardnerttx, perhaps I read a different thread. Which one did you read?14:57
ttxNo clue if they fix anything relevant for us though14:57
ttxmight be worth generating a test kernel with them and see14:57
* ttx looks at reference14:57
tgardnerttx, I read this one http://marc.info/?l=linux-fsdevel&m=127488511324250&w=2 which was inconclusive14:58
ttxtgardner: there are a few others at http://marc.info/?a=119980668900012&r=1&w=2 but I have no clue if they are more conclusive14:59
tgardnerttx, which is why I thought I'd contact the authors, 'cause I also have no clue15:00
ttxtgardner: if you're more comfortable with shipping a Lucid kernel for Maverick A2 than the modified 2.6.35 one, we should go that way15:01
ttxI'm missing historical referenbce to see what most acceptable for an A215:01
ttxtgardner: maybe that's a subject for the release meeting in one hour15:02
tgardnerttx, what is the use of testing a Maverick kernel in UEC if we _already_ know it doesn't work. It seems to me that it'll preclude testing of other UEC features.15:03
ttxtgardner: agreed, by "modified 2.6.35 one" I was thinking the one that ccheney just tested15:04
ttxthe issue being it affects more than just UEC, but all servers15:04
tgardnerttx, well, I'm not ready to recommend that ogasawara revert that commit. I assume that the A2 release _must_ use the kernel from the archive, correct?15:05
ttxtgardner: that's my understanding as well15:05
ogasawaratgardner: and I'd prefer not to have to spin a new kernel, I uploaded what I hoped to be our final A2 kernel last night15:05
tgardnerogasawara, yep, agreed15:06
ogasawaraand it's cutting it close to do another upload15:06
ttxtgardner: if A2 uses the one from the archive, how can you make it "ship the Lucid kernel" ?15:07
ttxanother solution is to just releasenote the fact that for UEC, you should revert to a Lucid kernel due to a yet-to-be-investigated bug15:08
tgardnerttx, I guess its a matter of seeding? The Lucid kernel _does_ live in the archive, just in a different releas.15:08
ttxtgardner: ah ok.15:08
ccheneyi'm not sure if you can seed something not in your pocket15:09
ccheney but that would be a question to ask someone like cjwatson probably15:09
ttxogasawara: I'm unsure we should just prevent non-UEC server use cases from testing the 2.6.35 kernel in the A2 milestone15:09
ccheneyif we can't and we want to use the old kernel we can probably get it copied to maverick15:09
ttxdocumenting the workaround for the UEC users sounds like the best move here15:09
ccheneyhmm actually if it has the same source package name then probably the only way to do it is by telling users to install (but i may be wrong)15:10
* tgardner gets breakfast. biab.15:10
ttxtgardner, ogasawara: i'll mention the issue in the release meeting since the release team should decide on what they prefer to have in A215:11
ttx(and will confirm what our options are, if we have more than one)15:11
ogasawarattx: ack15:12
ccheneyttx, yea our only options might be to either respin or to document in the a2 release notes, but i don't know if they have some magic they can perform on the image outside of regular seed generation15:12
cndis there a way to keep my system at 10.04, but somehow tell apt to upgrade my kernel from maverick?15:15
cndso far I've just been manually installing kernels15:15
ttxccheney: that's what I'll ask at the release meeting15:16
tgardnercnd, you could run the LTS backport.15:16
cndtgardner: can you remind me whether that differs in any way from the real maverick kernel?15:17
cndconfig options?15:17
tgardnercnd, should be identical.15:17
cndtgardner: are they now produced at the same time as the maverick kernels, or do they lag?15:17
tgardnercnd, the only diff is that the backport is built with the Lucid toolchain15:18
tgardnercnd, obviously it lags because its dependent on maverick, but usually no more then a day or so15:18
cndok15:18
cndthanks15:18
tgardnercnd, it depends somewhat on the crazy shit ogasawara has done to maverick. I'm working on the backport right, but apparmor is giving me some grief.15:19
cndheh15:20
ccheneytgardner, anything else you would like me to test? whenever the other patch gets reviewed by Linus I can test it out as well15:31
tgardnerccheney, it'll likely take a few days.15:32
ccheneytgardner, no problem just let me know when you are ready15:32
tgardnerccheney, just to be clear, this PAD error is emitted by a node host kernel when it is attempting to unpack and load a KVM guest, right?15:42
JFotgardner, is this bug on your radar? bug 59790415:42
ubot2Launchpad bug 597904 in linux (Ubuntu Maverick) (and 1 other project) "No video on beagleboard with 2.6.35 kernels. (affects: 1) (heat: 10)" [Critical,New] https://launchpad.net/bugs/59790415:42
tgardnerJFo, it should get on mpoirier's radar15:42
JFok15:43
JFompoirier, you around?15:43
mpoiriersure am15:43
ccheneytgardner, its emitted when attempting to uec-publish-tarball on the walrus server15:43
JFok, have you seen that bug I just pointed to tgardner?15:43
mpoirierhold on, looking at it...15:43
JFok15:43
ccheneytgardner, the node doesn't do anything with it at all, the node is the system running the kvm instance that you start using the image off the walrus server15:43
ccheneytgardner, i hope that was clear, i can try to reword a bit better if it wasn't :)15:44
tgardnerccheney, clear as mud. explain in a language that upstream will understand15:44
ccheneytgardner, ok15:44
mpoirierJFo: I am aware of the issue - been present since I joined the company.15:44
JFocool15:45
JFojust worried as I got mail indicating it was critical, etc.15:45
ccheneytgardner, so this bug shows up on the server running 'walrus' which is the equivalent of Amazon S3 if you know what that is?15:45
ccheneytgardner, reading my uecglossary page :)15:45
mpoirierJFo: they are all critical right now...15:45
JFoand they set milestone to Alpha2 and the kernel freeze is today15:45
mpoirierJFo wont' happen.15:45
ccheneytgardner, so its running on the box not running the node controller, so not the one running the images in VMs15:45
ccheneytgardner, when you start an instance (VM) the software provides the image you tell it to use via 'walrus' S3 to the node controller to use as the base image to run for the VM15:46
mpoirierJFo: we need to coordinate with the mobile team.15:46
JFoyeah, I agree15:46
ccheneytgardner, our bug is failing when attempting to register the image (make it available for use) to the 'walrus' S3 service15:46
JFojust wanted to verify that no one had given them the idea that it was possible :)15:47
tgardnerccheney, more fundamentally, this is the walrus host kernel attempting to decrypt a file?15:47
ccheneytgardner, to explain better than that i may need to defer to ttx15:47
ccheneytgardner, yes15:47
JFothanks for looking mpoirier 15:47
ccheneytgardner, so not a vm kernel, actual bare metal kernel15:47
ttxtgardner: in fact it's walrus calling a Java decryption routine15:47
tgardnerccheney, and your 10 cycle test simply runs that decryption 10 times?15:48
mpoirierJFo.  I'll have a chat with tgardner on how to proceed with the workload and what to prioritize first.15:48
ccheneyttx, i couldn't find where bouncycastle or openjdk-6 is actually using sendfile but i may be lacking my grep skills15:48
ttxwhen used against large files, ultimately it fails with that pad block error15:48
ccheneytgardner, essentially, my script attempts to register the image which does various things unknown to me including the decryption15:48
tgardnerttx, ccheney: ok, that seems to be the fundamental issue.15:48
ccheneytgardner, and repeats that 10 times, yes15:48
ccheneyttx: i did notice even when it does work it mentions something about the file being too large in the euca logs15:49
ccheneyttx, i don't know if you saw that message before15:49
ttxI suspect the JVM does things that are no longer well supported by the kernel15:49
ccheneysome cache error message i'll see if i can find it, might be in the bug report, will check15:50
ttxsmoser pointed at  java.nio.channels.FileChannel transferTo using sendfile15:50
JFompoirier, ok15:50
ccheneyhmm the cache message seems not in the bug report, i'll add it in case its useful15:50
ccheneyttx, do you happen to know what package that is in? seems not in the main openjdk-6 source15:51
* ttx looks into his mighty java-Contents file15:51
* ccheney doesn't know enough about java to know how to locate where it would be15:52
ttxopenjdk-6-jre-headless rt.jar java.nio.channels.FileChannel15:52
ccheneyif its a bug in java we can get doko to look into it15:52
smoserit would seem unlikely to me that java would utilize and expose something that isn't standard.15:52
ccheneyhmm weird15:52
ccheneyi grepped over openjdk-6 source15:52
tgardnerttx, do you think you could develop a simple reproducer? Something that an upstream developer could use to stimulate the bug without having to have all of the UEC stuff?15:52
ttxccheney: didn't the eucalyptoids provide us with that ?15:53
smoserits possible that its a java bug, but i would think, given 2 years of function and then breakage in linux and expectation of java to run cross unix, that its kernel regression.15:53
ccheneyttx, iirc someone said there is a huge reproducer in java attached to the bug15:53
tgardnersmoser, its clearly kernel regression (or at least a change in behavior)15:53
ttxright, it's just unclear why15:53
ccheneyyea 134MB test program15:53
tgardnerah, I see it.15:54
ccheneyits either regression or undefined behavior that was relied on15:54
tgardnerccheney, If I can reproduce it, then I can likely narrow down the exact line in the commit that is causing this issue15:55
ccheneytgardner, yea, looking to see if i can find the sendfile code used15:55
ccheneyumm we have patches in the FileChannel area15:56
* ccheney hopes we didn't cause this bug ourselves15:56
ccheneyactually hmm no those aren't ubuntu patches but something else15:56
* ccheney upstream source looks a bit odd15:56
ttxccheney: the bug is occurring on RedHat too15:57
ttxccheney: so it's not "just us"15:57
ccheneyoh ok15:57
ttxit's anyone with 2.6.34+15:57
tgardnerttx, do you have a reference to the RedHat report?15:57
ccheneyyea the patches are from icedtea, which i think is similar to ooo-build for java15:57
ccheneyi still don't see a reference to sendfile though15:57
ttxit's a forum post at eucalyptus.com15:57
ttxhttp://open.eucalyptus.com/forum/decrypting-image-exception15:58
ccheneymaybe it gets generated in some way15:58
* ccheney is going to write up the preliminary information to the mailing list in a few minutes15:58
=== virtuald_ is now known as virtuald
* ccheney still isn't on the list apparently, or hasn't gotten the response back yet16:00
ccheneyah ha my openjdk-6 source wasn't fully unpacked it has a bunch of extra stuff plus a huge tarball in the dir16:01
ccheneyno wonder i saw no reference to sendfile in my grep16:02
ccheneynow i found it16:02
tgardnerogasawara, the only item for us in the release meeting is the 'pad block corrupted' issue. You're aware of the status changes, i.e., we've narrowed it down to a single kernel commit?16:02
ccheneyjdk/src/solaris/native/sun/nio/ch/FileChannelImpl.c16:03
ogasawaratgardner: yep, got it all written up to paste16:03
tgardnerogasawara, cool. on top of things as always :)16:03
ccheneyits definitely special cased for solaris vs linux16:04
ccheneyso not fully standard across *nix anyway16:05
ccheneyhttp://pastebin.com/z5juRcCB16:05
=== kamal-away is now known as kamal
=== bjf[afk] is now known as bjf
cndapw: ogasawara: I want to grab the latest kernel ddeb, but the amd64 version is missing16:45
cndI see that the previous kernel, 2.6.35-5, is also missing amd64 ddebs16:46
ogasawaracnd: Is it still building?16:46
ogasawaraamd64 that is16:46
tgardnercnd, its still building16:46
cndoh16:46
tgardnerccheney, https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/588861/comments/10 doesn't reproduce the bug.16:46
ubot2Launchpad bug 588861 in linux (Ubuntu Maverick) (and 4 other projects) ""pad block corrupted" error when trying to register an image with 2.6.34 kernel (affects: 1) (heat: 12)" [High,In progress]16:46
ccheneytgardner, oh ok16:47
cndogasawara: tgardner: launchpad says it's been done building for 2 hours16:47
ccheneytgardner, i'll grab it and see if it happens with that for me16:47
tgardnerccheney, dang, it would be nice to a have a simple reproducer16:47
tgardnercnd, it takes awhile for the publisher  to run, plus things may be starting to freeze for A216:48
ccheneytgardner, i'm writing an email to upstream about it so maybe they will know how to make a simple test case16:48
cndtgardner: ok, I suppose I'll chill for a bit16:48
ccheneyer upstream eucalyptus i mean not kernel16:48
tgardnerccheney, hmm, this is gonna be a kernel issue I think.16:48
tgardnerunless the java engine is just borked16:49
ccheneytgardner, yea i don't know the code well enough to be able to tell16:50
tgardnerccheney, its in openjdk-6-jdk, right?16:50
ccheneytgardner, yes sendfile is in that, if you download it and unpack the source there is a large tarball with the actual source inside of that16:51
tgardnerworking on it....16:51
ccheneyso double packed debian source16:51
ccheneythats how i managed to overlook the sendfile call earlier16:51
ccheneytgardner, i attached a pastebin link of the file using it to the bug report earlier16:52
ogasawarajjohansen: just fyi, I went ahead and marked the two apparmor A2 work items you had as done (sync distro apparmor with upstream version, and update compatibility patch)17:01
jjohansenogasawara: thanks, I meant to do that17:01
ogasawarajjohansen: was selfish really, I just wanted our burn down charts to look good for the release meeting :)17:01
jjohansenhehe :)17:02
* ogasawara bails for a few hrs17:04
ogasawaratgardner: ^^ deflect any fires should there be any17:05
GrueMasterJFo: Any suggestions on how to get apport data through a serial console on a system that doesn't have a released kernel?  Re: bug 59790417:08
ubot2Launchpad bug 597904 in linux (Ubuntu Maverick) (and 1 other project) "No video on beagleboard with 2.6.35 kernels. (affects: 1) (heat: 10)" [Critical,New] https://launchpad.net/bugs/59790417:08
JFoGrueMaster, not off the top of the head17:09
GrueMasterThen QUIT MARKING OMAP BUGS AS INVALID.17:10
GrueMasterthank you.17:10
JFoGrueMaster, before you start yelling... probably best to give an example where I did17:10
GrueMasterI just did.17:11
JFoGrueMaster, get your facts straight17:11
JFoI set it incomplete17:11
JFobecause I asked for info17:12
JFoand for the record, and your general fund of knowledge17:12
JFoI don't take people yelling at me very well17:12
JFoespecially when they arewrong17:12
GrueMasterSystem isn't booting.  Best info we are able to gather is through minicom logs.17:12
JFoplease refer to my boss pgraner if you have issues\17:12
GrueMasterThis isn't an x86 system that has multiple tons of ways to collect this info.17:13
GrueMasterAnd because we need to get it working, we are getting hand-built kernels prior to release (which apport considers invalid).17:13
JFoI know that, hence my question as to whether it was possible17:13
GrueMasterNope.  Not possilble.17:14
JFolike I said, get your facts straight17:14
JFothen fine, add a comment to the bug to that effect, if necessary and we can move on17:14
JFosee how easy that could have been?17:14
* smb moves on to boc and we. See you next week17:21
GrueMasterJFo: sigh.  I owe you an apology.  I'm letting my frustrations of trying to get this image working get the best of me.  I shouldn't have taken it out on you like that.17:28
JFoapology accepted. If you need to vent to me about something like that in the future, may I suggest a private window? :)17:29
* JFo steps away for lunch17:30
tgardnermpoirier, you around?17:31
mpoirierI am.17:31
mpoirierwhazup ?17:31
tgardnermpoirier, did Lucid omap3 have video? Is this a maverick regression?17:31
keesogasawara, tgardner: this bisect script rocks.  :)17:31
ogratgardner, maverick regression17:32
mpoirierlet's mumble...17:32
tgardnerkees, If I can get clear of UEC I'll try and make it better17:32
GrueMastertgardner: Yes, Lucid works fine on omap 3.  In fact, I tried the latest 2.6.35 kernel on that image as well.17:32
keestgardner: works great for me already.  :)17:33
keesogasawara: I seem to be zeroing in on the middle of a mess of i915 commits again... we'll see how it goes.17:33
GrueMasterI have just been informed that there is now a 2.6.35-6 release kernel (not a hand rolled one).  I'm going to pull that and retry it just to verify.  Might even get some apport data.17:34
* JFo crosses his fingers17:37
sconklinkees: I'm about to leave for the day, but if you end up sending email, I'm interested in anything having to do with i91517:45
keessconklin: okay, sure.  are you seeing problems too?17:45
sconklinkees: I'mnot, but I don't know which problems, release, or hardware you're talking about - I'm just trying to keep up with i915 in general17:46
keessconklin: okay, cool.  I have a Q35 intel mainboard with onboard G35 (?) that since -rc2 hangs on boot.  I seem to be the only person with this problem.  :P17:53
kees(pci id 8086:29b3)17:53
sconklinkees: Hmm. Hang is not good. I'm not paying much attention at all to maverick recently, been working on lucid stable updates. Have you tried the simple things like disabling kms?17:54
keessconklin: booting with "nomodeset" you mean?17:55
sconklinkees: yes17:55
keesyeah, that doesn't work.  :)17:55
keesthe only way I could get past the initramfs was to blacklist i915 _and_ intel_agp.17:55
sconklinkees: that's a new one on me17:55
sconklinsorry17:55
keesthen once into root fs, they weren't blacklist any more (initramfs bug) and udev immediately loads them and hangs the system.17:55
keesand if I blacklist them on the rootfs, when X starts, it ignore the blacklist and loads them again.  it's been a really fun few days.  ;)17:56
maks_AGP needs to be built in.17:58
maks_we got heavily hit by having it modular afaik upstream is not properly testing modular agp17:58
maks_  * [alpha, amd64, i386, amd64, powerpc] Make all AGP driver built-in to17:59
maks_    workaround race-condition between DRM and AGP.17:59
maks_it was directly asked by airlied.17:59
sconklinkees: ^^18:01
keesoh, lovely18:02
keesogasawara: ^^18:03
ccheneytgardner, yea their test case works for me even on the broken kernel, so its not good enough test apparently18:03
ccheneytgardner, i'm going to try to modify it to use the same image i was previously using to see if that makes a difference18:03
tgardnerkees, the race is a generic problem. we had a macro that would allow you to force a symbolic dependency, but ogasawara ripped it out of Maverick 'cause it wasn't getting used anywhere.18:04
keestgardner: er, so, is my problem not due to this?  /me isn't sure what to test next.18:04
tgardnerkees, I'm not sure. Try changing the config to build-in AGP18:05
keeswill do18:06
GrueMasterJFo: No joy here.  18:10
GrueMaster*** Problem in linux-image-2.6.35-6-omap18:10
GrueMasterThe problem cannot be reported:18:10
GrueMasterThis is not a genuine Ubuntu package18:10
tgardnerccheney, I'm building an instrumented kernel that might give me enough information to indicate which part of that patch is causing problems,.18:10
ccheneyok18:10
GrueMasterI might be getting this due to bug 595949.18:12
ubot2Launchpad bug 595949 in linux-meta-ti-omap (Ubuntu Maverick) (and 3 other projects) "linux-meta-ti-omap depends on the wrong binary kernel in maverick (affects: 1) (heat: 564)" [Low,In progress] https://launchpad.net/bugs/59594918:12
=== sconklin is now known as sconklin-gone
tgardnerGrueMaster, this is omap3, right?18:14
GrueMasteryes18:14
tgardnerGrueMaster, k, lemme have a look18:14
GrueMasterIt may be possible that this is because the kernel was just released and not on my mirror yet.18:15
tgardnerGrueMaster, AFAIK it hasn't completed building yet. ogasawara upload the meta package before amd64 or armel were complete18:16
GrueMasterI have the kernel.  It was at https://edge.launchpad.net/ubuntu/+source/linux/2.6.35-6.7/+build/181123918:17
GrueMasteror am I missing something?18:17
tgardnerGrueMaster, what meta-package are you using? It should be linux-omap or linux-image-omap18:17
GrueMasterThis is a hand rolled image initially running 2.6.34-5.  I downloaded linux-image-2.6.35-6-omap from the above link.18:18
tgardnerGrueMaster, ok, so why do you think you're hitting bug # 595949 ?18:19
GrueMasterHas it been fixed yet?18:20
GrueMasterMy image was created 6/22.18:20
tgardnerGrueMaster, yes, but I just noticed that there might still be  a bogus package in the archive.18:20
GrueMasterAnd I haven't seen anything kernel related with apt-get update.18:21
GrueMasterhmmm18:21
tgardnerGrueMaster, lemme chat up some archive admins about this18:21
tgardnerGrueMaster, 'dpkg -l|grep ti-omap'18:25
GrueMasternone.18:26
tgardnerGrueMaster, then you don't have the meta package installed?18:26
GrueMasteras I said earlier, it is broken (or was on 6/22 when this image was created).18:26
tgardnerGrueMaster, um, try 'dpkg -l|grep omap'18:27
GrueMasterii  linux-image-2.6.34-5-omap       2.6.34-5.14               Linux kernel image for version 2.6.34 on OMA18:27
GrueMasterii  linux-image-2.6.35-2-omap       2.6.35-2.3                Linux kernel image for version 2.6.35 on OMA18:27
GrueMasterii  linux-image-2.6.35-6-omap       2.6.35-6.7                Linux kernel image for version 2.6.35 on TI 18:27
tgardnerGrueMaster, what happens with 'sudo apt-get install linux-image-omap' ?18:28
GrueMaster"apt-cache search ti-omap" only lists the linux-headers for 2.6.33|34|35 and omap4 meta packages.18:29
tgardnerits not ti-omap, just omap18:29
GrueMasterIt wants to revert to 2.6.35-5-omap kernel.  18:30
psusianyone care to apply the proposed patch in bug #59548918:32
tgardnerGrueMaster, after you've done an update? 'apt-get update' ?18:32
ubot2Launchpad bug 595489 in linux (Ubuntu) (and 1 other project) "lvm snapshot causes deadlock in 2.6.35 (affects: 1) (heat: 6)" [High,In progress] https://launchpad.net/bugs/59548918:32
GrueMasterJust a sec.  Updated mirror, now updating apt.18:33
tgardnerpsusi, it ain't gonna happen for A2, but I've milestoned it for A318:34
psusidoh18:35
tgardnerpsusi, is Eric gonna get it into .35 ? Linux won't be back until next week.18:35
tgardnerLinus*18:36
psusihe fired off the patch to the mailing lists and I guess it's waiting for Linus to apply18:36
psusibut since kernel freeze starts today, I think we're going to have to apply it ourselves?18:36
tgardnerpsusi, ok, this issue won't get lost now that its milestoned. I would prefer to get the fix direct from Linus' tree18:37
psusiyes, but that won't happen because of the kernel freeze will it?18:37
tgardnerpsusi, we don't freeze the kernel until the official 2.6.35 is released18:37
psusiohh... I thought I read that today begins kernel freeze...18:38
tgardnerthis is just a momentary freeze in order to get A2 out the door18:38
psusiahhh18:38
tgardnerso there time yet18:38
tgardnerthere is*18:38
psusithen hopefully it will get into .35 final... though Eric said there is another deadlock possible, but I have yet to hit it18:39
psusihe's working on tracking that down now18:39
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/ae0f36f0b964caf916c2ffc9f84b28c0f91c18a2-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb18:40
tgardnerccheney, do a 'sudo dmesg -c' right before you start your test18:40
keesmaks_, sconklin-gone, tgardner: building with intel_agp built-in did not fix it.  the bisect has resulted, finally, in this one: f1befe71fa7a79ab733011b045639d8d809924ad18:41
keesI'm building a maverick kernel with that reverted now...18:41
tgardnerkees, you have an older i945?18:41
keesyes18:41
keesG35, though, not G3318:42
tgardnerkees, you should ask Kyle about this. He did a bunch of i945 stuff a few years ago.18:42
keestgardner: I figured I'd poke Anholt since I think he's in my timezone.  (if I can find him)18:43
* tgardner lunches for a bit18:43
GrueMastertgardner: Ok, I am officially giving up trying to get apport data on this omap bug.  I have followed all the steps on https://help.ubuntu.com/community/ReportingBugs based on a headless system, but apport is now trying to collect data on my desktop system (different arch altogether).18:47
maks_I'd like to know what the fix for #554569 is18:54
maks_the linked rh and freedesktop bugs all point to different patches18:54
maks_oh the freedesktop bug finaly points to something interesting cool.18:56
* bjf will be back in a bit18:59
=== bjf is now known as bjf[afk]
tgardnerccheney, any update? 19:03
ccheneytgardner, not yet, sorry was at lunch19:10
ccheneytgardner, no response from my email and still looking into how to do the test with my own image19:11
tgardnerccheney, well, just rerun the original UEC failure case with this latest kernel so I can get some output from that patch19:12
ccheney2.6.35-6 ?19:12
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/ae0f36f0b964caf916c2ffc9f84b28c0f91c18a2-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb19:13
ccheneyoh nm i see you sent me a new one19:13
ccheneytgardner, appears broken, is that expected?19:19
ccheneyi'll see how it does over all 10 tests but it already has one failure19:19
tgardnerccheney, yes, I expect it to be broken. I want the dmesg output19:20
ccheneyoh ok19:20
ccheney[  211.296957] /home/rtg/kern/maverick/kern-64/ubuntu-maverick/fs/read_write.c:849 should this happen?19:20
ccheneyi'll get the whole log once its done19:20
tgardnerccheney, I only need the results of one pass. _Before_ you start it do 'sudo dmesg -c' to clear the kernel buffer19:21
ccheneyoh oops, i need to redo it then19:21
ccheneyok as soon as it finishes the first pass i will get it to you19:22
ccheneyall i saw was once for one pass: [  365.268713] /home/rtg/kern/maverick/kern-64/ubuntu-maverick/fs/read_write.c:849 should this happen?19:23
ccheneythat was the entire output of dmesg during the first pass of the run19:24
tgardnerccheney, yeah, its some debug, but I think its harmless19:24
ccheneyoh that wasn't my question it was this "[  365.268713] /home/rtg/kern/maverick/kern-64/ubuntu-maverick/fs/read_write.c:849 should this happen?" exactly19:24
ccheney:)19:24
tgardnerccheney, oh, yes. its the debug I put in.19:24
ccheneyok19:25
ccheneytgardner, so what does the special output mean? :)19:30
tgardnerccheney, hang on, I'm spinning one more 19:30
ccheneyok19:30
tgardnerccheney, 10 minutes or so19:30
ccheneyok19:31
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/0a87a0c1b12f56bd556fd4506041966717c87fb1-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb19:45
tgardnersame drill as before19:45
ccheneyok will test19:46
ccheneytgardner, i may have to disappear for an extended time in the next week, if so contact kirkland for how to proceed, i don't want to go into details in public19:46
ccheneytgardner, most of the server team knows the details though if it does happen so they can fill you in19:47
tgardnerccheney, ack19:47
ccheneytgardner, will be a medical emergency if it happens19:47
tgardnerccheney, you haven't disappeared already, have you?20:01
ccheneyno sorry20:04
ccheneyhad to talk my father in law to fill him in on my wife's status20:04
tgardnerccheney, I'm about EOD, but wanted to ponder the results of that last patch20:05
ccheneyok let me see if it finished running20:05
ccheneydoh i ran off to tell him before i rebooted and started it20:05
ccheneyit should take about 3-4m20:05
ccheneytgardner, [  129.980324] /home/rtg/kern/maverick/kern-64/ubuntu-maverick/fs/read_write.c:849 ffffffff816256e0 (null)20:11
ccheneyrepeated 20 times20:11
ccheneymost within a second, first was 50s before i'll pastebinit20:11
ccheneytgardner, http://pastebin.ubuntu.com/455139/20:12
tgardnerccheney, ack20:12
tgardnerccheney, hmm, I wonder if I restore the original lines there if it'll make a diff. you gonna be around for another 20 minutes?20:13
ccheneyyea20:14
tgardnerk, gimme 15 minutes20:14
ccheneytgardner, i'll be around the rest of the day unless emergency happens20:14
ccheneyi'll do my best to alert everyone to that case if time permits20:14
jjohansen-> Lunch20:18
tgardnerccheney, http://kernel.ubuntu.com/~rtg/mainline/afef312909fa10e603a05e030b2ee2feebde8d9f-maverick/linux-image-2.6.35-6-server_2.6.35-6.7_amd64.deb20:33
ccheneytesting20:34
tgardnerthis should be the last one.20:34
=== bjf[afk] is now known as bjf
tgardnerccheney, I left the debug in along with the restored code. It might just work.20:38
ccheneyok will see what happens20:38
ccheneytgardner, we got a bit of bad news, probably more to follow in the next week, but i'm still around today20:45
tgardnerccheney, I'm definitely outta here after this test.20:45
ccheneyok20:46
ccheneyjust one result from 1 loop20:46
ccheney[  193.874005] /home/rtg/kern/maverick/kern-64/ubuntu-maverick/fs/read_write.c:849 ffffffff816256e0 (null)20:46
tgardnerccheney, did the PAD error show up?20:47
ccheneyoh i forgot to check that, sorry :-\20:47
ccheneyno i think it is working20:47
ccheneyi'll rerun20:47
ccheneyi didn't clear my euca logs20:47
ccheneyit looks like it worked though20:47
ccheneyyea it worked, i checked the timestamp from my script vs the timestamp of the last error on the log20:48
ccheneyhaven't run an instance yet, but it seems to no longer show pad corrupted20:48
ccheneyyea no more pad corrupted error20:48
ccheneywhat did you change?20:49
tgardnerccheney, cool. I'll send a note to upstream about it later tonight. Add your results to the LP report.20:49
ccheneyok20:49
tgardnerccheney, I restored the 2 lines of code that made any substantive difference in the behavior of the patch20:49
ccheneyok20:52
tgardnerccheney, upstream email sent. I'm outta here.21:15

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!