mdz | jbarnes: did albert23's i915 debugfs stuff net any clues? | 00:00 |
---|---|---|
jbarnes | it seemed ok but I've got to go through each instruction with the docs next to me | 00:00 |
bryce_ | are the docs online? | 00:02 |
jbarnes | yeah | 00:02 |
jbarnes | at intellinuxgraphics.org | 00:02 |
bryce_ | ok thanks, I'll link to them | 00:02 |
jbarnes | trying to refine my debug patch a little | 00:02 |
mdz | ogasawara is working on building a module with jbarnes' patch for the jaunty kernel | 00:05 |
jbarnes | I'll be posting a new one shortly | 00:05 |
ogasawara | jbarnes: should I just wait for the new one? | 00:06 |
jbarnes | yeah give me a few minutes please | 00:06 |
mdz | bryce_: did you have any luck reproducing the bug with the script? | 00:10 |
bryce | oh yeah let me try that first | 00:10 |
bryce_ | 1280 * 4: syntax error in expression (error token is "1280 | 00:16 |
mdz | bryce_: it doesn't work under metacity | 00:17 |
mdz | it seems to configure workspaces differently, and I haven't bothered to make it work there | 00:17 |
bryce | aha | 00:18 |
bryce | I started it up, and *then* activated compiz | 00:18 |
bryce | it definitely is unhappy | 00:18 |
bryce | ok, froze | 00:18 |
bryce | ok, tooltime | 00:18 |
jbarnes | ogasawara: ok just posted an updated patch to the lp | 00:19 |
jbarnes | man my machine is failing hard | 00:20 |
ogasawara | jbarnes: ok cool | 00:20 |
mdeslaur | unfortunately, I can't help. I have an nvidia card. | 00:25 |
mdz | mdeslaur: oh, sorry, you were a false positive in my search | 00:27 |
jbarnes | oops had a register address wrong in that last patch | 00:31 |
jbarnes | not harmful, and that reg wasn't very interesting anyway | 00:31 |
ogasawara | jbarnes: I can fix it up really quick if it'll help | 00:32 |
jbarnes | IPEIR should be 0x2064 rather than 0x2088 | 00:33 |
ogasawara | jbarnes: np, I'll update it | 00:33 |
jbarnes | cool | 00:33 |
jbarnes | I'm checking the other ones now | 00:33 |
bryce | yay, got a freeze dump for you jbarnes | 00:44 |
jbarnes | oh yay :) | 00:44 |
mdz | it's after midnight here, I'm going to have to pick this up again tomorrow | 00:44 |
mdz | thanks so much to all of you for working this issue | 00:45 |
bryce | night mdz, thanks for your help as well | 00:45 |
mdz | please remember to update the bug report so that others can pick up where you leave off | 00:45 |
jbarnes | mdz: ok later, thanks | 00:45 |
bryce | jbarnes: http://launchpadlibrarian.net/25686110/freeze_dump.txt | 00:46 |
mdz | Buffer size too small in MI_STORE_DATA_INDEX (2 < 3) | 00:48 |
bryce | yeah | 00:49 |
bryce | interesting | 00:49 |
jbarnes | that might just be junk after the end of the ring | 00:49 |
jbarnes | the head pointer should indicate where the hang happened | 00:49 |
bryce | MI_NOOP ? | 00:51 |
jbarnes | yeah just padding | 00:51 |
jbarnes | lots of padding :) | 00:51 |
bryce | 3DSTATE_DRAWING_RECTANGLE sounds interesting | 00:52 |
jbarnes | bryce: see if you can get i915_interrupt too | 00:54 |
jbarnes | it's one of the debugfs files | 00:54 |
bryce | path? | 00:54 |
bryce | nevermind | 00:54 |
bryce | Interrupt enable: 00020053 | 00:56 |
bryce | Interrupt identity: 00000000 | 00:56 |
bryce | Interrupt mask: fffcdfac | 00:56 |
bryce | Pipe A stat: 00040000 | 00:56 |
bryce | Pipe B stat: 00000206 | 00:56 |
bryce | Interrupts received: 43680 | 00:56 |
bryce | Current sequence: 517905 | 00:56 |
bryce | Waiter sequence: 517991 | 00:56 |
bryce | IRQ sequence: 517746 | 00:56 |
bryce | jbarnes: uploaded tarball of all the files to the bug | 00:57 |
jbarnes | cool thanks | 00:58 |
jbarnes | bryce: those files are empty :p | 01:03 |
bryce | bah | 01:03 |
bryce | huh, you're right | 01:03 |
bryce | cp: reading `0/i915_batchbuffers': Cannot allocate memory | 01:04 |
jbarnes | bryce: wanna join #intel-gfx? anholt has questions | 01:05 |
kirkland | bryce: hey | 01:14 |
kirkland | bryce: i just upgraded mesa to 7.4-0ubuntu3 | 01:14 |
kirkland | bryce: is that the one thought to provide the 965+compiz fix? | 01:14 |
bryce | kirkland: yes | 01:14 |
kirkland | bryce: fwiw, i upgraded to that rebooted, reran mdz's repro.sh script; still hangs X | 01:14 |
bryce | ok | 01:15 |
bryce | bugger | 01:15 |
bryce | kirkland: there is now some additional debug tools I've packaged for ubuntu | 01:15 |
bryce | https://edge.launchpad.net/~ubuntu-x-swat/+archive/x-freeze-test | 01:15 |
bryce | requires installing a new kernel and new libdrm | 01:15 |
kirkland | bryce: okay, what are we trying to get out of this? | 01:17 |
bryce | kirkland: register dumps that the intel guys can grok to figure out what's gone wrong | 01:17 |
bryce | I posted my stuff on bug 359392 just now | 01:17 |
ubottu | Launchpad bug 359392 in xserver-xorg-video-intel "[i965] X freezes starting on April 3rd" [Critical,Triaged] https://launchpad.net/bugs/359392 | 01:17 |
bryce | a second set might give extra insight | 01:17 |
kirkland | bryce: okay, gimme a moment | 01:18 |
kirkland | bryce: doesn't look like intel-gpu-tools - 1.0-0ubuntu1 has built yet ... | 01:24 |
bryce | god I hate ppas | 01:24 |
bryce | there was an older ~pre2 there which is identical | 01:24 |
bryce | I guess ppa supersedes packages at point of upload, rather than waiting until the new thingee has built? | 01:25 |
bryce | anyway, looks like amd64 has built, so presumably i386 is nearly done | 01:25 |
kirkland | k, amd64 is what i need | 01:26 |
kirkland | bryce: which of the libdrm debs do i need? | 01:29 |
bryce | kirkland: if you add the apt sources.list lines from the ppa into your /etc/apt/sources.list, and do apt-get install intel-gtp-tools it will pull in exactly what it needs | 01:30 |
kirkland | bryce: perfect, thanks. | 01:30 |
kirkland | bryce: rebooting into this jankey environment :-) | 01:33 |
bryce | :-) | 01:33 |
kirkland | bryce: okay | 01:42 |
kirkland | bryce: X hung again, but slightly differently | 01:42 |
kirkland | bryce: i still have control of my mouse | 01:43 |
bryce | yeah that's typical | 01:43 |
kirkland | bryce: but keyboard is unresponsive | 01:43 |
kirkland | bryce: oh? hmm, i didn't rember that | 01:43 |
kirkland | bryce: okay, where do you want the freeze_dump.txt ? | 01:43 |
bryce | bug 359392 | 01:43 |
ubottu | Launchpad bug 359392 in xserver-xorg-video-intel "[i965] X freezes starting on April 3rd" [Critical,Triaged] https://launchpad.net/bugs/359392 | 01:43 |
kirkland | bryce: k | 01:44 |
bryce | maybe snag also the files in /sys/kernel/debug/dri/0/i915* while you're at it | 01:44 |
kirkland | bryce: sure | 01:47 |
kirkland | bryce: done | 01:49 |
=== rickspencer3 is now known as rickspencer3-afk | ||
bryce | kirkland: thanks! | 01:49 |
kirkland | https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/359392/comments/102 | 01:49 |
bryce | kirkland: go ahead and restore your system to normal, I think that'll do it for now. | 01:49 |
ubottu | Ubuntu bug 359392 in xserver-xorg-video-intel "[i965] X freezes starting on April 3rd" [Critical,Triaged] | 01:49 |
kirkland | https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/359392/comments/103 | 01:49 |
bryce | god that bug's getting long :-) | 01:50 |
kirkland | bryce: yeah, i'm not jealous, i promise | 01:50 |
* kirkland thinks bryce has one of the hardest jobs in ubuntu :-) | 01:50 | |
pwnguin | how much of the -x team's work is writing code versus testing / filing reports for intel? | 01:51 |
pwnguin | seems like a lot of what goes on is wrangling patches from various sources | 01:51 |
bryce | kirkland: heh thanks | 01:52 |
bryce | pwnguin: pretty much | 01:52 |
bryce | pwnguin: most of the X coding is limited to reworking upstream patches to apply to our stuff, or little quirks or enable/disable options and stuff | 01:52 |
bryce | pwnguin: the bug load is just too high to be able to spend much time doing coding for any one particular issue | 01:53 |
bryce | unless it's either really important, or really trivial | 01:53 |
pwnguin | heh | 01:54 |
pwnguin | indeed, it kinda seems like the worst thing that can happen to a bug is to be triaged medium | 01:55 |
bryce | I have been doing a fair bit of scripting type coding, for support tools and so forth | 01:55 |
pwnguin | low priority bugs seem to have tiny fixes | 01:55 |
jbarnes | ok I have to go home now too.. I'll leave my machine running (still hasn't hung with mdz's test script even after suspend/resume) | 01:55 |
bryce | jbarnes: any leads from the info collected so far? | 01:56 |
bryce | pwnguin: to be honest I mostly ignore the priorities on bugs | 01:56 |
jbarnes | my best guess is that we're actually looking at a few bugs here | 01:56 |
jbarnes | but until we can accurately reproduce things we won't know for sure | 01:57 |
bryce | hah, my gut was right | 01:57 |
bryce | jbarnes: yes I've sensed since the start that there are more than a couple bugs going on | 01:57 |
=== eric__ is now known as LLStarks | ||
bryce | pwnguin: what I do is skim for bugs with patches, git commit ids, backtraces, fixed-upstream, needs-sponsoring, or other indications that the bug is "ripe". I'll pick those off regardless of priority set | 01:58 |
LLStarks | bryce what is this delicious new ppa you've created? | 01:58 |
bryce | LLStarks: https://edge.launchpad.net/~ubuntu-x-swat/+archive/x-freeze-test | 01:58 |
LLStarks | i see | 01:58 |
ogasawara | bryce, jbarnes: I've attempted testing with the updated patch, but my system is locking up hard | 02:23 |
ogasawara | bryce: could you give it a try and see if you get the same? | 02:23 |
ogasawara | bryce: I've posted a comment to the bug | 02:23 |
ogasawara | bryce: I gotta drop off, but will be checking back in later. feel free to email if you need anything else. | 02:42 |
=== rickspencer3-afk is now known as rickspencer3 | ||
=== rickspencer3 is now known as rickspencer3-bbl | ||
=== rickspencer3-bbl is now known as rickspencer3 | ||
Mirv | tjaalton: I wasn't either able to crash ith mem=1G, ran rotate-forever for the night | 07:43 |
Mirv | tjaalton: what about resolutions btw, I've 1400x900, does your and dustin's machine have same screen? | 07:47 |
Mirv | oh, I guess I shouldn't update compiz since now all 2a02:s are blacklisted | 07:49 |
tjaalton | Mirv: I have a 10x7 screen | 08:16 |
=== Sarvatt_ is now known as Sarvatt | ||
Mirv | ok, there is simply nothing to differentiate non-crashing 2a02 rev 0c:s from crashing ones :( | 08:27 |
Mirv | or maybe if taking a hard look at lspci, but still | 08:27 |
tjaalton | I'll try the script now | 08:29 |
tjaalton | heh, dholbach has a X61s (very similar to X61) and no problems | 08:30 |
tjaalton | maybe it's the models they sell in the US that have problems :) | 08:31 |
Mirv | maybe it's 110V that causes the crashes ;) | 08:33 |
Mirv | then we could disable compiz based on country ;) | 08:33 |
tjaalton | ooh, must be.. power drainage | 08:33 |
tjaalton | sweet, now it crashed | 08:37 |
tjaalton | running repro.sh | 08:37 |
tjaalton | it got to the fourth workspace | 08:38 |
Sarvatt | do the crashing ones have dual channel memory by any chance? | 08:40 |
tjaalton | this is a laptop, dunno | 08:41 |
tjaalton | sigh, for some reason reboot doesn't actually reboot, it'll just bounce back to runlevel 2 | 08:41 |
bryce | I got frozen again | 08:41 |
bryce | this time I did it the old fashioned way, just used it for several hours | 08:42 |
bryce | this was after updating to today's mesa | 08:42 |
bryce | I expected it would freeze, but what was curious was that it took 4 hours to do | 08:42 |
bryce | anyway, I posted the intel_gpu_dump and etc. data onto the bug report | 08:43 |
bryce | it seems to be considerably different from the output I got when freezing with the repro.sh script | 08:43 |
bryce | so I wonder if that means they're independent bugs | 08:43 |
bryce | jbarnes: ^ is that a correct conclusion to draw? | 08:44 |
bryce | or can the same root bug cause differing intel_gpu_dump output? | 08:44 |
tjaalton | heh, after shutdown compiz refuses to work | 08:46 |
Mirv | tjaalton: 2a02 was blacklisted... | 08:47 |
tjaalton | gah.. | 08:47 |
bryce | there is a config thingee to force it | 08:47 |
Mirv | but right, I had missed repro.sh, I have it running now though not getting a crash | 08:47 |
Mirv | I just edited /usr/bin/compiz and commented out the 2a02 line | 08:47 |
bryce | sudo gedit .config/compiz/compiz-manager | 08:48 |
bryce | and add this line: | 08:48 |
bryce | SKIP_CHECKS=yes | 08:48 |
bryce | we should probably recommend testers enable compiz that way, rather than editing the script | 08:48 |
tjaalton | I edited the wrapper like Mirv | 08:49 |
tjaalton | last time it hung before finishing the first cycle, but not this time | 08:49 |
tjaalton | I'll let it run for a while | 08:49 |
Mirv | should the repro.sh be visibly able to switch between ws 4 and 0? for me the visible thing is that it stays at workspace 0, but I just cannot change the workspace since it always rotates back | 08:50 |
Mirv | oh, later comment, excepts 1x6 workspace | 08:53 |
bryce | yeah has to be 1x6 | 08:53 |
bryce | are you two on the 2.6.30 kernel with the intel_gpu_tools stuff installed? | 08:54 |
tjaalton | you have to have 1x6 grid | 08:54 |
tjaalton | ok, hung again | 08:54 |
tjaalton | not yet | 08:54 |
tjaalton | this is stock jaunty | 08:54 |
tjaalton | what ppa was it again | 08:54 |
tjaalton | ? | 08:54 |
Mirv | http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.30-rc2/ | 08:54 |
bryce | https://edge.launchpad.net/~ubuntu-x-swat/+archive/x-freeze-test/ | 08:54 |
Mirv | and not yet | 08:55 |
Mirv | oh right the intel_gpu_tools | 08:55 |
bryce | I've tried to make the directions fairly paint by number but let me know if I missed anything | 08:55 |
tjaalton | heh, intel_gpu_top is funny | 09:01 |
tjaalton | every task is 100% when it's hung | 09:01 |
tjaalton | well, makes sense I guess | 09:01 |
Mirv | tjaalton: so you managed to crash easily with repro.sh? are you still using the mem=1G? I'm not getting a crash but using also full 2G now again. | 09:03 |
tjaalton | Mirv: not using mem=1G and it crashes, yes | 09:04 |
Mirv | hmmh | 09:04 |
=== Sarvatt_ is now known as Sarvatt | ||
tjaalton | Mirv: are you running the stock mesa? | 09:06 |
=== Sarvatt_ is now known as Sarvatt | ||
Mirv | tjaalton: yes, ubuntu3 now, earlier was that ubuntu2~bug359392 | 09:19 |
bryce | those two should be basically identical | 09:20 |
Mirv | I've done a couple of suspends as well while running the script, no problems | 09:22 |
tjaalton | ok I got a dump | 09:26 |
tjaalton | next I'll try without patch 104 | 09:29 |
tjaalton | no difference | 09:37 |
Mirv | still running smoothly, ca. 10 suspend-resumes done as well | 09:45 |
tjaalton | next I'll build mesa 7.3 and try repro.sh with it | 09:49 |
mdz | morning folks | 11:09 |
tjaalton | morning mdz | 11:10 |
mdz | tjaalton: it appears the mesa patch was not to blame for this particular set of bugs | 11:11 |
tjaalton | mdz: no, but neither was the mesa upgrade ;) | 11:12 |
tjaalton | 7.3 froze immediately | 11:12 |
tjaalton | just finished testing it | 11:12 |
tjaalton | repro.sh is probably a bit too ruthless | 11:13 |
mdz | yes, it's useful to have a case which freezes, but I worry that it may not be the same bug which people encounter in normal usage | 11:13 |
tjaalton | right | 11:14 |
mdz | the visual artifacts are the same, though | 11:14 |
mdz | I think it's reasonably likely | 11:14 |
mdz | it sounds like there is one bug which is fixed by reverting the mesa patch, but it is not the one experienced by most people in the bug report | 11:47 |
ogra | mdz, i'm pretty sure i saw the issue way earlier | 11:48 |
ogra | like around the berlin sprint | 11:48 |
mdz | ogra: which one is "the" issue? | 11:48 |
mdz | we are certain at this point that there are multiple bugs which can cause X to freeze | 11:49 |
ogra | for me thats the alt-tab animation being delayed by about two seconds, workspace switching being very slow, freezes every two days | 11:49 |
ogra | when i had enabled UXA (which i did to overcome the first two) i could see my swapspace being eaten, after two/three days my system hit OOM | 11:50 |
mdz | ogra: those are at least two separate bugs | 11:51 |
ogra | right, and UXA isnt relevant atm anyway, i switched that off yesterday | 11:51 |
ogra | but i remember that my X exposed issues in berlin already | 11:52 |
ogra | and it didnt change much since | 11:52 |
tjaalton | the delayed animations one is just generic slowness with the -intel driver jaunty has | 11:58 |
tjaalton | and UXA has memleaks, probably affects EXA too but not as much | 11:59 |
ogra | yeah, i still see zero swap used since i switched off UXA | 12:02 |
ogra | running constantly my system used to use about 2G swap after a day before ... | 12:03 |
ogra | the genric slowness came together with the freezes here though | 12:03 |
tjaalton | that's weird | 12:04 |
tjaalton | sure it's not after turning UXA off? | 12:04 |
mdz | ogra: it's changed a lot since then actually, including a whole new upstream mesa | 12:04 |
ogra | i turned off UXA yesterday, i had my first freezes shortly after the sprint | 12:04 |
ogra | and turned on UXA about two to three weeks after the sprint | 12:05 |
* ogra turns off all important apps and runs the testscript now | 12:07 | |
Sarvatt | for what it's worth, i'm running mesa 7.5 (with only 03 patch in the debian/ubuntu series), xserver-xorg-video-intel 2.7.99/libdrm-2.4.9 on 2.6.30-rc2 with the latest drm-intel-next stuff from a few hours ago and UXA (with and without KMS) is still freezing with alot of compositing activity on my 945GME. With the ubuntu packages I had hard freezes usually when alt-tabbing or opening a new gnome-terminal but with my packages the mouse i | 12:10 |
Sarvatt | s still moveable but the same things trigger it. updating just libdrm/-intel/kernel alone had the same exact crashes as ubuntu packages, the mesa upgrade is the only thing that changed it at all. | 12:10 |
ogra | hmm, glxgears eats a complete CPU core ... it feels like i have no GL accel at all and its software rendering | 12:14 |
ogra | and shouldnt it constantly switch workspaces ? | 12:15 |
ogra | it only did one run and left me on the first then | 12:16 |
tjaalton | you have a 1x6 grid? | 12:17 |
ogra | nope, all default | 12:17 |
ogra | 4 workspaces | 12:17 |
tjaalton | you need to have that | 12:17 |
tjaalton | one row | 12:18 |
ogra_babbage | yup, that helped | 12:21 |
* ogra_babbage is impressed how good pulse copes ... not the slightest coppyness with the video sound | 12:22 | |
ogra_babbage | *choppyness | 12:22 |
albert23 | With Virtual 2048 2048 in display subsection in xorg.conf I could run the test script without freezing for more then an hour | 12:41 |
albert23 | After removing that etting X froze again in less then 2 minutes | 12:41 |
albert23 | I originally used it to solve the slow cube rotation. Effect is that you get more EXA offscreen memory | 12:42 |
albert23 | For me EXA offscreen memory goes up from 19 to 49 MB | 12:44 |
jbarnes | bryce: it's possible we're seeing different gpu dumps but still the same root cause | 16:56 |
jbarnes | bryce, mdz: ran the test script all night with stock jaunty bits from yesterday, no hang, even did a suspend/resume shortly after starting it | 17:06 |
jbarnes | unfortuantely I didn't have the right number of virtual desktops so I've got to start it over this morning :/ | 17:07 |
Mirv | jbarnes: do you see any sense in albert23's suggestions of Virtual setting (increasing EXA offscreen memory)? at least that would be worth trying also by others who experience the hang easily | 17:48 |
Mirv | albert23: if you don't mind, I quote you in the bug report? | 17:48 |
Mirv | (oh well, these are publicly logged channels anyway) | 17:49 |
Mirv | so quoted :) | 17:49 |
ogra_babbage | tell them also that sitting in front of your screen for to long while the script runs might harm your stomack | 17:52 |
ogra_babbage | :) | 17:53 |
Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!