/srv/irclogs.ubuntu.com/2012/06/19/#ubuntu-x.txt

=== wgrant_ is now known as wgrant
brycehRAOF, tjaalton: any concerns if I upload the mesa 8.0.3 merge to quantal now?02:01
RAOFI'm good with that.02:01
brycehalrighty02:02
brycehup it goes02:03
tjaaltonbryceh: yeah, thanks05:31
mlankhorsthm, does xorg 1.12 have the signal safe stuff?08:58
jcristauno08:59
jcristauiirc08:59
tjaaltonnot even a point-release?09:06
tjaaltonhmm was it the security fix or something else?09:06
tjaaltongot it now, apparently still under testing09:08
mlankhorstbut yeah updating my laptop now, might as well do a thorough sru of https://bugs.launchpad.net/ubuntu/precise/+source/xserver-xorg-input-synaptics/+bug/941953 and confirm it in quantal first :)09:18
ubottuLaunchpad bug 941953 in xserver-xorg-input-synaptics (Ubuntu Precise) "Xorg crashed with SIGSEGV in WriteToClient() with buf = 0x100000000 from ProcXIGetProperty()" [High,Triaged]09:18
mlankhorstlooks like my desk is getting too small already :)09:20
seb128mlankhorst, hey09:45
mlankhorsthey09:52
seb128mlankhorst, how are you?09:54
mlankhorstim good, you?09:54
seb128mlankhorst, I'm good thanks09:54
seb128mlankhorst, is there any reason you are not on #ubuntu-desktop? ;-)09:54
mlankhorstwoops must have left at one point and never rejoined09:54
seb128mlankhorst, ok, Laney is having login issues and we were looking for xorg expertise ;-)09:55
seb128is Xephyr segfaulting on precise for others as well?10:16
tjaaltonhow do you use it?10:18
seb128tjaalton, Xephyr :110:18
seb128DISPLAY=:1 somecommand10:18
seb128rather10:20
seb128- Xephyr :1 as my user10:20
seb128su testuser10:20
seb128DISPLAY=:1 gnome-settings-daemon10:20
seb128which leads to a10:20
seb128Backtrace:10:20
seb1280: Xephyr (xorg_backtrace+0x37) [0xea1107]10:20
seb1281: Xephyr (0xd02000+0x1a2e8a) [0xea4e8a]10:20
seb1282: (vdso) (__kernel_rt_sigreturn+0x0) [0x25640c]10:20
seb128every time10:21
jcristauget a debug build and use gdb?10:21
seb128no xserver-xephyr-dbgsym and xserver-xorg-core-dbg doesn't include it :-(10:25
seb128I guess I will need to rebuild xorg10:25
seb128http://pastebin.ubuntu.com/1048865/ is the non debug bt10:25
tjaaltonI ran xterm inside xephyr, and exiting it caused a similar segfault10:27
tjaaltonI think we have several of these reported against xserver10:29
seb128tjaalton, do you have a number I can track and maybe milestone for 12.04.1? ;-)10:30
tjaaltonbug 100962910:31
ubottuLaunchpad bug 1009629 in xorg-server (Ubuntu) "Xorg crashed with SIGSEGV in DeliverRawEvent()" [High,Confirmed] https://launchpad.net/bugs/100962910:31
seb128tjaalton, thanks10:32
jcristaupossibly https://lists.debian.org/debian-x/2012/05/msg00240.html?10:35
jcristausadly no bt in that bug though10:36
tjaaltonin the lp one? right10:36
tjaaltonbut does look similar10:36
jcristauyeah looks the same10:37
jcristauthere's a revert in 1.12-branch that fixes it10:37
tjaaltonoh10:37
jcristau58dfb13953af71021317b9d85230b1163198f03110:37
tjaaltonI'll check it out10:37
tjaaltonuh, there was an sru to _add_ that code10:39
tjaaltonseb128: can you reproduce it with -0ubuntu10.1?10:39
tjaaltonif you can find it..10:39
seb128tjaalton, let me try, do I need to downgrade only xserver-xephyr?10:40
tjaaltonseb128: maybe so, I think the code is builtin10:40
seb128tjaalton, https://launchpad.net/ubuntu/+source/xorg-server/2:1.11.4-0ubuntu10.1 has the binaries if you want btw10:40
tjaaltonyeah I'll try it as well10:41
seb128tjaalton, yeah, no segfault with that version10:41
tjaaltonconfirmed10:42
tjaaltonbad cnd :)10:43
tjaaltonwonder if there was another fix upstream for the original bug10:43
seb128cnd, no cookie for you!10:43
tjaaltonthanks jcristau for the pointer10:43
tjaaltonseb128: I added a note to the bug10:45
seb128tjaalton, thanks10:45
seb128tjaalton, who should be assigned to this bug? we want to make sure that regression is fixed before 12.04.110:46
mlankhorstseb128: pick me?10:47
seb128mlankhorst, done, thanks ;-)10:47
mlankhorstwe'll probably move to 1.13 though10:48
tjaaltonnot for precise10:48
RAOFThat's *totally* SRUable. No new features at all!10:49
tjaalton:)10:49
mlankhorsthehehe :P10:49
tjaaltonmaybe the original bug would need to be reopened as well10:49
tjaaltonbug 96884510:49
ubottuLaunchpad bug 968845 in xserver-xorg-input-synaptics (Ubuntu Quantal) "bcm5974 touchpad doesn't work after S3 on MacBookAir" [Medium,Fix released] https://launchpad.net/bugs/96884510:49
seb128while you guys are around10:52
seb128could somebody look at https://bugs.launchpad.net/ubuntu/+source/xorg-server/+bug/96289210:52
ubottuLaunchpad bug 962892 in xorg-server (Ubuntu) "Xorg crashed with SIGABRT in __assert_fail_base() unless clear compiz/unity settings" [High,Triaged]10:52
seb128it's getting quite some dups on launchpads and on errors.ubuntu.com10:52
tjaaltonwhat a messy bug10:57
tjaaltonlooks like the duplicate bot isn't that helpful there10:58
tjaaltonsome have fglrx loaded, mostly intel though10:58
mlankhorsttjaalton: the original report is a assert(0) in intel ddx10:58
mlankhorstso ignore the fglrx ones..10:58
tjaaltonright10:58
mlankhorstseb128: I'm going to take a look :)10:59
seb128mlankhorst, thanks10:59
tjaaltonyeah, i'll reboot instead10:59
seb128tjaalton, mlankhorst: you can check the reports on errors.ubuntu.com as well they might have useful infos11:01
seb128open http://errors.ubuntu.com, entry xserver-xorg-core in the entry and select "month" in the combo11:01
seb128it's the first bug listed, if you click on the function you have the list of individual reports11:02
mlankhorst00:02.0 VGA compatible controller [0300]: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller [8086:0116] (rev 09) (prog-if 00 [VGA controller])11:02
mlankhorstspecific error seems to be assert(0) when it can't identify the generation of bufmgr_gem->pci_device11:02
mlankhorstbut it should be recognising it as gen6 if I'm reading it right..11:10
tjaaltonit's also a hybrid system, like many on the bug11:29
tjaaltonof the dupes11:29
mlankhorsti noticed, but running updates now to test11:30
mlankhorsthttps://bugs.launchpad.net/ubuntu/+source/xorg-server/+bug/984189 is not hybrid though11:40
ubottuLaunchpad bug 962892 in xorg-server (Ubuntu) "duplicate for #984189 Xorg crashed with SIGABRT in __assert_fail_base() unless clear compiz/unity settings" [High,Triaged]11:40
mlankhorsthm getting a different bug instead11:43
tjaaltonyeah not all dupes had hybrid11:45
mlankhorstsome early corruption in xserver-xorg11:46
mlankhorstawesome :)11:46
tjaaltoni've unduped 98418911:50
tjaaltonlooks like another snb crasher, probably fixed already11:50
mlankhorstdamageRegionProcessPending11:57
tjaaltonyou can repro it?11:59
mlankhorstI'm hitting another bug12:01
mlankhorstdamageRegionProcessPending for some reason12:01
mlankhorstI'll see if I can attach valgrind again12:02
* mlankhorst looks for the signal patches again12:04
mlankhorstyay, worth it14:04
mlankhorst==1651==  Address 0xdfdfdfdfdfdfdff7 is not stack'd, malloc'd or (recently) free'd14:04
tjaaltoni sent three drm/i915 commits to stable@, makes at least my ivb stable :)14:05
mlankhorstso it seems my crash is caused by freed memory14:05
mlankhorstmy X server is called 'exec /usr/bin/valgrind --leak-resolution=high --malloc-fill=ef --free-fill=df /usr/bin/Xorg "$@" &> /home/mlankhorst/nfs/vg' :)14:07
mlankhorsthttp://pastebin.com/xg2yfXKs14:08
cndtjaalton, seb128, mlankhorst: only Jeremy Huddleston saw the crash on OS X, so I didn't bother with reverting the added patch14:11
cndwe will need to cherry-pick a couple of patches that fix it cleanly, IIRC14:11
seb128cnd, it happens every time you use Xephyr and close a client14:11
cndfun14:12
mlankhorstweird.. I'll try upstart xf86-video-intel14:12
cndmlankhorst, did you chat with RAOF about synaptics?14:13
tjaaltoncnd: btw, is the precise xserver missing some input fixes from 1.12.x? can't recall what it maps to14:13
tjaaltoni mean _other_ fixes ;)14:13
cndtjaalton, it maps to 1.12.1 + the patches in debian/patches14:13
mlankhorstcnd: ugh some other issues popped up in between, I wanted to reproduce it on this laptop first with valgrind but it dies in a new way I haven't seen before14:14
cndfor input14:14
tjaaltonoh right14:14
tjaaltonyeah14:14
mlankhorstI'll try the other one14:14
cndmlankhorst, do you have a macbook?14:14
mlankhorstnope14:14
cndmlankhorst, then there's a low chance you'll be able to reproduce it easily14:15
cndI can only reproduce it on a macbook air14:15
mlankhorstcnd: k14:15
mlankhorstwhich issue specifically?14:16
cndmlankhorst, when you close the lid, the screen interacts with the touchpad and causes many dancing touches14:17
cndand then the device is disabled for suspend14:18
cndthe touches aren't disposed of properly14:18
cndso on resume, some touches may be "stuck" as active14:18
mlankhorstah k14:21
seb128I guess you can probably fake that playing with the lid contact on a normal laptop :p14:23
mlankhorstwas thinking the same14:24
mlankhorstjust echo mem > /sys/power/state14:24
mlankhorstwait.. why does irssi have tab completion for that?14:24
Prf_JakobSo yeah, you guys need to blacklist the AMD SI driver from Unity.15:33
Prf_JakobNo cayman15:33
mlankhorsthm, just running X with valgrind is providing plenty of amusement..16:05
mlankhorstbryceh/RAOF: If we decide to push x 1.12, can we upstream the signal safe patches too and force some testing with valgrind?16:51
brycehmlankhorst, some patches can be upstreamed (and are on my todo list), but a few are not going to be acceptable upstream16:52
mlankhorstI mean, I was tracking why i915 was refusing to log in only to notice that upstream added another regression on top :s16:53
mlankhorstand seeing how many different bugs in x org are memory based it would be nice to have as feature..16:57
tjaaltonhow do you force testing with valgrind?-)16:59
mlankhorstcreate a shell script  X2 with contents:17:00
mlankhorstexec /usr/bin/valgrind --leak-resolution=high --malloc-fill=ef --free-fill=df /usr/bin/Xorg "$@" &>> /home/mlankhorst/nfs/vg.$(hostname)17:00
mlankhorstlook for any suspicious read/write errors or crashes17:00
mlankhorstbryceh: hm any thoughts on this ? http://pastebin.com/qFpTpMzx17:21
bryceh    drawableDamage(pDrawable);17:25
brycehhmm17:25
mlankhorstdf is my valgrind free-fill17:26
mlankhorstbut from whatI can tell it should only nuke that drawable if refcount drops to zero, which it did.17:30
brycehI wonder if it's caused by this cast:17:31
bryceh    return (char *) (*privates) + key->offset;17:31
mlankhorstdon't think so, it just looks like that's how it registers private data into it17:32
brycehor something.  the "Invalid read of size 8" errors are complaining about differences in variable sizes17:33
mlankhorstI think it's simply reffing the damage after the pixmap was freed, but I don't see how..17:33
seb128mlankhorst, I'm not a valgrind expert but --leak-check=full might help to stop where it was freed17:35
seb128mlankhorst, not sure if --leak-resolution=high does the same17:35
seb128mlankhorst, I just know that the one we listed is on our standard set of flags for desktop debugging17:35
mlankhorstseb128: no that's on exit17:35
seb128ok17:35
mlankhorst       --leak-check=<no|summary|yes|full> [default: summary]17:36
mlankhorst           When enabled, search for memory leaks when the client program finishes. If set to summary, it says how many leaks occurred. If set17:36
mlankhorst           to full or yes, it also gives details of each individual leak.17:36
mlankhorstI need to set --track-origins=yes though17:36
mlankhorstmore slowdown :)17:36
brycehmlankhorst, yeah not sure what's going on there.  If you got a reproducible case, might chat with ickle17:38
mlankhorstbryceh: yeah seems to happen on upstream intel too17:39
mlankhorstit's annoying since it blocks login17:41
mlankhorsthm just to be sure I'll try without valgrind patch17:58
mlankhorstbryceh: hm, at this point I'm not even 100% sure it's intel specific, I'll cut down on other options18:18
mlankhorsthttps://bugs.freedesktop.org/show_bug.cgi?id=51240 added :)18:34
ubottuFreedesktop bug 51240 in Driver/intel "[i915] crash in damageRegionProcessPending on login" [Normal,New: ]18:34
mlankhorsthm was afraid of that, some changes between x 1.11 and x 1.1220:28
Sarvattmlankhorst: i dont even see that commit in xserver at all20:32
mlankhorstSarvatt: well with the x 1.12 I uploaded to x-staging ppa things work, so I guess there's probably some truth to it20:33
brycehSarvatt, hey I'm looking at the gpu lockup udev rule.20:35
brycehcurrently we trigger on ERROR=1 from the kernel.  there is also a RESET=1 which apparently happens later but still prior to the reset20:35
brycehone of the Intel guys suggested moving to RESET=1 might eliminate the false gpu lockups20:36
Sarvattsounds right to me, we did it on ERROR=1 before to grab an intel_gpu_dump of the actual crash before but its automatically captured in debugfs now20:53
brycehright20:53
brycehSarvatt, ok...  I'll test it on a couple systems but expect to have it in within the week; ping me if you spot anything wonky... I think you tend to notice gpu weirdness before anyone else here :-)20:54
mlankhorstnight all :)20:59
* bryceh waves21:12

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!