/srv/irclogs.ubuntu.com/2012/06/19/#ubuntu-x.txt

=== wgrant_ is now known as wgrant
bryceh	RAOF, tjaalton: any concerns if I upload the mesa 8.0.3 merge to quantal now?	02:01
RAOF	I'm good with that.	02:01
bryceh	alrighty	02:02
bryceh	up it goes	02:03
tjaalton	bryceh: yeah, thanks	05:31
mlankhorst	hm, does xorg 1.12 have the signal safe stuff?	08:58
jcristau	no	08:59
jcristau	iirc	08:59
tjaalton	not even a point-release?	09:06
tjaalton	hmm was it the security fix or something else?	09:06
tjaalton	got it now, apparently still under testing	09:08
mlankhorst	but yeah updating my laptop now, might as well do a thorough sru of https://bugs.launchpad.net/ubuntu/precise/+source/xserver-xorg-input-synaptics/+bug/941953 and confirm it in quantal first :)	09:18
ubottu	Launchpad bug 941953 in xserver-xorg-input-synaptics (Ubuntu Precise) "Xorg crashed with SIGSEGV in WriteToClient() with buf = 0x100000000 from ProcXIGetProperty()" [High,Triaged]	09:18
mlankhorst	looks like my desk is getting too small already :)	09:20
seb128	mlankhorst, hey	09:45
mlankhorst	hey	09:52
seb128	mlankhorst, how are you?	09:54
mlankhorst	im good, you?	09:54
seb128	mlankhorst, I'm good thanks	09:54
seb128	mlankhorst, is there any reason you are not on #ubuntu-desktop? ;-)	09:54
mlankhorst	woops must have left at one point and never rejoined	09:54
seb128	mlankhorst, ok, Laney is having login issues and we were looking for xorg expertise ;-)	09:55
seb128	is Xephyr segfaulting on precise for others as well?	10:16
tjaalton	how do you use it?	10:18
seb128	tjaalton, Xephyr :1	10:18
seb128	DISPLAY=:1 somecommand	10:18
seb128	rather	10:20
seb128	- Xephyr :1 as my user	10:20
seb128	su testuser	10:20
seb128	DISPLAY=:1 gnome-settings-daemon	10:20
seb128	which leads to a	10:20
seb128	Backtrace:	10:20
seb128	0: Xephyr (xorg_backtrace+0x37) [0xea1107]	10:20
seb128	1: Xephyr (0xd02000+0x1a2e8a) [0xea4e8a]	10:20
seb128	2: (vdso) (__kernel_rt_sigreturn+0x0) [0x25640c]	10:20
seb128	every time	10:21
jcristau	get a debug build and use gdb?	10:21
seb128	no xserver-xephyr-dbgsym and xserver-xorg-core-dbg doesn't include it :-(	10:25
seb128	I guess I will need to rebuild xorg	10:25
seb128	http://pastebin.ubuntu.com/1048865/ is the non debug bt	10:25
tjaalton	I ran xterm inside xephyr, and exiting it caused a similar segfault	10:27
tjaalton	I think we have several of these reported against xserver	10:29
seb128	tjaalton, do you have a number I can track and maybe milestone for 12.04.1? ;-)	10:30
tjaalton	bug 1009629	10:31
ubottu	Launchpad bug 1009629 in xorg-server (Ubuntu) "Xorg crashed with SIGSEGV in DeliverRawEvent()" [High,Confirmed] https://launchpad.net/bugs/1009629	10:31
seb128	tjaalton, thanks	10:32
jcristau	possibly https://lists.debian.org/debian-x/2012/05/msg00240.html?	10:35
jcristau	sadly no bt in that bug though	10:36
tjaalton	in the lp one? right	10:36
tjaalton	but does look similar	10:36
jcristau	yeah looks the same	10:37
jcristau	there's a revert in 1.12-branch that fixes it	10:37
tjaalton	oh	10:37
jcristau	58dfb13953af71021317b9d85230b1163198f031	10:37
tjaalton	I'll check it out	10:37
tjaalton	uh, there was an sru to _add_ that code	10:39
tjaalton	seb128: can you reproduce it with -0ubuntu10.1?	10:39
tjaalton	if you can find it..	10:39
seb128	tjaalton, let me try, do I need to downgrade only xserver-xephyr?	10:40
tjaalton	seb128: maybe so, I think the code is builtin	10:40
seb128	tjaalton, https://launchpad.net/ubuntu/+source/xorg-server/2:1.11.4-0ubuntu10.1 has the binaries if you want btw	10:40
tjaalton	yeah I'll try it as well	10:41
seb128	tjaalton, yeah, no segfault with that version	10:41
tjaalton	confirmed	10:42
tjaalton	bad cnd :)	10:43
tjaalton	wonder if there was another fix upstream for the original bug	10:43
seb128	cnd, no cookie for you!	10:43
tjaalton	thanks jcristau for the pointer	10:43
tjaalton	seb128: I added a note to the bug	10:45
seb128	tjaalton, thanks	10:45
seb128	tjaalton, who should be assigned to this bug? we want to make sure that regression is fixed before 12.04.1	10:46
mlankhorst	seb128: pick me?	10:47
seb128	mlankhorst, done, thanks ;-)	10:47
mlankhorst	we'll probably move to 1.13 though	10:48
tjaalton	not for precise	10:48
RAOF	That's totally SRUable. No new features at all!	10:49
tjaalton	:)	10:49
mlankhorst	hehehe :P	10:49
tjaalton	maybe the original bug would need to be reopened as well	10:49
tjaalton	bug 968845	10:49
ubottu	Launchpad bug 968845 in xserver-xorg-input-synaptics (Ubuntu Quantal) "bcm5974 touchpad doesn't work after S3 on MacBookAir" [Medium,Fix released] https://launchpad.net/bugs/968845	10:49
seb128	while you guys are around	10:52
seb128	could somebody look at https://bugs.launchpad.net/ubuntu/+source/xorg-server/+bug/962892	10:52
ubottu	Launchpad bug 962892 in xorg-server (Ubuntu) "Xorg crashed with SIGABRT in __assert_fail_base() unless clear compiz/unity settings" [High,Triaged]	10:52
seb128	it's getting quite some dups on launchpads and on errors.ubuntu.com	10:52
tjaalton	what a messy bug	10:57
tjaalton	looks like the duplicate bot isn't that helpful there	10:58
tjaalton	some have fglrx loaded, mostly intel though	10:58
mlankhorst	tjaalton: the original report is a assert(0) in intel ddx	10:58
mlankhorst	so ignore the fglrx ones..	10:58
tjaalton	right	10:58
mlankhorst	seb128: I'm going to take a look :)	10:59
seb128	mlankhorst, thanks	10:59
tjaalton	yeah, i'll reboot instead	10:59
seb128	tjaalton, mlankhorst: you can check the reports on errors.ubuntu.com as well they might have useful infos	11:01
seb128	open http://errors.ubuntu.com, entry xserver-xorg-core in the entry and select "month" in the combo	11:01
seb128	it's the first bug listed, if you click on the function you have the list of individual reports	11:02
mlankhorst	00:02.0 VGA compatible controller [0300]: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller [8086:0116] (rev 09) (prog-if 00 [VGA controller])	11:02
mlankhorst	specific error seems to be assert(0) when it can't identify the generation of bufmgr_gem->pci_device	11:02
mlankhorst	but it should be recognising it as gen6 if I'm reading it right..	11:10
tjaalton	it's also a hybrid system, like many on the bug	11:29
tjaalton	of the dupes	11:29
mlankhorst	i noticed, but running updates now to test	11:30
mlankhorst	https://bugs.launchpad.net/ubuntu/+source/xorg-server/+bug/984189 is not hybrid though	11:40
ubottu	Launchpad bug 962892 in xorg-server (Ubuntu) "duplicate for #984189 Xorg crashed with SIGABRT in __assert_fail_base() unless clear compiz/unity settings" [High,Triaged]	11:40
mlankhorst	hm getting a different bug instead	11:43
tjaalton	yeah not all dupes had hybrid	11:45
mlankhorst	some early corruption in xserver-xorg	11:46
mlankhorst	awesome :)	11:46
tjaalton	i've unduped 984189	11:50
tjaalton	looks like another snb crasher, probably fixed already	11:50
mlankhorst	damageRegionProcessPending	11:57
tjaalton	you can repro it?	11:59
mlankhorst	I'm hitting another bug	12:01
mlankhorst	damageRegionProcessPending for some reason	12:01
mlankhorst	I'll see if I can attach valgrind again	12:02
* mlankhorst looks for the signal patches again		12:04
mlankhorst	yay, worth it	14:04
mlankhorst	==1651== Address 0xdfdfdfdfdfdfdff7 is not stack'd, malloc'd or (recently) free'd	14:04
tjaalton	i sent three drm/i915 commits to stable@, makes at least my ivb stable :)	14:05
mlankhorst	so it seems my crash is caused by freed memory	14:05
mlankhorst	my X server is called 'exec /usr/bin/valgrind --leak-resolution=high --malloc-fill=ef --free-fill=df /usr/bin/Xorg "$@" &> /home/mlankhorst/nfs/vg' :)	14:07
mlankhorst	http://pastebin.com/xg2yfXKs	14:08
cnd	tjaalton, seb128, mlankhorst: only Jeremy Huddleston saw the crash on OS X, so I didn't bother with reverting the added patch	14:11
cnd	we will need to cherry-pick a couple of patches that fix it cleanly, IIRC	14:11
seb128	cnd, it happens every time you use Xephyr and close a client	14:11
cnd	fun	14:12
mlankhorst	weird.. I'll try upstart xf86-video-intel	14:12
cnd	mlankhorst, did you chat with RAOF about synaptics?	14:13
tjaalton	cnd: btw, is the precise xserver missing some input fixes from 1.12.x? can't recall what it maps to	14:13
tjaalton	i mean _other_ fixes ;)	14:13
cnd	tjaalton, it maps to 1.12.1 + the patches in debian/patches	14:13
mlankhorst	cnd: ugh some other issues popped up in between, I wanted to reproduce it on this laptop first with valgrind but it dies in a new way I haven't seen before	14:14
cnd	for input	14:14
tjaalton	oh right	14:14
tjaalton	yeah	14:14
mlankhorst	I'll try the other one	14:14
cnd	mlankhorst, do you have a macbook?	14:14
mlankhorst	nope	14:14
cnd	mlankhorst, then there's a low chance you'll be able to reproduce it easily	14:15
cnd	I can only reproduce it on a macbook air	14:15
mlankhorst	cnd: k	14:15
mlankhorst	which issue specifically?	14:16
cnd	mlankhorst, when you close the lid, the screen interacts with the touchpad and causes many dancing touches	14:17
cnd	and then the device is disabled for suspend	14:18
cnd	the touches aren't disposed of properly	14:18
cnd	so on resume, some touches may be "stuck" as active	14:18
mlankhorst	ah k	14:21
seb128	I guess you can probably fake that playing with the lid contact on a normal laptop :p	14:23
mlankhorst	was thinking the same	14:24
mlankhorst	just echo mem > /sys/power/state	14:24
mlankhorst	wait.. why does irssi have tab completion for that?	14:24
Prf_Jakob	So yeah, you guys need to blacklist the AMD SI driver from Unity.	15:33
Prf_Jakob	No cayman	15:33
mlankhorst	hm, just running X with valgrind is providing plenty of amusement..	16:05
mlankhorst	bryceh/RAOF: If we decide to push x 1.12, can we upstream the signal safe patches too and force some testing with valgrind?	16:51
bryceh	mlankhorst, some patches can be upstreamed (and are on my todo list), but a few are not going to be acceptable upstream	16:52
mlankhorst	I mean, I was tracking why i915 was refusing to log in only to notice that upstream added another regression on top :s	16:53
mlankhorst	and seeing how many different bugs in x org are memory based it would be nice to have as feature..	16:57
tjaalton	how do you force testing with valgrind?-)	16:59
mlankhorst	create a shell script X2 with contents:	17:00
mlankhorst	exec /usr/bin/valgrind --leak-resolution=high --malloc-fill=ef --free-fill=df /usr/bin/Xorg "$@" &>> /home/mlankhorst/nfs/vg.$(hostname)	17:00
mlankhorst	look for any suspicious read/write errors or crashes	17:00
mlankhorst	bryceh: hm any thoughts on this ? http://pastebin.com/qFpTpMzx	17:21
bryceh	drawableDamage(pDrawable);	17:25
bryceh	hmm	17:25
mlankhorst	df is my valgrind free-fill	17:26
mlankhorst	but from whatI can tell it should only nuke that drawable if refcount drops to zero, which it did.	17:30
bryceh	I wonder if it's caused by this cast:	17:31
bryceh	return (char ) (privates) + key->offset;	17:31
mlankhorst	don't think so, it just looks like that's how it registers private data into it	17:32
bryceh	or something. the "Invalid read of size 8" errors are complaining about differences in variable sizes	17:33
mlankhorst	I think it's simply reffing the damage after the pixmap was freed, but I don't see how..	17:33
seb128	mlankhorst, I'm not a valgrind expert but --leak-check=full might help to stop where it was freed	17:35
seb128	mlankhorst, not sure if --leak-resolution=high does the same	17:35
seb128	mlankhorst, I just know that the one we listed is on our standard set of flags for desktop debugging	17:35
mlankhorst	seb128: no that's on exit	17:35
seb128	ok	17:35
mlankhorst	--leak-check=<no\|summary\|yes\|full> [default: summary]	17:36
mlankhorst	When enabled, search for memory leaks when the client program finishes. If set to summary, it says how many leaks occurred. If set	17:36
mlankhorst	to full or yes, it also gives details of each individual leak.	17:36
mlankhorst	I need to set --track-origins=yes though	17:36
mlankhorst	more slowdown :)	17:36
bryceh	mlankhorst, yeah not sure what's going on there. If you got a reproducible case, might chat with ickle	17:38
mlankhorst	bryceh: yeah seems to happen on upstream intel too	17:39
mlankhorst	it's annoying since it blocks login	17:41
mlankhorst	hm just to be sure I'll try without valgrind patch	17:58
mlankhorst	bryceh: hm, at this point I'm not even 100% sure it's intel specific, I'll cut down on other options	18:18
mlankhorst	https://bugs.freedesktop.org/show_bug.cgi?id=51240 added :)	18:34
ubottu	Freedesktop bug 51240 in Driver/intel "[i915] crash in damageRegionProcessPending on login" [Normal,New: ]	18:34
mlankhorst	hm was afraid of that, some changes between x 1.11 and x 1.12	20:28
Sarvatt	mlankhorst: i dont even see that commit in xserver at all	20:32
mlankhorst	Sarvatt: well with the x 1.12 I uploaded to x-staging ppa things work, so I guess there's probably some truth to it	20:33
bryceh	Sarvatt, hey I'm looking at the gpu lockup udev rule.	20:35
bryceh	currently we trigger on ERROR=1 from the kernel. there is also a RESET=1 which apparently happens later but still prior to the reset	20:35
bryceh	one of the Intel guys suggested moving to RESET=1 might eliminate the false gpu lockups	20:36
Sarvatt	sounds right to me, we did it on ERROR=1 before to grab an intel_gpu_dump of the actual crash before but its automatically captured in debugfs now	20:53
bryceh	right	20:53
bryceh	Sarvatt, ok... I'll test it on a couple systems but expect to have it in within the week; ping me if you spot anything wonky... I think you tend to notice gpu weirdness before anyone else here :-)	20:54
mlankhorst	night all :)	20:59
* bryceh waves		21:12

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!