/srv/irclogs.ubuntu.com/2010/01/15/#ubuntu-x.txt

Sarvatthttp://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0e253fdb3b5739fd8514f617ec582762bcfaea4800:01
Sarvatt:/00:01
Sarvattpasted that in the wrong place, sorry00:03
persiaGood day.  I was pointed at the plymouth FTBFS, and it dies in ./configure with "configure:11953: error: Package requirements (libdrm libdrm_intel libdrm_radeon libdrm_nouveau) were not met" for powerpc and armel (the only architectures from the FTBFS list I can test).01:49
persiaDoes anyone know what modules should replace that, or am I in the wrong place (drm and X are linked in my head for some reason)01:50
Sarvattit's the libdrm_intel thats the problem for sure, only builds on amd64/i386 for obvious reasons. libdrm-nouveau/radeon build on all arches. it was added here http://cgit.freedesktop.org/plymouth/commit/?id=f2048af97dcc862dbbb587a0cc2546ddbdbd2b0c02:13
persiaSarvatt: So basically either libdrm-intel needs to be ported to other architectures, or ./configure needs to know not to look for it on those architectures?02:14
Sarvattwould have to conditionalize the intel drm renderer build in the plymouth build system I think?02:15
persiaThat avoids porting :)02:15
persiaAnd if I wanted to use plymouth with SiS, I'd need to do something like the patch you pasted to enable that?02:16
Sarvattit needs a KMS driver to work that way :( you could use a vesa FB though02:18
persiaHrm, that's probably beyond me then :)  I do have a powerpc with radeon, so maybe I can test with that to cover just the don't-use-intel-for-sparc/powerpc/armel/ia64 patch.02:19
persiaThanks a lot for the guidance.02:20
Sarvattredhat might have done the work already, might be worth poking around their CVS02:20
SarvattKMS isnt enabled in the ubuntu powerpc kernel last I checked, I build my own for my ibook02:21
persiaheh.  First google result on "RedHat CVS" is an article entitled "CVS is out, Subversion is in"02:21
Sarvattfedora cvs sorry :)02:21
persiaSarvatt: It's probably not enabled due to lack of testing.  If it works, submit a patch to the ubuntu-kernel team.02:22
Sarvattpoking around there now too02:22
Sarvattno luck, they must just not use it on powerpc02:24
Sarvattor they build libdrm_intel on other arches02:24
persiaYeah.  BuildRequires: pkgconfig(libdrm_intel)02:24
Sarvatti dont think it would really hurt anything to just build libdrm-intel1 on linux-any?02:26
persiaLet me see if it works on powerpc02:26
Sarvatti mean, libdrm-nouveau and libdrm-radeon are linux-any and would never be used on most other arches02:26
persiaWhy linux-any rather than any?  The current list is "amd64 i386 kfreebsd-amd64 kfreebsd-i386"02:29
persiaWhy wouldn't nouveau or radeon be used on other arches?  I have a radeon in my powerpc box, and I know several were sold with nVidia cards.02:31
Sarvattoh, whoops :) really though it doesnt make sense for it to build other places since its limited to integrated graphics, at least the others can come in pci/agp form02:31
persiaMy powerpc radeon certainly looks like integrated graphics (older MacMini)02:31
persiaSarvatt: Good call.  libdrm-intel indeed seems to compile cleanly for at least powerpc.  That seems the easiest way forward (and those people who have intel graphics cards in their sparc/powerpc/armel/ia64 boxes can file bugs if they encounter issues)02:46
Sarvattfedora does build libdrm-intel on powerpc, just looked in their packages02:47
persiaExcellent: there's precedence, which always helps in convincing someone to upload.02:48
persiaI have a suspicion that it's a completely untested codepath, due to lack of hardware, but I don't imagine that it matters because of the same lack of hardware.02:49
Sarvattthey cant ever have intel graphics in their powerpc/sparc/arm/ia64 is the problem02:50
Sarvattwell ia64 maybe but it wouldnt use libdrm-intel02:50
persiaWell, that just happens to be true today.  I do have an ARM box with an Intel processor (but intel suggested an ATI video adaptor back then).02:50
bjsnidersince when does ubuntu support powerpc?02:51
persiabjsnider: Since warty or hoary or something.02:51
bjsnidernot officially02:51
bjsniderit was dropped02:51
persiaUm, only kinda.  Not like hppa was dropped.02:52
persiaSo packages still build, and it still shows up on all the QA reports, and there are still images on cdimage.ubuntu.com and thousands of users.02:52
SarvattI still use it and appreciate it working still at least on powerpc :D02:52
persiaAnd a bunch of people fix powerpc-specific bugs.02:52
persiaIt's not unless a port gets really unpopular or is just broken that it gets dropped.02:53
persiaFor instance, hppa had 3 users left when it went away, and all of them agreed.02:53
Sarvattgranted hardy was the last release I could ever install from disk on my ibook because of the ide-cd changes02:53
persiaSarvatt: Is that a general hardware support issue, or something that could be fixed?02:53
Sarvattnothing 6 hours of dist-upgrades doesn't fix though :D02:54
Sarvattit was over my head when I looked into it last so I'm not sure, was just a problem with the cd installers because the drive works fine after its installed02:57
Sarvatthttp://ucho.ignum.cz/fedora/linux/development/ppc/os/Packages/libdrm-2.4.6-6.fc11.ppc.rpm is the fedora libdrm I looked at to see libdrm-intel1 was being built on powerpc there though03:00
persiaWell, on the bright side, once 10.04 is out, it should be a one-step dist-upgrade.03:00
persiaAnyway, bug #507765 filed to port libdrm & ideally fix plymouth ftbfs.03:49
ubottuLaunchpad bug 507765 in libdrm "Please build libdrm-intel for ports architectures" [Undecided,New] https://launchpad.net/bugs/50776503:49
=== ripps is now known as ripps|sleep
superm1bjsnider, would you mind adding a conflicts on your nvidia karmic 190 and later packages to the mythtv version in the karmic archives or less?  the mythbuntu autobuilds PPA is compatible with your stuff, but the stuff karmic launched with isn't and we get people popping into #ubuntu-mythtv with some weird stuff that keeps coming up and is leading to that06:53
akaiholaThe page https://wiki.ubuntu.com/X/Troubleshooting/Freeze contradicts itself. It claims that a backtrace is a non-symptom, but lists "[mi] EQ overflowing" and X freeze as a relevant problem. A backtrace is included with such log messages.07:55
akaiholaAlso, there are no instructions about what to do in case of a "crash, not freeze" under the subheading "Non-Symptoms".07:56
akaiholaOtherwise the page is fantastic! I'd like to help improve it further, but I have insufficient knowledge to decide what information is correct.08:00
akaiholaAlternatively, what is a good place besides IRC to discuss Wiki pages, if I don't want to join the mailing lists? File a bug in Launchpad?08:05
Ngso I had a weird thing last night where I plugged my phone into my laptop and something about USB went a little bit mental and started flapping, and X segfaulted09:36
Nggoing by the kernel messages it seems like all USB devices were appearing and disappearing09:36
Ngvery fast I mean09:37
Ngbut whatever the problem with my USB is, that shouldn't make X crash :)09:38
Ngapport doesn't seem to have caught anything though, which is a bit weird09:41
Nghmm I wonder if it's https://bugs.freedesktop.org/show_bug.cgi?id=2564009:47
ubottuFreedesktop bug 25640 in Server/general "Reattaching USB keyboard causes double free" [Critical,Assigned]09:48
Ngthat one sounds a whole bunch more reproucible though09:48
tseliotNg: did you try this patch? http://lists.freedesktop.org/archives/xorg-devel/2010-January/004908.html10:03
Ngtseliot: not yet, I don't have as predictable a reproduction scenario as the fedora bug, but the circumstances are similar10:06
Ng(something about my USB went a bit strange when I plugged my phone into my USB hub and all the USB devices started flapping)10:06
NgI really hate USB, maybe I should switch to bluetooth keyboards/mice ;)10:07
seb128bug #50723910:09
ubottuLaunchpad bug 507239 in xorg-server "Xorg crashed with SIGSEGV in FatalError()" [Medium,New] https://launchpad.net/bugs/50723910:09
seb128is that something being worked?10:09
sebnerseb128: heya, can I ask you something (little bit offtopic)? I'm wondering if the split between in the indicator session makes sense (again, 1 icon more) + what's the plan for the future. Isn't that useless afford since Ubuntu 10.10 will ship with Gnome3 and gnome-shell with it's own indicator system?10:15
seb128sebner, who said we will ship gnome-shell?10:15
sebnerseb128: wondering as it's in the archive anyways and will be part(?) of GNOME3?10:16
seb128nothing decided10:16
seb128it's far to be ready10:16
seb128and we need something which works on non accelerated videos10:16
seb128netbooks10:16
seb128old computer10:16
seb128remote display10:16
seb128etc10:16
sebnersure but I'm pretty sure the old (current look) will be outdated or not maintained at some point at the future and gnome-shell is the future imho10:17
seb128the indicator are new technologies and use dbus10:17
tseliotseb128: I'm not sure about that bug. Maybe I can reproduce it here. I think I've experienced something similar10:17
seb128they can be used in gnome-shell if upstream wants10:17
seb128tseliot, it crashes this way when trying to open a guest session there on lucid10:18
seb128intel i96510:18
jcristauseb128: the backtraces on that bug seem fairly useless10:18
jcristaulike there are symbols for libc but not the server10:19
sebnerseb128: I never said that the indicator stuff is bad but as you said it's really something like "pushing our stuff into upstream and hoping they'll accept"10:19
seb128sebner, you think doing work and suggesting people try it then is the wrong way?10:20
seb128what would be the right one?10:20
seb128coming with random ideas and waiting for others to do the work?10:20
sebnerseb128: nah, doing discussion with upstream beforehand10:20
seb128jcristau, do you know what debug package is required for VAuditF?10:21
sebnerand/or with other distributions so more can benefit10:21
sebnerIntercompatible ftw! ...10:21
seb128sebner, those were discussed with upstream at boston summit hackfest 2 years ago10:21
jcristauseb128: xserver-xorg-core-dbg10:21
seb128jcristau, thanks10:21
tseliotI think -dbg packages should interfere with apport10:22
sebnerseb128: so only a matter of time, you said "if upstream wants" though so I doesn't really seem that fix10:22
seb128sebner, sometime people disagree on the way10:23
seb128sebner, it doesn't we shouldn't experiment10:24
seb128to be honest I'm not convinced by gnome-shell yet10:24
sebnerseb128: me neither but I have the feeling that sooner or later it will be default so I was just wondering about the ubuntu self-development stuff :)10:25
seb128re11:40
seb128jcristau, 11:40
seb128#0  0x080ac9b4 in _XSERVTransClose (ciptr=0x8be8aa8)11:40
seb128    at /usr/include/X11/Xtrans/Xtrans.c:93011:40
seb128#1  0x080a6be4 in CloseWellKnownConnections () at ../../os/connection.c:49211:40
seb128#2  0x080b0caf in SigAbortServer (signo=11) at ../../os/log.c:42611:40
seb128#3  0x080b10d1 in FatalSignal (signo=11) at ../../os/log.c:56011:40
seb128#4  0x080a9aa2 in OsSigHandler (signo=11, sip=0xbfb0f21c, unused=0xbfb0f29c)11:40
seb128    at ../../os/osinit.c:15711:40
seb128#5  <signal handler called>11:40
seb128#6  0x00000000 in ?? ()11:41
seb128is that useful in some way?11:41
jcristausort of.  it starts at the sigsegv signal handler though, so it doesn't catch the first crash11:42
seb128#6  0x00000000 in ?? ()11:48
seb128#7  0x080b13de in FatalError (f=0x81c5206 "no screens found")11:48
seb128    at ../../os/log.c:58511:48
seb128#8  0x08067044 in main (argc=9, argv=0xbfb0f6c4, envp=0xbfb0f6ec)11:48
seb128    at ../../dix/main.c:20611:48
seb128jcristau, ups, forgot to copy that11:48
seb128the 0x00000000 is weird11:48
jcristauindeed11:49
jcristautseliot: so, hum, patch 191 is not in git11:50
jcristauseb128: os/log.c:585 seems to be AbortServer(); which is defined 100 lines earlier...11:52
tseliotjcristau: no, it's not. Shall I add it with -f? .patch files are in .gitignore12:04
jcristautseliot: yes12:05
tseliotok12:05
jcristau(or use .diff)12:05
jcristau(or kill .gitignore from the debian or ubuntu branch)12:06
tseliotok, pushed12:11
tseliotnext time I'll use .diff12:11
jcristauthanks12:16
seb128jcristau, any suggestion to get extra informations?12:20
seb128or to turn the bug about this crash about something useful12:20
jcristauseb128: is it reproducible?  can you try without the ubuntu patches?12:21
seb128I get it every time I try to open a guest session12:21
seb128but I guess it's nothing specific to the guest session12:21
seb128it's probably when trying to open a second xsession12:22
jcristausounds likely12:22
seb128ubuntu patches for which source?12:22
jcristauxorg-server12:22
seb128ok, will try12:22
seb128I guess that takes a bit to build12:22
jcristauthere's a couple that touch os/log.c12:22
jcristausomething like 15min on my laptop12:22
seb128I will do a noopt nostrip build12:22
seb128jcristau, which patches do you recommend to comment, all the patches directory or just the one added in ubuntu compared to debian?12:39
jcristauseb128: 100 and 160 in particular12:46
seb128ok, that's the one I turned off since they were touching log.c, it's building12:47
seb128also other issue12:49
seb128on the mini10v and lucid the screen flickers every few seconds after suspend12:49
seb128what is useful to debug such issues?12:49
jcristautry i915.powersave=0 on the kernel cmdline12:51
seb128ok, no luck getting a debug stacktrace13:43
seb128it's not crashing now13:43
seb128I got the xorg mouse pointer over all vts13:43
seb128and vt7 displaying text13:43
seb128instead of my session13:43
seb128but things are still running 13:43
seb128weird...13:43
tseliotseb128: as I said, -dbg packages shouldn't work particularly well with apport (if that's your problem)13:57
seb128no it's not13:58
tseliotok13:58
seb128I did use gdb to get the stacktrace I copied before13:58
tseliotyes, that should work13:58
seb128and did load the symbol by hand using "symbol file"13:58
seb128the automatic loading is broken for some reason13:58
seb128now I tried a build without the patch which touch log.c13:58
seb128but it doesn't crash without those, it just destroy the graphical view and display a vt13:59
seb128but desktop proccess are still running13:59
tseliotpatches 100 and 160?13:59
seb128yes14:02
tseliotseb128: what does the Xorg.0.log say if you ssh into that computer?14:03
seb128I don't need to ssh, vts are still working14:04
seb128there is nothing in the xorg logs14:04
seb128no error14:04
tseliotI'm a bit concerned about this: FatalError (f=0x81c5206 "no screens found")14:05
tseliotwhich, I must admit, is a bit vague14:06
tseliotseb128: does dmesg say anything interesting?14:07
seb128tseliot, no14:10
seb128does opening a guest session is working?14:10
seb128does opening a guest session is working for you?14:11
seb128does opening a guest session is working for you?14:11
seb128ups14:11
tseliotseb128: can you reproduce the problem if you disable KMS?14:15
seb128tseliot, I will try later14:15
seb128I'm using this computer now14:15
tseliotok14:16
tseliotapw: did you try my fix for X?14:18
seb128tseliot, did you try the guest session? 14:33
tseliotseb128: no, I didn't15:14
=== ripps|sleep is now known as ripps
=== yofel_ is now known as yofel
Sarvattdo these guest session bugs that we're getting hit with really look like xserver? it just looks like it might be a GDM problem to me, guest sessions trying to spawn a new xserver instance with the old one still active for some reason?18:09
jcristauSarvatt: if the server is segfaulting, that sounds like a server bug :)18:10
SarvattI get eerily similar gdm logs when I have to start in failsafeX, x segfaults and I get [    0.000000] (WW) xf86CloseConsole: KDSETMODE failed: Bad file descriptor18:12
Sarvatt[    0.000099] (WW) xf86CloseConsole: VT_GETMODE failed: Bad file descriptor18:12
Sarvatt[    0.000132] (WW) xf86OpenConsole: VT_GETSTATE failed: Bad file descriptor18:12
Sarvatt(when I dont have failsafeX set to use fbdev instead of vesa and am using KMS)18:12
jcristauso X can't get a fd to the console?18:12
SarvattFatal server error:18:13
SarvattServer is already active for display 018:13
Sarvatt        If this server is no longer running, remove /tmp/.X0-lock18:13
Sarvatt        and start again.18:13
jcristauhmm18:13
Sarvattand it tries to spawn the new one on tty2 when tty7 is still active18:13
jcristauwhy does it try to start the new server as :0?18:14
Sarvattjust looked over my logs again, that was happening when fsck on boot was kicking me to failsafe 100% of the time, and i think it was because gdm would start, crash because something (dbus?) wasn't ready and relaunch itself before the other cleanly ended? failsafe ended up on :1 and tty2 when it finally worked. i really can't follow this gdm startup maze18:44
Sarvattit's over my head and i'm just adding noise, it seems to be happening with guest sessions for everyone now though. seen bugs from people with ati intel and s3 getting it. lets see if it crashes here :)18:50
Sarvattyep it kicks me to a VT with the mouse still working when I do a guest session18:56
Sarvatt:0-slave.log says gdm-session-worker[3500]: pam_unix(gdm-autologin:session): session opened for user robert by (uid=0)18:56
Sarvatt/etc/gdm/PreSession/Default: 16: initctl: not found18:56
Sarvattseb128: I fixed guest sessions here by changing /etc/gdm/PreSession/Default to call /sbin/initctl -q emit desktop-session-start DISPLAY_MANAGER=gdm instead of just initctl -q emit desktop-session-start DISPLAY_MANAGER=gdm19:08
Sarvattthere was just a bunch of changes to the paths being exported around in the sessions, guessing it needed changing after that since /sbin isnt in the path for it anymore19:09
Sarvattof course I cant get back into my logged in session after trying to switch back though, its stuck with the gdm background on the screen. but switching to a guest session works :D19:13
Sarvattthis is the commit I was talking about messing with the paths http://git.gnome.org/browse/gdm/commit/?id=e33ee9d9b23c103ac25b6fdb53fe8c074de0de5319:26
bryycehi Sarvatt19:33
Sarvattheyo!19:33
bryyceSarvatt, think we need that patch added to gdm?19:33
bryycemaybe we should point it out to seb12819:34
Sarvattyeah pinged him there, doubt hardcoding /sbin/ is the right thing to do, just know it works here19:36
bryyceSarvatt, is there a launchpad bug open about this issue?19:38
Sarvattloooots, looks like some people are getting X crashes also thats probably a different issue (vish?)19:40
Sarvattbug #50651019:41
ubottuLaunchpad bug 506510 in xorg-server "Xorg crashed with SIGSEGV in FatalError()" [Medium,New] https://launchpad.net/bugs/50651019:41
Sarvattbug #50723919:41
ubottuLaunchpad bug 507239 in xorg-server "Xorg crashed with SIGSEGV in FatalError()" [Medium,New] https://launchpad.net/bugs/50723919:41
vishi tried with >   /sbin/initctl -q emit desktop-session-start DISPLAY_MANAGER=gdm  19:41
vishbut it kept kicking me out ;)19:41
Sarvattoh I changed the path to have /sbin in it too19:42
SarvattPATH="/usr/bin:/sbin:$PATH"19:42
* vish tires again ;)19:42
vishtries*19:43
Sarvatthttp://paste.ubuntu.com/357210/19:43
vish\o/19:44
vishSarvatt: works19:45
Sarvattsorry about that19:45
vishnp :)19:45
vishSarvatt: thanks ...19:45
bryyceSarvatt, so I guess the question for us is if X should not be crashing in this situation but should exit with a friendly error message?19:45
SarvattI have no idea what magic gdm is doing to segfault X like that, it was doing it sometimes before we waited for dbus to have started and when fsck's were going when it tried to start before too.. I only noticed the error in :0-slave.log because of the timestamp, thats not getting attached to the bugs19:48
bryycehrm19:49
bryyceyeah the stack traces on 506510 are fairly useless19:49
Sarvatttrying to respawn a second server with DISPLAY=:0 when its already taken I guess?19:49
bryyceSarvatt, you are able to reproduce this bug?19:50
Sarvattyeah you can too, just try to log in a guest session on lucid19:51
Sarvatt(an up to date one)19:51
Sarvattnot machine specific as far as I can see, seen it on ati intel nvidia and s319:51
Sarvattand SIS19:51
vishbryyce: thanks , was just about to comment on the bug about Sarvatt's fix :)19:52
Sarvattlaunchpad + tethered EDGE connection dont go well together :D19:53
bryyceSarvatt, ok sounds like the next step would be to get a gdb attached to xorg-server and figure out where it is crashing19:53
Sarvattseb128 did that earlier, have the scrollback handy?19:53
bryyceah19:53
* bryyce looks19:53
Sarvattaround 6:40AM EST19:54
bryycemm19:59
bryycejcristau suspects patch 100 or 16019:59
jcristaubryyce: it's the 2 patches that touched the code that seemed to crash.  might still be something else.20:03
jcristaubut i thought it was worth testing without that20:03
bryyce160 has been in for a while20:03
bryyce100 is a few weeks old, it might be worth a more detailed review20:04
Sarvattyeah would be nice to find why it happens because it seems easy to make gdm do it :D time to build a server with no ubuntu patches and see if it happens since its so easily reproducable right now20:04
seb128I don't get a crash on a debug build without those20:05
seb128debug build = noopt nostrip20:05
bryycehowever, the os/log.c stuff is normal to pass through on crashes so could be something higher up20:05
seb128it doesn't segfault but it still bugs20:05
seb128vt7 goes away and get the xorg mouse pointer over any vt20:05
Sarvattthats what happens here with the stock packages20:06
seb128vt7 goes away = it's in text mode with some fsck lines which seems to come from boot20:06
Sarvattputs me to vt8 and i have a mouse over my vt's20:06
seb128but ps still lists the GNOME processes running20:06
seb128I get no backtrace in Xorg.0.log though20:06
seb128nor error20:06
Sarvattweird that i dont get any errors outside of the segfault message in dmesg, nothing in Xorg.0.log or /var/log/gdm/:0.log (which is just a copy of /var/log/Xorg.0.log)20:07
bryyceSarvatt, is that with or without patch 100?20:08
Sarvattthe stock packages in lucid20:08
bryycepatch 100 changes how the crash is reported20:08
Sarvattwith20:08
bryycehm20:08
seb128without the patch I don't get apport triggering which is expected20:08
bryycewith patch 100 should give *more* info not less20:08
seb128but it's not written in Xorg.0.log either20:08
seb128with the patch I get apport triggering20:09
bryyceok, 20:09
seb128that's the stacktrace I copied on the chan earlier if you read backlog20:09
seb128earlier being some 9 hours ago20:09
bryycewell I do know there are situations where X can crash but no backtrace goes into Xorg.0.log so that's not too unusual20:09
bryyceseb128, yeah going through it now20:09
Sarvattmaybe its a second xserver trying to start thats actually crashing for me and the first stays up20:10
bryyceseb128, you're right that line #6 looks weird.  #6 0x00000000 in ?? ()20:10
bryyceline 585 in log.c is         AbortServer();20:11
bryycemy guess there is that AbortServer() calls AbortDDX(), which is a virtual function called differently for different hw's20:12
* bryyce ponders20:12
jcristauyeah so there was two issues.  first is why was it calling FatalError in the first place, and then why FatalError was calling a null function pointer20:12
bryycecould in the guest mode case, it's using some hw thingee with no defined AbortDDX()?20:13
jcristauit's all Xorg afaik..20:14
bryyceok well as to the question of why it's calling FatalError in the first place, it's this code:20:15
bryyce        if (screenInfo.numScreens < 1)20:15
bryyce            FatalError("no screens found");20:15
bryyceso that *should* be just our nice friendly neighborhood "No screens available" error20:15
bryycemaybe it's a miscommunication between X and gdm about what screen number to use20:16
bryycenext step for that probably would be to examine what args X is getting called with (ps should say)20:17
bryyceand compare with what gdm is actually doing20:17
jcristauone reason i'm sort of uncomfortable with the os/log.c patches is that this is touching signal handlers, and there's not a lot of stuff you can do safely in signal handlers20:17
bryyceSarvatt / seb128, in the case where you aren't using patch 100, can you confirm whether you are seeing "No screens found" listed in one of the gdm log files?20:18
Sarvattgoing to take about 30 minutes to build here, speedy atom cpu :)20:19
jcristaubut we wouldn't get in these signal handlers if stuff worked fine in the first place so that doesn't explain why it fails InitOutput..20:20
bryycejcristau, well I can certainly say I don't like how much code patch 100 touches.  The old patch was a lot more concise.  But due to the way signal handling stuff has changed it was not as easy to keep all the code in one place20:20
bryycejcristau, I know messing with the signal handlers is bad but we have to get apport hooked in somehow, and the X signal handling stuff eats the signals by default, which prevents apport from working20:21
jcristauack20:21
bryycejcristau, I would *love* it if we could render this into something worth upstreaming, I hate having to maintain this patch as ubuntu-only20:21
Sarvattwould the sudo DISPLAY=:0 xinit& output be useful at all? because thats what I see in gdm's log and what the s3 guy's gdm log showed20:23
bryyceSarvatt, couldn't hurt20:24
jcristaugdm is calling xinit?20:24
Sarvatthttp://paste.ubuntu.com/357228/20:24
Sarvatthttp://paste.ubuntu.com/357230/20:25
Sarvattsecond one was the GdmLog attached to the savage bug20:25
* bryyce ews lines 15-1720:26
bryyceSarvatt, ok what we need is what the X command line was, to see if it was indeed trying to start two X's on :020:26
bryyceSarvatt, and if so, instrument gdm to show its work20:27
bryycejcristau, hmm, looking at seb128's backtrace, it is crasing on this bit of code in CloseWellKnownConnections():20:35
bryyce    for (i = 0; i < ListenTransCount; i++)20:35
bryyce        _XSERVTransClose (ListenTransConns[i]);20:35
bryyceany ideas on what this does?20:36
Sarvatthmm cant get gdb to load the symbols for xserver-xorg-core-dbg or -dbgsym20:39
bryycehmm, looking at Xtrans.c, it doesn't have null pointer checking, so if ListenTransConns[i] is NULL, it'd crash.  Perhaps that bit of code needs to do some nullptr checking?20:39
seb128Sarvatt, symbol /usr/lib/debug/usr/bin/Xorg20:40
seb128(I'm not really around, just passing in front of computer)20:40
Sarvattthanks seb128, used to load that automatically20:40
seb128right, it's buggy for some reason20:41
Sarvattguess it just plain wont load the dbgsym symbols at all20:46
Sarvatt-dbg worked though20:46
bryyceSarvatt, ok I think I understand the crash20:47
Sarvattfix = disable xace/xselinux? :D20:50
* Sarvatt hides20:50
bryyceSarvatt, main() notices theres no screens so issues a FatalError(), which is fine20:52
bryycenow in FatalError(), it calls AbortServer() (which is just a pass-through to SigAbortServer(0))20:53
bryycehere's where it gets fun20:53
bryyceSigAbortServer calls CloseWellKnownConnections() to do some sort of cleanup I don't understand20:54
bryyceit continues on with cleanup work20:54
Sarvattending secure connections thanks to selinux support?20:54
bryycesomewhere this triggers signal 1120:55
jcristauSarvatt: xselinux is disabled in ubuntu..20:55
bryyceso now we find ourselves in OsSigHandler()20:55
bryyceso now it hits FatalSignal(11) (fine), and here we get to SigAbortServer(11)20:55
bryyceagain, it calls CloseWellKnownConnections()20:56
bryycebut this time through, the stuff has already been cleared, so a null pointer gets called20:56
bryyce*boom*20:56
bryyceso... we have three bugs20:56
bryycea) why is X getting called by gdm with :0 twice 20:56
bryyceb) why is the signal 11 being thrown during an ordinary FatalError() call?20:57
bryycec) why is CloseWellKnownConnections() brittle at being called twice?20:57
bryycec could be fixed by just adding a nullptr check in the Xtrans stuff.  dunno if that's warranted20:58
Sarvattjust starting /usr/bin/Xorg I get the same segfault, dont even have to specify :020:58
jcristauSarvatt: :0 is the default20:58
bryyceor actually we could probably handle the nullptr check in CloseWellKnownConnections()20:58
jcristauSarvatt: so if you don't give a display it uses that20:58
bryyceanyway, wife says I must eat, so bbiab20:58
Sarvattwhats with the trash after ddxSigGiveUp: Closing log? http://paste.ubuntu.com/357244/21:03
Sarvattyeah same here, wife's waiting for me to make dinner :D21:03
bryyceo_O21:36
bryyceSarvatt, aha.  Looks like there are some error messages logged after the log is closed.21:36
bryycethat can't be good21:36
bryyceSarvatt, boy I bet that's the explanation for the sig 11 throw21:37
bryycetotally21:39
bryyceok easy fix21:39
Sarvattwoohoo!21:40
bryycehmm, I wonder if this is the same root cause for bugs like lp #50803522:25
ubottuLaunchpad bug 508035 in xorg-server "Xorg crashed with SIGSEGV in <signal handler called>()" [Medium,New] https://launchpad.net/bugs/50803522:25
=== ubott2 is now known as ubottu

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!