[05:25] alkisg: I tested a https://www.asus.com/us/Graphics-Cards/A9250TD128M/overview/ and it worked out of the box, so I doubt ATi cards are affected [05:26] KitsuWhooa: nice... do you think those horizontal lines in vbox were related? I haven't tested if pageflip off fixes vbox as well.. [05:27] Those go away by switching ttys, the ones on nvidia hardware don't [05:38] And the one in tnt2 causes a segfault... those all might be bad codepaths, forced by a "central" bug somewhere... [05:38] E.g. uninitialized resolution variable => may cause either corruption or segfault later on... [05:39] The corruption can be fixed by xrandr -s other-resolution; xrandr -s original-resolution [05:39] which is why a gdb backtrace would be useful [05:40] Sure; I'll set up the tnt2 system within the week [05:40] Yeah, I was just mentioning it [05:40] Apologies if it sounded a bit harsh [05:41] No no I very much appreciate all your input and help in this [06:38] turns out I found a TNT2 [06:43] and also what identifies itself as "NV10GL Quadro" [07:02] 01:00.0 VGA compatible controller: NVIDIA Corporation NV5 [Riva TNT2 Model 64 / Model 64 Pro] (rev 15) [07:04] Yup, X crashes right after the login screen [07:11] KitsuWhooa: do you have epoptes working in your ltsp setup? Could you run this at the login screen, before login, and see if it's still crashing or not? xrandr -s 640x480; xrandr -s original-resolution [07:11] I don't have LTSP set up on epoptes, but I do have a shell [07:11] two of them [07:11] OK, you'd need to export DISPLAY and XAUTHORITY for xrandr to work [07:12] would I need to rebuild the squashfs image to add epoptes? [07:12] Yes [07:12] Any way to rebuild the image without keeping the old one around during build? [07:12] not enough disk space :p [07:13] ltsp-update-image --no-backup -c / [07:13] excellent, thank you [07:13] ...stop nbd-server first so that it's not in use [07:13] Also, I think I found a mate bug [07:13] https://tasossah.com/s/3d40462bc831.jpg [07:14] I don't even know what I did to make those two network indicators show up [07:14] It goes away if you kill/rerun nm-applet, or on next login [07:14] It's some race condition somewhere, I haven't pinpointed it [07:15] So, I need both epoptes and epoptes-client on the server? [07:15] Yes. apt install epoptes should be enough, it installs epoptes-client as well [07:15] ah [07:15] okay [08:04] Right, after being distracted for a bit [08:04] I can't seem to be able to find the nouveau debug symbol package [08:11] Ah, they're on a separate repo [08:43] alkisg: that did in fact make it not crash [08:45] I have to say though, the TNT2 is really slow at 1680x1050 [08:45] and I got epoptes set up too, really cool [09:05] KitsuWhooa: it sounds like it's something in xorg initialization that goes wrong, and gets correctly initialized with xrandr [09:22] Eeeeyup [09:22] I'll try with gdb now and see if I can find anything interesting [09:23] Also, wakeonlan works and it got me more excited than it should have :p [09:23] s/and/to/ [09:41] alkisg: I ran startx, I attached gdb, and then after a while epoptes said my client went offline and killed my remote terminal [09:41] any idea why? [09:41] (X was in a frozen state) [09:42] network never died either [09:51] KitsuWhooa: how much ram does that client have? [09:51] oom? [09:51] I checked dmesg, not oom [09:51] it has 3x256 DDR1 [09:52] Strange [09:52] When the client is logged off, do you see it in epoptes as a blue icon? [09:52] (fat client) [09:52] yeah [09:52] and I can even spawn a shell [09:53] Btw epoptes is supposed to reconnect after a minute of inactivity, did it? [09:54] The remote terminal was "root", not "user", right? [09:54] yup [09:54] it did not reconnect, no [09:54] (root epoptes-client runs from systemd as root, while user runs from /etc/xdg/autostart as the user) [09:54] and I checked that the network was still functioning [09:55] That's rather strange, if you have a vt on the client it would be interesting to rerun epoptes-client as root, make it crash again, and see the stderr there [09:55] I do yeah [09:55] let me try again [09:56] I have two vts and the remote shell [09:56] I ran startx on the vt and attached gdb through the epoptes root shell [09:57] (btw to open another vt if needed, it's openvt bash) [09:57] I have two [09:57] but I will, thanks [09:57] e.g. if you want to attach gdb from a vt, so that it is more "stable" [09:58] export $(/usr/share/epoptes-client/get-display) => gives you access to xorg btw [09:58] I wanted the gdb session remotely so that it's more comfortable for me to work as I will be using my desktop keyboard [09:58] (DISPLAY/XAUTHORITY) [09:58] Yeah, I know the feeling :D [09:58] I don't know why epoptes would do that though [09:58] Ah maybe systemctl status epoptes-client would show the stderr [09:58] service vboxadd stop [09:58] ...oops [09:59] Or journalctl something, for the previous crash [09:59] I'll check, yeah [09:59] is it python? [09:59] I'm used to seeing segfaults in dmesg, but if it's python it probably won't be a segfault [10:00] epoptes-client is mostly shell + socat [10:00] A couple of python scripts are ran externally, as child processes [10:01] https://tasossah.com/s/1ecf05d7e48b.jpg [10:01] might need to recompile nouveau without optimisations [10:02] but nouveau_present_flip_exec definitely has to do with double/triple buffering [10:40] alkisg: 13:24:22 ltsp162 epoptes-client[1561]: Could not detect or access the active display [10:40] 13:36:34 ltsp162 epoptes-client[1561]: 2018/09/16 13:36:34 socat[1561] E waitpid(): child 1629 exited on signal 3 [10:41] former makes sense because X is frozen, not sure why socat died 12 minutes later though [10:43] Not detecting the display shouldn't crash epoptes though [16:44] alkisg: I traced the segfault down a bit [16:44] It's an X issue [16:45] exaGetPixmapDriverPrivate() returns a 0 pointer [19:40] KitsuWhooa: is this enough info for a bug report? [19:41] Possibly, but I'm going to look more into it tomorrow. X 1.19 also has the same issue, so it happened some time before it. [19:42] Great! [19:42] I tried disabling acceleration and it still crashed, which is a bit odd [19:42] because the above gets called due to acceleration [19:42] NoAccel was the first thing I tried yeah [19:42] I also tried setting AccelMethod or whatever it is to none [19:42] and that didn't help either [19:43] * alkisg has studied vga internals back in 1990, but has no idea about xorg internals :( [19:46] I don't know much about it either to be honest. [19:47] I will however say that this whole thing helped me uncover two kernel bugs that affect my desktop, that I'll need to look into/report at some point