/srv/irclogs.ubuntu.com/2018/09/12/#ubuntu-x.txt

alkisgCould I `dpkg -i xserver-xorg-video-nouveau` from 16.04 to 18.04 to see if it fixes various showstopper issues that I see in ancient nvidia cards? Or would it break due to abi changes?06:33
alkisgI mean, install the 16.04 package to 18.0406:34
tjaaltonno, you'll find that it won't install06:34
alkisgThank you :/06:35
KitsuWhooaalkisg: which version of X is it?06:42
alkisgKitsuWhooa: stock 18.04,  xserver-xorg-video-nouveau 1:1.0.15-206:43
alkisgThe problems I've seen so far are xorg segfault/crashes, and scrambled lines in the physical output which do display fine over vnc though06:44
alkisgIn various old nvidia cards, e.g. tnt2, mx 400, mx 4000...06:44
KitsuWhooaHm, 18.04 has 1.19.606:46
KitsuWhooaAre the segfaults with acceleration enabled consistent?06:46
alkisgKitsuWhooa: yes, and they are also consistent with NoAccel defined as well06:46
alkisghttp://termbin.com/haem06:48
alkisgIt boots fine up to lightdm, and crashes on login (where I assume things like compositing and opengl are used)06:49
KitsuWhooaAh, it's probably different to what I encountered06:50
alkisgMaybe; I've seen tens of crashes and I assumed it was the same; maybe I've seen various different ones and haven't realized it yet06:50
KitsuWhooaI had a similar backtrace with the s3 savage driver with acceleration enabled, and resolved it by re-enabling sigio in X and recompiling it. I never found out why it happened, but it got disabled some time in 1.19 and I only found out by bisecting it06:50
alkisg(I'm getting reports from various schools with a lot of different clients)06:51
KitsuWhooaIf possible, maybe try enabling HWE on a 16.04 machine and see if that also breaks it06:51
alkisgThanks for the sigio pointer, I'll give it a try; I'll also see if I can reproduce it locally to try with 16.04 + hwe06:52
alkisgAlthough I think some schools already have that, and none reported the issue06:52
KitsuWhooaMaybe also install the debug symbols and get a gdb backtrace to see if there is anything interesting there06:52
KitsuWhooaMy (old) school's ubuntu lab still runs the version with gnome2 (pre 12.04). I guess no one bothered updating it :p06:54
alkisgSome of the schools that I upgraded from 12.04 to 18.04 report that they want to go back :D06:55
alkisgI'd love to see less features and more stability, but I guess many developers find stability programming = boring :D06:55
KitsuWhooaI think it's more like it's getting incrementally difficult to support old hardware06:56
alkisgOh I see it in programs that aren't related to hardware as well06:56
alkisgPanels that crash by just switching keyboard layouts "but they're so modern now,they support searching for programs while you type!"06:56
KitsuWhooaoh :p06:57
alkisgBtw the problem with the scrambled output was in nvidia 7200 as well; I don't know, is that considered too old/unsupported too?06:58
tjaaltonalkisg: so did you try the git version?06:59
alkisgtjaalton: I need to reproduce it locally first06:59
alkisgI'll try to get such a card in my office06:59
tjaaltonjust build it and push to clients, it's only five commits on top of 1.0.15 ;)07:00
alkisgty, will do07:02
KitsuWhooaalkisg: I can confirm I get corruption on 18.04.1 mate with nouveau on an MX40009:05
alkisgKitsuWhooa: thank you :) Btw, changing resolutions some times fixes it09:06
alkisgI tried with fx5200 locally, it worked on 1024x768, will try some other one now...09:06
KitsuWhooahttps://tasossah.com/CameraPics/P1110844.JPG09:07
KitsuWhooathis is the ubuntu mate installer screen :p09:07
alkisgYeah exactly like that09:07
KitsuWhooaI'll see what other old nvidia cards I have. I know there's an MX440 somewhere09:11
alkisgKitsuWhooa: I think it might also help if you click "try ubuntu" instead of "install ubuntu", as it might cause the second bug too, the segfault09:16
KitsuWhooaI can't see where to click :p09:16
alkisgHehe, true, I have vnc there, you don't09:16
KitsuWhooait's a live boot over usb1.1 and it's ridiculously slow09:16
KitsuWhooaso that's not helping either09:16
alkisgOuch,network boot would help09:16
KitsuWhooaI started lightdm manually and I can barely see the firefox icon pinned in mate09:19
KitsuWhooaso I'm going to say this didn't segfault09:19
KitsuWhooaAnd I think netboot would take too much time to set up. I'd also need to find a NIC for this board as it doesn't have an onboard one09:20
KitsuWhooaand even then, it'd be fast ethernet, so I'm not sure how much better it'd be09:20
KitsuWhooaalkisg: interesting thing. When I switch to a tty, the desktop renders fine for a bit before switching to a tty09:22
alkisgKitsuWhooa: if you want help with netbooting, I'm an expert, I could set it up for you in a few minutes, and, 100mbps is a hell of a lot faster than 1.1 usb09:23
alkisgNIC => ipxe boots almost all of them09:24
KitsuWhooaThat'd be appreciated09:24
KitsuWhooagive me a bit to see if I can find a PCI NIC to plug in to this board09:25
KitsuWhooaRight, I found one that looks to have a realtek chipset clone of sorts, and what looks like a boot rom09:26
alkisgThat way you won't even need ipxe then :)09:27
alkisgSo the last notes that I've made for netbooting without our "ltsp" project, are the "automation script" paragraph of this page: https://wiki.ubuntu.com/LiveCDNetboot09:28
alkisgI.e. you're supposed to mount the cd to the server /cdrom, and just run this command:09:28
alkisgwget 'http://alkisg.mysch.gr/steki/index.php?action=dlattach;topic=2525.0;attach=1421' -O /tmp/livecd-netboot && sudo sh /tmp/livecd-netboot09:28
KitsuWhooaone moment, trying to find out how to enable the boot rom in the bios09:29
alkisgIf the boot rom is in the nic itself, you might need to press ctrl+f11 or so when it displays that message09:29
alkisgI.e. it might not be in the bios09:29
alkisgThe easiest way would be to create a usb stick/floppy/cd with ipxe though09:30
KitsuWhooait's not loading the rom at all, and IIRC there's usually a toggle in the bios09:30
alkisgboot.ipxe.org => images to download09:30
alkisgThe bios usually is for onboard nics. It's the same bios for many boards, so it won't work for pci nics.09:30
alkisg(well, unless the bios was expecting a realtec onboard nic in other board versions)09:31
KitsuWhooayeah looks like I'll be going with ipxe09:34
KitsuWhooaΠροέκυψαν σφάλματα κατά την επεξεργασία του: nfs-kernel-server09:41
KitsuWhooaI have a feeling this isn't going to work09:41
KitsuWhooamount: unknown filesystem type 'rpc_pipefs'09:42
KitsuWhooalooks like I'm going to be recompiling my kernel09:42
alkisgEr, nfs-kernel-server can't be installed? Yeah that's not a good sign...09:50
alkisgYou can also boot another pc with the ubuntu mate usb stick, and then run that command, and it will allow you to netboot the older pc09:50
alkisgi.e. both live server and live client09:51
KitsuWhooaalkisg: your mysch site doesn't seem to be responding over IPv609:51
alkisg(in case your actual setup is strange and you can't install nfs temporarily...)09:51
alkisgYeah they're old school :D09:51
KitsuWhooaI'll do it in a stock 18.04 VM09:51
KitsuWhooait has an AAAA record though and it resolves to an address09:51
KitsuWhooaso it just causes wget to hang09:51
alkisgI've filed complaints a lot of times, but no solution yet. They also don't support https in their hosting. Lame :/09:52
KitsuWhooamight want to edit the wiki page to use wget -4 then09:52
KitsuWhooaso that it forces ipv409:52
alkisgOh I wrote that paragraph 10 years ago, I don't think anyone maintains it09:52
KitsuWhooaAh09:53
alkisgIt's full of obsolete information, but I think my script still works09:53
* alkisg loves code that works 10 years later :D09:53
alkisg01:00.0 VGA compatible controller [0300]: NVidia / SGS Thomson (Joint Venture) Riva128 [12d2:0018] (rev 10) ==> nah this one loaded vesa, too old, trying another...09:54
KitsuWhooaI love you10:02
KitsuWhooaer10:02
KitsuWhooalmao10:02
KitsuWhooanot sure how that got in my X clipboard10:02
alkisgHaha no worries it's always a good thing to say10:02
KitsuWhooaSorry for that. Anyway, I'm waiting for the unattended updates to finish10:02
KitsuWhooabecause it started running dpkg when I booted the VM so I can't use apt10:03
KitsuWhooathat is true :p10:03
alkisgI managed to reproduce the segfault with this one: 01:00.0 VGA compatible controller [0300]: NVIDIA Corporation NV5 [Riva TNT2 Model 64 / Model 64 Pro] [10de:002d] (rev 15)10:04
KitsuWhooathe netboot script seems to fail to detect my IP10:16
KitsuWhooaoh well10:16
KitsuWhooaalkisg: how do I get ipxe to boot?10:18
alkisgalkisg: do you mean how to put it in floppy/cd/usb?10:18
alkisgor, how does it detect the server ip?10:18
KitsuWhooathe latter10:18
KitsuWhooaI got ipxe running on the machine but it just says press a key to reboot10:18
alkisgThe normal netbooting setup there is "a dhcp server somewhere, e.g. in a router, and my script running dnsmasq in proxydhcp mode to only send the boot server ip/boot filename"10:19
alkisgIs this your use case? Btw, did you put the VM in bridged mode, so that it has all TCP/UDP ports open?10:19
KitsuWhooayour script is running dnsmasq, yeah10:19
KitsuWhooathe VM is in bridged mode10:20
KitsuWhooait has an IP in the lan like any other device10:20
KitsuWhooaand doesn't go through any nAT10:20
KitsuWhooa*NAT10:20
alkisgDoes ipxe get an ip?10:20
alkisgYou can also try ctrl+b there, and then run `autoboot`, and if it fails, run `config` and see what it got from dhcp/proxydhcp10:20
KitsuWhooaIt definitely shows the mac address, but I'm not sure about the IP. I am already in the command prompt so I'll try that10:21
KitsuWhooa"[...]/tftpboot/pxelinux... no such file or directory"10:21
KitsuWhooait does get an ip10:21
KitsuWhooaand it does talk to the serv er10:21
KitsuWhooa*server10:22
alkisg(01:21:51 μμ) KitsuWhooa: "[...]/tftpboot/pxelinux... no such file or directory" => that sounds like some error in the script? Did you see anything wrong when it ran?10:22
KitsuWhooahttps://tasossah.com/txt/netboot_script_log10:23
alkisgroute => not found => yeah that's an issue10:25
alkisgOld script, not relying on ip10:25
KitsuWhooaguess I'll install it10:25
alkisgBtw it should be trying to download pxelinux.0, not pxelinux10:26
KitsuWhooaI thought that was to detect the ip10:26
KitsuWhooaroute, I mean10:26
alkisgCan you try to symlink it just for a quick hack?10:26
alkisgYeah, probably10:26
KitsuWhooaSure, give me a bit to restart it10:26
KitsuWhooaI don't think I can10:27
KitsuWhooaI take that back10:27
KitsuWhooahowever, pxelinux.0 is a symlink itself that points to a file that does not exist10:27
KitsuWhooait points to /usr/lib/syslinux/pxelinux.010:28
KitsuWhooamy guess is it depends on the pxelinux package10:28
alkisgIt installs it, but it moved elsewhere, moment,10:28
alkisg/usr/lib/PXELINUX/pxelinux.010:28
KitsuWhooaah10:29
alkisgOK so I guess the 10 year old script needs to be updated with the new pxelinux path, sorry :D10:29
KitsuWhooaokay, adding those two symlinks got me further10:29
KitsuWhooanow it's complaining about .c32 files not being found10:30
KitsuWhooahttps://tasossah.com/txt/netboot_script_log_210:31
alkisgThe new location is in /usr/lib/syslinux/modules/bios/10:31
alkisgCopy them from there or symlink them or something10:31
KitsuWhooaYeah this is really broken10:33
KitsuWhooanow it fails loading capser/vmlinuz10:34
KitsuWhooa...IO error? what10:34
KitsuWhooa"attempt to access beyond end of device"10:34
alkisgI wonder if the vmlinuz/initrd symlinks also point to wrong paths10:35
alkisgDo an ls -lR in the tftp dir and check for broken symlinks10:36
KitsuWhooaI remounted the image and now it loads vmlinuz but it can't find initrd.lz inside the casper dir10:36
KitsuWhooaand no, no broken symlinks10:36
alkisgIt might be initrd.gz now or something10:36
KitsuWhooathere's only an initrd in the image10:36
KitsuWhooaand I can't modify anything under casper because it's a symlink to /cdrom10:36
alkisgAh you modify the kernel etc in pxelinux.cfg/default10:37
alkisgThat's somewhere under tftp, a file that the script generated10:37
KitsuWhooaI found it10:37
KitsuWhooalooks like it's booting10:38
* alkisg crosses fingers, took too long already...10:38
KitsuWhooa"nfs server not responding"10:39
alkisgMaybe that route part that failed, failed to export to local network only10:39
alkisgwhat's /etc/exports like?10:39
KitsuWhooaI installed route and restarted 10:39
KitsuWhooathere's /cdrom in there10:40
KitsuWhooa /cdrom *(ro,no_subtree_check,no_root_squash)10:40
KitsuWhooaoh in the script log there's a "job for nfs-server.service cancelled"10:40
alkisgSounds good. Try `exportfs -ra` in case it helps,10:40
alkisgah10:41
alkisgtry restarting it manually from another tab and see why it fails to start10:41
alkisgsystemctl stop nfs-kernel-server, then start again, journalctl -xe, etc10:41
KitsuWhooaactive (exited)10:42
KitsuWhooaand there are no errors whatsoever10:42
KitsuWhooathere are only two lines in the log, starting and started10:42
KitsuWhooabut it's not running10:42
alkisghttps://serverfault.com/questions/859934/ubuntu-16-04-nfs-kernel-server-wont-start10:43
alkisgs your Ubuntu server a linux container (lxc) ?  If yes, you need to set something like explained here:  mount fstype=rpc_pipefs, mount fstype=nfsd, 10:43
KitsuWhooaThat happened on my 16.04 desktop because I run a custom kernel that didn't have the filesystem needed10:44
alkisgI wonder if running in a VM is related there10:44
KitsuWhooathe 18.04 VM is virtualbox10:44
alkisgHmm10:44
KitsuWhooaso I very much doubt it10:44
KitsuWhooait doesn't fail to start it10:44
KitsuWhooaissuing the start command doesn't throw any errors, I mean10:44
KitsuWhooaI wonder if the unattended update installed a new kernel and broke things. Let me reboot the VM and redo all the symlinks :p10:45
alkisgAh, damn those unattended updates :)10:46
KitsuWhooaI ended up disabling them, since it's a VM anyway10:47
KitsuWhooayeah sch.gr hosting is terrible10:53
KitsuWhooaI can't even download the script over v4 now :p10:53
alkisgAh don't wget it again, one time is more than enough for one day :D10:54
alkisgHehe10:54
KitsuWhooaI rebooted, so it's gone from /tmp10:54
KitsuWhooaI made the mistake of not saving it to the disk10:54
KitsuWhooathere we go, it worked on the third attempt10:54
alkisgThis avoided the segfault, I'll try to limit down now: Option "HWCursor" "off" Option "PageFlip" "off" Option "WrappedFB" "on" Option "ShadowFB" "on"10:54
KitsuWhooawe're back to ipxe not detecting the server10:57
KitsuWhooaor the NIC10:59
KitsuWhooaone of the two10:59
alkisgWhat's the client output, does it get an ip? autoboot, config etc...10:59
alkisgctrl+b before that10:59
KitsuWhooaokay yeah that was my fault. Needed to reseat the NIC11:01
KitsuWhooasorry11:01
alkisgtjaalton: I didn't get to compiling git yet, but I found out that `Option "PageFlip" "off"` avoids the segfault in11:01
alkisg01:00.0 VGA compatible controller [0300]: NVIDIA Corporation NV5 [Riva TNT2 Model 64 / Model 64 Pro] [10de:002d] (rev 15)11:01
KitsuWhooaalkisg: after the reboot, nfs is working11:10
KitsuWhooahowever the live image fails11:10
KitsuWhooagoes into emergency mode11:11
KitsuWhooaa few units fail, and I can't really figure out why11:11
alkisgOuch, it sounds like it needs a lot of updating... I'm using ltsp everywhere now so I haven't updated it11:12
alkisgSorry about that11:12
KitsuWhooaOh well, it's fine11:12
alkisgI need to go afk for a couple of hours, be back later... :/11:12
KitsuWhooasure11:12
KitsuWhooathanks again11:12
alkisgnp, thank you too11:13
alkisgDo check that pageflip option if you get the chance11:13
KitsuWhooaI'll go through the sch.gr manual and see if it's worth/easy to set up LTSP11:13
alkisgNono ignore sch.gr, follow this one (mine again): http://wiki.ltsp.org/wiki/Installation/Ubuntu11:13
KitsuWhooaAh, thanks11:14
alkisgIf you have a mate installation, you can make it an ltsp server in about 5 commands and 10 minutes11:14
alkisgAnd it gives epoptes=vnc as a bonus11:14
KitsuWhooaI have a stock 18.04 with gnome in a vm11:14
alkisgThat works too11:15
KitsuWhooadoes that mean the client will try to boot gnome too11:15
KitsuWhooa?11:15
KitsuWhooaI doubt gnome3 will work11:15
alkisgIn the quick "chrootless" setup, yeah11:15
KitsuWhooaah11:15
alkisgBut you can choose an xterm session if you prefer11:15
alkisgOr install mate as well ...11:15
KitsuWhooaAh, yeah11:15
KitsuWhooathanks11:15
* alkisg really goes for now, bbl11:15
KitsuWhooasee ya11:16
KitsuWhooaalkisg: for when you get back, this is what happened when I tried booting the image in a VM to install it and then set up LTSP https://tasossah.com/s/dd9d97babad3.jpg11:42
alkisgKitsuWhooa: try alt+ctrl+f1, then alt+ctrl+f712:07
alkisg*right ctrl, since it's vbox12:07
KitsuWhooait's rshift for me, but wow that worked12:08
KitsuWhooa(I rebound it)12:08
* alkisg searches how to fetch/compile the git version...12:10
KitsuWhooaMy guess would be to git clone the repo and then either install it directly, or use the files from the ubuntu package to make a deb12:11
KitsuWhooapackages.ubuntu.com usually points to all the necessary resources12:12
alkisgThis one? https://cgit.freedesktop.org/nouveau/xf86-video-nouveau/12:12
alkisgI only see 3 commits there from 2018, that's a good sign that it will be somewhat easy to bisect it...12:12
KitsuWhooaI think that's the one, yeah12:13
alkisgThe test client ran glxgears for an hour with pageflip off, so I guess it makes things stable12:16
tjaaltonyou assume it's fixed in git..12:22
tjaaltonthat's not at all clear12:22
alkisgtjaalton: I don't assume that at all! I'm assuming it broke in git! :)12:22
tjaaltonah12:23
alkisgI.e. i want to bisect and find the commit that broke it, I don't expect to find a commit that fixed it...12:23
tjaaltonit's not necessarily the driver that broke it12:23
KitsuWhooait may be X server itself12:23
KitsuWhooaand a smaller chance the drm driver12:24
alkisgAh12:24
tjaalton1.0.15 was released in april '1712:24
tjaaltontest cosmic12:26
tjaaltonor just file a bug upstream12:26
tjaaltonagainst nouveau12:26
alkisgI'll do both tomorrow morning12:26
alkisgI haven't reproduced the "scrambled screen" issue locally though, only the segfault12:27
alkisgI can only see the scrambled screen in vbox12:27
KitsuWhooapretty sure that's a different issue12:27
KitsuWhooaI can't get the scrambled screen to go away by switching to a tty and back with my MX40012:28
alkisgGotcha. So, 3 different issues.12:28
KitsuWhooaand then I got distracted trying to set up netboot/ltsp12:28
alkisgHehe12:28
KitsuWhooaalkisg: I installed ltsp-server-standalone ltsp-client, ran ltsp-update-image, didn't install epoptes, and ran the first ltsp-config dnsmasq line in the wiki13:12
KitsuWhooais it supposed to be working now?13:12
alkisgKitsuWhooa: I believe so, do you get any errors while booting the client?13:19
KitsuWhooaipxe says nothing to boot13:20
KitsuWhooadnsmasq seems to be running13:21
KitsuWhooabut ipxe is only seeing my normal gateway13:21
alkisgtry ltsp-config dnsmasq --overwrite; systemctl restart dnsmasq13:42
KitsuWhooathat was the first thing I did, and I even rebooted13:46
KitsuWhooaOnly different thing I did from the wiki was to not install epoptes or make a user for it13:46
KitsuWhooaApparently I just needed to change the subnet in the dnsmasq conf for the proxy14:13
alkisgIt should be autodetected... unless you didn't have an ip when you run ltsp-config14:28
KitsuWhooathat might have been it14:30
KitsuWhooabut since it works now, I can also reproduce the bug with an MX44014:30
alkisgThe fuzzy lines or the segfault?14:31
KitsuWhooacorruption14:32
KitsuWhooano segfault14:32
KitsuWhooaI think the segfault might be exclusive to the TNT214:32
alkisgdoes the pageflip off option fix the corruption?14:33
KitsuWhooaI haven't tried yet14:33

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!