[04:37] i've just upgraded from 17.10 to 18.04, and now the computer won't boot. it fails to mount the root filesystem, and drops to initramfs [04:39] https://9unvsq.bn.files.1drv.com/y4m6YIxUZfXzd96cuCzcKJR22rzSeBCy-2VUrndZk5gLlOyk52M6w8oZgr-ybtLbnVvC1trcidcR8L1ndgSUzH9NSysNDl1jTUD5_wBIPQ_PwSMIrWNa69rqvVv_MgV3o2czkQebQUj27idfawPbDN5jaY40rOW5swWLpKXpiu3yMgcPKj_PP6GROCAvAJpnpn3maXS_5GPsrkocfuEEbbHQQ/residue_2019_01_24.PNG?psid=1 [04:39] ugh, sorry about the awful url [04:40] the upgrade process went without any issues or complaints at all [04:40] what can i do to troubleshoot this? [04:45] since i'd had a similar problem with another upgrade not going well, with non root filesystems that were not accessible, i tried vgchange -aay, on a whim. so far, it seems to just hang upon that command indefinitely, and not do anything [07:10] Good morning [07:11] lunaphyte: From that initramfs can you mount your rootfs? === cpaelzer__ is now known as cpaelzer [09:43] cpaelzer: looking at your dpdk/ovs work today [09:54] jamespage: ok, I started to test it as well [09:54] but that takes quite some time [09:54] as I said, I'll ping you once I have a reasonable result [09:59] cpaelzer: ack [11:03] good morning [14:06] ahasenack, cpaelzer, kstenerud: I can hit the review queue now, but it looks like everything has been looked at for the moment? [14:07] rbasak: agreed [14:07] I'm finishing up haproxy, then I'll go after apache2 which you reviewed already, and back to the net-snmp merge [14:07] OK [14:26] rbasak: can you merge that salsa haproxy mp, or are you just passing by? [14:29] ahasenack: I was just passing by, being curious about how haproxy tests operate as I know nothing about haproxy. [14:29] cool [14:29] first attempt failed, investigating [14:29] I am not a maintainer for haproxy in Debian so I shouldn't do anything but comment. [14:29] https://pastebin.ubuntu.com/p/CvVKbJC2XV/ [14:31] ah, I have to invert the check [14:31] Yeah, or swap the else block. [14:34] (I prefer the latter in general to avoid an extra inversion) [14:36] yeah, I also dislike "!" in ifs [14:40] rbasak: better, let me push: https://pastebin.ubuntu.com/p/WDtmWnZRCZ/ [15:03] i'm trying to set up xorg+lightdm+fluxbox on a server build [15:03] i've gotten a graphical login screen after reboot [15:04] but it just fails to start session [15:06] lol if I switch from fluxbox to openbox [15:07] it lets me log in, but to just black nothing [15:07] no right click menu even [15:07] >.< [15:10] MJCD, anything in logs? [15:10] its ok third reboot was the charm for whatever reason, fluxbox still doesn't work [15:10] but i'm fine with openbox for now [17:31] rbasak: can I consider our salsa exchange as another +1 for the haproxy dep8 tests, provided tests pass in ubuntu? For this MP: https://code.launchpad.net/~ahasenack/ubuntu/+source/haproxy/+git/haproxy/+merge/362217 [17:31] cpaelzer is eod now [17:31] I induced some errors to see how it would behave, and it looks good: [17:32] - diff content: https://pastebin.com/Jcn8TE4A [17:33] - diff size: https://pastebin.com/4y4crD39 [17:57] ahasenack: yes, and +1 added to the MP [17:58] rbasak: thanks === Tornevall is now known as Guest36728 [20:05] lordievader: thanks for responding. i can't, no. [20:07] if i try to mount the root filesystem manually at the initramfs prompt, it says the same thing - "mount: mounting /dev/mapper/vg1-root on /root failed: no such device" [20:07] all of the lvm pieces are active and functioning, i can see all of the physical volumes, all of the volume groups, all of the logical volumes [20:08] i can run btrfs check on /dev/mapper/vg1-root and it runs properly and successfully, and finds no errors [20:08] lunaphyte: broke it again?! [20:09] TJ-: :) - hah - no, this is a different one [20:10] https://9unvsq.bn.files.1drv.com/y4m6YIxUZfXzd96cuCzcKJR22rzSeBCy-2VUrndZk5gLlOyk52M6w8oZgr-ybtLbnVvC1trcidcR8L1ndgSUzH9NSysNDl1jTUD5_wBIPQ_PwSMIrWNa69rqvVv_MgV3o2czkQebQUj27idfawPbDN5jaY40rOW5swWLpKXpiu3yMgcPKj_PP6GROCAvAJpnpn3maXS_5GPsrkocfuEEbbHQQ/residue_2019_01_24.PNG?psid=1 [20:10] lunaphyte: compare the dm-X number shown by "ls -l /dev/mapper/vg1-root" with "ls -l /dev/dm-X" to ensure it exists [20:11] lunaphyte: oh, a VM image? is the underlying file corrupt. You shouldn't get the kernel stracktrace [20:12] TJ-: i don't think anything is corrupt. i can boot off the installer, in rescue mode, and successfully mount all filesystem and chroot into them as the root [20:12] yeah, it's a vmware guest [20:13] lunaphyte: is this always a VMware guest? [20:13] yes [20:13] it always has been, from its inception, and always will be [20:13] lunaphyte: any changes to the hardware profile ? [20:14] no changes to hardware [20:14] just an upgrade from 17.10 to 18.04, that's it [20:14] no errors at all during the upgrade [20:14] went perfectly typical [20:14] lunaphyte: that screenshot suggests /root/ doesn't exist [20:15] lunaphyte: all commands trying to set-up the real file-system on /root/ fail - either that is missing or the real rootfs really has a major problem [20:15] /root/ does exist [20:16] lunaphyte: so does the block device symlinked from /dev/mapper/vg1-root exist? [20:16] /dev/md-0 ? [20:16] yeah, that exists [20:16] i can run fsck successfully on the device [20:16] lunaphyte: I have no idea, that that is NOT an LVM node [20:16] it properly recognizes the filesystem [20:16] lunaphyte: as I said earlier: ---> compare the dm-X number shown by "ls -l /dev/mapper/vg1-root" with "ls -l /dev/dm-X" to ensure it exists [20:17] yeah, it does [20:18] bah, sorry [20:18] i missed that irc ignored what i wrote because of the leading slash [20:18] /dev/mapper/vg1-root points to /dev/dm-0, which exists [20:19] lunaphyte: the stacktrace shows that get_fs_type() fails, so what does "blkid /dev/dm-0" report? [20:19] https://pasteboard.co/HYJHXiJ.png [20:19] one moment [20:20] https://pasteboard.co/HYJIrFu.png [20:21] sorry i can't just use normal text. i'm stuck inside a virtual guest console [20:21] I know :) [20:22] So, this is really looking like a btrfs problem. is the module available? "lsmod | grep btrfs" [20:23] ugh. no lsmod in this busybox, it seems [20:24] /proc/modules ? [20:24] lunaphyte: "grep btrfs /pro/mocules" [20:24] it's ugly but still mostly legible [20:24] modules! [20:24] i did check the initrd earlier with lsinitramfs and it seems to allegedly include the btrfs module [20:24] ah, ok, one moment [20:24] lunaphyte: and more importantly: "find /lib/modules -name 'btrfs*' [20:26] aha, it seems the module may not be loaded [20:26] it does exist though [20:26] btrfs.ko [20:27] lunaphyte: "modprobe btrfs" [20:27] lunaphyte: that ought to be loaded by udev rules based on identifying the metadata, so there may be some issue with udev rules, or 'strange' metadata [20:28] hmm, ok [20:28] well, modprobe didn't complain, but it's still not appearing in /proc/modules [20:28] dmesg | tail ? [20:29] lunaphyte: corruption? :p [20:31] sarnold: the kernel stack-trace makes me suspicious; get_fs_type() should not be causing that [20:31] stack trace? I missed that part :) [20:32] ewwwwww [20:32] it's hard to tell where dmesg stops and starts between boot and modprobe [20:33] sarnold: https://9unvsq.bn.files.1drv.com/y4m6YIxUZfXzd96cuCzcKJR22rzSeBCy-2VUrndZk5gLlOyk52M6w8oZgr-ybtLbnVvC1trcidcR8L1ndgSUzH9NSysNDl1jTUD5_wBIPQ_PwSMIrWNa69rqvVv_MgV3o2czkQebQUj27idfawPbDN5jaY40rOW5swWLpKXpiu3yMgcPKj_PP6GROCAvAJpnpn3maXS_5GPsrkocfuEEbbHQQ/residue_2019_01_24.PNG?psid=1 [20:33] https://pasteboard.co/HYJNunc.png [20:34] "request-module fs-btrfs succeeded, but still no fs?" [20:38] funny, that message actually means "we tried to look up the fs, that failed, so we loaded the module, and that succeeded" [20:38] is there a way to try to force btrfs to load *earlier* ? [20:41] thing is, why isn't it loading manually [20:42] I wonder if there are missing depends? "depends: libcrc32c,zstd_compress,raid6_pq,xor" [20:49] where can i look? [20:49] kernel module dependencies? [20:50] lunaphyte: it needs the "modinfo" tool [20:50] busybox needs modinfo? [20:51] lunaphyte: no, checking the dependencies needs it [20:51] oh [20:52] lunaphyte: if the module doesn't show as loaded in /proc/modules then something is obviously going wrong [20:53] yeah, makes sense [20:53] lunaphyte: "grep btrfs /proc/modules" would show something like this: "btrfs 1155072 0 - Live 0x0000000000000000" [20:54] what if you modprobe all the dependencies manually? [20:54] They're not there else modprobe would add them, that's the weird part here [20:56] I see 5 lines with 'btrfs' in them with that grep (showing the modules btrfs depends on also loaded) [21:08] TJ-: looking at the server before it's broken, it looks like btrfs has two dependencies: xor and raid6_pq [21:08] i've loaded those two modules manually in initramfs, and that went ok [21:08] but btrfs still doesn't load [21:09] lunaphyte: check the hash/checksum of the module in case it is corrupt - compare against one known to load (from the same kernel version of course!) [21:09] ok [21:09] i just verified that nothing is appended to dmesg when doing modprobe btrfs [21:10] i see messages from the raid6 module, and nothing more after that [21:10] NIC with intel chip X550 are supported ? [21:11] lunaphyte: rtry "grep btrfs /proc/filesystems" [21:11] Blueking: probably, I see loads of X550 hits in the ixgbe bit of the source tree, and I've got an ixgbe.ko kernel module [21:12] sarnold: guess I am going with Intel NIC [21:12] TJ-: btrfs not present [21:12] lunaphyte: so something is causing it to fail, silently too. Is there a blacklist!? [21:12] but what does word 'converged' say something about NIC product ? [21:13] lunaphyte: theory: btrfs initramfs scripts aren't expecting btrfs atop of lvm? perhaps you could try force a /etc/initramfs-tools/scripts/local-top/ script that modprobes the modules? [21:13] lunaphyte: try "grep -rn btrfs /etc/modprobe.d" [21:16] lunaphyte: the udev rule /lib/udev/rules.d/64-btrfs-dm.rules should trigger "ENV{DM_NAME}=="?*", RUN{builtin}+="btrfs ready /dev/mapper/$env{DM_NAME}" " [21:16] lunaphyte: does the 'btrfs' tool exist ? [21:22] i'll check shortly - just indisposed for a moment [21:23] I've done some tests on a loopdev here and as soon as the btrfs image is connected the kernel loads the module [21:29] I've now done the same for your scenario /dev/loop5 -> PV > vg1 > root > btrfs and as soon as the image is attached to the loopdev udev causes the module to load and the FS is available [21:43] TJ-: !!!! [21:43] btrfs was blacklisted! [21:56] lunaphyte, wat? why? [21:56] pffft, sysadmins! :p [21:59] Sounds legit. ZFS for president! *runs* [23:12] yeah, it's a little bit odd, sort of [23:12] i'm the one who had blacklisted it [23:13] this is a template system, used to create other gusts from [23:13] *guests [23:14] one of the things i'd done was blacklist a bunch of various modules that served no purpose [23:14] a previous incarnation of this system used ext4, not btrfs and there was no need for btrfs, so it got blacklisted [23:15] then the new incarnation was built, and the blacklist file copied, not realizing that was still there [23:15] so that all makes sense [23:15] what i don't quite understand though is how the system was even functioning before the upgrade [23:16] if the btrfs module was blacklisted, how was the system booting, and how was the module being loaded? [23:17] anyway, that's very much for the help, as always :) [23:17] *thanks