/srv/irclogs.ubuntu.com/2018/01/11/#ubuntu-kernel.txt

rellisSo two things, first I just wanted to say thanks to all of you folks for so much kernel work over the holidays, you rock.03:17
rellisSecond I wanted to ask if anyone knows if the table on this page with versions is getting updated as new kernels become available? I'm just asking because my machines on 16.04 LTS linux-aws are getting 4.4.0-1048.57 which I don't see listed there. I wasn't sure if the page just hadn't been updated or I was misinterpreting what they're doing with that table.03:19
rellisWhen I said this page I meant: https://wiki.ubuntu.com/SecurityTeam/KnowledgeBase/SpectreAndMeltdown03:19
=== Elimin8r is now known as Elimin8er
apwrellis, i would expect the table to have the first version it was nominally addressed in to be in the table, so as long as you have a higher one than that08:33
klebersricotz, for which series?08:39
ricotzklebers, noticed it with xenial, but probably all08:42
alkisgPlease also make it Recommends instead of Depends, as the rpi firmware in many times is provided outside of the real installation (e.g. netbooting), and linux-firmware-raspi2 postinst fails on chroots etc08:50
threshI wonder if the tools described in https://wiki.ubuntu.com/Kernel/CrashdumpRecipe will work for me to debug that crash?08:51
thresh(with 4.13.0-21.24~16.04.1 and later)08:51
klebersricotz, alkisg: thanks a lot for the input, I'll discuss it with the raspi2 guy09:01
alkisgThank you klebers09:01
ricotzklebers, yw09:04
threshI'm kinda curious right now - I've installed linux-crashdump;  I have GRUB_DEFAULT="1>2" to choose the working kernel on boot automatically (as it's a remote server);  what do I do now to force booting into the faulty kernel, and getting a dump out of it?09:06
thresh(I'm kinda stupid, too, which is why I need some guidance)09:06
zioprotoIf you are upgrading because of SPECTRE/Meltdown, be aware that your hardware could not boot if you hit this bug https://bugs.launchpad.net/ubuntu/+source/linux/+bug/174263012:03
ubot5Launchpad bug 1742630 in linux (Ubuntu) "Booting from 4.13.0-21-generic leads to Oops: NULL pointer dereference - RIP: isci_task_abort_task+0x30/0x3e0 [isci]" [Undecided,Confirmed]12:03
zioprotoThis is a "server thing" for us12:03
threshhey, I have this kernel failing for me, too12:03
threshbut I wasnt that lucky to have the trace12:03
zioprotothresh: read the bug description, it is already fixed in Debian12:03
zioprotothresh: do you have Quanta hardware as well ?12:04
threshI'm on Hetzner dedicated server, which runs AMD Ryzen and some Supermicro motherboard IIRC.12:04
zioprotothresh: it is a server thing as I mentioned12:04
zioprotoI hope the bug gets the right attention. I opened it this morning12:04
threshmany thanks!12:04
zioprotoI figured out the issue was already fixed in Debian but in Ubuntu community it was unknown12:05
threshI was collecting data on the kernels I could boot before I could file a bug as well.12:05
zioprotowell, the issue is well described if you follow those link on Launchpad12:05
zioprotodiscussion on the kernel mailing list and so on12:05
threshyep, I'm reading it at the moment12:05
zioproto:)12:05
zioprotoSo, I hope the information reached the right people and the bug gets triaged :) thanks !12:06
threshwell, I'm definitely not the right person to fix it12:06
threshjust someone who was struggling with booting the hwe kernel since yesterday12:06
zioprotothresh: do you know who we can tag in this IRC channel so that the bug gets attention ?12:07
zioprotowe have to upgrade a ~100 servers Openstack cluster and we are blocked by this bug :(12:07
threshthey person who offered help yesterday is not here, but he asked to file a bug on launchpad when I'm done - so I guess opening a bug is the way to get attention12:08
zioprotothresh: makes sense :)12:09
zioprotoif you remember the nickname we can assign the bug on Launchpad12:09
zioprotoif somebody was already working on this specific issue12:10
threshit was jsalisbury12:10
zioprotothresh: also it would be great of you can click in Launchpad "Yes This bug affects me"12:10
zioprotoso that we get an headcount of how many people have a problem with this kernel12:10
TJ-zioproto: have you tested the latest image? 4.13.0-23 ?12:10
threshI cannot really confirm since i'm not sure this is the exact bug12:11
threshI'm having12:11
zioprotoTJ-: no I did not test it. But now I will go first to look at the changelog to see differences between 4.13.0-21 and 4.13.0-23. Thank you12:12
threshI think the latest is -26, no?12:13
threshand 21 vs 26 is mostly kpti patches12:14
threshzioproto, and I have a very different SATA controller I think12:16
zioprotoI cant find the changelog, do you have the link ?12:16
threshno, I'm reading /usr/share/doc/linux-image-extra-4.13.0-26-generic/changelog.Debian.gz on the machine where I have the image installed12:17
zioprotoI have this http://changelogs.ubuntu.com/changelogs/pool/main/l/linux/12:17
zioprotobut the version 4.13.0-23 is not listed12:17
threshhttp://changelogs.ubuntu.com/changelogs/pool/main/l/linux-hwe/linux-hwe_4.13.0-26.29~16.04.2/12:18
TJ--26 is the latest now... keeps moving too quickly for me :D12:19
zioprotoI have tested this version linux-image-generic-hwe-16.04         4.13.0.26.4612:25
zioprotothis is not working for me12:25
TJ-zioproto: update the bug report so we know it's affecting the current image12:30
zioprotoTJ-: ok !12:30
TJ-zioproto: I'd suggest altering the title so instead of mentioning the version, it says "PTI kernels" and list the versions tested in the description12:31
zioprotoall done12:33
threshhmm, I was able to run the mainline kernel 4.14, linux-image-4.14.13-041413-generic14:46
threshwhich one is the linux-image-4.13.0-26-generic based on? 4.13.13?14:47
threshalright, linux-image-4.13.13-041313-generic_4.13.13-041313.201711150531_amd64.deb also fails to boot14:53
threshlinux-image-4.13.16-041316-generic_4.13.16-041316.201711240901_amd64.deb works15:00
thresh(huge thanks to whoever maintains mainline kernel builds over at http://kernel.ubuntu.com/~kernel-ppa/mainline/)15:01
threshlinux-image-4.13.14-041314.201711180632 fails, linux-image-4.13.15-041315-generic boots15:14
jsalisburythresh, do you have a bug open, so I can review all the data so far?15:18
threshjsalisbury, no, I'm going to open it right now, now that I have some informations.  many thanks!15:18
jsalisburythresh, great, thanks.  I'll review everything and see if we can create a test kernel or dig deeper if needed.15:19
threshthere you go jsalisbury https://bugs.launchpad.net/ubuntu/+source/linux/+bug/174272115:38
ubot5Launchpad bug 1742721 in linux (Ubuntu) "linux-image-4.13.0-26-generic / linux-image-extra-4.13.0-26-generic fail to boot" [Undecided,New]15:38
jsalisburythresh, thanks!  I'll take a look at it shortly15:38
thresh\o/15:39
jsalisburythresh, Just to confirm, upstream 4.13.14 has the bug and 4.13.15 fixes the bug?  If that is the case, I can "reverse" bisect between those two versions.15:43
threshjsalisbury, yes, that's correct.15:43
threshI'm thinking about setting up make-kpkg or something to bisect, too.15:44
jsalisburythresh, perfect.  I'll start bisecting between those two versions and start buiding a test kernel.15:44
threshI've did that for debian, I wonder if that's the same for Ubuntu?15:44
jsalisburythresh, I can build test kernels fairly quickly if your able to test them?  There will be about 10 or so to test.15:44
threshjsalisbury, yeah, I can do that.15:44
jsalisburythresh, great.  Give me about 20 minutes and I'll have the first test kernel ready.15:45
jsalisburythresh, I built the first test kernel and posted a link to it in the bug.17:08
threshjsalisbury, thanks, Ill be able to test it in an hour (commuting home atm)17:09
jsalisburythresh, sounds good, thanks!  Just post the results and I'll build the next test kernel based on that.17:09
rellisapw: Thank you for the reply.19:19
TJ-Had a report in #ubuntu earlier; user with ~1200 AWS instances, testing the PTI upgrade and despite grub default being  -109, -100 is booting. I've asked them to enable the GRUB serial console logging to capture anything useful, because it appears the fallback kernel is being booted.19:25
Odd_BlokeTJ-: That's a little confusing; -109 is the regular xenial kernel, not the custom AWS kernel (which is -1048, I think).19:30
Odd_BlokeTJ-: If you see them again, could you ask them to file a bug in the cloud-images LP project?19:30
TJ-Odd_Bloke: this user is installing their own kernels from what I could get from them19:30
TJ-Odd_Bloke: I've asked them to file a bug if they can collect some useful data :) Not heard anything since (about an hour)19:31
TJ-one thing I saw earlier - we've had so many PTI kernel upgrades that with package autoremove we've seen situations where all the installed kernels are PTI and leaving the system unable to boot without nopti19:36
threshjsalisbury, given linux-image-4.13.14-041314-generic_4.13.14-041314.201801111932_amd64.deb also fails to boot, and I have sm750fb loaded on a working kernel... https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git/commit/?h=v4.13.15&id=c52278a636018cb8fa39b2538c6da5d35e6515f7 is likely the commit.20:32
threshjsalisbury, but, let's test kernels a bit more ;-)20:32
=== rellis is now known as rkeiii
jsalisburythresh, I posted the next kernel in the bug report.  23:24
jsalisburythresh, sorry for the delay, juggling a couple other bugs as well.23:24

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!