[02:05] apw: just got the 4.4.112 update :) [02:05] of course, now i'm watching the 4.4.113-rc1 :( now with minty retpoline flavor [08:38] how can I check the status of applied fixes (IBRS and IBPB)? [13:28] apw, bjf (cc sbeattie, tyhicks): I booted a Lenovo IdeaPad U460 into the new 4.13 linux-generic-hwe-16.04 kernel and it immediately reboots when selecting it from grub [13:29] apw, bjf (cc sbeattie, tyhicks): it can boot the 4.10 kernel fine [13:29] apw, bjf: because it won't boot, I can't ubuntu-bug it. can you advise on how to report? [13:31] jdstrand, +filebug is your only real option [13:31] apw: what would be good to attach? [13:31] jdstrand, and that is the -31 one yes ? [13:31] jdstrand: the kernel team will know better but in the meantime, you may want to try booting with the "nopti noibrs noibpb" kernel command line options to see if there's any different behavior [13:32] apw: meta is 4.13.0.31.51, kernel is 4.13.0-31.34~16.04.1 [13:32] jdstrand, if you have nothing other than a blank screen then there is little to add, trying tyhicks options is the next step [13:32] jdstrand, is this the first 3.13 you have tried on this ? [13:32] apw: it is the first 4.13, yes [13:32] ok [13:32] the 4.10 was ok [13:32] yep, it has none of the new fun applied [13:33] * jdstrand nods [13:33] apw: is there a pre meltdown/spectre 4.13 kernel that would be useful to try? [13:37] jdstrand: Ubuntu-4.13.0-21.24 is the pre meltdown/spectre 4.13 kernel [13:37] ok, I'll try that to see if it is a 4.13 thing or a security patch thing [13:51] apw, bjf (cc sbeattie and tyhicks): ok, I filed https://bugs.launchpad.net/ubuntu/+source/linux-hwe/+bug/1744942 [13:51] Ubuntu bug 1744942 in linux-hwe (Ubuntu) "Lenovo IdeaPad U460 fails to boot with 4.13.0-31.34~16.04.1" [Undecided,Confirmed] [13:51] I'll now try different command lines and kernels [13:52] I think it is a Westmere/Arandale if that is helpful [13:52] (though you might want to check the bug to be sure) [13:53] I also marked the bug 'Confirmed' to avoid the bot [13:54] tyhicks: I don't see any 4.13.0-21 kernels in https://launchpad.net/ubuntu/+source/linux-hwe/+publishinghistory [13:55] tyhicks: are you saying just boot the artful kernel? [13:55] jdstrand: there was a 4.13.0-21 artful kernel so I just assumed there was a 4.13.0-21 linux-hwe kernel [13:55] seems so [13:56] that vintage is still 4.10 [13:56] jdstrand, it would have been linux-hwe-edge back then [13:56] ah [13:56] oh, right... linux-hwe was bumped from 4.10 to 4.13 [14:00] also if it helps, it is an i386 install [14:00] jdstrand, it makes it less worrying :) in the sense if it is broke you are in a smaller set [14:09] jdstrand, also does the latest 4.4 boot for you [14:09] apw: I'll test that too [14:10] jdstrand, ta [14:10] apw: honestly, I forgot I was using 4.10 on that machine so going back to 4.4 wouldn't be so bad [14:10] we'll see how things go [14:11] jdstrand, and you finding an issue is always good [14:18] jdstrand: does that system have more than 4GB ram? [14:20] i have bisected a 32 bit boot failure with > 4GB ram and got 92a0f81... x86/cpu_entry_area: Move it out of the fixmap as first bad commit [14:20] albert23: no. 4G [14:24] hmm, I thought the >4GB was important as it boots fine when pae is disabled in the kernel [14:39] albert23, it might not matter how much ram you have just that you are or are not using pae [14:54] apw, bjf (cc tyhicks and sbeattie): https://bugs.launchpad.net/ubuntu/+source/linux-hwe/+bug/1744942/comments/1 [14:54] Ubuntu bug 1744942 in linux-hwe (Ubuntu) "Lenovo IdeaPad U460 fails to boot with 4.13.0-31.34~16.04.1" [Undecided,Confirmed] [14:54] apw, albert23: note that pae is listed in cpuinfo [14:55] apw: right now I'm in an up to date 4.4 kernel, and it seems to be working so I'll stay there for now [14:55] (this is not my machine. if you need me to test kernels, I can, but possibily not immediately) [14:58] jdstrand: pae is also shown when pae is disabled in the kernel, and top shows 3547 MB ram instead of 8000 [14:59] it only has 4G, I promise [15:00] :) [15:00] jdstrand, sounds like a plan, we may have some h/w showing similar issues, so we can investigate [15:45] bjf, apw: FYI, chrisccoulson hit an issue where he could not boot an artful VM with the new kernel that we determined to be related to the bad intel-microcode that was reverted yesterday [15:46] here's the bad combination of versions: [15:46] host kernel: 4.13.0-25 [15:46] host microcode: 3.20180108.0~ubuntu17.10.1 [15:46] guest kernel: 4.13.0-31 [15:48] bjf, apw: the guest kernel booted after updating the host kernel (4.13.0-31) OR applying the reverted microcode package (3.20180108.0+really20170707ubuntu17.10.1) in the host and rebooting [15:48] I'm glad we reverted the microcode package yesterday [15:49] I don't think there's anything for the kernel team to do here but I wanted to point out the combination of old host kernel, bad microcode, and new guest kernel could cause VMs to not boot [15:50] the solution is to apply the updates in the host (either the kernel or microcode but both are ideal) and reboot [16:06] tyhicks, thanks, that is a good data point [16:12] * alkisg still hasn't had any reply for "kernel reboots on i5's", https://bugzilla.kernel.org/show_bug.cgi?id=198529 ... [16:12] bugzilla.kernel.org bug 198529 in x86-64 "Reboot on kernel load due to 92a0f81d" [High,New] [16:22] apw: btw, here's the stack trace from the guest kernel that chrisccoulson saw: https://www.dropbox.com/s/rt6v13rifm7qt8z/Screenshot%20from%202018-01-23%2014-25-00.png [16:25] tyhicks, i wonder if chrisccoulson reported a bug with that, with what he was doing to trigger it [16:26] apw: he didn't but he could if it would help you all [16:26] tyhicks, all and any issues; we want to know as much as possible about htem [16:27] ack [16:27] as any one may be impossible to fathom but that third one gives you insight into the set [16:28] chrisccoulson: please file a bug to help out the kernel team track the issue that you hit (https://wiki.ubuntu.com/Kernel/Bugs) [16:34] tyhicks, sure, will do [16:35] thanks! [16:51] alkisg: that's the same commit i found bad too, on i3-4010U (intel nuc) [16:52] that commit is known bad, see https://lkml.org/lkml/2017/12/23/121, but the fix 2 commits later still doesn't work for me [17:04] albert23: on 32bit installation, or 64bit? [17:05] alkisg: 32 bit [17:08] Thanks, I verified this on a number of i5 processors, but not on any i3's until you mentioned that one [17:12] Happens on: i5-3470, i5-4460, i5-4440, i5-6200U, and doesn't happen on i3-4170, i3-7100, Q6600, Q8300, i5-4590 [17:13] albert23: do you think we need to do something more, except the bisection I did to pinpoin the commit, and the bugzilla report? [17:13] I would have expected some reply until now... [17:14] maybe mark is a regression [17:15] Could you? And also comment that it affects you too? [17:16] I think it's better i you do it. At least Intel graphics never liked "me too's" in bug reports [17:17] I thought that more than 1 user need to confirm a bug in order to get in the "confirmed" state [17:17] Otherwise it's just monologue... [17:17] Anyways, I'll revert the schools to 4.4 kernel and leave it for them to solve it when they can [17:28] albert23: btw, does it boot for you if you pass acpi=off? It worked on most of i5's, but a teacher reported it didn't work for him [17:31] * alkisg also notes that there are no instructions in the ubuntu wiki on how to revert to the non-hwe kernel series... in the past, there were... [17:32] alkisg: yes, it does boot for me too with acpi=off [17:32] Nice [17:43] hmm, apparently acpi=off also mean no PTI [17:50] and also no PTI in a kvm guest and no PTI when the kernel is built without PAE