tyhicksbug: https://bugs.launchpad.net/ubuntu/+source/linux-hwe-edge/+bug/185258100:00
ubot5Ubuntu bug 1852581 in linux (Ubuntu) "hwe-edge kernel 5.3.0-23.25 kernel does not boot on Precision 5720 AIO" [Critical,In progress]00:00
tyhicksfix: https://lists.ubuntu.com/archives/kernel-team/2019-November/105544.html00:00
tyhickssarnold, shibboleth: ^00:00
sarnoldtyhicks: heh, bummer the simple simple fix can't actually be used by anyone but us00:01
=== cjwatson_ is now known as cjwatson
m90sHi, I found a way to get a kernel opps by using tcpdump to write to a file on a USB stick and then unplugging the USB stick.08:40
m90swhat is the best way to report this?08:40
m90sI where able to repoduce this on multipe platforms and kernel versions.08:41
=== fling is now known as goffee
=== goffee is now known as fling
=== henrix_ is now known as henrix
sub526I installed crashdump tools(kexec\kdump) to acquire a crashed Linux kernel dump and enabled the “kernel.panic = 60”,  “kernel.softlockup_panic = 1” and “kernel.hardlockup_panic = 1” variables. I triggered the some kind of misbehavior in the kernel by writing to /sys/kernel/debug/provoke-crash/DIRECT file. I see that my system just13:11
sub526reboots without copying the crash dump to /var/crash.. Can someone help me to collect the coredump?13:11
sub526*crash dump13:11
sub526My system has linux-crashdump\kexec-tools\crash deb packages13:18
sub526and kdump-tools13:18
connor_km90s, the best place to report that is by filing a bug on launchpad: https://bugs.launchpad.net/ubuntu/+source/linux14:13
cascardoconnor_k: I guess that person is gone from IRC already14:21
cascardosub526: do you have console access? a virtual terminal would suffice (tty1)14:22
connor_kcascardo: yeah I suppose so :-/ maybe all the join/leaves aren’t present in the history from my bouncer to my phone. That’s okay, still good advice in general lol14:22
sub526cascardo: yes, i've monitor connected14:23
cascardosub526: and which version of kernel and kdump-tools do you have?14:23
cascardosub526: okay, I mean a console opposite to a graphical environment. at least, that could help finding out if the panic kernel is executed at all14:24
sub526cascardo: kdump-tools : 1:1.6.3-2~16.04.1 and 5.0 kernel14:24
cascardosub526: that seems to be xenial, what is uname -r ?14:25
sub526For debugging purpose, I compiled an installed the vanilla kernel 5.0 on Ubuntu 16.04.4 machine14:26
cascardowell, that version of makedumpfile probably does not support 5.0. at least, you would get a very large dumpfile on /var/crash/14:27
cascardobut at worst, it wouldn't be able to dump the kernel at all14:27
cascardosub526: by debugging purposes, do you mean debugging the kdump/crash situation or something else?14:27
sub526cascardo: no, I'm facing system hang issue for actual test case... So to debug this I enabled few debug options like KASAN etc on plain kernel and then rebuild it. 14:30
sub526cascardo: Before executing the actual test case, I'm trying to validate my system whether it supports collecting the crashdump or not...14:31
cascardosub526: well, I would suggest you get a more recent makedumpfile/kdump-tools too, then, if that's possible14:31
cascardosub526: and which config did you use? the same one as Ubuntu's? and which kernel source?14:32
sub526cascardo: I downloaded the kernel source from https://mirrors.edge.kernel.org/pub/linux/kernel/v5.x/14:33
cascardookay, so that doesn't contain Ubuntu patches14:35
cascardowhat about the config?14:35
sub526Regarding .config , I added certain debug options under 'kernel hacking menu'14:35
cascardoadded in respect to what base?14:35
sub526I ran make defconfig and then make menuconfig14:35
sub526my .config has above stuff related to kexec14:37
sub526do you see any issues in this aproach?14:39
=== ben_r_ is now known as ben_r
cascardosub526: what about CONFIG_CRASH_CORE ?14:41
sub526cascardo: CONFIG_CRASH_CORE=y14:42
cascardosub526: okay, so what do you get on your console after panic?14:43
cascardosub526: and by the way, can you update kexec-tools and makedumpfile/kdump-tools ?14:43
cascardoyeah, kexec-tools could be an issue14:43
cascardoby update, I mean you should get a version from bionic, at least, but better if newer than that, like the one from eoan14:44
sub526cascardo: sure , i will update those tools. sudo apt-get update and then install or any other method?14:44
sub526In console I see the corresponding crash dump and after kernel.panic timeout it reboots, but /var/crash is empty.. also no crash log in /var/log/kern.log14:46
cascardosub526: what do you mean by corresponding crash dump ?14:48
sub526Dumb question: console log mean whatever displayed on monitor, right? What exactly console means?14:48
cascardosub526: yeah, if you see reboot logs and crash dump logs on the monitor, that's sufficient14:48
sub526cascardo: I triggered crash via SYSRQ key press and corresponding log I can see in monitor14:49
cascardosub526: what is the version of kexec-tools?14:49
sub526let me check14:50
cascardoI checked that xenial-updates has a reasonable recent version14:50
cascardoyeah, 2.0.16, not too old14:50
sub526Cascardo:  As per https://help.ubuntu.com/lts/serverguide/kernel-crash-dump.html , i understood that a manual intervention is required in order to capture the memory for 'machine exceptions'. What exactly ‘manual intervention’ means? 14:53
cascardosub526: I haven't seen that guide before, I am not certain what that means. but you are doing the right thing in testing that it's working, because kdumping is not a certain success as you can see by yourself14:54
cascardosub526: I need to attend a meeting and get out after it, can you open a bug? I can promise I can work on it, unless you can reproduce it with the Ubuntu kernel? you should try linux-hwe, you will get a 4.15 kernel on xenial14:55
sub526cascardo: Sure thanks for your support.. Bye for now.. 14:56
cascardosub526: and do you see makedumpfile being called at all? is that what you mean by corresponding log? or only the panic stack trace?14:56
sub526I did not check for makedumpfile being called, but I see panic stack trace. 14:57
sub526what exactly need to be checked in makedumpfile called or not?14:58
cascardosub526: well, there should be two rebootss14:59
sub526cascardo: /sbin/reboot is part of systemd-sysv - is this gets called? what is other reboot?15:01
cascardosub526: so, what I mean is that after panic, the system should start executing the kdump kernel, which will trigger the capture of the dump followed by a cold reboot15:29
cascardoif you don't see the logs for the kdump kernel after the panic, but a cold reboot, then there will never be an opportunity for makedumpfile to execute15:30
cascardoif you see those logs, but that kdump kernel panics itself, or there is an OOM on that kernel, then those logs would be useful to understand what is happening. in the case of an OOM, increasing the memory for crashkernel allocation should fix it15:31
sub526cascardo: what should be the kernel.panic set to?15:32
cascardosub526: do you see the "Rebooting in 60 seconds..." message? if you do, then crash kernel is not being executed?15:34
cascardowhat is the result of kdump-config status ?15:34
cascardoand kdump-config show ?15:34
sub526cascardo: current state   : ready to kdump15:36
sub526cascardo: https://pastebin.com/Lb0tADcZ15:38
sub526cascardo: kdump-config show, looks ok?15:46
cascardosub526: that looks ok15:53
cascardosub526: I need to go out now, just let me know if you see the "Rebooting in 60 seconds..." or not15:53
sub526cascardo:I did not see that log15:54
sub526What it means?15:54
sub526cascardo: Thanks a lot for your support. I need to leave now... will catch you in next week.16:07
=== himcesjf_ is now known as him-cesjf

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!