/srv/irclogs.ubuntu.com/2024/01/15/#ubuntu-kernel.txt

=== arraybolt3 is now known as arraybolt3_tl
=== arraybolt3_wc is now known as arraybolt3
joseogandopetn-randall, it is for NVIDIA systems, nothing to do with graphics, desktop pcs or any of the sorts. Most likely, unless you use NVIDIA servers you won't need it.06:55
lishi.  i'm working in the cockpit team at redhat.  our CI just caught a pretty bad regression in the kernel version in jammy-proposed (5.15.0.94.91) which wasn't present in the previous version11:07
lisi'd normally report that as a bug but i haven't logged into lauchpad in half a decade and i lost my 2FA.  i wanted to report it here to give the best chance possible of the update not making it to stable11:07
lisanalysis and a simple reproducer is in https://github.com/cockpit-project/bots/pull/5793 with a link to the patch that landed in the ubuntu kernel tree on Jan 5.  it comes down to the BLKPG_DEL_PARTITION ioctl() returning EINVAL on missing partitions when it used to return ENXIO, which breaks partprobe11:09
-ubottu:#ubuntu-kernel- Pull 5793 in cockpit-project/bots "Image refresh for ubuntu-2204" [Open]11:09
listhe last version our CI ran against (known-good) was 5.15.0.91.8811:11
tjaaltonlis: fixed by 6f64f866aa1ae69 upstream?11:16
lisi'd guess not.  i think the problem is the GENHD_FL_NO_PART → EINVAL check on entrance to blkpg_do_ioctl() and this patch doesn't seem like it would change anything about that11:17
lisit doesn't look like there's much else that could return EINVAL around.  bdev_del_partition() gets called fairly directly and it can only return a few possible errnos, none of which are EINVAL11:19
listhere's one other check that could return EINVAL but it's unchanged from the known-good version of the kernel, so i guess it's not that one11:19
tjaaltonwell, file it upstream then?11:25
tjaaltonthe patch came from upstream stable, and is applied basically every stable tree11:26
listhat makes sense, but i still thing ubuntu should stop this update from reaching stable11:31
lis*think11:31
tjaaltonit's been there since v6.611:37
tjaalton-rc111:37
listhere's this upstream: https://marc.info/?l=linux-kernel&m=169753467305218&w=211:37
lissomeone already reported the issue, with a fix, but it was rejected since EINVAL is 'appropriate' (even though it's a behaviour change vs the previous version)11:38
tjaaltonit should still be resolved upstream11:58
tjaaltonand if it won't change, then I don't see why we should change things against upstream11:59
tjaaltonbut that's just my 2c11:59
lishttps://lkml.org/lkml/2024/1/15/147 ← upstream report.  let's see what they say12:21
lisit's lunch time here.  take care!12:23
DiogoConstantinohi12:48
petn-randalljoseogando: We're running NVIDIA DGX servers. Does it apply to those? What's the difference to the regular kernel?14:40
joseogandopetn-randall, there is some newer hardware enabled there, although AFAIK it is mostly used with NVIDIA custom distribution. Wether it makes sense or not It likely depends on the hardware you've got. If all of the hardware that your server has is enabled then it likely makes no sense. Let me think for a moment....14:45
petn-randallI guess I could diff the /boot/config-*, and also look at the difference in patches for both kernels, but I believe that info should really be in the package description.14:47
=== cascardo_ is now known as cascardo
joseogandoThat is definitely one way to do it. I was looking for more info but can't think of anything right now.14:50
joseogandoI'll pass along your suggestion - I just did apt-cache show linux-nvidia and I see the description.14:50
petn-randallYeah, given that nvidia is producing a wide range of hardware and use cases it might make sense to make it more descriptive what it does and does not do.15:08
=== mhcerri_ is now known as mhcerri

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!