=== shadeslayer_ is now known as shadeslayer === ghostcube_ is now known as ghostcube [15:50] Hello! [15:50] Are there any known issues with btrfs on 3.13 on trusty? [15:51] I've received a report from someone who's seeing memory allocation failures on a system with 24 GB of RAM with dpkg and related operations. [15:51] http://pastebin.com/raw.php?i=GXaj1iHp [15:52] Basically, they're not actually running apps which require a lot of memory, but they seem to be running into some kind of problem. [15:59] backjlack, are there any special mount options being used? [16:00] i've seen issues with 3.13 btrfs on large memory systems with compression being used [16:00] cking: I'll provide /proc/mounts as soon as I get it, shouldn't take long. [16:02] sysctl config: http://pastebin.com/raw.php?i=3Guv4KaQ === ghostcube_ is now known as ghostcube [16:04] cking: http://pastebin.com/raw.php?i=zNr62crU That's /proc/mounts from that system. [16:05] Please let me know if there's anything else you'd need or if you happen to have any recommendations. [16:07] backjlack, those mount options don't look like the one's I've seen issues with [16:11] Yeah, I know what you mean. I've had issues with compression on an older 3.10. [16:13] backjlack, it does seem that there are a lot of slabs being used for inodes, I make it ~4.65GB [16:13] but still, that's still not all of memory [16:14] There's btrfs as well. [16:14] backjlack, what's the kernel reporting in dmesg? [16:15] 3.13.0-43 [16:15] backjlack, I mean, is the kernel reporting any out of memory failures in the kernel log? [16:16] backjlack, it does seem heavily loaded on memory, the conntrack messages possibly indicate lots of open connections etc.. [16:16] This were logs I've received from the same system before the upgrade to 3.13.0-43: http://pastebin.com/raw.php?i=rSpQU9dk [16:18] http://pastebin.com/raw.php?i=rSpQU9dk [16:22] http://pastebin.com/raw.php?i=X8pggEsp [16:23] backjlack, I believe there used to be vm.zone_reclaim_mode sysctl that could be set 1 to force cached memory to be reclaimed [16:24] not checked on that lately, but it's worth a punt [16:25] cking: Thanks! I've passed that on. [16:25] Also, btrfs seems to have some kind of problem. [16:26] backjlack, in what way? [16:26] (mind you, btrfs is experimental, so it does not surprise me) [16:26] http://pastebin.com/raw.php?i=X8pggEsp [16:27] There's a btrfs related stack trace in there. [16:28] backjlack, i also suggest that vm.min_free_kbytes = 242000 is enabled (or see what it is currently set at), this may help [16:29] From what I've been told, these errors were still being encountered, even when that was enabled. [16:34] backjlack, it would be interesting to see how the slabinfo changes in time to see if there is any leaking or it perhaps is just running low on memory because you are running a memory intensive system config [16:34] cking: It's not my system, it's from a user. However, I'm interested in making sure the trusty kernel is stable and doesn't have problems. [16:35] It's being used by a lot of people and making sure it doesn't have such problems would be great. [16:35] What would you need? Periodic snapshots of slabinfo? [16:35] backjlack, periodic snapshots of slabinfo would be useful, eg. every minute or every 5 mins [16:36] as it stands, btrfs being used in production environments when it is "experimental" is a tad worrysome [16:38] It's a dev environment, but this is important. [16:38] Such usage catches bugs which are rather difficult to catch with automated fs testing. [16:38] sure, I agree, I think a bug should be opened, and we can work through this [16:39] I've requested snapshotting for slabinfo and will open an issue after I get the slabinfo snapshots over a period of 24 hours. [16:39] backjlack, well, I'm working on a thorough thrashing of btrfs and I will be backporting fixes once I've identified the kitten killer issues [16:39] There are issues even in newer kernels, like 3.14, 3.15 and so on. [16:39] 3.19-rc2? [16:39] I've got stacktraces, but couldn't reproduce so far. [16:40] I haven't tried that myself or in any dev environment yet. Reproducing the issues is rather difficult. [16:41] Getting a hard btrfs crash which takes down the whole system or just crashes btrfs reproduced can be very difficult. [16:41] backjlack, not with my tests, I crash it daily ;-) [16:41] Is there a repository for these tests? [16:42] backjlack, xfs tests, generic and xfs specific ones [16:42] I mean "btrfs specific ones" [16:43] Ah, I see. Ok, I'll try those as well. [16:43] and I'm testing against a wide mix of mount options, I've been hammering test configs for ~3+ weeks non-stop and I'm building a matrix of failure points [16:43] then I start identifying fixes and backporting them [16:44] I'm on the case [16:50] ## [16:50] ## Kernel team meeting in 10 minutes in #ubuntu-meeting [16:50] ## [16:54] apw, LP: #1400289 ... can you verify ? [16:55] Launchpad bug 1400289 in linux (Ubuntu Vivid) "Hyper-V: drivers:scsi:storvsc: Fix a bug in handling ring buffer failures that may result in I/O freeze" [High,Fix committed] https://launchpad.net/bugs/1400289 [16:57] dannf, LP: #1381084 ... can you verify? [16:57] Launchpad bug 1381084 in linux (Ubuntu Trusty) "xgene-enet: add 10GbE support" [Medium,Fix committed] https://launchpad.net/bugs/1381084 [16:57] bjf, ack [16:59] cmagina: ^ can you verify LP: #1381084 for bjf? [16:59] Launchpad bug 1381084 in linux (Ubuntu Trusty) "xgene-enet: add 10GbE support" [Medium,Fix committed] https://launchpad.net/bugs/1381084 [16:59] arges, there are two for you: LP: #1396235 and LP: #1401150 [16:59] Launchpad bug 1396235 in linux (Ubuntu Utopic) "Ubuntu - unable to use XMON debugger (running ppc64le on PowerVM)" [Medium,Fix committed] https://launchpad.net/bugs/1396235 [16:59] Launchpad bug 1401150 in linux (Ubuntu Utopic) "Endianness issue in the VPHN topology update code" [Medium,Fix committed] https://launchpad.net/bugs/1401150 [16:59] bjf: was just looking at those : ) [16:59] arges, would you be able to verify those if i let you have modoc for a bit? [17:00] ## [17:00] ## Meeting starting now [17:00] ## [17:00] dannf, bjf: sure [17:00] bjf: i'd be able to verify 1396235, but 1401150 is a perf fix i could only verify it doens't break things [17:01] arges, ok, i'll see about freeing up modoc for a bit ... lets shoot for tomorrow if that's ok [17:01] bjf: works for me [17:03] bjf, utlemming is on the case [17:04] cool [17:07] sforshee, you think we can mark LP: #1275879 as verified? [17:07] Launchpad bug 1275879 in linux (Ubuntu Vivid) "Kernel panic " [High,Fix committed] https://launchpad.net/bugs/1275879 [17:08] bjf: looking [17:09] bjf: I think so, we'll be getting that commit via upstream stable either way === jsalisbury changed the topic of #ubuntu-kernel to: Home: https://wiki.ubuntu.com/Kernel/ || Ubuntu Kernel Team Meeting - Tues January 13th, 2015 - 17:00 UTC || If you have a question just ask, and do wait around for an answer! If the question is should I file a bug for something, likely you can assume yes. === broder_ is now known as broder