/srv/irclogs.ubuntu.com/2013/03/27/#ubuntu-kernel.txt

maxbI have a weird issue where my Ubuntu machine has decided that it doesn't want to honour ICMP fragmentation needed for PMTU discovery, whilst other Linux machines on the same subnet connecting to the same destination are fine, can anyone think of any useful avenues to investigate?05:42
maxb(I'm assuming something this deep into the networking stack must be a kernel issue)05:42
=== MaJian is now known as BruceMa
ppisatimoin07:28
diwicI got a question from upstream about how Ubuntu deals with the power_save parameter for HDA codecs. I don't think we do anything (i e, just follow upstream), but is there a way to verify?07:52
RAOFdiwic: I guess check the module parameters in /etc/modprobe.d?07:54
diwicRAOF, nothing there, I was more thinking of kernel patches07:55
RAOFYou'd find them in the kernel git tree, then.07:56
RAOFThe kernel tree is based on whatever upstream commit is most recent, with all our patches on top of that, so ‘git log 3.8-rc3..’ should get you all the patches we apply.07:57
diwicRAOF, ok, thanks07:58
apwdiwic, i don't recall anything specific any more for hda, but worth looking indeed08:09
diwicapw, ack. As a side note, I think we once had a larger buffer size for hda, but that patch must have been removed again08:10
apwmaxb, pmtu discovery does not require fragmentation, indeed it requires you request no fragmentation08:12
apwmaxb, on the expectation you get a 'fragmentation required' icmp in response to anything too large.  you should be able to see those in your network traces if things are right08:13
maxbapw: Yes indeed. My problem is that I can see ICMP fragmentation needed packets arriving, but Linux doesn't seem to be taking them into account. It continues trying to use a too-large MTU with DF set08:13
apwmaxb, i wonder if they are being firewalled into the bit bucket08:14
maxbI can see them in tcpdump on the client host08:15
apwyou would expect to see them in tcpdump even if they get dropped i think, as the copies for tracing are taken very early08:16
apwdo you have iptables rules on this box08:16
maxbonly one, a nat/POSTROUTING/MASQUERADE rule, and all the chain default policies are ACCEPT08:16
apwmaxb, not that than08:18
maxbIndeed :-/08:18
apwmaxb, i assume /proc/sys/net/ipv4/ip_no_pmtu_disc is 008:18
maxbyes. Of course, disabling pmtu is a workaround for the original communications problems that set me to investigating this, but it's not ideal08:19
maxbIn theory this is a fairly average ubuntu workstation install running raring :-/08:21
maxbPerhaps I should boot a live USB and see if the problem persists08:21
apwmaxb, well now we need to try and acertain if this has always been this way or has regressed, so i would probabally grab a live CD and see if that is affected, adn then if so, grab a quantal kerenl and boot that against the raring user space08:22
apwmaxb, is it possible to test without the MASQ rule you mentioned too, as there was a case in 2.6.11 where loading rules there broke this08:23
apwmaxb, finally can you tell me which host you are having the issue with server side (privatly is fine) so perhaps i can test here and see with my raring system08:24
maxbI deleted that rule, no change. But I'll try without it from a clean reboot too08:24
maxbThe problem host is on the other side of a private IPsec tunnel08:24
apwyeah it is 'having ipt_MASQUERADE loaded' which was the trigger, though it should be fixed in theory08:24
apwmaxb, fair enough not going to be doing that then08:25
apwmaxb, a quick survey of places i visit often i do not get any icmp-fragneeded packets, sigh08:28
=== smb` is now known as smb
smbmorning08:30
apwsmb, moin08:30
smbapw, insomnia?08:30
apwsmb, sunny day and no curtains08:30
smbah08:31
smbunexpected but seems there is something bright outside here too08:31
* smb has curtains, though08:31
maxbWell, not loading the MASQUERADE rule doesn't seem to have changed matters08:39
maxbHmm, but quantal kernel raring userspace works08:45
apwmaxb, ok that implies a regression in v3.808:46
apwmaxb, so ... next we would normally ask you to try the v3.8 mainline, v3.7 mainline and v3.6 mainline kernels08:46
apwmaxb, https://wiki.ubuntu.com/Kernel/MainlineBuilds08:46
maxbwill do08:47
apwmaxb, and ... get a bug filed and let jsalisbury know so he can help us get the bisect done08:47
apw(jo and me of course the bug number)08:49
maxbI'll file a bug later today once I have some kernel-ppa tests done08:53
apwmaxb, ack09:10
brendandhenrix, do you know why the certification-testing task is marked Invalid in the lucid tracking bug? https://bugs.launchpad.net/ubuntu/+source/linux/+bug/115893909:26
ubot2Launchpad bug 1158939 in kernel-sru-workflow/verification-testing "linux: 2.6.32-46.107 -proposed tracker" [Medium,In progress]09:26
henrixbrendand: hmm... i'm not aware of any specific reason, so most likely a bot bug09:27
henrixbrendand: you can go ahead and just change the state to 'New' i believe09:28
henrixbrendand: i'm ping bjf later about that09:28
ppisatibrb10:11
maxbMy pmtud-related bisection has established the interval of v3.5.7.8-quantal .. v3.6-rc1-quantal :-/11:16
apwmaxb, ok ... what is v3.5-foo like, as that is on the same mainline11:22
maxbOh, as in determine whether a fix landed during 3.5.x ?11:23
apwmaxb, and if not it is easier to bisect v3.5->v3.6-rc1 than from .811:33
maxbJust doing a quick side trip into 3.9-rc4 to see if anything changes there, then I'll try out 3.511:34
* ppisati rush out to get some food before the conf call11:50
maxbA colleague has just observed that 3.6 saw the removal of the IPv4 routing cache12:07
maxbWhich would kind of be a good reason for this to have broken12:29
maxbExcept, I've also discovered an additional wrinkle.12:29
maxbI'm connecting to several different sites via the same IPsec gateway. And some behave differently to others12:31
maxbAccessing some, my local machine just magically decides to operate an IP MTU of 1420, and I can't see any evidence why12:32
rtg_ppisati, I assume you want those 2 patches mentioned in your response to robher applied ?12:33
rtg_if so, please submit them on the public k-t list.12:34
rtg_apw, the kbuild test robot email re: 'lib/dynamic_debug.c:1059:6: warning: passing argument 7 of 'parse_args' from incompatible pointer type' looks legitimately broken. can you have a look ?12:42
apwrtg_, sure12:44
apwmaxb, it is not making your life easy is it12:44
maxbI've just figured out that the ones when it works, is because a router in the remote site is doing MSS clamping12:45
maxbSo I think as far as Ubuntu in general is concerned, the question is "did 3.6 break pmtud?"12:47
apwmaxb, yep12:57
maxbThis commit message is quite scary - https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=3c4cfadef6a1665d9cd02a543782d03d3e6740c612:59
=== timchen`` is now known as timchen119
apwheh yep, it is :)13:25
apwrtg_, ok ... fixed and pushed13:29
rtg_apw, just going through the rest of them to see which are legit13:30
apwrtg_, great, any you want me to poke send 'em over13:31
rtg_ack13:31
=== kentb-out is now known as kentb
maswanhenrix: anything more we should do for 1111416 or is it all in hand now?14:25
henrixmaswan: nop, everythings good. the next kernel to go to updates will now have CONFIG_NFS_V4_1 ;)14:26
ppisatirtg_: i'm building another kernel to grab a stack trace, then ill send the email14:27
rtg_ppisati, ack14:27
apwppisati, your "MUSB annotation can be dropped" comment does that apply to CONFIG_USB_MUSB_OMAP2PLUS CONFIG_TWL4030_USB and CONFIG_TWL6030_USB14:29
apwrtg_, did i see you say you had the 8250_DMA thing in hand ?14:30
maswanhenrix: excellent14:30
rtg_apw, yes, I think so14:30
ppisatiapw: no, we need TWL4030 for booting14:30
ppisatiapw: does it depend on MUSB?14:30
henrixmaswan: enjoy ;)14:30
apwppisati, those are both M in armhf-generic but marked as 'y' in our annotations14:46
apwppisati, i suspect from your bootting comment 'y' is correct14:46
ppisatiapw: that's what i recalled, but if you tell me they are 'm' now (and everything works) we should just keep them as is14:47
ppisatiapw: bu let me check14:47
apwppisati, ack14:47
apw(to letting you check)14:47
maxbSo I *think* I've figured out what's going on now, it looks like in the refactorings in 3.6, support for fragmentation needed packets which don't supply next hop mtu information (i.e. set the field to zero) got dropped14:49
maxbOff to file an actual bug at this point14:49
apwmaxb, great14:50
maxbThe bug that I have been talking about for most of today is now bug 116096615:27
ubot2Launchpad bug 1160966 in linux (Ubuntu) "PMTU discovery no longer works in Linux 3.6+ with routers that do not send next hop MTU information" [Undecided,New] https://launchpad.net/bugs/116096615:27
jsalisburyrtg_, apw should we be building the ddeb packages for Precise?  bug 116067415:39
ubot2Launchpad bug 1160674 in linux (Ubuntu) "ddeb package missing for 3.2.0-31-generic kernel (and 3.2.0-30 too)" [Medium,Confirmed] https://launchpad.net/bugs/116067415:39
rtg_jsalisbury, they prolly _are_ getting built, but perhaps they aren't getting copied. bjf ?15:40
ppisatiapw: ok so, booting from mmc doesn't work15:40
ppisatiapw: let me check if making them =y fixes it15:40
ppisatiapw: i recall it was mmc related15:40
apwmaxb, if i am reading this commit correctly the issue is that there is a 6 year old router which does not support pmtu correctly in the channel?15:48
rtg_jjohansen, 'UBUNTU: SAUCE: apparmor: Add the ability to mediate mount' is broken in raring. the prototype for struct security_operations.sb_mount has changed. apparmor_sb_mount() needs to be changed accordingly. I wonder how this even works.15:48
rtg_jjohansen, oh , never mind. I was looking at the wrong function.15:49
apwrtg_, we may only use that support for lxc ?15:50
rtg_apw, its just the addition of 'const' to a couple of the parameters.15:50
maxbapw: It supports PMTU, it just uses the original RFC792 definition of what an ICMP fragmentation needed packet should look like15:54
apwmaxb, i assume this means you can work round it by mss clamping at the source end16:08
apwmaxb, ie at your linux box16:08
apwmaxb, 792 isn't exactly helpful in defining the PMTU form clearly is it16:11
apwmaxb, ok this is better described in rfc1191 which says16:20
apw"Hosts MUST be able to deal with Datagram Too Big messages that do not16:20
apw   include the next-hop MTU, since it is not feasible to upgrade all the16:20
apw   routers in the Internet in any finite time. "16:20
apwmaxb, so you might want to add that to the two bugs, indicating we stopped being compliant there16:20
apwmaxb, but i would not be too hopeful of upstream ever putting this back, they seem to think your router is too old to care about, is there a reason it is not upgraded to a later version of openbsd there seems to be a bunch of later versions16:24
apwmaxb, either way i would be interested if a simple mss clamp would sort you out16:24
maxbReason for not upgrading is merely round tuits.16:25
maxbA MSS clamp should work.16:25
apwlets see what they say on your bug, but i am expecting ... "heee, that would hurt" or something helpful16:26
apwjsalisbury, i don't think reverting that patch will fly on its own, i am expecting you will find it is part of a larger series you'll never unpick16:26
jsalisburyapw, ack.  16:29
apwmaxb, that said it is not clear we could not just pass the 0 down and handle it as a 'mtu -= 16' or something until it works16:36
maxbThe prior behaviour was to pick from a descending list of common MTU sizes16:37
apwwell ... based on some random info in the packet16:38
rtg_ppisati, I pushed your 2 highbank patches on raring master-next. please check that they are correct.16:38
apwthe issue is that changing the mtu there is not allowed16:38
apwbut ... it must be changed lower down16:39
ppisatiapw: ok, i don't have any mmc-only installation anymore16:43
ppisatiapw: drop the annotation and leave these as modules16:43
apwppisati, thanks16:49
apwmaxb, this is a raring box yes ?16:49
maxbyes, that is right16:49
ppisatirtg_: i think you lost part of robher cfg16:49
ppisatirtg_: let me do that16:50
rtg_ppisati, ack16:50
apwmaxb, ok ..  i have had a go at just reinstating the most basic 'step down until it works'16:57
apwmaxb, down in the bit where we normally update mtu anyhow, i'll get you a kernel to test16:58
rtg_ppisati, I'm off to grab a bite. ping me when you have those config options done. I need to upload this stuff today.17:04
* rtg_ -> lunch17:04
ppisatirtg_: yep, i'm building another kernel17:04
ppisatirtg_: houston, we have a problem17:29
ppisatirtg_: i'm checking if it's our stuff or robher but17:29
ppisatirtg_: http://paste.ubuntu.com/5652876/17:29
ppisatirtg_: on highbank17:29
robherppisati: you need the cpuidle disable by default patch.17:32
ppisatirobher: was it part of your pull?17:35
ppisatirobher: ok, saw what's missing17:38
dobeyanyone around knowledgeable enough about intel/ivybridge to look at https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1021924 perhaps?17:38
ubot2Launchpad bug 1021924 in linux "Multiple Displays not working on Core i7 3770S + Intel DQ77MK motherboard" [Medium,Confirmed]17:38
apwdobey, did you try any if the 3.9-rc kernels as yet ?17:57
ppisatirtg_: ok, all sent18:07
* apw calls it a day18:11
=== psivaa is now known as psivaa_afk
rtg_ppisati, both batches of patches ?18:13
rtg_ppisati, pushed18:19
ppisatirtg_: k18:20
* ppisati -> dinner18:20
=== samson_ is now known as fragmede
fragmedeHi all; I'm not seeing a tag for Ubuntu-3.8.0-14.24 in git://kernel.ubuntu.com/ubuntu/ubuntu-raring.git.18:35
rtg_fragmede, oops, just re-pushed18:35
fragmedeGreat, thanks!18:36
rtg_ogasawara, ok, looks like I'll be able to upload raring pretty quick. I was beginning to wonder if this highbank stuff was gonna come together in time for the Beta freeze.18:44
ogasawarartg_: ack18:44
dobeyapw: i haven't. given the lack of comments on the upstream bug though, i doubt it will fix it if i do try one18:46
* rtg_ -> EOD19:32
FUF Greetings.. I was just wondering how the linux-image-virtual kernel packages differs from the -generic images - what advantages do they offer for my VMs compared to generic?19:53
FUFshould the differences I see when I diff their /boot/config* be enough to answer my question?19:55
dobeyprobably19:57
* ogasawara lunch19:58
apwdobey, i'll get you some test kernels with this patch for tommorrow, and put a pointer in the bug23:59

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!