/srv/irclogs.ubuntu.com/2015/01/26/#ubuntu-kernel.txt

=== ara is now known as Guest5294
=== smb` is now known as smb
=== txspud|away is now known as txspud
=== lool- is now known as lool
shadeslayerrbasak: hi there, it seems like I'm hitting the same issue as you were here http://irclogs.ubuntu.com/2013/08/01/%23ubuntu-kernel.html14:55
shadeslayerrbasak: was there any fix for it?14:55
shadeslayerI have a very reliable way of reproducing it in my schroot, when unpacking the firefox tar14:55
rbasakshadeslayer: only fix is to not use overlayfs15:03
shadeslayer:(15:03
shadeslayerbut I need it :(15:03
rbasakYou can configure schroot to clone the entire tree instead I think?15:03
rbasakOr something like that.15:03
shadeslayerthat sounds expensive15:04
rbasakIndeed.15:04
shadeslayerWell, the workaround I added seems to work, which is basically, keep retrying the tar comand15:05
shadeslayer*command15:05
rbasakIt's a race, so that might work eventually for you. It's not guaranteed to, though.15:05
shadeslayeraha 15:05
rbasakIIRC, the code in tar makes an assumption that doesn't hold true on overlayfs.15:05
rbasakI can't remember the details. Something about fstat after the file has been written or something.15:06
rbasakProbably inode number related.15:06
shadeslayerwell, the exact combination is eatmydata + overlayfs + tar, so I can imagine things going wrong15:08
=== jdstrand_ is now known as jdstrand
DizietHi.  I'm having a problem with     HOME=/ strace -ttfot git-clone -q git://kernel.ubuntu.com/ubuntu/linux.git16:04
DizietIt falls over after almost exactly 2 minutes.16:05
DizietWithout -q it works.16:05
DizietThe strace shows the server sending EOF while the client is still waiting.16:05
DizietWhere should I report this ?  It's quite inconvenient as it's making something in our CI system not work...16:05
apwDiziet, which version of git is that ?16:10
DizietDebian wheezy16:10
Diziet 1.7.10.416:10
apwwhy does that ring a bell16:10
Diziet(Obviously the failure happens without the strace.)16:11
DizietThe fact that it works without -q suggests to me that something is deciding that this process or this connection is "stuck", using a timer which gets reset by output (which is fairly copious with -v)16:11
Diziet(I mean, copious without -v)16:11
apwyep, that is something like the bug, and it is tickling my "this is familiar" noddle16:12
DizietAs a workaround, perhaps the timeout could be increased to an amount sufficient to make "git clone -q linux" work ?16:13
henrixapw: a grep in my irc logs (because it *does* ring a bell) shows a ref to bug #122814816:13
ubot5bug 1228148 in git (Ubuntu Precise) "git clone -q ends with "early EOF" error on large repositories" [Undecided,Confirmed] https://launchpad.net/bugs/122814816:13
apwhenrix, thanks ... i knew i'd seen something16:14
DizietI am experiencing this problem with our git caching proxy.  Naturally I can't really sensibly tell the proxy not to use -q in its own git invocations.16:14
apwhenrix, oh yeah, i think we shelved this till zinc got upgraded to P16:14
apwwhich i assume it already has16:14
henrixyeah, it looks like it has been upgraded16:15
DizietThat LP bug has a workaround (sending KA packets) but I'm pretty sure the timeout isn't in git itself.16:16
DizietIt is probably in some proxy or wrapper you have.16:16
DizietSo I don't think it is necessary for you to patch your git (although that would help people with broken^W NAT).16:16
DizietFor me it would suffice to increase your timeout.  (I know it is at your end because I have repro'd the problem from my colo.)16:17
apwDiziet, what is the config for the timeout16:20
apwDiziet, as i can likely get that changed quickly16:20
DizietIDK.  I'm not sure it is actually in git.16:20
DizietThere's a --timeout option to git-daemon which might be relevant.16:21
DizietAnd also a --timeout option to git-upload-pack.16:21
DizietI think the latter may be the one.16:21
DizietAre you running vanilla git-daemon out of inetd, or what ?16:22
DizietI'm going to try adding a --timeout to my own git-daemon here to see if I can repro the bug.16:23
apwDiziet, i am pretty sure it is going to be timeing out, so 1) i am gc'ing the repo you are cloning, and 2) looking to change the timeout16:26
Dizietapw: Thanks.  If you bear with me 10 mins or so I can probably confirm what to do to git to increase the timeout.16:27
DizietAre you running git from inetd, or from upstart ?  Is it git-daemon or something else ?16:28
apwDiziet, great, pretty sure it is from xinetd16:28
DizietCan you check the command line you're giving it ?  I think it's probably   git-daemon blah blah blah --timeout=120 blah blah16:28
DizietBut before you change that 120 I am going to try to repro the fault here.16:29
apwDiziet, yeah looks to be ... bah ... i'll wait on your test to confirm, and then can set those wheels in motion16:29
apwDiziet, though the bigger wheels would be to add these patches16:30
DizietGreat, will get back to you.16:30
DizietHeh.16:30
Dizietcol tells me that it's fixed (as in, the KA patch is included) in trusty.16:30
apwDiziet, yeah and i am sure that box is on P rig16:35
apwDiziet, yeah and i am sure that box is on P right now16:35
DizietI can confirm that --timeout=30 makes my own git server produce the same problem.16:36
DizietSo I think adjusting your --timeout=120 to (say) --timeout=2000 will probably help.16:36
DizietActually-stuck processes consume very little resource so I think you should be fine with a big timeout.16:36
apwDiziet, yeah, i'll see what i can get fixed16:37
DizietThanks.  Should I expect a change soon (eg today) ?16:38
DizietI have sent an email to the bug suggesting increasing --timeout as a workaround.16:48
Diziet(Of course that won't help people behind a NAT with a short timeout.)16:48
apwDiziet, i'd hope, but not expect it to change quite that quick16:52
DizietOK, thanks.16:59
=== pgraner-afk is now known as pgraner-food
=== pgraner-food is now known as pgraner
apwDiziet, ok i've reuqested the timeout change, and am working on the fixes, we shall see what occurs19:41
smoserhey21:34
smoserwonder if someone has an idea... not necessarily kernel, but somewhat related.21:34
smoserfrom trusty: http://paste.ubuntu.com/9887959/21:35
smoserfrom utopic: http://paste.ubuntu.com/9887962/21:35
argessmoser: its like one of those 'spot the differences' pictures. So you're wondering why the estimated minimum size is less in 3.16?21:37
smoseryes. i had more info coming. but good job spotting the diff :)21:37
smoserI'm not terribly concerned about the difference, but its causing a failure on a21:38
smoserbuild of maas images21:38
argessmoser: so without looking at the code, I wonder if some meta-data structures changed size and thus the minimal size changed. Whats the failure?21:39
smoserafter doing a of apt-get installs and such to a loop mounted image, on trusty I try to 'resize2fs' to 1400M (arbitrary historic size) and it fits fine.21:40
=== pgraner is now known as pgraner-gym
argessmoser: one thing to try is use the older e2fsprogs with the newer kernel to see if there is any calulcation differences there21:47
smoserdo you think that resize2fs is able to use anything from the kernel ?21:47
smoseri would not haave thought it would.21:47
smoserbut your suggestion would give more info to that21:48
argessmoser: there is a patch that adjusts the inode struct which means it could be padded differently and have a different size too.. haven't exhaustively searched21:48
argesthe difference is 8114 bytes, but we have 145152 inodes... so maybe its a superblock change? 21:49
argessmoser: another thing to look at is config differences between 3.13/3.16 to see if something was turned on that could affect structure sizes21:51
smoserarges, but the file isn't mounted.21:51
smoserso i'm confuse das to how kernel would be involved.21:52
smoserisn't resize2fs just opening a file an dlooking around at it ?21:52
argessmoser: not sure. strace and find out21:52
argesdownloading the image here and looking at it21:53
smoserstrace output: http://paste.ubuntu.com/9888176/21:54
argessmoser: so you see a few ioctls before 'Minimum' gets printed21:56
argesi'm not sure extactly how minimum size is calculated though.21:56
smoserwhat i'm doing is in lp:maas-images.  something like this ends up getting run:22:01
smoser  time maas-cloudimg2eph2 -vv --kernel=linux-generic --arch=arm64 \22:01
smoser     $url root-image.gz \22:01
smoser    --krd-pack=linux-generic,boot-kernel,boot-initrd 2>&1 | tee out.log22:01
smoserand on utopic, it doesn't fit into 1400M and on trusty it does.22:01
smoserand i think (doing a more controlled test now) that its significantly different.22:01
argessmoser: do you know what size it ends up being overall?22:02
smoseri'll have that later tonirhg. just started a run on utopic and one on trusty.22:03
smoseri'll keep the images aroudn too so i can grab more debug info on them.22:04
smoserbut i have to go afk for a while. thanks for your thoughts.22:04
argessmoser: hmm running this in vivid makes me run 'e2fsck -f *' first, then I get 290304 (this is on 3.19)22:04
argessmoser: sure. this might be worth filing a bug at some point. feel free to ping me again22:05
smoserhm..22:05
smoserthe downloaded image is dirty ?22:05
argesthat's what resize2fs 1.42.12 says22:05
smoserhm..22:06
argesmd5 sum matches22:06
smoserthat could be relavant.22:06
smoserwell, you see the output that i got, it doesn't complain. 22:06
smosermaybe i'll try more liberally sprinking 'e2fsck -fy the-image' 22:06
arges1.42.12 vs 1.42.9 wonder if there is some fix that exposes this... ugh22:07
* arges tries a trusty image for the heck of it22:09
argestrusty image on vivid doesn't complain that i need to run e2fsck. size on 3.19/vivid 236103, 3.13/trusty 230329 so still a difference22:12
argessmoser: if you want i can do a bisect and figured out which commit made it blow up in size.22:20
smoserarges, 23:34
smosertrusty: http://paste.ubuntu.com/9889226/23:34
smoserutopic: http://paste.ubuntu.com/9889235/23:34
smoserbasically, note that 'df' reports a difference used of '980200' to '980216'23:35
smoserbut resize2fs on trusty reports min size of of ~1073M versus ~1582M on utopic.23:36
smoserresize2fs does claim "estimated" size, but you'd hope it could estimate a bit closer than that.23:36
Dizietapw: (git-deamon timeout) Thanks.23:38

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!