/srv/irclogs.ubuntu.com/2013/07/24/#ubuntu-server.txt

jsonperl_cwill_at_work... does a system with high run q and low load indicate that it's I/O bound to you?00:00
jsonperl_cwillu_at_work, sorry00:01
Patrickdkiowait% is i/o bound00:01
Patrickdkyou have 0% iowait00:01
jsonperl_That post was of a healthy system... I was just trying to figure out how to interpret00:02
jsonperl_I logged some netstat numbers during the issue... I see some high send and recv Q, this may be somethin here00:04
Patrickdkthat would likely be a nic driver issue, I would think00:06
Patrickdkdo all your machines have the same driver/nic?00:06
Patrickdkbeen a long time since I attempted to diagnose or work on something at that level00:07
jsonperl_Yep every machine is identical00:08
jsonperl_Most are in LAST_ACK00:08
Patrickdke1000e?00:08
jsonperl_but some ESTABLISHED00:09
jsonperl_like almost 200k in some queues00:09
Patrickdkit might just be the result of the issue, but it might be a cause00:09
jsonperl_the card?00:09
jsonperl_it's a broadcom hangom00:09
Patrickdknot sure what to tell you to figure it out00:09
jsonperl_Broadcom 572000:10
jsonperl_That queue size seems pretty unhealthy right?00:10
Patrickdkwhat do you get for ethtool -k eth000:11
jsonperl_I don't have a currently borked system00:11
Patrickdkdoesn't matter00:11
jsonperl_http://pastebin.com/ARwR2W6K00:12
Patrickdkbut atleast till someone has an idea how to diagnose this somemore, I can atleast throw you some things to see if they have any effect00:12
Patrickdkif they do, it's likely the cause, of not, just an effect00:12
jsonperl_Yea totally. You've been reallllly helpful00:13
jsonperl_This stuff is all fairly new to me00:13
Patrickdkgive a try: ethtool -K rx off tx off sg off tso off gso off rso off rxvlan off txvlan off eth000:13
Patrickdkmaybe again for eth1 if you use it00:13
Patrickdkopps00:13
Patrickdkethtool -K eth0 rx off tx off sg off tso off gso off rso off rxvlan off txvlan off00:14
PatrickdkI am not sure about the broadcoms, but I know the intel driver has gone back and forth on it working and not working00:14
Patrickdkmy older intel ones, I had to disable a few of those, to make it work correctly00:14
Patrickdkthis will cause higher cpu usage00:15
PatrickdkI doubt it will be enough for you to notice though00:15
jsonperl_so basically turning everything off00:15
Patrickdkyep00:15
jsonperl_any potential for badness here?00:15
jsonperl_besides cpu usage00:15
Patrickdkno00:15
Patrickdkthe chcksums just lower cpu usage00:15
jsonperl_But in your experience they gunk up the works sometimes?00:16
Patrickdkthe rest mainly cause the nic and linux to move around 64k of data at a time, instead of one packet at a time00:16
Patrickdkgro sometimes, rxvlan on one of mine here at home00:16
Patrickdktso I think I had an issue with on some too00:16
Patrickdkthis system I am using now needs: ethtool -K eth0 rxvlan off tx off00:17
Patrickdkthat leaves only gso turned on00:17
Patrickdkforget about the tx on it, but it doesn't support rxvlan, but the driver thinks it does00:18
jsonperl_Its a gigabit card, it's speed is set at 100Mb... likely the network its on00:18
Patrickdkoh, at 100mbit you will never see the increased cpu usage :)00:19
jsonperl_I'm wondering if I need to upgrade the network... I've never seen us go beyond maybe 15Mb though00:19
jsonperl_Seems like a lot of data in queue in LAST_ACK state indicates a problem with our code no?00:27
jsonperl_Basically the connection has been severed on their end, but we haven't gotten rid of it00:27
jsonperl_TCP:   458 (estab 50, closed 96, orphaned 63, synrecv 0, timewait 1/0), ports 000:28
jsonperl_Lotta orphans00:28
Patrickdkdunno what a LAST_ACK is00:29
jsonperl_Right before the tcp connection closes00:30
Patrickdkoh, that is actually a state00:30
PatrickdkI never see those00:30
jsonperl_I've got a bunch... perhaps thats an issue00:30
Patrickdkna, normally that for me is TIME_WAIT, where the connection was closed, but not properly00:30
sarnold"The remote end has shut down, and the socket is closed. Waiting for acknowledgement."00:31
Patrickdkya, sounds like your sending it data, but it's not responding00:31
Patrickdkoh00:31
sarnoldis there a funny firewall in the way preventing those packets?00:31
jsonperl_just iptalbes00:32
Patrickdkhmm, odd though, never seen them, just the FIN_WAIT TIME_WAIT mainly00:32
jsonperl_Here so you have some idea what I'm looking at: http://pastebin.com/9RNzEbb900:34
jsonperl_ips jiggled to protect the innocent :)00:34
jsonperl_I wonder if we're just shipping data to a "almost closed" socket, and filling up the tcp queue00:36
sarnoldjsonperl_: the 'slabtop' utility ought to be able to show you if TCP is eating too much of your memory00:42
jsonperl_I'll check it out00:43
jsonperl_though memory utilization is quite good now00:43
jsonperl_(with a little help from my buddy PatrickDK)00:43
jsonperl_gotta head home... thanks folks, back later00:49
sarnoldhave fun :)00:49
jsonperlback for more!01:11
jsonperlPatrickDK01:31
jsonperlI just had a system flake out… I hit those networking settings live, and it seems to have fixed it?01:32
jsonperl(super super anecdotally)01:32
Patrickdkdunno :)01:32
jsonperlso your theory there is that there is a driver issue with the card?01:32
Patrickdkpersonally, I would put those on like 3 or so, and see01:32
Patrickdkwell, driver or firmware01:33
jsonperlyea the whole "it didn't explode" thing is a really frustrating way to prove stuff :)01:33
Patrickdkmore likely driver, but firmware could affect the drivers actions01:33
jsonperlso by turning all of that off, we reduce the load on the card essentially?01:33
jsonperland let the os take care of stuff01:33
Patrickdkwell, it puts the card into normal dumb mode basically01:33
Patrickdkinstead of attempting to limit interrupts, and queue up requests and stuff01:34
Patrickdkand offloading some of the work01:34
Patrickdkit might be there is some kind of buffer overrun happening on the nic, causing the issue01:34
Patrickdkbut I'm totally random guessing01:34
jsonperlme201:35
jsonperl:D01:35
Patrickdkbut now since that is off, nothing is really getting buffered01:35
jsonperloh man, if this fixes the problem01:35
PatrickdkI have had issues with broadcom drivers before, but not on linux01:35
Patrickdkbut then, I really have not used broadcom on linux so :)01:35
jsonperlI use what they rent me :)01:35
jsonperl(peer1 / serverbeach)01:35
sarnoldjsonperl: was that an ethtool command that seems to be fixing it?01:37
jsonperlyep01:37
jsonperlethtool -K eth0 rx off tx off sg off tso off gso off rxvlan off txvlan off01:38
jsonperl'seems' being the operative word01:38
Patrickdkif your really interested, start knocking one off at a time, till it acts up again :)01:40
jsonperlhahahaha01:41
jsonperloh man, the fact that thats a reasonable thing to do kinda of makes me ill :)01:41
sarnold:)01:42
* Patrickdk bets on the tso or gso01:43
jsonperlim gonna turn everything off on all machines01:44
Patrickdkcould be tx, but normally not01:44
jsonperlthen i'll pull those on one of them01:44
jsonperlso do tso, gso, and tx in that order huh :)01:44
Patrickdkor, pull a different one per machine? :)01:44
sarnoldyeah, I'm also suspicious of tso and gso01:44
jsonperlahahha01:44
sarnoldand it feels like 'sg' would be nice to have back01:44
PatrickdkI have no idea what sg is, never bothered by it before :)01:44
sarnold(at least I assume it means Scatter/Gather)01:45
Patrickdkit does01:45
jsonperloh man, im excited01:46
* Patrickdk locates a bed01:46
jsonperlI MAY BE ABLE TO SLEEP01:46
sarnoldg'night :)01:46
jsonperlcya Patrick, thanks again01:46
jsonperlallright, all machines updated01:52
jsonperlnow I wait :)01:52
jsonperlsarnold/Patrickdk, it makes sense those settings kick in live right?01:52
jsonperlno networking restart or anything01:52
sarnoldjsonperl: right01:53
jsonperlgood… because if it didn't that would disprove that it fixed it ;)01:53
=== jtv2 is now known as jtv
=== smb` is now known as smb
jamespageyolanda, https://code.launchpad.net/~james-page/glance/sqlalchemy-bump/+merge/176613 if you are around :-)07:27
jamespagezul, ^^07:27
yolandamorning07:27
jamespageI'm gonna review all packages today07:27
yolandagreat07:27
jamespageyolanda, morning!07:28
yolandajamespage, bad news, since this branch is on ubuntu-server-dev, i don't have permissions07:28
jamespageyolanda, just need a review07:30
jamespagenot a merge07:30
jamespageI'll do that myself07:30
yolandajamespage, assign me as a reviewer07:30
yolandaotherwise i can't07:30
yolandai don't have the permissions to "Request review"07:31
jamespageyolanda, dog07:31
jamespagedoh rather07:31
jamespageyolanda, done07:31
yolandaok, reviewed, i cannot change the main status anyway07:33
jamespageack07:34
jamespagethanks07:34
=== Ursinha-afk is now known as Ursinha
=== racedo` is now known as racedo
=== tim___ is now known as vorpalbunny
=== LordOfTime is now known as LordOfTime|EC2
=== tedski- is now known as tedski
=== vorpalbunny is now known as thumper
=== Tribaal_ is now known as Tribaal
=== thumper is now known as thumper-afk
=== thumper-afk is now known as thumper
=== psivaa_ is now known as psivaa
ThothCastelhow to check if ssh server is running?10:25
mardraumThothCastel: service ssh status10:26
greppyps -ef | grep ssh10:26
ThothCastelgreppy: mardraum: thanks, it's running, however I am unable to connect to it via ssh :S10:30
mardraumwhat exactly happens? use pastebin if you must10:31
jamespagezul, when you start review needed please - https://code.launchpad.net/~james-page/neutron/fixup-h2/+merge/17665010:36
cwillu_at_workgreppy, "sshd"10:43
jamespagezul, you might wanna take a look at the python-greenlet upload you did yesterday11:05
jamespageit blasted all of the python3 work that you did in the previous two ubuntu versions11:05
jamespage(which is why its block in proposed right now)11:06
zuljamespage:  fuuuuuu11:35
ikoniazul: ?11:38
streulmahello, I can upgrade my kernel on Ubuntu Server 12.04, but when I reboot, the server don't boot and hangs, it's KVM virtualisation12:14
jamespagezul, hey - I also uploaded trivial fixes for keystone and glance autopkgtest failures12:18
jamespageI'm stuffing them into havana staging as well12:19
zuljamespage:  ack12:19
jamespagestreulma, anything on the console?12:21
streulmathere is on the moment a problem with console, the isp upgraded to new version of OnApp12:21
streulmabut before I had the problem12:22
streulmait boots the kernel12:22
streulmaand then hangs after keyboard...12:22
streulmabefore the services loads12:22
=== smb` is now known as smb
zuljamespage:  http://people.canonical.com/~chucks/ca/13:40
jamespagezul, ceilometer?13:42
zuljamespage:  yep13:43
jamespagezul, why does simplejson need "     - Build for python 3.2 as well."13:56
jamespageI know precise has python 3.213:56
jamespagebut can't a generic fix be applied in saucy which makes it a no-change backport again?13:57
zuljamespage:  because it explicity dependeon on python 3.313:58
jamespagezul, +1 for msgpack-python13:58
zuljamespage:   python3-all-dev (>= 3.3.0-3) in the debian/control13:59
jamespagezul, ack14:01
jamespagereviewing now14:01
zuljamespage:  ill fix the saucy version14:01
jamespagezul, does it work with python3.214:01
zuljamespage:  yeah14:01
jamespagejust wondering if that why the min-versions are specced14:01
zulnothing in the changelog14:02
jamespagezul, nope14:03
jamespageand it looks OK - maybe poke piotr in #debian-python on OFTC and see if there are any gotchas14:04
zuljamespage:  nope im not uploading it, i just noticed a bug14:04
jamespagezul, do we really need the new webtest?14:04
jamespageis 1.3.3 -> 1.3.414:04
jamespageits rather14:04
zuljamespage:  im not sure, nack it please14:04
zuljamespage:  chuck@homer:~/pbuilder/precise_result$ dpkg -c python3-simplejson_3.3.0-2ubuntu1~cloud0_amd64.deb14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/doc/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/doc/python3-simplejson/14:05
zul-rw-r--r-- root/root      3160 2013-07-24 09:06 ./usr/share/doc/python3-simplejson/changelog.Debian.gz14:05
jamespagezul, -114:05
zul-rw-r--r-- root/root      1645 2011-02-15 15:56 ./usr/share/doc/python3-simplejson/copyright14:05
zulchuck@homer:~/pbuilder/precise_result$ dpkg -c python-simplejson_3.3.0-2ubuntu1~cloud0_amd64.deb14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/14:05
jamespage\o/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/doc/14:05
zuldrwxr-xr-x root/root         0 2013-07-24 09:14 ./usr/share/doc/python-simplejson/14:05
zul-rw-r--r-- root/root      7062 2013-05-01 16:01 ./usr/share/doc/python-simplejson/index.rst.gz14:05
zul-rw-r--r-- root/root      3160 2013-07-24 09:06 ./usr/share/doc/python-simplejson/changelog.Debian.gz14:05
Picinice14:05
zul-rw-r--r-- root/root      1645 2011-02-15 15:56 ./usr/share/doc/python-simplejson/copyright14:05
zulshit!14:05
Piciits zul, so I'll let it slide... this time ;)14:05
* jamespage drowns in irc14:06
zuljamespage:  tests are not enabled in that package either14:06
=== mahmoh1 is now known as mahmoh
dranixhi everyone14:19
dranixi need some help with ldap integration with packetfence14:19
dranixanyone has any idea how to go about doing this?14:19
Monotokoif I connect to an openvpn server in the office... should it not tunnel all my internet connection through it?14:40
MonotokoI have the same IP as before...14:40
rbasakDepends on how you have it configured.14:42
Monotokorbasak, it was configured by my predecessor - where can I check?14:43
rbasakI don't recall, sorry. Check the docs for mentions of your default gateway. I think it's a client-side setting, but you can also configure the client to accept the server's settings and then configure it on the server (IIRC).14:44
rbasakOr may default route, rather than default gateway.14:45
rbasakmaybe14:45
Monotokolots of mention of bridging...14:46
raubTrivial question: how do I upgrade a kernel module that is in use? By in use it is module for raid controller but I am booting using a live CD14:47
oozboozMonotoko: check your routing, is default route via VPN or your ISP?14:47
oozboozcommand "ip  r sh"14:48
oozboozusually, server pushed routes to the client, but client can overwrite it or do some other tricks w/out getting server involved14:49
oozboozpushed=pushes14:49
Monotokooozbooz, http://pastebin.com/65vcRqk714:50
MonotokoI tried to remove the comment in the config here: ;push "redirect-gateway def1 bypass-dhcp"14:50
Monotokohowever then the client wouldn't load anything14:51
oozboozI assume 5.10.152.225 is your ISP GW14:51
oozboozthen your internet traffic should go over it14:51
Monotokoyeah, we have a /29 I believe14:51
Monotokowhen I'm connected from outside the office14:51
MonotokoI want it to still use the office IP14:51
oozboozuse office IP for ... ?14:52
oozboozyou mean send your ALL traffic via the tunnel?14:52
Monotokoyeah - it's static - a lot of people who work here work from homes etc, with dynamic IP's14:52
MonotokoI'd rather they all used our network to make it easier to firewall the servers and not keep punching random holes in the FW14:53
oozboozI don't get your last statement ...14:55
oozboozusually, you want to only relevant traffic to send to your office via the tunnel,14:55
oozboozrest of the stuff, they should use their ISP14:55
oozboozwhy would you want them to download youtube videos using office bandwidth14:56
rbasakI'd say it depends. Road warriors might prefer everything to go via the office if they don't trust the connections they're using (coffee shops, hotels, etc)14:56
Monotokooozbooz, we have a "cloud" provider off site that I need to give developers access to, and certain things that they can log into through the web browser but only from this IP14:56
oozboozaha14:57
oozbooz3rd party mess..14:57
Monotokoaye - obviously I need a static IP I can trust for that, so I'd rather tunnel everyone through our office network14:58
oozboozwell... you can create a new route that only traffic for cloud provider goes via the tunnel14:58
Monotokohmm, what route would I be adding for that? route add 1.2.3.4 gw 5.10.152.227 eth0 ?15:00
oozboozbut, if you decide to divert all traffic, you will have to change routing rules on the server, that will be pushed to the client15:01
oozboozwhich VPN server do you use15:01
Monotokoopenvpn15:01
zuljamespage:  simplejson fixed locally ill upload to the regular archive and get it for the cloud archive as well15:01
oozboozopenvpn or openvpn-AS?15:02
Monotokoregular openvpn AFAIK15:02
Monotokoyeah15:02
Monotokojust checked with dpkg15:03
oozboozok, first my advice to upgrade to openvpn-AS - much easier to manage15:03
oozboozthere is IRC channel "openvpn", you should confirm with them... but it should be not difficult15:04
Monotokocheers oozbooz15:05
oozboozhave fun15:06
zuljamespage:  http://people.canonical.com/~chucks/ca/15:07
=== pleia2_ is now known as pleia2
zulsmb: ping i was wondering if you could offer some insight on it https://launchpadlibrarian.net/145685953/buildlog_ubuntu-precise-amd64.xen_4.2.2-1ubuntu1~cloud0_FAILEDTOBUILD.txt.gz15:14
smbzul, maybe, let me read15:15
zulsmb: this is on precise15:16
smbzul, Looks like the known problem of passing LDFLAGS in gcc format -Wl but don't we work around that15:17
zulsmb: yeah seems to ignore that for some reason15:18
smbAnd why do you compile xen 4.2.2 on Precise?15:18
smb:-P15:18
smbStill have not cleared theat MRE15:18
smbActually I would not aim 4.2.2 immediately but 4.1.5... or .6 but anyway15:19
jamespagezul, +!15:20
jamespage+!15:20
jamespage+1 rather15:20
zuljamespage:  cool thanks15:20
smbzul, "LDFLAGS = $(shell dpkg-buildflags --get LDFLAGS|sed -e 's/-Wl,//g')" in debian/rules?15:21
jamespagezul, https://code.launchpad.net/~james-page/neutron/fixup-rootwrap-conf/+merge/17670815:21
soahcccI'm more the nginx kind of guy so what did I missed here? Installed apache, changed port (so that it won't conflict with nginx), getting this nestat "tcp    0     0 0.0.0.0:8080     0.0.0.0:*       LISTEN      0      185527477   18552/apache2" but it just reacts to local requests. There is no iptable rule... Any ideas, I'm desperate :(15:21
smbzul, Just out of curiosity is that the 4.2.2 version from current Saucy?15:26
zulyeah15:26
smbzul, Hm, so it has that line... but for some reason I vaguely remember something going wrong with something like this (but I believe that was another package)15:27
smbzul, Oh wait maybe because in P LDFLAGS is exported by the build system...15:29
zulhmm...interesting ill try it out15:29
smbzul, Is that LDFLAGS := instead of LDFLAGS =15:30
=== JonnyNomad_ is now known as JonnyNomad
smbzul, Oh I think I can imagine what is going on: we do not set LDFLAGS at all by default in newer releases. So when compiling in S I did not notice none of them being used and setting LDFLAGS in debian/rules being useless15:39
smbBut in P when they are set by default it fails...15:39
zulsmb:  so disable it?15:39
smbzul, I'd probably try either an export in debian/rules or move the definition into debian/rules.real for a moment15:40
=== Catbuntu is now known as LexieGrey
zulsmb:  ok ill try that15:40
smbzul, And I need to make sure I really use those flags in the Xen 4.3 I am preparing15:41
smbfor S that is15:41
zulsmb:  when are you doing 4.3?15:43
smbzul, I am just about to think I got all pieces together. Testing it on my boxes15:43
zulsmb: ok cool15:44
med_zul, a user asked me if there will be any quantum-> neutron renaming in raring or earlier (and similarly, anything before havana)15:45
med_my answer was "NO, but I'll check with zul"15:45
zulmed: no quantum in raring was quantum15:46
med_nod.15:46
* med_ was pretty sure it was only a cease and desist not a "go undo the world"15:47
roaksoaxMadkiss: howdy! have you looked into packaging dlm?15:59
zulsmb:  nope neither worked16:00
smbzul, Hm, ok need to figure out how to modify it correctly for the actual compile. Seems the more recent releases just don't use any16:01
smbI mean it does not get passed in and fails because where we change it somehow does not replace the default of the system16:02
smbzul, Doing the export did break the build in the same way on S though... So maybe := is the second missing piece16:04
smbzul, having LDFLAGS= and export LDFLAGS both in rules.real seems to make the compile run longer (not finished yet)16:13
zulsmb:  can i see a snippet your rules.real please?16:14
Davieyrbasak: BTW, merges.py won't work right now - until egress firewall is more relaxed.  Have raised RT16:17
rbasakDaviey: OK, thanks.16:20
Davieyroaksoax: Hey, does Openstack / Kombu support Rabbit Active/Active in Havana?16:20
rbasakI'll try and keep people.canonical.com/~rbasak/delta.py updated in the mean time, though note that I'm doing it manually.16:20
roaksoaxDaviey: I haven't check yet, sorry! I'm doing the whole upgrade process of the clustering tools, whcih is not as easy as syncing packages from debian16:22
adam_gDaviey, the issue wasn't active/active its the lack of any type of heartbeating support, so that the rpc layer (quickly) detects failure and migrates to a new server16:24
jsonperlPatrick, I still got the issue, but I think I'm getting closer16:27
jsonperlPatrickdk that is16:28
jsonperlWould a BUNCH of connections in CLOSE_WAIT stop up the tcp pipeline at some point?16:28
zuljamespage:  still around?16:40
jamespagezul, yes16:44
zuljamespage:  one more for you today http://people.canonical.com/~chucks/ca/16:44
jamespagezul, does that one build against the havana-staging PPA?16:45
zuljamespage:  just finished building16:45
jamespagezul, +1 then16:45
zuljamespage:  thanks16:45
patdk-wkjsonperl, if that is the case, a couple of issues could be the case16:47
patdk-wkopen file handles?16:47
patdk-wkor just exaustion of resources16:48
patdk-wkmaybe look here, it seems to have an ok description of the sysctl's involved16:48
patdk-wkhttp://www.ufirsttech.com/content/linux-kernel-settings-related-tcp-connections-6816:48
jsonperlAwesome thanks16:48
patdk-wknormally there are several sysctls that need to be adjusted for any kind of high performance server16:49
patdk-wkexpecially when handling lots of connections16:49
jsonperlIn this case it's actually a library i use to hit amazon s316:49
patdk-wkdon't think any of this would cause that single cpu usage issue though16:49
jsonperlwhich is the least often used connection i got16:49
jsonperlI think all of what we were seeing is a RESULT of connectivity issues16:49
jsonperlno players = no processing16:50
patdk-wkoh, that page uses proc, I normally do it via sysctl instead16:50
jsonperlI think the ethtool command to change stuff maybe reset the stuck connections?16:50
patdk-wkjsonperl, still :)16:50
jsonperlmaking it look fixed16:50
patdk-wksetup a ping16:50
patdk-wksee if you start missing, or get delayed pings16:50
patdk-wkif your running tcpdump on the server at the time too, watching just for icmp16:51
jsonperlok, we use pingdom… that sufficient you think?16:51
patdk-wkyou should be able to easily tell16:51
jsonperlI actually try tcp to the server every minute16:51
patdk-wkisn't that like once a minute?16:51
jsonperlyea16:51
jsonperlYou're thinking more often?16:51
patdk-wkya, I would go second, and watch delays16:51
patdk-wkyou want to know how long it takes, you know it gets there ,and responds16:51
patdk-wkyou want to know if it gets lost, or delayed16:52
patdk-wkwell, tcp would get lost and retried16:52
patdk-wkbut ping would just get lost16:52
jsonperlAny service you can recommend? or you just do it from another box16:52
patdk-wkI normally just do it from my home box16:52
jsonperlgotcha16:52
patdk-wkor a work computer16:52
patdk-wknot like ping uses much traffic16:52
jsonperlDoesn't feel very enterprisey :D16:53
patdk-wknow if you want to take it a step more, use mtr :)16:53
patdk-wkso you can see where the issue actually happens, if it's network related16:53
jsonperlIt's not16:53
jsonperlthis is my boxes16:53
jsonperlI wish it were somebody elses fault!16:54
patdk-wkno, if you think the issue was you aren't receiving the players traffic16:54
patdk-wkthat would be network issue :)16:54
patdk-wkping would easily show that16:54
jsonperlBut I see the same issue cross machines, cross facilities16:54
jsonperldifferent parts of the US16:54
jsonperlsame issue16:54
patdk-wknot likely then16:54
patdk-wkI really don't know where to go16:55
patdk-wkunless I actually get on it and dig around and maybe setup my own stuff to monitor it16:55
jsonperlI feel like i need to get rid of those orphaned connections16:55
patdk-wkbut not even sure how good I could do that16:55
jsonperlWant a consulting job? :D16:55
patdk-wkI have enough of those :)16:55
jsonperlhaha16:55
jsonperlBut we're a super entertaining indie game company16:56
jsonperllike on the tv :D16:56
jsonperlSo real quickly...16:56
jsonperlDo you believe it's possible that piling up of CLOSE_WAIT connections eventually can lead to connectivity issues in the tcp stack?16:57
jsonperlor am I going up the wrong road here16:57
patdk-wkit can, I doubt your anywhere near that though16:57
patdk-wkI doubt your even >5% of the limit16:57
jsonperlDoes the OS limit per process?16:58
patdk-wkcheck ulimit for that16:58
jsonperlk16:58
patdk-wkremember, tcp connections are file handles, and count with open files16:58
jsonperlSo what seems like a clue to me is16:59
jsonperlTurning everything off with ethtools fixed "the glitch"16:59
jsonperlTemporarily16:59
jsonperlNo question… went from "very borked" to normal the moment I changed the settings17:00
patdk-wkwhat kernel you running on these?17:00
jsonperl3.2.0-38-generic-pae #61-Ubuntu SMP Tue Feb 19 12:39:51 UTC 2013 i686 i686 i386 GNU/Linux17:01
patdk-wkhmm, 32bit17:01
patdk-wkwhy not 64?17:01
jsonperlactually wait… that box is an oddball17:02
jsonperlthe rest are 6417:02
patdk-wk:)17:02
patdk-wkusing any dkms modules?17:02
patdk-wkI doubt you are17:02
jsonperl32 was to save memory17:02
jsonperlthese are the rest 3.2.0-49-generic #75-Ubuntu SMP Tue Jun 18 17:39:32 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux17:02
jsonperldkms? I donno what that is17:03
patdk-wkaddon modules for the kernels17:03
jsonperlah… can I list em?17:03
jsonperlit's pretty stock 12.0417:03
patdk-wklike, vmware drivers, xtables, ....17:03
patdk-wknvidia17:03
jsonperlah, i doubt17:03
jsonperlno video17:03
patdk-wknormally should show up in dpkg -l | grep dkms17:03
jsonperlno virtualization17:03
jsonperlnothin17:03
patdk-wksince I still believe it's a kernel issue17:04
patdk-wkmight be worth giving a 3.8 kernel test on it17:04
patdk-wkthough, on all my servers I haven't hit this issue, but then, I likely wouldn't have noticed either17:04
patdk-wkI'm using 3.8 on my firewall machines for the newer firewall stuff in it17:06
patdk-wkto install it, apt-get install linux-generic-lts-raring linux-tools-lts-raring17:07
patdk-wkthen reboot17:07
patdk-wkyou can always uninstall it too17:07
jsonperlWas just looking into that?17:07
jsonperlHow do you downgrade?17:08
patdk-wkit's just a new grub kernel option17:08
patdk-wkjust select a different one17:08
patdk-wkthen once it's booted apt-get remove those two17:08
patdk-wkI had all kinds of dkms issues with it17:08
patdk-wkcause I needed both vmware and xtables dkms modules17:08
jsonperlMakes sense… That's why i like to stay 2 steps behind bleeeeeding edge17:09
patdk-wkI really wanted the bufferbloat stuff in 3.8 though :)17:09
patdk-wkfor the firewall, and firewall needs xtables :)17:09
patdk-wkall my other machines are normal 64bit 12.04 though17:09
patdk-wkbut I wonder if the issue your having got fixed in the kernel already17:10
patdk-wkand there is a LOT of changelogs to read to find out easily17:10
patdk-wkwithout just testing it17:10
jsonperlOr testing that it doesn't happen to explode17:11
jsonperlover a period of days :)17:11
patdk-wkI guess we could always setup an ice, and test it there :)17:12
jsonperlice?17:12
patdk-wkhttp://en.wikipedia.org/wiki/In-circuit_emulator17:12
patdk-wkwhen you go there, it's not pretty17:13
patdk-wkI guess these days people would just use a vm17:14
patdk-wkbut oldschool it was using an ice17:15
=== Ursinha is now known as Ursinha-afk
jsonperlgotcha… yep that's before my time!17:17
jsonperlmtr is cool17:17
jsonperlcept allll my packets are lost on the way to my server17:17
jsonperlMust be clipping all but the first17:18
jsonperl1whoops17:21
=== jsonperl1 is now known as jsonperl
jsonperlbtw I would be HAPPY to give you access to the box :)17:33
jpdsjsonperl: Sounds like a dreadful idea from a security point-of-view.17:43
jsonperlhaha17:44
jsonperlTruth17:44
rizzuhHey guys. I want to install Redis on a 12.04 Azure Extra Small VM. It has only 768MB of RAM available. How can I find the RAM usage and what steps should I follow to minimize memory usage, so Redis can have the lion's share?18:42
sarnoldrizzuh: measuring memory use is a bit complicated; 'free' will give you a very quick overview of free memory on the system, the -/+ buffers/cache line is probably most important summary of the summary..18:43
sarnoldrizzuh: ps auxw or top (sorted with M), look for the highest RSS numbers, that's what's actually resident in RAM for those programs..18:43
sarnoldrizzuh: ut sometimes shared libraries take a pile, the 'smem' tool can help you find out wihch processes have which shared libraries loaded, and apportions to each of them a certain amout of the fault for the memory used by those shared libraries18:44
rizzuhsarnold, well ATM top shows 554478k free - if that isn't woefully inaccurate it's pretty good18:47
sarnoldrizzuh: well, "free" is a funny thing. the kernel keeps some memory around, free, to handle spikes of allocations. but it tries to minimize the amount of free memory because free memory is wasted memory. :)18:48
rizzuhsarnold, ahh, sure, free as in not reserved by an app. If it's full of cache that ain't an issue.18:48
sarnoldrizzuh: that's where the -/+ buffers/cache line comes in -- that includes memory that is currently being used for storing in ram copies of files but _could_ be thrown away under pressure18:48
sarnoldrizzuh: *nod* *nod*18:49
=== wxl_ is now known as wxl
rizzuhsarnold, that said, 500 MB RAM to use is good, but damn this thing is slow. Good that Redis doesn't need much processing power. It's taking a while to update a few apt packages.18:55
sarnoldrizzuh: at least the amazon micro instances are very heavily penalized in much the same way.. not bad for slight spikes in a mostly-idle environment, but installing a few hundred packages is -painful-18:56
jsonperl1yea those micros18:57
jsonperl1i'm fairly sure they arbitrarily throttle you...18:57
rizzuhsarnold, yeah these are pretty much the same as AWS micro. 5 Mbit network as well, not great.18:57
sarnoldif the azure storage can be moved among instances, it might even make sense to turn it off, attach to a good instance, upgrade, and move back to cheap again.. heh.18:57
=== jsonperl1 is now known as jsonperl
sarnoldrizzuh: 5MBit? wow!18:57
rizzuhThe next one is small at $50 a month, with 1.5GB RAM and a dedicated core. Oh and 100 Mbit network or something like that.18:58
rizzuhBut then through BizSpark we pay 33% less. "Pay", as we have $150 credit / dev, with production usage rights, so it's pretty good for the money :P18:59
=== Ursinha-afk is now known as Ursinha
jsonperlPatrickdk, so running simulators at a box… I'm able to REALLLLLY pile up on LAST_ACK state connections19:50
jsonperlOver about 20 minutes, I'm able to get to a count of 450 or so19:50
patdk-wknice19:51
jsonperlSeems odd right?19:51
patdk-wksomething isn't closing the connection correctly19:52
patdk-wkmight just be normal for ios, no idea though19:52
jsonperlOur server was trying to "close a connection after writing remaining data"20:02
jsonperlI changed it to just close the connection, seems to fix that at least20:02
jsonperlsarnold: Ive dumped some dmesg output from blocked processes, but still unclear how to read it20:34
hallynjdstrand: would adding AUDIT_WRITE to libvirtd apparmor policy be acceptable?20:35
jdstrandhallyn: usr.sbin.libvirtd?20:41
hallynyes20:46
jdstrandhallyn: that's fine, libvirtd is not really confined anyway (the VMs it launches are)20:55
jdstrandhallyn: let me point you at a bug though20:55
jdstrandhallyn: actually, nm, you should be ok20:55
hallynjdstrand: ok, thanks.  (i consider this ultra-low priority)20:59
hallynzul: ^ if you happen to be merging libvirt soon-ish, we should toss that in i guess (there is an open bug requesting it)20:59
jsonperlnetstat -s output… does anything here look overly concerning? http://pastebin.com/bnzEFRPh21:05
thumperhi hallyn21:05
thumperhallyn: thanks for the comprehensive email21:05
thumperit has me thinking...21:05
thumperhallyn: also, lxc-device isn't available in the precise lxc that we are limited to21:06
sarnoldjsonperl: 10878 invalid SYN cookies received21:08
sarnoldjsonperl: that seems steep.21:08
jsonperltake the system down steep21:08
jsonperl?21:08
sarnoldmaybe it's normal on the internet now, but .. it'd be worth asking your host if you're under attack..21:08
sarnoldjsonperl: what's this machine -do-?21:08
jsonperlserves a game via a persistent tcp connection to a bunch of users21:09
jsonperlat this time only about 50-100 concurrent on that machine21:09
jsonperldistributed amongst 14 servers on that machine21:09
hallynthumper: are you actually limited to the stock precise lxc, or could you use lxc from the ubuntu-lxc ppa for precise?  AFAIUI you're using ppas anyway....  but in any case lxc-device is just a nicety, you do NOT need it :)21:10
thumperhallyn: possibly not necessarily limited to stock lxc21:11
thumperbut I've not considered extra ppas21:11
thumpermanaged to not really need it at this stage21:11
thumperhallyn: this would be on every machine, and I don't think we install ppas on every machine21:12
hallynthumper: well lxc-device itself isn't enough of a reason to switch to ppa i don't think21:12
* thumper nods21:12
thumperI need to find someone who knows maas21:12
thumperto work out how to do the "gimmie a nic" thing21:12
hallynthumper: is it acceptable to simply start up the container after getting the nic from <whatever hands it to you> ?21:12
thumperyes, I think we can do that21:12
hallyncool, that'll be easiest21:13
thumperas long as the getting a nic doesn't take too long21:13
thumper< 10s would be ok I think21:13
thumperlonger than that and we might need to work out something else21:13
thumperby something else21:13
thumperjust a better work flow21:13
jsonperlsarnold: Any ideas for further investigation into the invalid syn cookies?21:13
thumperhallyn: I wish I knew about the "no network conf" bit to use the host21:14
thumperthat would have been a good enough setting by default I think21:14
jsonperlan attack certainly could explain the very random connectivity issues we've seen21:14
thumperI need to consider the implications for the local provider21:14
hallynthumper: i don't follow.  you mean lxc.network.empty ?21:14
thumperno, the number 221:14
thumperno network entry21:15
sarnoldjsonperl: syn packets tie up kernel memory; syn cookies are one way to tyr to avoid the worst of the kernel memory use. for some good backgroud information, see http://lwn.net/Articles/277146/21:15
sarnoldjsonperl: /etc/sysctl.conf has a configuration you can set to turn on syn cookies21:15
thumperalso I need to work out how to have a nice api to our internal providers, and how to handle that config with the containers21:15
jsonperlok, thanks for the read21:16
thumperthe brain is busy handling this with a background process :)21:16
thumperI think I almost have it :)21:16
jsonperlsarnold: if netstat is reporting invalid syn cookies, doesn't that mean they're on?21:16
sarnoldjsonperl: maybe? :)21:17
jsonperlsarnold is that the only thing of concern that popped out at ya?21:26
sarnoldjsonperl: the high connection counts made me wonder, but the use makes sense, hehe21:28
jsonperlKids jumping in and out of the game21:28
sarnoldsorry nothing just stands out to me ;(21:28
jsonperlworlds exist on one server on one machine, and they can "teleport" between them21:29
jsonperlhaha ok :)21:29
jsonperlsarnold: good reading on syncookies thanks21:31
=== Ursinha is now known as Ursinha-afk
thumperhallyn: still around?21:50
hallynthumper: yup21:50
thumperhallyn: thinking about number four, where we create a veth pair21:51
thumperhallyn: if the container hasn't been started, there is no network namespace right?21:51
thumperor is there?21:51
hallynnope.21:52
thumperalso, this "sudo lxc-unshare -s NETWORK -- /bin/bash" seems like it does something intersting I don't quite grok21:52
hallynthumper: that's just doing the same thing as creating a container.21:52
hallynit starts a task inside a new, private network ns21:52
hallynas for veth - if MAAS/openstack/ec2 will hand you a nic, then ignore veths21:52
hallynlxc.network.type = veth will always create a new veth pair and attach the one end to lxc.network.link.21:53
thumperwell openstack won't21:53
thumperah, I was going to ask what the link bit was21:53
thumperhallyn: can I run my idea past you?21:54
hallynso if you *were* going to use veth, which my feeling is you won't, then you would bridge whatever you get <handwaving> from openstack to br0, then say lxc.network.type = veth lxc.network.link=br021:54
hallynsure21:54
thumperhallyn: although #juju-dev might be better21:54

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!