/srv/irclogs.ubuntu.com/2017/04/28/#juju-dev.txt

wpkanastasiamac: thanks00:15
anastasiamacwpk: anytime \o/00:16
=== bdx_ is now known as bdx
menn0babbageclunk: I guess the lazy lookup was in case something happened to import the package without actually running mongod using tests?00:45
menn0babbageclunk: for that reason I guess it makes sense to keep it lazy00:45
thumperwallyworld: I have things to do in town this afternoon, so let's cancel our call00:49
thumperwallyworld: we will be seeing each other in two days anyway00:49
wallyworldthumper: sgtm00:51
=== mup_ is now known as mup
axwmenn0: I've noticed that we're not passing "--storageEngine wiredTiger" to mongo 3.2 in the tests. have you tested with that added?01:05
babbageclunkmenn0: yeah, that'd be the reason not to do it at import time.01:20
menn0axw: isn't the default WT?01:52
axwmenn0: yeah, just the help text is a bit confusing "defaults to wiredTiger if no data files present"01:53
axwmenn0: anyway, I tested and it made no difference01:53
menn0axw: ok good to know01:53
axwmenn0: forcing it to mmapv1 makes it faster though, in the limited testing I did01:54
axwmenn0: I don't think we want to do that though, since we use WT in prod01:54
axwthough maybe just having WT in CI would be good enough01:54
menn0axw: we think the problem is that we're still deleting DBs instead of clearing the collections in a lot of places01:56
menn0WT is much slower at deletings DBs01:56
axwmenn0: could be, but I did notice that at least some of the test code (i.e. between SetUpTest completion and before TearDownTest start is slower with WT01:59
menn0ok, so there's issues there as well02:01
menn0axw, thumper, babbageclunk: there's quite a significant performance increase with wiredtiger if you disable transparent huge pages02:32
menn0echo never > /sys/kernel/mm/transparent_hugepage/enabled02:32
menn0without changing anything else this takes the agent/agentbootstrap tests from 2.8s to 1.8s on my machine02:33
menn0consistently02:33
menn0interestingly, Juju is supposed to be setting that on controller machines but I don't see that02:35
babbageclunkmenn0: right, but do you know what other impact that would have on our systems?02:36
menn0babbageclunk: not sure yet02:36
axwmenn0: hmm interesting. doesn't improve allWatcherStateSuite.TestChangeApplications in state for me02:36
menn0axw: what's the setting set to on your machine?02:36
axwmenn0: it was set to always02:36
menn0ok02:37
babbageclunkmenn0: It makes sense to do it on dedicated machines (although I guess it wouldn't help in lxd?)02:37
menn0duh02:37
axwmenn0: with mongo 2.4 (juju-mongodb), that test takes just under 0.4s. with mongo 3.2 (juju-mongodb3.2) it takes 1.3s02:37
menn0that sucks02:38
menn0maybe we shouldn't be using wiredtiger at all02:38
menn0there's a lot of horror stories online about WT02:38
menn0lots of people seem to have switched back02:38
menn0babbageclunk: https://github.com/juju/testing/pull/123/files02:43
thumperhmm...02:43
thumperwow02:43
thumpermenn0: initial timing tests... 2.6 mongod 23m17s, api 32s, apiserver 138s, state 671s02:45
menn0babbageclunk, axw, thumper: seems like we get most of the performance back by passing --storageEngine mmapv102:45
menn0but then it's not like production02:45
babbageclunkmenn0: I think I floated that at the time.02:45
thumpermenn0: initial timing tests... 3.2 mongod 55m36s, api 366s, apiserver 1515s, state 1455s02:46
thumperapi and apiserver packages are 10x slower02:46
menn0thumper: can you try with huge transparent pages turned off?02:46
babbageclunkmenn0: I guess the main risk/annoyance would be if there's some behaviour difference and we only find out when the CI tests fail.02:46
thumperlet me address the clear databases02:46
thumperthen I'll try with the transparent pages off02:47
menn0thumper: yep ok.02:47
axwmenn0: yeah that's what I found too. I think I'm OK with that as long as our CI still uses 3.2 (which it would, since there's no way to override that?)02:47
babbageclunkmenn0: less of a problem now that we have check builds though02:47
axwbabbageclunk: do you have time for a small review? https://github.com/juju/testing/pull/12402:51
* thumper has kicked off the new test run02:52
menn0babbageclunk, thumper: I'm going to return to other work for now but will be happy to discuss ideas or test stuff02:53
thumpermenn0: ack02:53
thumperbabbageclunk: using clear databases rather than reset actually looks like it is taking longer...02:57
thumperhmm03:00
thumperapi package was 378s03:00
thumpermenn0, babbageclunk: I'll wait for the apiserver package timings, but the clear databases call makes it even slower03:01
thumperI'll try the transparent pages turned off03:01
babbageclunkaxw: sure, looking at that now (and menn0's too).03:03
babbageclunkaxw: LGTM'd03:10
axwbabbageclunk: thanks03:19
axwbabbageclunk: did you forget about this one? https://github.com/juju/description/pull/803:26
axwthere's no bot on that repo, I can merge if you like03:26
babbageclunkaxw: oh, yes please - I think I decided to merge it when thumper was away and then forgot to follow it up.03:27
babbageclunkta03:27
axwbabbageclunk: done03:27
menn0babbageclunk: thanks for the review03:31
babbageclunkmenn0: <thumbsup emoji>03:31
thumpermenn0, babbageclunk: changing the huge transparent pages is making very little difference03:32
thumperI don't know what to do next03:32
menn0thumper: weird. it seemed to make a big difference for me.03:33
babbageclunkthumper: :( you said clearDatabases is slower?03:33
thumperyeah03:33
babbageclunkthumper: well bums03:35
thumperyeah03:35
thumpereven with the huge pages set to never, the api package is 10x slower03:37
thumperso 300s instead of 30s03:37
thumperNFI what to do next03:38
babbageclunkconvienient timing too03:39
menn0thumper: --storageEngine mmapv1 :)03:41
thumpermenn0: where?03:41
menn0in juju/testing/mgo.go, in the args we pass to mongod03:41
menn0have to be careful that mongod is 3.x though as --storageEngine doesn't exist in 2.x03:42
menn0thumper: also, what about dropping DBs in MgoSuite.TearDownTest instead of clearing?03:43
thumpermenn0: did you do something about caching the mgo version?03:54
menn0thumper: yes03:54
thumpermenn0: where?03:55
menn0https://github.com/juju/testing/commit/3ccb7d0a3f3412b41f8c1bf000ff7078d28c0af903:55
thumperk03:59
menn0thumper: that doesn't deal with the slowness. it just closes a potential problem.04:00
thumperhmm with mmapv1 storage engine it is only 30% slower04:02
thumperare we prepared to accept a 30% decrease in test speed?04:09
thumperand what other choice do we have?04:09
thumperand should this info drive production choices?04:10
babbageclunkthumper: production usage patterns are very different from the tests though - we very rarely drop databases. :)04:13
thumperyeah...04:13
* thumper needs to organise some trip stuff04:13
babbageclunkjam: ping?05:19
axwwallyworld: oracle bootstraps for me on my trial account FWIW06:16
wallyworldaxw: it is failing because it is starting a yakkety instance (and looking for yakkety tools in /var/lib/juju) even though juju thinks it has asked for a xenial instance. so there's an issue with image selection :-(06:17
wallyworldi will try and remove the yakkety image from my account, that should fix it06:18
axwokey dokey. I think I only have xenial in mine06:18
wallyworldyeah, i'll let gabriel know, just testing to make sure06:19
wpkaxw: re: proxy - this can break things, as it really sets the proxy globally07:22
=== frankban|afk is now known as frankban
wpkaxw: I forwarded you an e-mail from jam describing changes, I don't want to land it without a proper test run, preferably by someone who uses proxying07:45
wpkaxw: there is a 'workaround' (remove systemd code), but that has yet to be decided07:45
jamwpk: how about if we leave the ability to do systemd, but have it disabled, and land the rest ?07:59
jamI think the rest is a clear improvement07:59
jamand we can discuss whether systemd will actually break things08:00
mupBug #1686938 opened: During a destroy-model the units first update/upgrade which delays the destroy process <apt> <delay> <destroy-model> <upgrade> <juju-core:New> <https://launchpad.net/bugs/1686938>09:14
wpkIs anyone willing to review #7204? I know it's big but it's been waiting for almost a month now..09:59
=== fnordahl_ is now known as fnordahl
jamwpk: did you go through it as though you were doing a review as well?11:12
wpkjam: yes, although it is ahuge one so I might have missed some things...11:26
jamwpk: so it seems there is one piece that needs fixing wrt bridge_ports and inet and inet6 stanzas11:27
wpkjam: we should only put one?11:28
jambug #165030411:28
mupBug #1650304: Juju2: 'Creating container: failed to ensure LXD image: image not imported!' <oil> <oil-2.0> <regression> <juju:Incomplete> <juju 2.1:Incomplete> <https://launchpad.net/bugs/1650304>11:28
jamfrom axw: https://bugs.launchpad.net/juju/+bug/1650304/comments/711:29
wpkjam: Reading through bridge-utils, IMHO it's OK to have it but I'll check11:37
jamwpk: it wasn't ok from experience, given the original bug11:38
wpkjam: as I understand that wasn't the problem, and looking at the code it's simply doing for (bridge in bridge_ports) if (bridge not already in the bridge) brctl addif....11:41
jamhopefully thats if "port not already in the bridge" :)11:42
jambut I get your point11:42
jamwpk: are we sure that is true across trusty+xenial+yaketty, etc?11:43
wpktrusty, xenial, zesty11:46
wpk    if [ "$MODE" = "start" ] && [ ! -d /sys/class/net/$IFACE/brif/$port ]; then11:46
wpkI don't have any Yakkety but I don't think they'd change that line just for this one :)11:46
=== tinwood is now known as tinwood_lunch
axwwpk: if you're changing it back (I'm happy to be told I was wrong - again, I'm no expert), can you please run the QA steps from that PR that jam linked to?12:35
axwI mean, the PR linked from the bug that jam linked to...12:35
wpkaxw: willdo, I'm checking how the 'old version' worked and bridge_ports wasn't the only problem (repeated auto, etc.)12:44
wpkaxw: was the problem only with rackspace or with other providers too?12:44
axwwpk: yep, I may well have conflated the other issues. that's the only place I observed and reproduced the issue. I think it was also seen on MAAS by others, but could never confirm myself12:45
rogpeppehere's a tiny little PR for review, controller-specific cookie jars: https://github.com/juju/juju/pull/729412:48
rogpeppeaxw, wpk, jam, wallyworld: if someone manages to review it, i'll owe them a beer or two :)12:49
axwrogpeppe: heh. I've already got a beer and it's 9pm on a a friday night ;)  I'll take a look on monday if nobody does it sooner12:50
rogpeppeaxw: thanks12:50
rogpeppeaxw: this work was the subject of this tweet: https://twitter.com/rogpeppe/status/849963422032723968 :)12:51
axwrogpeppe: I figured :)12:52
=== tinwood_lunch is now known as tinwood
=== freyes__ is now known as freyes
=== frankban is now known as frankban|afk

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!