/srv/irclogs.ubuntu.com/2011/10/19/#juju.txt

SpamapShazmat: does that mean that some degree of polling is necessary?00:06
hazmatSpamapS, actually it was a bad test, watches are sent reliably in the event disconnection, when the client reconnects00:06
hazmatSpamapS, i'm trying to tackle our zk error handling with eye to solving disconnect problems00:07
hazmati think i've got a good game plan, just writing some tests to verify behavior00:07
hazmati ended up having to create a tcp proxy, else the whole thing is way too timing dependent00:08
hazmatMITM FTW ;-)00:08
SpamapSnice00:08
SpamapSsad reality.. my laptop is slower than EC2 for deploying00:38
SpamapSiterating on a 2 node ceph cluster over and over takes about 3 minutes per deploy/add-unit ..00:39
SpamapSEC2 can do it in 200:39
SpamapS(assuming my units go from pending to running in the usual 30 seconds)00:39
jimbakerbcsaller, if we ever have a sprint in boulder: http://www.yelp.com/biz/zudaka-healthy-latin-food-boulder-2 - all vegetarian (and vegan friendly), and it's south american cuisine, which is the unusual part01:48
bcsallerjimbaker: nice01:50
_mup_txzookeeper/four-letter-client r44 committed by kapil.foss@gmail.com02:12
_mup_four letter command admin client02:12
=== _mup__ is now known as _mup_
_mup_Bug #878114 was filed: launch multiple bootstrap nodes <juju:New> < https://launchpad.net/bugs/878114 >11:43
uksysadminhi guys12:09
uksysadminanyone used juju with OpenStack - is it as simple as configuring the environments.yaml and away you go or is there more to it?12:10
fwereadeuksysadmin: I haven't used it myself, so I'm afraid I don't know the details, but I'm pretty sure people have12:13
fwereadeuksysadmin: if hazmat's around he may be able to tell you more details12:14
uksysadmincheers - I'm about to have a go, but nice to know any gotchas up front12:14
fwereadeuksysadmin: we've certainly fixed bugs related to openstack compatibility12:15
fwereadeuksysadmin: please do let us know if you find any more :)12:15
uksysadminwill do12:25
uksysadminbefore I start - I'm using 11.10 - do I need to add the ppas as per the documentation or is it ok to use the ones provided in universe?12:47
uksysadmin(ignore - all is going ok so far... :))12:54
fwereadeuksysadmin, sorry I missed you, the ones in universe should be just fine13:04
uksysadmingreat cheers13:15
uksysadminone things (and I'll cross post) - to do a bootstrap of a server under openstack, do I need 11.04/11.10 and if so where can I get suitable images from?13:16
HarryPandauksysadmin: look at Orchestra13:16
uksysadminCheers HarryPanda, I am. just getting the basics going first by being able to do the juju bits13:17
uksysadminalthough... that comes with images...13:17
uksysadminI see your thinking13:17
uksysadminno dice (if that was what you were implying)13:21
uksysadminseeing if these work instead: http://uec-images.ubuntu.com/server/oneiric/current/13:35
hazmatuksysadmin, its pretty much as simple as configuring the environment13:56
hazmatuksysadmin, use euca-describe-images.. to find an image that you can specify in environments.yaml13:57
hazmatuksysadmin, course that has to be set up in openstack glance14:08
uksysadminsure, cheers hazmat14:12
uksysadminkim0, (or anyone) I'm trying to get the juju charms as per https://www.linux.com/learn/tutorials/495738-conducting-clouds-the-ubuntu-way-an-introduction-to-juju and I get bzr errors15:18
uksysadminnot only under 11.10 am I getting a warning about bzrlib being different to bzr (fresh installs still get this) it says15:18
uksysadminbzr: ERROR: Not a branch: "http://bazaar.launchpad.net/~charmers/charm/oneiric/mysql/trunk/sql/".15:19
uksysadminif I forget the bzr repos and just do a juju deployment it says it can't find store.juju.ubuntu.com15:20
kim0crap .. principia is now gone, this article has been waiting at linux.com for too long15:21
uksysadminif it makes anyone feel better my juju bootstrap worked a charm ;-)15:21
uksysadmind'oh15:21
uksysadmingood article though kim0 - is there a slightly modified version you've done elsewhere?15:22
kim0uksysadmin: prolly the most updated would be docs: https://juju.ubuntu.com/docs/15:22
uksysadminI followed that too/similar stuff -but failed at: juju deploy --repository=examples mysql15:23
SpamapSoh wow it used principia?15:23
SpamapSyeah thats been gone for like 6 weeks15:23
uksysadminDNS lookup failed: address 'store.juju.ubuntu.com' not found: [Errno -5] No address associated with hostname.15:23
uksysadmin2011-10-19 16:20:08,676 ERROR DNS lookup failed: address 'store.juju.ubuntu.com' not found: [Errno -5] No address associated with hostname.15:23
SpamapSkim0: will they let us edit it?15:23
SpamapSuksysadmin: the docs haven't been updated in a while.. its supposed to be   juju deploy --repository=examples local:mysql15:24
* SpamapS is getting rather annoyed that the website still says the wrong thing15:25
uksysadminok I'll see if that makes a difference... though strange if it fixes a DNS error...15:25
SpamapSuksysadmin: its not a "DNS error" .. the store isn't live yet.15:25
SpamapSuksysadmin: though we are pondering changing it so local: is always assumed15:26
SpamapSuksysadmin: bug 872164 if you'd like to comment/track it15:27
_mup_Bug #872164: [Oneiric] Cannot deply services - store.juju.ubuntu.com not found <juju:In Progress by hazmat> < https://launchpad.net/bugs/872164 >15:27
uksysadminta - will do15:28
uksysadminin the meantime - how do I get those lucky charms?15:28
_mup_Bug #873907 was filed: Security group on EC2 does not open proper port <juju:New> < https://launchpad.net/bugs/873907 >15:28
hazmatuksysadmin, easiest way is to apt-get install bzr,  bzr branch lp:charm-tools, cd charm-tools.. ./charm get-all ../path/to/place-to-store charms15:29
hazmater.. s/bzr/mr15:29
hazmatthat will fetch all the charms of lp15:30
hazmats/of/off15:30
uksysadmincheers - no docs mention that (or ones that I'm being referred to)15:30
hazmatjuju.ubuntu.com/Charms should mention it15:30
SpamapShazmat: we're about to get re-flooded.. linux.com article went out with the same problems.15:31
uksysadminwould be good if the tutorial at /docs/ mentioned it then...15:31
hazmatuksysadmin, we've been working on a charm store, that's directly integrated with the client, its not expected that folks will normally have to go down the charm-tools road15:31
hazmatSpamapS, bummer15:31
uksysadminsorry for the bearer of bad news - was good up until the point about principia15:31
hazmatuksysadmin, we've got a technical problem getting our docs updated that we're waiting on a sysadmin to fix..15:31
uksysadminoh dear - I'll not pester you guys.  I've probably got enough to get me going, thanks.15:32
hazmatuksysadmin, its very good to know where the bad docs are ;-)15:32
hazmatbut the rest should be good15:32
uksysadmin:)15:32
SpamapSuksysadmin: for 'lp:principia' you can replace it with 'lp:charm' and that should solve those problems15:40
SpamapSI think somebody already said that15:40
uksysadminok, I'll give that a shot whilst the charms are checking out using the tool15:41
SpamapSkim0: btw, juju.ubuntu.com/docs is actually quite out of date at the moment. We're still trying to figure out why its not updating from the bzr tree.15:42
uksysadminon my 11.10 machine I did the tools getall, and in the directory I specified I needed to do a quick symlink to the current dir: (ln - s . oneiric) as it was failing with 2011-10-19 16:43:37,007 ERROR Charm 'local:oneiric/mysql' not found in repository ...15:45
hazmatuksysadmin, hmm.. yeah. the charms need to prefix with their release series15:46
uksysadminafter that it seems to be working15:46
uksysadminone could say its working like a charm15:47
uksysadmin(are there openinings for a PR person? ;-))15:47
SpamapSuksysadmin: too easy. ;)15:48
robbiewin terms of the docs needed to be updated from bzr....if someone has hosting space and time to setup the docs, we can fix this fast than apparently IS can15:50
robbiewat least until they are able to respond15:50
hazmatrobbiew, sure i can do it15:53
hazmatrobbiew, i'm tired of waiting15:53
* hazmat setups a dns entry15:53
robbiewhazmat: cool15:53
SpamapShave we tried #is yet?15:53
hazmatSpamapS, i tried them last week, someone (?) took a look around didn't see the cron job15:54
hazmator where to update, feel to try again15:54
SpamapSon it15:54
SpamapSticket#?15:54
hazmatSpamapS, 4845615:55
SpamapSty15:55
uksysadminthanks all - apart from environmental restrictions (you try sanely running virtual under virtual) all seems to be good!16:03
SpamapSLXC isn't virtual. :)16:03
SpamapSits just contained16:03
SpamapSbut yeah, my box hits a load of 12 quite often16:03
uksysadminnot running under lxc - vbox16:04
uksysadmintemporary issue fortunately16:06
uksysadminhometime here in the good ol' uk. thanks for your help.16:06
hazmatugh.. mongodb randomly restarted on me16:07
SpamapShazmat: chaos monkey support.. disable with --no-chaos-monkey16:19
hazmatSpamapS, lol16:20
fwereade_I was thinking we should have a chaos-monkey built into juju16:20
SpamapSWe do, his name is Ben and he's a vegan16:21
jimbaker:)16:22
jimbakeri'm increasingly vegan. except for honey. or fish. also, i couldn't give up yogurt. and i have a soft spot for cheese, especially parmigiano-reggiano. but perhaps a little more vegan than not ;)16:25
robbiewSpamapS: hazmat:  docs fixed with hazmat's workaround until IS can respond16:27
robbiewhttps://juju.ubuntu.com/Documentation  should have the right stuff16:27
SpamapSjimbaker: maybe you just hate dairy cows and bees16:28
hazmatjimbaker, thats awesome ;-)16:28
jimbakerSpamapS, to the contrary, i love them too much ;)16:28
SpamapSrobbiew: at least we have somewhere to send them now. :)16:29
SpamapSrobbiew: we should really fix that frame to be an iframe so there aren't two scroll bars.16:29
robbieweh..whatever16:29
robbiewSpamapS: now you16:31
robbieware just trying to get cute16:31
SpamapSno.. the embedded window is way too small to hold the whole page16:35
robbiewSpamapS: I can change that...one sec16:44
robbiewSpamapS: reload16:46
robbiew;)16:46
SpamapSrobbiew: better than before :)16:46
SpamapSI wonder if there's a way to say "embed this and make it as big as it wants to be"16:47
* SpamapS *hates* html16:47
robbiewprobably16:47
robbiewone sec...let me try16:47
hazmatthe doc html should scale down pretty well16:47
robbiewbah! I haven't written html in forever....Spamaps, it's a wiki, so feel free to try :D16:52
* SpamapS will just use the direct link16:54
robbiewlol16:54
SpamapSSo, my new favorite use for the local provider is to spin up a giant EC2 instance and use it there16:54
SpamapSbecause it just *kills* my laptop16:55
SpamapS$0.50/hour for an m2.xlarge is better than 40 $0.08/hour m1.small's per hour ;)16:55
robbiewlol16:56
* SpamapS wonders if lenovo makes a thinkpad w/ 12GB16:56
SpamapSjust need 8GB of RAM for charms and the rest for compiz16:56
SpamapSheh.. piping 'debug-log' through 'ccze' makes the day a lot more fun I have to say17:09
SpamapSoooooooo17:10
SpamapSjuju status | ccze -m ansi17:11
SpamapS*pretty*17:11
hazmatinteresting17:15
SpamapSSo, with peer relations.. there needs to be something analogous to 'remove-relation ; add-relation'17:47
SpamapSI have all these idempotent hooks, I want to run them again17:47
hazmatSpamapS, i don't follow17:59
hazmatSpamapS, you can remove a peer relation, and add it17:59
hazmatSpamapS, juju does auto activate peer relation is all17:59
SpamapSOH18:03
SpamapSvery useful. :)18:03
* SpamapS did not actually *try* removing it18:03
SpamapSso thats how I can get refreshes after upgrading charms18:03
SpamapSTho that also calls broken.. which will likely take services down18:04
hazmatSpamapS, m_3 made a nice suggestion on how to add relation iteration capabilities18:04
hazmatbefore we sort of blocked on the anonymous relations from a server provides.. but the easy solution is to just qualifies though during iteration with the end point service name in addition to the local relation name18:04
SpamapSEPARSE18:06
SpamapShuh?18:06
hazmatSpamapS, just thinking of making relations addressable from upgrade hooks18:07
SpamapS*that* is 100% necessary18:07
SpamapS(you may recall, I suggested very early that upgrade would require re-running every hook)18:07
hazmatSpamapS, one of the issues is that for something that provides an interface, it could have multiple relations from things that require to effectively the same named relation interface18:08
hazmatie. provides creates what are effectively anonymous/non-addressable relations18:08
SpamapSbug 873116 and bug 767195 may be duplicates.. both I think opened by me. ;)18:08
hazmatan easy way to qualify those would be to just suffix the endpoint service name18:09
hazmatso they can be iterated and addressed in non ambigious fashion18:09
SpamapSrelation-get variable_name [ unit_name ] [ relation_name ]  ?18:09
hazmatmysql charm ... list-relations -> db-wordpress, db-drupal  .. to pass to relation-get18:10
SpamapSOh the active relations18:10
SpamapSyeah thats bug 767195 .. you opened it.. I recall discussing this a while back18:11
hazmatyeah...18:11
SpamapSVery high order stuff.. important, but can be worked around for now.18:13
SpamapSIt would be quite useful in the ceph charm. What I have to do there is just store relation data locally and keep regenerating stuff from that.18:13
hazmatfwereade_, any interest in looking at that? or do you want to roll with ha stuff, i can comment on your proposal on the bug18:13
SpamapSfwereade_: btw, great discussion on the SSH key management stuff. I feel like that is going to be really cool when we get to it.18:14
hazmati just sent out a large mail on regarding conn failure and session expiration analysis and what my plan is18:14
SpamapShazmat: oh good. :) I just got hit by it18:14
hazmatSpamapS, yeah.. making local provider work through hibernate is a great test scenario ;-)18:14
jimbakerthat would definitely be nice18:15
SpamapSfor me it just stops working after lunch18:16
_mup_juju/ssh-passthrough r408 committed by jim.baker@canonical.com18:33
_mup_PEP8, PyFlakes, docstrings18:33
fwereade_SpamapS, thanks :)18:36
fwereade_hazmat, sorry, need to catch up on context, was putting laura to bed18:36
hazmatfwereade_, no worries18:36
fwereade_ah, I just saw that email, and suspected it would be relevant to my interests :)18:36
hazmatfwereade_, the context for the relation stuff is in the bug links, just responding to your other email re ha, and then going to try and do some reviews18:37
hazmatthe review queue is overflowing18:37
hazmatfwereade_, bcsaller, jimbaker if you have some time, we really should get some more reviews in18:37
fwereade_hazmat, duly noted :)18:38
jimbakerhazmat, will do, just want to get this ssh passthrough stuff done first18:38
bcsallerhazmat: I'll try for a couple today18:38
hazmatthanks guys18:38
jimbakertaking too long but almost there (too many things that should be easy in argparse, aren't ;) )18:39
fwereade_and, hazmat, I can happily work on whatever seems most sensible to you18:39
fwereade_hazmat, the HA stuff is interesting, but whether or not I should be working on it can probably be determined by how much I appear to have been on crack while writing that comment ;)18:40
hazmatfwereade_, we should do a round discussion on it, the bug is probably a decent place for it, we can decide after that.. there's another low hanging but critical task, which is upstartifying all the agents18:41
hazmatcurrently only local provider uses upstart for the unit agent, but creating an upstart module that can be used for all the agents would be a huge win on the way to ha18:41
SpamapS+100 for that18:42
jimbakerit would seem that the two features are hugely related18:42
* hazmat grabs some coffee18:43
jimbakeri still think just relying on the fact that the provisioning agent can be restarted + a leader election would suffice for provisioning agent HA, for now. not the scalable solution of course18:43
SpamapSThe're related in that they both will help with the resiliency of the system18:43
SpamapSjimbaker: ZK is mroe important18:43
SpamapSmore rather18:43
jimbakerSpamapS, in terms of upstart of ZK?18:44
SpamapSAnd as long as you're going to run two ZK's, why not run two provisioning agents? ;)18:44
SpamapSjimbaker: in terms of HA for bootstrap18:44
jimbakerSpamapS, exactly18:44
SpamapSfor upstart.. thats more about being able to reboot nodes18:44
jimbakerevery ZK should have a corresponding provisioning agent, just makes sense18:44
jimbakerin terms of layout18:45
SpamapSthere's a second task for rebooting, which is making sure that agents that are disconnected can recover from a long absence gracefully.18:45
SpamapSbut really.. if you can just reboot them and block zk changes while something is gone.. thats a huge step forward.18:46
bcsallermaybe something like: is_bootstrap = fib(num_active_machines +2) while num_active_machines < 4, on all those machines we run a PA and ZK and the lowest machine id is leader18:46
jimbakeri think reboot is fine, certainly that works for the provisioning agent18:46
hazmatfor multi pa, i think fine grain locks are preferrable18:46
hazmattheir is parallel work to be done18:46
hazmatfor zk, its unesc. zk does its own leader election18:46
jimbakerhazmat, exactly, that would support better scalability18:46
hazmatand the clients can connect to all of them and route appropriately18:47
SpamapSbcsaller: simple and elegant.. I would have no problem with that solution. :)18:47
jimbakerhazmat, leader is only for determining whether a given provisioning agent is active, other than the too simple solution i have in mind :)18:47
hazmatupstart is a good first step though18:47
SpamapStrue, no need for a leader PA18:47
jimbakersorry, under the too simple solution18:47
hazmatthe zk service and pa just become another service managed by juju is the end goal i'd like get to18:48
hazmatmake it just another service managed through juju18:48
bcsallerhazmat: +118:48
jimbakerit just removes the SPOF. but i agree with the end goal18:48
bcsallerthought like we've mentioned namespaces for things like status seem even more important then18:49
bcsallerbecause you don't want to see juju internal services by default18:49
SpamapSI don't know if its that clear18:49
SpamapSsome would say they want to see *everything* they are responsible for.18:50
SpamapSI mean, every machine is effectively related to bootstrap18:50
hazmatbcsaller, yeah.. one step at a time though... namespacing might be a nice alternative to do strange internal service name checks for protection... with an option to show all namespaces18:50
hazmatit also helps when we start to exploring hierarchies of services18:50
SpamapSCars existed a long time w/o seatbelts. :)18:51
SpamapSI don't know if you need safety up front. Just sanity.18:51
bcsallerdid rockets ever not have seat belts though?18:51
SpamapSOh dear.. please don't tell me I've wandered into the rocket science lab? ;)18:52
jimbakeragain, wouldn't it make more sense to follow this plan: remove the SPOF by just having one active provisioning agent + some number of standbys. the PA is extensively tested to follow its design, that it can always be restarted18:55
jimbakerthen implement a better provisioning agent, which in fact does parallelize work18:55
jimbakerthis can also get to the more desirable quality that the PA is just another service18:56
SpamapSHonestly18:57
SpamapSeven the most active site with 1000's of nodes18:57
SpamapSI doubt can overwhelm a single PA18:57
SpamapSZK will be the choke point there18:58
jimbakerSpamapS, agreed. i think we might have some issues in how we iterate the topology, etc. but not in instructing the cloud provider on what to do next18:58
jimbakerSpamapS, for instance, i think we could be smarter about the watch mgmt in the expose logic in the PA. too much duplicate work when it's just operating on the the toplogy node. but that's just a matter of having a better watch setup. (or alternatively, moving expose to the machine agent!)19:01
SpamapSI kind of like how we're taking advantage of the provider firewall19:01
jimbakerSpamapS, sure, and it is transparent in the usage, so point well taken19:03
SpamapS2011-10-19 19:03:20,032: hook.output@ERROR: + relation-get hostname19:04
SpamapS2011-10-19 19:03:20,222: hook.output@ERROR: Traceback (most recent call last):19:04
SpamapSFailure: juju.hooks.protocol.NoSuchUnit: The relation 'mon' has no unit state for 'ceph/11'19:04
SpamapSSo...19:04
SpamapSin a departed hook..19:04
SpamapSI can't get the relation data?19:04
hazmatSpamapS, the provider firewall is just an optimization for ec2 at this point, it impl as the sole network security  prevents security for other providers19:07
SpamapSYeah seems like both are useful19:07
hazmatin the future with a machine level firewalls, the ec2 firewall can be maintained just as a provider specific optimization19:07
hazmati've got another long email in the works on that topic19:08
hazmattime for a doctor's appt, bbiab19:08
evandevdoes anyone have any experience with installing hbase on ec2? I am trying to find any documentation. I currently have Juju installed with hadoop-master / slave charms and would like to get Hbase running19:50
SpamapSevandev: IIRC, the hadoop master and slave charms only give you HDFS19:51
evandevCorrect, I was just wondering if anyone had taken it a step further and installed hbase on top of that19:51
SpamapSHave not.. but if you want to take a crack at it, I'm sure m_3 would be interested in helping. :)19:52
SpamapSm_3: ^^19:52
_mup_Bug #878462 was filed: resolved --retry does not retry the hook <juju:New> < https://launchpad.net/bugs/878462 >19:55
evandevcool thanks19:57
SpamapSevandev: can HBase and HDFS share the same set of namenode/slaves ?19:58
SpamapSevandev: if so it might be best implemented as a config option on top of the existing hadoop charms.19:58
SpamapSugh.. departed hook executes in parallel with unit removal...19:59
SpamapSno way to gracefully remove a ceph monitor node from another one then. HRM19:59
SpamapSI suppose the stop hook would work19:59
evandevYea that was my next question20:00
evandevI thought about modify the charms20:00
SpamapSevandev: if you have pulled them down with 'charm getall', simplest thing to do is to unbind the charm, commit your changes, and then push to a bzr branch.20:00
evandevmodifying*20:00
* SpamapS realizes thats not in a wiki page and remedies the situation20:00
robbiewSpamapS: thnx20:01
evandevahh I think ill try that20:01
evandevthanks SpamapS20:02
hazmatSpamapS, you mean in parallel across different units i assume, yeah..21:00
hazmatSpamapS, stop hooks are not executed atm, nother topic for discussion21:01
SpamapShazmat: so, yeah, we need to provide graceful shutdown for clusters21:06
SpamapShazmat: I'm thinking that you should not actually destroy the unit until its departed hooks have finished on all related nodes21:07
SpamapSthis is already well documented in bugs tho..21:07
_mup_juju/ssh-passthrough r409 committed by jim.baker@canonical.com21:40
_mup_Test for parse errors21:40
jimbakerbcsaller, looks like you meant to do an approve on https://code.launchpad.net/~hazmat/juju/unlocalize-network/+merge/7947622:02
bcsallerjimbaker: I thought the second person did the approve22:03
jimbakersure, and i can do that, but your comment was just a normal comment, not an approve comment22:04
jimbakerbcsaller, ^^^22:04
bcsalleryeah, it should have been an approve then22:04
jimbakerok, i've just approved it, since it's pretty clear in the merge proposal the intent22:05
jimbakerbcsaller, do you want to propose your branch for bug 873643? it looks good to me22:11
_mup_Bug #873643: config values are re-set to their default values when only one is changed <juju:In Progress by clint-fewbar> < https://launchpad.net/bugs/873643 >22:11
bcsallerjimbaker: I never got a reply from SpamapS about what he wanted done, I can propose my extension of his branch or he can merge and repush.22:12
jimbakerSpamapS, maybe you can delete your merge proposal for that bug? i don't know what the process should be, but i'm ready to approve bcsaller's work, between your trivial change and the reasonable test, it looks good with just the caveat that there's a grammatical error22:12
jimbakerin a comment22:12
hazmatbcsaller, you can just unlink the old branch and link yours22:13
hazmatits been pending for a while22:13
bcsallerhazmat: that might be best then22:13
jimbakerand it's high22:13
hazmaton life22:13
jimbakerit's really impacting actual usage, so we should get it in22:13
SpamapSYeah if bcsaller's is complete please do move forward22:13
SpamapSthre22:15
SpamapShandled22:15
SpamapShttps://code.launchpad.net/~bcsaller/juju/config-do-not-overwrite/+merge/7989022:15
SpamapShazmat: great email about the timeouts. I think I hit that just when my system load gets high because some things take 3+ seconds23:22
SpamapShazmat: I think this may be another "production" bug.. when this hits.. the agents basically are dead in the water.23:40
hazmatSpamapS, ?23:41
SpamapShazmat: The weirdness I reported last week seems to be a timeout23:41
hazmatSpamapS, what's the defect?23:41
SpamapSbug 87590323:42
_mup_Bug #875903: Zookeeper errors in local provider cause strange status view and possibly broken topology <juju:New> < https://launchpad.net/bugs/875903 >23:42
hazmatSpamapS its two different issues23:43
hazmatSpamapS, one the session expired, so the units are dead23:43
hazmatSpamapS, two status was reporting based on the recorded state instead of taking into account the presence of the connected agent23:43
hazmatthe second issue has been addressed by a branch fwereade_ has in the review queue23:43
hazmatthe first by the timeout email23:44
SpamapSOk23:44
SpamapSI have been running into a lot of the weird status..23:44
SpamapSnot suspending/hibernating/anything23:44
SpamapSjust using it through the day23:44
hazmatSpamapS, hmm.. with high load / swapping?23:45
SpamapSsome load, no swap23:45
SpamapSdisk is definitely *slammed*23:45
hazmatSpamapS, i'd probably attribute it to the same23:45
SpamapSI have no doubt that occasionally some things block for 3 seconds23:45
SpamapSwhich is why I moved my testing to an m2.xlarge for a while today23:46
SpamapSwith a giant tmpfs volume23:46
SpamapSno such issues over there. :)23:46
SpamapSat $0.50/hour, its a bargain compared to dealing with my silly laptop23:47
SpamapS2011-10-19 23:48:45,173:480(0x7fa28ae7f700):ZOO_ERROR@handle_socket_error_msg@1621: Socket [192.168.122.1:48263] zk retcode=-112, errno=116(Stale NFS file handle): sessionId=0x1331e917b100004 has expired.23:48
SpamapS 16:49:25 up 3 days, 17:12,  2 users,  load average: 9.03, 5.04, 3.2623:49
SpamapShazmat: so .. yeah.. this is frustrating.23:51
* SpamapS realizes he's late to pick up the little one and signs off for the day23:52
hazmatSpamapS, yeah.. the fix is actually pretty small and straightforward, just needs some good tests23:52
hazmati'm in progress on it, but trying to take some time today to reviews23:52
hazmatSpamapS,  is suspect on ec2 its the vagaries of virtual and the load from the multiple units23:54
hazmats/is/i23:54

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!