/srv/irclogs.ubuntu.com/2011/04/22/#ubuntu-ensemble.txt

jimbakerhazmat, this looks provocative. the last line in the formula.log for wordpress/0 - 2011-04-21 23:30:08,060: twisted@ERROR: TypeError: 'NoneType' object is not callable00:39
hazmatjimbaker, hmm00:40
jimbakerso maybe the relation workflows are running, but they are going into a bad state?00:40
hazmatjimbaker, possibly00:40
hazmatsurprised we don't have anything nicer in the traceback. that's unfortunate00:41
hazmatjimbaker, the status looks right00:41
jimbakerhazmat, no, it's missing the relation status info00:41
hazmatjimbaker, one option is to  have a look at the zkshell.. ensemble ssh 0 && /usr/share/zookeeper/bin/zkCli.sh00:42
hazmatjimbaker, hmmm..00:42
hazmatjimbaker, i'll have a look in the morning00:42
jimbakercompare it against this output: http://pastebin.ubuntu.com/597192/00:43
jimbaker(from an earlier run)00:43
jimbakerhazmat, sounds good00:44
jimbakerhazmat, for later consumption - shouldn't this be more than two? zk: localhost:2181(CONNECTED) 26] ls /units --- [unit-0000000000, unit-0000000001]00:54
jimbakerhmmm... maybe not that part after all - i was looking at zk_workflow_identity01:00
jimbakerit looks like it uses the same path for both ServiceUnitState and UnitRelationState01:00
jimbakerbut those are not the same paths01:01
jimbakerbased on looking at those specific classes01:01
jimbakereither i'm confused or ensemble is confused ;)01:01
hazmatjimbaker, they are at the same path01:46
hazmatall the workflows for a unit-agent are managed on a single node01:46
jimbakerhazmat, thanks for the clarification02:16
jimbakerthis would have caused more issues if it were not the case, i guess02:16
hazmatjimbaker np.. sorry i had to run out02:16
hazmatjimbaker, yeah.. it seems strange the workflows on the unit are initialized and showing them as running the but the units weren't up or in an error state which seems strange02:17
hazmati can't think of any reason why that would be the case.02:17
jimbakeranyway, just curious it's happening in us-west now - one more thing to try is in eu-west (if that's the other region completely set up)02:18
hazmati'll take a look at it tomorrow02:18
hazmatjimbaker, it is02:18
jimbakerhazmat, have a good night, ttyl02:18
_mup_ensemble/refactor-to-yamlstate r197 committed by bcsaller@gmail.com06:24
_mup_set not taking any random data, but insisting on a dict in tests (for YAMLState)06:24
kim0Morning everyone08:52
kim0anyone around o/13:06
hazmatkim0, g'morning 15:02
hazmator hello is probably more appropriate15:02
kim0hazmat: hey o/15:30
kim0team on vacation huh :)15:33
_mup_ensemble/merged-alt-region-logging r210 committed by kapil.thangavelu@canonical.com15:59
_mup_merge ensemble-log-level15:59
_mup_ensemble/merged-alt-region-logging r211 committed by kapil.thangavelu@canonical.com16:00
_mup_merge ensemble-log-crash16:00
jimbakerkim0, hi16:07
jimbakerkim0, we have been trying out the alternative region branch. it's not worked for me with deploying our example formulas. you want to give it a try too?16:08
jimbakerit did work w/ kapil however16:08
=== deryck is now known as deryck[lunch]
kim0ec2 has recovered it seems .. I'm writing a small user level tutorial now, but if you need someone else to test that branch, sure16:16
jimbakerkim0, i think it would be useful for sure16:17
jimbakerit's also a good way to play w/ the environments.yaml file16:17
kim0jimbaker: cool, any instructions on using that branch ? I'll try it in a few hours though, now right now16:18
kim0let me know how do I use it, thanks16:20
_mup_Bug #769030 was filed: Enable one control bucket to be used for multiple regions <Ensemble:New> < https://launchpad.net/bugs/769030 >16:27
jimbakeryou just need to configure two things in your environments.yaml file: region - us-east-1, us-west-1, eu-west-1; and ensemble-branch - https://code.launchpad.net/~hazmat/ensemble/ensemble-alternate-regions 16:29
kim0got it16:30
hazmatkim0, i also have one with the logging stuff merged in.. lp:~hazmat/ensemble/merged-alt-region-logging16:30
jimbakerhazmat, sounds good, then we don't use an earlier formula set16:30
jimbakerhave to use16:30
hazmatjimbaker, it didn't work entirely16:30
hazmatjimbaker, i'm trying out trunk in east atm to verify the delta16:31
jimbakerhazmat, sounds good, less mystifying then16:31
jimbakerwe prefer our failures to be consistent ;)16:31
hazmatjimbaker, what's odd is that the unit relation state in zk (in west) is good, but status isn't reporting it, and wordpress isn't running16:31
hazmatalso ensemble-log seems to return 0 even when it fails/errors.16:32
jimbakerhazmat, i just got this weird output in a log16:33
jimbaker2011-04-22 09:20:52,519 unit:mysql/0: twisted ERROR: TypeError: 'Port' object is not callable16:33
hazmatjimbaker, can you paste the full log to pastebin16:34
jimbakerhazmat, will do16:34
hazmatthanks16:34
jimbakerhttp://pastebin.ubuntu.com/597493/16:35
hazmatjimbaker, is this on the open-port/close-port branches?16:35
jimbakerhazmat, no, the alternative region branch, running with trunk r200 formulas16:36
jimbakeri'm going to try the new alt region branch w/ logging merge in now16:36
hazmatjimbaker, hmm.. there are no port classes in ensemble, only in twisted.16:36
jimbakerhazmat, indeed16:36
jimbakerare we sure we got good versions of python, twisted, etc built in these amis?16:37
jimbakerit's as if we got some version skew going on here16:37
_mup_Bug #769035 was filed: Need a top level decorataor on all independent callbacks to do nice error printing <Ensemble:New> < https://launchpad.net/bugs/769035 >16:38
hazmatjimbaker, just stock natty16:39
_mup_Bug #769036 was filed: Ensemble hook cli api needs to do correct exit codes  <Ensemble:New> < https://launchpad.net/bugs/769036 >16:41
hazmatjimbaker, so trunk works for me16:44
jimbakerhazmat, trunk formulas, or using trunk with us-east-1?16:45
hazmattrunk and us-east-116:45
hazmattrying alt-region with us-east-116:45
hazmatthere's nothing in the branch remotely related to units or hooks..16:46
jimbakerhazmat, exactly, that's what is so strange here. some other unexpected dependency seemingly16:46
=== deryck[lunch] is now known as deryck
hazmatjimbaker, the merged-alt-region-logging branch seems to work okay17:37
hazmatin us-east-117:37
jimbakerhazmat, trying it out now17:43
hazmatjimbaker, cool, i'm trying it out in a different region and then i do see a problem17:44
jimbakerhazmat, doesn't work for me this try. speaking of round trip overhead, now doing "watch ensemble status" ;)17:49
jimbakeri had proposed building that in for ensemble status with actual watches, but repeatedly polling like this is the poor man's approach for sure17:50
jimbakerand the interesting thing is seeing the relation service state disappear... crazy17:51
bcsaller jimbaker: the plan with status is to have a mode where it blocks on a topo watch and then reissues the status in a loop17:57
bcsallerunlike watch it knows when things change17:57
jimbakerbcsaller, yes, intelligent watches :)17:57
jimbakerbcsaller, good to know it is in the works17:57
jimbakerbcsaller, doing "watch ensemble status" is still useful right now17:58
bcsallergood17:58
jimbakeri think once we have the relation settings added to ensemble status, that's going to be pretty awesome17:59
hazmataha18:42
hazmatunit agents are dead18:42
jimbakerhazmat, that would make sense18:43
hazmatjimbaker, the fact there is nothing in the log is rather frightening18:43
jimbakerhazmat, indeed. i'm just about to testing us-east-1 w/ trunk at r200, which is the last good one i observed18:44
jimbakertry testing18:44
hazmatjimbaker, i was able to get trunk latest from merge-alt-region-logging working on us-east-118:45
jimbakerhazmat, i was unable to get that - i was just getting "it works" plus empty relation service states18:46
jimbakerprobably because of dead unit agents18:46
hazmatjimbaker, no.. i'm actually got populated relation states with dead units agents18:46
jimbakerhazmat, crazy18:46
hazmatjimbaker, the variations i'm using are trunk, ensemble-alternate-region, merge-alt-region-logging18:46
hazmatall with formulas from that are the equivalent of the trunk versions18:47
hazmati've seen trunk and merge-alt-region-logging working on us-east-118:47
jimbakerhazmat, i have tried all of those, both with us-east-1 and us-west-118:47
jimbakernothing is working end-to-end for me today18:47
jimbakereverything starts off fine... then it just mysteriously fails18:48
jimbakerhazmat, maybe i should rebuild my buckets, don't know if that's an issue based on bug 76903018:49
_mup_Bug #769030: Enable one control bucket to be used for multiple regions <Ensemble:New> < https://launchpad.net/bugs/769030 >18:49
hazmatjimbaker, yeah.. i switch my buckets when changing regions atm18:49
hazmatthey should recover fine18:50
hazmatie. detect dead instance stale file, and create new one18:50
jimbakeralthough given how the control bucket works, i wouldn't expect it to impact18:50
hazmatwhich is what they normally do, or we'd be cleaning it all the time18:50
jimbakerbcsaller, hazmat - standup?19:00
hazmatjimbaker, sounds good19:00
bcsallerI have little to report, but sure19:00
jimbakerthen it will go fast :)19:01
_mup_Bug #769120 was filed: Ensemble status shouldn't report dead units based soley on state, but also on presence. <Ensemble:New> < https://launchpad.net/bugs/769120 >19:09
hazmathttp://dtrace.org/blogs/bmc/2010/08/30/dtrace-node-js-and-the-robinson-projection/19:19
hazmathttp://wiki.joyent.com/display/node/Using+Cloud+Analytics19:20
hazmathttp://dtrace.org/blogs/dap/2011/03/01/welcome-to-cloud-analytics/19:20
hazmatallergies miserable.19:30
jimbakerhazmat, you really should try hot yoga. i found it really helps clear sinuses and it would seem prevent allergic symptoms too19:39
hazmatjimbaker, sadly hot yoga isn't my thing20:15
hazmatdoes anyone understand apport handling of core files?20:16
hazmatjimbaker, can you give a hook at this look trivial patch for trunk.. https://pastebin.canonical.com/46611/20:28
hazmatnot sure if argparse version changed, but i currently have these two tests failing for me on trunk20:28
hazmatbcsaller, ^ if you have a moment and could look at the trivial.. i'm waiting on that before doing some merges.20:32
bcsallerhazmat: the change to generation happens outside the patch?20:33
hazmatbcsaller, yeah.. the error output change cause is not clear, i just matched to what the current production is20:33
bcsallerseems like it should have been caught when the change happend and the tests would have broken. 20:34
bcsallerI'm fine with the change, but want to understand how it happeded20:34
bcsallerhappened20:34
hazmatbcsaller, yeah.. i'm bisecting the last 5revs now to double check20:37
hazmatbcsaller, just went back a month history, still getting the errors, i'd have to guess its an argparse change and rev increment20:40
bcsallermaybe, yeah20:42
bcsallerthanks for checking20:43
hazmatbcsaller, seems to be a change between pypi version of argparse and the builtin 2.7 version20:43
bcsallerahh, natty on 2.720:43
bcsallermakes sense now20:43
hazmatnot sure if that's it.. also happens with python 2.6 using the distro argparse20:45
hazmatbut it does work with the pypi argparse 1.2.120:45
hazmatwhere as the distro version (for 2.6) is 1.1-120:46
hazmatno.. actually it didn't work 1.2.120:47
hazmati had used a patched trunk to test that one20:47
_mup_ensemble/trunk r205 committed by kapil.thangavelu@canonical.com20:48
_mup_argparse error output seems to have changed, match tests to match current output [trivial][r=bcsaller]20:48
_mup_ensemble/trunk r206 committed by kapil.thangavelu@canonical.com20:54
_mup_merge ensemble-log-level [a=niemeyer][r=niemeyer][f=767364]20:54
_mup_This fixes a problem with the ensemble-log hook CLI API,20:54
_mup_not correctly taking a -lLOG_LEVEL option.20:54
_mup_ensemble/trunk r207 committed by kapil.thangavelu@canonical.com20:57
_mup_merge ensemble-log-crash [a=niemeyer][r=kapil][f=767391]20:57
_mup_This fixes a traceback when attempting to use the ensemble-log20:57
_mup_hook CLI API from hooks.20:57
hazmatjimbaker, the principia trunk seems to work with trunk, but the trunk formulas don't on natty.21:36
hazmatstill seeing unit agents die though21:44
hazmathmm21:48
hazmatjimbaker, txzookeeper unit tests are segfaulting with default natty it appears.23:18
hazmatjimbaker, bcsaller do you run with the package zk or a local build?23:19
bcsallerlocal23:19
hazmatyeah.. we've been getting away with not having our own packages23:20
hazmateven though there were known issues with the lucid one, it still worked for our uses.23:20
hazmatdoesn't appear to be the case with natty, we're going to need package trunk (3.4) or backport perhaps23:20
jimbakerhazmat, curiously i'm reinstalling stuff now23:31
jimbakeris everyone running on python 2.7 at this point?23:32
hazmatjimbaker, i am23:32
hazmatjimbaker, i test with 2.6 occasionally as well23:32
jimbakeri'm now getting some test errors on trunk, just building a new virtualenv to try 2.7 out23:32
hazmatjimbaker, what does ./test need in a ZOOKEEPER_PATH ? just the zkServer.sh  script?23:36
jimbakerhazmat, iirc, it doesn't use zkServer.sh23:36
hazmathmm23:37
hazmatjimbaker, looks like i just need a directory with the jar23:37
jimbakerhazmat, sounds about right23:38
jimbakerit looks for both dev and prod installs23:38
jimbakerwhich are laid out differently23:38
hazmathmm.. yeah.. it doesn't work just pointing to a directory of jars.. which is what the deb does for install into /usr/share/java23:39
jimbakermakes sense23:39
jimbakerfortunately easy enough to change in ensemble.tests.common.ManagedZooKeeper23:40
jimbakerbasically a variant of zkServer.sh23:40
jimbakerwas that adjusted in the deb package?23:40
hazmatjimbaker, just had to fix the test/common get class path to not hardcode src release stuff23:43
hazmatjimbaker, debian uses /usr/share/java for java libs23:44
jimbakerwhich was the whole point of that classpath property, so that's cool23:44

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!