/srv/irclogs.ubuntu.com/2011/05/31/#ubuntu-ensemble.txt

=== koolhead17 is now known as koolhead17|home
poolieanyone here?06:33
TeTeTno developers I fear06:49
kim0Morning everyone09:11
hazmatgood morning12:43
kim0hazmat: morning :)12:46
niemeyerhazmat, kim0: Yos!14:39
niemeyerhazmat: Good extended weekend?14:40
hazmatniemeyer, indeed.. lots of play time14:45
kim0hehe14:49
kim0Love the Ensemble audition program14:50
kim0wonder if we could have a miniature public version of it (like for a week, to write a formula)14:50
kim0The aws trial thing was a big success, I suppose we could do something similar14:51
niemeyerkim0: Maybe.. let's see how the program goes14:59
=== crazed_ is now known as crazed
=== crazed is now known as Guest21517
=== Guest21517 is now known as crazed
=== crazed is now known as crazed_
=== crazed_ is now known as crazed
SpamapSgoood morning ensembladeros16:47
kim0SpamapS: morning o/16:50
kim0ensembladeros .. mm can we pick something easier to say :)16:51
SpamapSEnsemblanators?16:52
SpamapSEnsemblasters16:53
jimbakervisions of the toy story ride at disney in my head right now16:54
SpamapSEnsemble Space Rangers17:10
robbiewbcsaller: thanks for stepping up for the demo at structure...I wasn't thrilled about flying to SF for a day :D17:17
bcsaller robbiew: no problem17:17
bcsallerI like talking with people about the project17:17
robbiew:D17:18
robbiewbcsaller: how'd the cloudcamp go last week?17:18
robbiew(or whatever it was called, heh)17:18
bcsallerat first I was worried that it was too high level with lots of exec types running around, but by the end I think I made some good contacts and generated some interest 17:19
bcsallerno one there had a good answer for orchestration, some didn't even understand they'd hit this problem but it became very clear in talking with people that a sysadmin named Tim sitting in the back room wasn't a viable orchestration solution17:20
kim0SpamapS: 17:23
kim0SpamapS: Ensemblasters actually sounds great :)17:23
robbiewbcsaller: :)...sweet17:23
SpamapSbcsaller: dude, so harsh.. Tim's just tryin to feed his family. ;)17:24
robbiewlol17:24
bcsallertim shouldn't have to work so hard, we offer better tools17:24
SpamapSTrue, though without work as a distraction Tim might head back to the bottle.. ;)17:26
bcsallerTim should go back to enchanting 17:26
SpamapSbcsaller: I'm going to a similar event down in SD .. how would you suggest approaching it?17:27
jimbakerat pycon, one reaction i had is that "tim" might want to ssh into an indiv machine and fix it manually. wrong, wrong :)17:27
jimbakerstill an attitude that needs to be addressed17:27
koolhead17beep beep :)17:29
kim0koolhead17: hey17:29
SpamapSActually to that point jim, the current way to do it "the right way" is to modify the formula and upgrade the service..17:29
SpamapSjimbaker: I think there's a need for a parallel ad-hoc command interface17:30
koolhead17kim0: am heading for niemeyer docs as you suggested.17:30
kim0koolhead17: cool!17:30
bcsallerSpamapS: I really try to vary to to the audience. I went in, listened to the talks and the questions people had about other things and adjusted what I had to say. This time I was able to focus a lot of talk around the idea "Elasticity is one of the key properties of cloud. As you become elastic the way you need to think about and model services changes..." and so on. It was buzzword compliant enough that I think even the non-t17:30
SpamapSjimbaker: also sometimes it is as simple as un-sticking a stuck service with a gentle kill -9 ;)17:30
* niemeyer waves17:31
kim0koolhead17: It's Ensemble docs :) you can write there too17:31
koolhead17hello niemeyer :)17:31
SpamapSbcsaller: what was the format like? "un-conference" where people suggest what they'd like to discuss?17:31
jimbakerSpamapS, agreed for development all of these things are good (so ensemble ssh and other tools we will write). clearly we would hope that ad hoc needs are not necessary outside of dev17:31
koolhead17kim0: are we waiting for oneiric then we will put a community documentation page for ensemble?17:32
kim0koolhead17: not really .. docs are here https://ensemble.ubuntu.com/docs/17:32
bcsallerSpamapS: in this case yes, for example audience might collect a list of questions and then people that think they can answer one or more of those questions might join a panel to field questions from the audience 17:32
kim0koolhead17: I'm just used to the older place 17:32
koolhead17http://people.canonical.com/~niemeyer/ensemble/user-tutorial.html :P17:32
jimbakerSpamapS, i think the real issue was that "tim" might need to not only do some transient changes, but there would be a need to capture from production some permanent fixes too17:33
koolhead17haha17:33
SpamapSjimbaker: for instance, sometimes you want to restart all of your app servers because you changed a DNS setting and they're using a long TTL..17:33
kim0koolhead17: both are almost identical17:33
* koolhead17 waves to jimbaker 17:33
koolhead17kim0: ok17:33
bcsallerI was a little nervous getting up in front of a room full of people when the people on either side tended to be a little more business focused, but it worked out fine17:33
SpamapSjimbaker: yeah, in that instance "Tim" needs to learn how to use his tools better. :)17:33
* kim0 hugs Tim17:33
kim0hehe17:33
jimbakerkoolhead17, hi17:33
SpamapSbcsaller: alright that sounds great. :)17:33
bcsallerSpamapS: the whole thing worked pretty well17:34
SpamapSbcsaller: interesting.. the SD one is co-located with an event called "The Business of Cloud Computing" that is free for end users, but costs $1895.00 for "service providers"17:40
bcsallerThis was co-located with another conference as well. I think it's the pattern for cloud camps17:41
bcsallerdifferent messaging between the two as well. MS, IBM and Salesforce are really pushing PaaS at the CTO level 17:42
bcsallerbut that wasn'17:42
bcsallerwasn't so much what people were there to talk about 17:42
robbiewbcsaller:  another one in Mountain View -> http://www.devopsdays.org/events/2011-mountainview/proposals/17:43
SpamapSThe attendees list of this business of cloud is quite aheavy w/ C level17:43
robbiewif you're interested ;)17:43
SpamapSCTO, CIO, etc. etc.17:43
SpamapSDevops days is *awesome*17:43
robbiewJune 17-18th17:43
robbiewcall for proposals deadline is tomorrow17:43
SpamapSthe one last year only had ignite talks and panels17:44
bcsallerrobbiew: I'll put a proposal in and see what happens17:45
robbiewsweet17:45
robbiewspreading the gospel!!!17:45
bcsalleryes, but I really need to start scheduling time to follow up with people after these things. 17:46
SpamapSDefinitely should bring them in here too. :)17:46
robbiewSpamapS: +117:47
SpamapSooops.. left my 8 node mediawiki running for 24 hours17:47
bcsallersome of the people I talked with might be too high level for this type of chat though17:48
bcsallerSpamapS: Ensemble audition ftw :)17:48
robbiewanyone in the Boston area?...interested in showing off ensemble thursday at a CloudCamp? http://www.cloudcamp.org/boston/2011-06-0217:49
SpamapSany idea when 'ensemble set' will land?17:49
* robbiew quits soliciting for talks....for now17:50
robbiewuuuahahahahaaa17:50
bcsallerSpamapS: re: set, its working, should be merging one 1/2 of it today and the rest will be in review, its almost all though the process 17:51
bcsallerSpamapS: we were joking that it was had to come up with a simple user visible example. I started with trying to change the blog title in wordpress and ouch, that was a can of worms, it really involves a mysql update and it spiraled out of control from there 17:52
SpamapSbcsaller: I'd think with a mysql update it should be *dead* simple17:54
SpamapSbcsaller: mediawiki has me writing PHP files all over the place. :-P17:54
bcsallerSpamapS: set wordress blog-title="foo". Right, so now wordpress has to identify its relation to mysql but config option hooks are not built around relation context, I think if you follow that thread you'll see it gets silly fast17:55
bcsallerwriting a PHP file would be a single change with immediate results that don't cross relationship boundaries. might make more sense for this 17:56
* SpamapS exports the entire "OpenBSD" category from wikipedia to import into his mediawiki17:58
SpamapSbcsaller: right, this is where I think one thing missing from  my current understanding of the model is ordering17:59
SpamapSbcsaller: I think we may need to be able to be able to guarantee ordering of relations17:59
SpamapSthere's no sense relating website to a loadbalancer until all the required things are related18:00
bcsallerSpamapS: that's related but not quite where I was heading. 18:00
SpamapSand since add-relation doesn't block until the relation exists.. this can be a problem I think.18:00
niemeyerbcsaller: Glad to hear your talk went well18:01
SpamapSbcsaller: Right, I guess what I'm saying is, that setting requires mysql... so settings should error or block until relations are setup. At that point you'd at least know wordpress knows how to talk to mysql.18:01
niemeyerSpamapS: Ordering isn't entirely straightforward, as we discussed back in Cape TOwn18:01
bcsallerSpamapS: relation settings are not even readily available in the config-changed hook as that's not a relation hook18:01
SpamapSniemeyer: indeed.. the stacks may be the place to address that18:03
SpamapSI'm still trying to see if something solves my problem of not being able to relate to my slave database until after it has been related to the master18:04
SpamapSwhoa.. I just discovered the resolved command18:09
SpamapSsweeeet18:09
SpamapSheh.. now that I'm importing a massive amount of data into my mediawiki.. I find myself wishing I had a graphing system setup. :)18:13
* SpamapS looks into reconnoiter formula18:14
niemeyerWoot :)18:16
SpamapSniemeyer: what would be cool would be if the EAP program shared a single ensemble environment... so we could all relate all of our services to one another. ;)18:18
SpamapShave one EAP large instance for mysql for everybody's mysql needs. ;)18:18
niemeyerSpamapS: I'm not entirely sure other people would agree with that :-)18:20
SpamapSSo here's an interesting conundrum. I want all the machines that I spawn to direct their syslog service to one syslog machine... essentially a system issue, not a service issue directly...18:25
SpamapSi had one thought which is that formulas can have a "management" relation defined, which will run traditional config management type stuff like this.. and then a management formula that does all these tasks18:27
SpamapSthe other option is to just make it easy to drop in puppet/chef/cfengine to do these types of things18:29
bcsallerits not yet clear to me how that would fit into the lifecycle or why the current lifecycle can't perform those tasks18:30
niemeyerSpamapS: I'm not sure I get how that's any different from the other relations?18:31
niemeyerSpamapS: Why not a "syslog" relation?18:31
SpamapSniemeyer: because I'd have to define it for *every* formula in principia, and it would be the exact same program18:32
SpamapSergo, it is at a different level18:33
SpamapSIts basically a machine policy, not a service policy18:33
SpamapSso another option is to provide a machine analog for formulas18:34
niemeyerSpamapS: We have to think about it a bit more.. I think it is a service policy because syslog is a service, but I see your point regarding handling that comfortably.18:35
SpamapSyeah, the receiver is a service...18:35
bcsallerSpamapS: we did talk about something like that, and something else that was provider specific, but nothing is spec'd yet18:35
niemeyerSpamapS: There are class of things which are related to the machine that need further thinking18:36
SpamapSI think making it as narrow and flexible as possible would be ideal. The thing I reclal talking about was "ensemble deploy ... --policy-formula=X" which would deploy another formula on the machine as X18:36
SpamapSBut I recall that the implementation of that would be fairly disruptive18:37
niemeyerSpamapS: Yeah, that feels pretty close to ringing a bell18:38
SpamapSAnother one that might be a simpler stop-gap would be to allow specifying cloud-config data to add to the initial cloud-config18:39
niemeyerSpamapS: That sounds like something hard to get out off down the road18:42
SpamapSIt would at least be easier to migrate away from than AMI's, which is the other way I could see people solving it18:42
niemeyerSpamapS: We have to spend some time thinking about these issues, collecting a few use cases, and then sit down to design it properly and start implementing the barebones functionality18:43
niemeyerbcsaller, jimbaker, hazmat: Standup?19:31
bcsalleryeah19:31
hazmatsounds good19:32
jimbakerniemeyer, sure19:32
hazmathmm.. skype still segfaulting on me19:32
hazmathanging out in #ensemble on mumble19:33
niemeyerI'm happy to go with Mumble19:33
_mup_ensemble/expose-provision-service-hierarchy r252 committed by jim.baker@canonical.com19:33
_mup_Test corner case that service has been removed between watch and the watch function execution19:33
niemeyerNot sure for how long we'll be able to count with Skype either way19:33
* SpamapS deploys his munin formula..20:32
niemeyerWoah21:35
niemeyerSpamapS: How did it go?21:35
SpamapSniemeyer: working out kinks21:35
SpamapSniemeyer: its doing everything I told it to.. but munin isn't picking up my file in /etc/munin/munin-conf.d .. have to figure that out.21:36
SpamapSah! the dreaded type-o in the bash script problem :)21:38
niemeyer:-)21:41
niemeyerSpamapS: We need to compile bash!21:41
* SpamapS imagines niemeyer writing a Go parser for bash21:44
niemeyerSpamapS: Oh man.. I don't want to get anywhere near that21:45
robbiewlol21:52
_mup_ensemble/trunk r239 committed by gustavo@niemeyer.net22:06
_mup_Include examples as documentation.22:06
niemeyerhazmat: Packages churning22:09
hazmatniemeyer, awesome, thanks22:10
niemeyerhazmat: np22:10
SpamapShttp://ec2-50-17-114-201.compute-1.amazonaws.com/munin/22:12
niemeyer\o/22:19
SpamapSIt should gain mysql stats too.. qps, cache hits, etc.22:20
niemeyerhazmat: pkgs are up for all Ubuntu releases22:22
SpamapShrm I think I hit a weird bug in the agent22:28
SpamapShttp://paste.ubuntu.com/615517/22:28
SpamapSload got crazy high on the box, I think that actually may be have what caused the issue22:29
SpamapSit was streaming those stack traces22:29
_mup_ensemble/expose-provision-service-hierarchy r253 committed by jim.baker@canonical.com22:30
_mup_Don't ignore watch_exposed_flag problem, fix it22:30
niemeyerSpamapS: Yeah, that's a weird traceback22:31
niemeyerSpamapS: Do you have the top of it?22:32
_mup_ensemble/expose-provision-service-hierarchy r254 committed by jim.baker@canonical.com22:32
_mup_Merged trunk22:32
SpamapSniemeyer: should be in the agent log right?22:34
niemeyerSpamapS: Yeah22:35
niemeyerSpamapS: These two tracebacks are just a side effect of something else that happened earlier22:35
SpamapSniemeyer: I think the agent may have crashed22:36
niemeyerSpamapS: Yeah, the traceback certainly looks bad22:36
SpamapSthe new log is very small and from about 2 minutes before the traceback I pasted22:36
niemeyerSpamapS: In a weird way.. that's twisted complaining about something strange22:36
SpamapSahh I was running debug-log when it started...22:36
niemeyerSpamapS: Hmm22:36
SpamapShttp://paste.ubuntu.com/615520/22:38
SpamapSseems like it missed the top22:38
niemeyerSpamapS: There's actually some interesting info there that I missed earlier 22:39
niemeyerhazmat: Would you mind to have a look at this when you have a moment?22:39
niemeyerhazmat: The first traceback paste is within exists_and_watch, which is breaking due to the deferred being called twice22:40
hazmatniemeyer, looking22:40
niemeyerhazmat: I suspect it may have something to do with the recent refactorings 22:40
hazmatinteresting22:40
SpamapSif you guys want to login to the box or anything let me know22:40
SpamapSrelations don't seem to be working to the service anymore22:40
hazmatSpamapS, thanks22:40
niemeyerSpamapS: That's understandable, thanks22:40
niemeyerSpamapS: The watching within the agent is borked, so it'll not behave properly22:41
SpamapScan it be restarted or anything?22:42
hazmatniemeyer, not related to the refactoring just a new event type i think that wasnt in the txzookeeper event mapping for pretty names, which we  got exercised by a log statement22:42
niemeyerSpamapS: Which agent was that22:42
niemeyerSpamapS: You can restart it either way, but may need some env setup22:42
hazmatSpamapS, it can be restarted by hand, but its tedious you have to setup the /proc/pid/env and launch with the same cmdline22:43
SpamapSah22:43
SpamapSreboot?22:43
niemeyerWe need to work on this22:43
niemeyerhazmat: new event type?22:43
hazmatniemeyer, its not a new type.. just one that wasn't previously mapped22:43
niemeyerhazmat: A deferred was called twice.. that sounds like wrong wired of deferred chaining22:43
niemeyers/wired/wiring22:43
hazmatniemeyer, its caused by a SESSION_EVENT22:44
hazmatniemeyer, indeed that too22:44
niemeyerhazmat: How do you mean?22:44
niemeyerhazmat:   File "txzookeeper/client.py", line 393, in callback22:44
hazmatah.. actually no the logging is async22:44
hazmatFile "txzookeeper/client.py", line 79, in type_name22:44
hazmat    return self.type_name_map[self.type]22:44
hazmatexceptions.KeyError: -122:44
niemeyerhazmat: This is within exists_and_watch22:44
SpamapSOk well ec2-50-17-114-201.compute-1.amazonaws.com is the hostname, I added keys for hazmat and niemeyer (from launchpad)22:44
hazmatah.. there are two tracebacks posted22:44
SpamapSStill haven't grabbed that lunch.. ;)22:45
* SpamapS runs out for it22:45
niemeyerSpamapS: Enjoy, and thanks!22:45
hazmatniemeyer, i was referring to the second traceback22:45
hazmatjust looking at the first22:45
hazmathmmm.. the tailspin might have been a recursive error logging loop22:46
niemeyerhazmat: Thanks22:46
niemeyerhazmat: I'll leave that with you.. feeling very sleepy right now22:47
hazmatniemeyer, get some sleep.. and thanks for setting up the ppa, much nicer to demo now22:47
niemeyerhazmat: Will do, and np22:47
niemeyerLaters22:48
_mup_txzookeeper/fix-event-type-name-mapping r36 committed by kapil.foss@gmail.com22:55
_mup_add some missing events to the event type name mapping22:55
hazmathmm.. this is a bug in the ensemble watch usage.. http://zookeeper-user.578899.n2.nabble.com/watcher-semantics-for-session-events-in-the-C-client-td6206081.html23:00
hazmatnot problematic for this particular case, which is fixed by the above commit23:00
hazmathmm. i guess its its not really an issue since all of our watches refetch current state23:00
hazmatbut it is an additional event firing which they most don't account for as spurious to their watch intent23:01
hazmatniemeyer, jimbaker, bcsaller ^23:01
hazmatso given our usage its fine, given that we check state23:04
hazmathmm.. actually we don't do it correctly, since in this case we also reset the watch23:05
hazmathmmm23:08
hazmatit looks like we ran into this already.. http://comments.gmane.org/gmane.comp.java.hadoop.zookeeper.user/195123:15
hazmatbut we never addressed it in our usage afaics23:15
hazmathmmm.. this feels like something we should set on the client23:21
hazmatbcsaller, jimbaker could i get a +1 on this trivial.. http://paste.ubuntu.com/615542/23:25
bcsallerhazmat: that's just mapping constants?23:26
hazmatbcsaller, yup23:26
bcsaller+123:26
_mup_txzookeeper/trunk r38 committed by kapil.foss@gmail.com23:29
_mup_Add some missing constants to the client event type mapping. [trivial][r=bcsaller]23:29
SpamapShazmat: any progress on that problem? I am going to tear down the box if you don't need it23:31
hazmatSpamapS, i committed a fix for the immediate cause (that change to txzookeeper), i'm still trying to figure out if we're handling the event that was received properly or not (the error was from printing the event), it looks like that machine got disconnected from zk for a little bit23:33
hazmatSpamapS, feel free to tear down the machine23:33
SpamapShazmat: thanks :)23:33

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!