/srv/irclogs.ubuntu.com/2011/02/18/#ubuntu-cloud.txt

=== dendrobates is now known as dendro-afk
wolfricare there any videos around of a controller assigning images to nodes and making them work?00:31
wolfricAlso - any more elaborated documentation on the matter other than the community pages?00:31
RobertLaptopI have a questions regarding https://help.ubuntu.com/community/UEC/Topologies it shows breaking systems out with some systems label just NC.  The question is does the system running other services not showing NC run VM's or is it just for managemnt stuff?03:46
willkesslerhello, i'm having a cpu load problem under medium network traffic on 10.04LTS, large instances, can anybody help me?05:29
flaccidwhat is using the cpu?05:29
willkesslerapache05:29
flaccidthen that is your problem05:30
willkesslerit would make sense, except that traffic to the app does not significantly change05:30
flaccidhow does that conclude anything05:30
flaccidyou could have 0 traffice and apache do 100%05:30
willkesslertrue; however, the application responds pretty much the same way to everybody that uses it. it's a social game, and tens of 1000's of players play it all day05:31
willkesslerhowever, at certain times of the afternoon load will surge in apache withotu the app doing anything different per se05:31
flaccidsounds normal to me05:32
flaccidits your problem to put it simply05:32
willkesslernot serving more users, different game play, etc. the same at 10am as at 4pm. but between 3 and 6, the normal load for its traffic (3) spikes to 5005:32
flaccidyou can't expect the open source community to debug your app and shit05:32
willkesslerif it were a ton more players, i would think apache would become more allocated (looking at /server-status)05:32
flaccidyeah bad assumptions once again05:33
willkessleri don't expect that, certainly05:33
willkesslerand why is that bad assumptions?05:33
flaccidbecause you don' need a load of users to have a mem and/or cpu leak or other indirect issue05:33
flaccidit does not look like the problem with your app is anything to do with aws or ubuntu05:34
willkessleri am merely hoping to hear that either a) this release of ubuntu has issues with high network load and to use something different or b) load on ec2 instances can vary even without your instance doing any work at all, since i've done a ton of debugging on the app and do not see an issue except when the app is in the cloud05:35
flaccidkeen to hear any evidence to suggest otherwise05:35
willkesslerfor example, the app has been heavily loaded outside the cloud by browsermob, without issues05:35
flaccidyeah sorry, not a good hope.05:35
willkesslerflaccid: hmm, ok, i'll take your feedback as a valid single data point05:36
flaccidi do hope that other users here come out with other responses..05:36
willkesslera hope i share as well :-)05:36
willkesslerflaccid: as regards your suggestions, certainly there could be a ram leak, but i don't see evidence of it using the usual tools (top, iostat, free, etc)05:38
willkesslerflaccid: swap is never allocated, e.g.05:38
willkesslerflaccid: as far as CPU leak, also possible, i don't expect the community here to debug anything like that. however, again, if code doesn't change hour over hour yet system behavior does, it leads one to suspect OS or metal05:39
flaccidcan you show stats at least?05:42
willkesslersure05:43
willkesslerflaccid: which stats would you want to see05:44
willkesslerflaccid: i have some output from iostat, for example05:44
flaccidwell05:46
flaccidwhats the actual issue first05:46
flaccidcpu usage?05:46
willkessleryes...05:46
willkesslerwell, load overall, causes tasks that take say 300ms to return to take 1500ms, etc05:47
flaccidis top basically correlating to the load reported in w05:47
willkessleryes, it does exactly05:47
flaccidwell what can i say, apache is using that cpu05:48
flaccidit doesn't look like a load accounting bug05:48
flaccidhow about you try maverick?05:48
willkesslerit definitely is the app but what i don't understand is, why the app doing the same thing it does all day, should suddenly demand so much more cpu05:48
flaccidnobody here can answer that question05:48
willkessleris maverick production ready?05:48
flaccidsure05:49
flaccidno ubuntu is production ready05:49
flaccidthats the nature of it05:49
flaccidbut use it if you like05:49
willkesslerok, i wil give it a tryout then, thank yo05:49
flaccidi'd be curious to see if the same thing occurs on debian 605:53
willkessleri might just switch to centos after reading this posting: http://alestic.com/2010/10/ec2-ubuntu-maverick#comment-48405:56
flaccidwillkessler:  we offer alternate builds @ RightScale if you want to try them out05:57
flaccidthat post seems a little speculativ05:57
flaccidwell it is. you don't report bugs on blog comments for a start.05:58
willkessleras was pointed out05:59
willkesslerwhere do those rightscale builds live?05:59
flaccidwillkessler: http://support.rightscale.com/18-Release_Notes/OS_and_Software_Package_Support06:00
flaccidim an engineer at rightscale, so please feel free to join us in #rightscale if you want06:00
willkesslerdo i need to be a customer?06:00
flaccidno06:00
willkesslercertainly, rightscale has gotten high reco's from my colleagues, and if we could afford it , i'd probably talk to you via the front door... but not quite yet06:01
flaccidrightscale is free unless you would like a subscription06:02
willkesslerwhere can i read more about rightscale/free? that same site?06:04
willkessleri'll just look at the links, nvm06:05
willkesslerthanks and good night06:05
flaccidwillkessler: http://support.rightscale.com/03-Tutorials/01-RightScale/1._Signing_Up_for_RightScale/Sign-up_for_a_Free_RightScale_Account06:06
flaccidgood night then.06:06
RobertLaptopQuestion I selected the wrong options during install is there a way to re-run the install script?06:07
flaccidwhat install?06:07
RobertLaptopI selected CLC, Walrus, on the wrong box.06:08
superxglhi all, i wanna ask a question, now in every nodes i use Wget to get the ip of nodes, and i want this IPs be sent to cc, how should i do ?12:35
superxglin nodes, i use cron to execute the a shell script to run Wget every minutes12:35
superxgland i want it to report to the cc12:36
TeTeThow would I specify --addressing private for uec-run-instances? When I run 'uec-run-instances --attach-volume=$vol -- --addressing private $emi' I get an error. I use the TREllis modified cloud-utils packages for support of attaching volumes12:36
superxglif the cc can not receive the ip of nodes execeed 3 mins, i will think the node dies and start a new one...12:37
superxglTeTeT:  hmm.. i also use --addressing private12:38
TeTeTsuperxgl: do you use euca-run-instances or uec-run-instances? euca-run-instances works fine12:38
superxglTeTeT: i use euca-run-instances, hmm.. don't know then ..12:39
TREllisTeTeT: --run-args=RUNARGS    pass option through to run-instances12:40
TeTeTTREllis: doesn't seem to work either, http://pastebin.ubuntu.com/568724/12:42
TREllisTeTeT: hmm I see the same on trunk12:45
TeTeTTREllis: ok, this is the syntax: uec-run-instances -v --attach-volume=$vol $emi --run-args="--addressing private"12:46
TeTeTnote the quotes around the run-args, tried it without the -- and the quotes first12:46
TREllisI normally put the emi at the end, but testing here that doesn't work either12:47
TeTeTsuperxgl: hmm, you try to detect if the nodes are still up and running?12:47
TeTeTTREllis: request for improvement: check if the volume is in an attachable state ;) if it's already been attached to a running instance, print an error or warning12:49
TREllisTeTeT: hmmm I tried with a different runargs and it worked -k mykey12:51
TeTeTTREllis: but otherwise, works very nicely! Thanks for providing that option, will introduce it in my next class :)12:51
superxglTeTeT : yes..i call it heartbeat, if the cc can not receive the heartbeat execeed 3 mins, i would think this node dies...12:51
TeTeTsuperxgl: why don't you use an existing hearbeat system then?12:52
superxglTeTeT: Is there have one ?12:52
TREllisTeTeT: did you get the run-args working then?12:53
TREllisTeTeT: thanks for the suggestion, thats one on the list :)12:54
TeTeTTREllis: yes, got it working with the quotes and the --12:54
TeTeTsuperxgl: please have a look at 'apt-cache show pacemaker'12:54
TeTeTsuperxgl: there's also https://wiki.ubuntu.com/UbuntuHighAvailabilityTeam/PacemakerHeartbeat for more doc on how to get going12:55
TREllisinfact, RoAkSoAx has been working on a resource agent for the CLC too12:56
superxglTeTeT: tnx:) go and checking now ..hmm.. i am really a newbie to this..12:59
TeTeTsuperxgl: no worries, we've all been newbies once12:59
superxglTeTeT : :)13:01
superxglTeTeT:hmm...PacemakerHeartbeat is very cool :)  but since it is my first time to do something like this, now i want to do it myself. my though is the nodes do a Wget like http://cc.com/?thisismyip=192.168.1.2 , that is the Ip be embedded into the GET parameter, so that the cc can get it's ip address..13:51
superxglanyone here have any idea?13:51
RoAkSoAxTeTeT: that documentation is outdated13:58
RoAkSoAxsuperxgl: you might be interested in this: http://www.roaksoax.com/2010/10/high-availability-uec-clc-howto13:59
RoAkSoAxTREllis: and as a matter of fact, the RA for the CLC is ready. I'll publish it later today14:01
superxglRoAkSoAx : tnx very much :) go and checking now:)14:02
TREllisRoAkSoAx: cool :D nice work14:09
RoAkSoAxTREllis: the hardest thing was to test it lol!14:10
TREllisRoAkSoAx: heh14:11
TeTeTRoAkSoAx: great, thanks for the update!14:29
RoAkSoAxTeTeT: wiki.ubuntu.com/ClusterStack/LucidTesting is the most updated, which still stands for natty14:31
TeTeTRoAkSoAx: ok, will bookmark it, superxgl might be interested in it as well14:36
RoAkSoAxTeTeT: you might also be interested in http://www.roaksoax.com/2010/10/high-availability-uec-clc-howto (I'll update it later today with a better way to monitor the CLC though)14:38
superxgli am really interested in part: Load Balancing with Pacemaker/ldirectord :)14:44
RoAkSoAxsuperxgl: ah!! I have one how-to for that I just haven't make it public yet14:45
superxglRoAkSoAx: i'll wait for it then :)14:55
smoserSpamapS, awake ?15:48
=== dendro-afk is now known as dendrobates
=== dendrobates is now known as dendro-afk
=== dendro-afk is now known as dendrobates
SpamapSsmoser: only have a couple minutes.. wassup?17:59
smoseri had some upstart questions.18:00
smoserhold on. let me get a pastebin18:00
smoserSpamapS, http://paste.ubuntu.com/568842/18:01
SpamapSsmoser: ok.. whats the ?18:02
smoserso i'm trying to force the order of those 3 things18:03
smosercloud-init-local -> cloud-init-nonet -> cloud-init18:03
SpamapSah ok18:04
SpamapSsince they're all tasks..18:05
SpamapSstopped and started are equivilent18:05
SpamapSstart on stopped cloud-init-local  ... start on stopped cloud-init-nonet ...18:05
SpamapSalso since they're tasks..18:07
SpamapSyou can do this (I think this is evil but it may be necessary for fine grained control... have a task that just does18:07
SpamapSstart cloud-init-local ; start cloud-init-nonet ; start cloud-init18:07
SpamapSnote that && would maybe be better ;)18:07
smoserso do you not think what i have there would work ?18:08
SpamapSit should actually18:12
SpamapSthe mounted is tricky tho18:12
SpamapSI'd only do it on the first one18:13
SpamapSunless you MUST block mounting / on all 3?18:13
SpamapSor rather, mounting anything except mount18:13
SpamapSmounted blocks mountall18:13
smoserwell, we want to block.18:39
SpamapSsmoser: just make sure those tasks *always* finish or the system won't boot18:40
SpamapSmostly because filesystem won't fire .. once / is mounted a lot is already done :)18:41
smosernot really.18:41
smosernot much is done on MOUNTED /18:41
smosermost stuff requires filesystems18:41
SpamapStrue.. like rc-sysinit18:54
smoserand ssh, just about everything. and really you should'nt start before that18:55
smoseryou're not guaranteed that tmp files in /tmp wont get deleted while you write them18:55
jmgallowaycan someone tell me where the uec website is located on the cloud controller?18:58
smoserSpamapS, shoot. my plan doesnt work19:11
smoseri think i know why19:11
SpamapSsmoser: ?19:12
smoserhttp://paste.ubuntu.com/568842/ doesnt work. i get19:12
smoser  cloud-init-nonet waiting for a network device.19:12
smosercloud-init-nonet gave up waiting for a network device.19:12
smosereven though there was a netowrk device19:12
smoseri think the problem was that eth0 came up before cloud-init-nonet was running19:13
smoser(ie, while cloud-init-local was running)19:13
smoserSpamapS, make sense ?19:15
TeTeTjmgalloway: /etc/eucalyptus/cloud.d/www defines admin.xml, which refers to /var/lib/eucalyptus/webapps/root.war I think19:21
jmgallowayoh thanks, I'll take a look there.  I'm running apache and another website on my cloud controller..I just wanted to know if I could customize the web gui uec already has19:23
SpamapSsmoser: right so you also need to create another task that emits an event on net-device-up or a timeout19:27
smosergetting messy :-(19:27
SpamapSnetwork detection is messy19:27
SpamapSwait is nonet supposed to figure out if there is or isn't a network?19:27
smoseryeah.19:27
SpamapSso make a task that sends it a SIGUSR1 every time net-device-up fires19:28
SpamapSand handle that event by checking again19:28
SpamapSs/event/signal/19:28
SpamapSthats how mountall does it19:28
smoserthe problem is that network comes up before it runs.19:29
smoseri think19:29
SpamapSso its failing to detect that fact.. sounds like it should do/while rather than while .. ;)19:30
smoser?19:30
smoseri was trying to force order... cloud-init-local -> cloud-init-nonet -> cloud-init19:30
smoserbut (i think) the network comes up while cloud-init-local is running19:31
smoserso cloud-init-nonet does not get the 'net-device-up' to stop it19:32
smoserand so it waits until timeout19:32
SpamapSRight so you need to check if its up *first*19:32
SpamapSwhile (!netup) sleep();19:33
SpamapSwhile (!netup()) sleep();19:33
smoserwho do i check if netup?19:36
smoserSpamapS,19:37
smosersuggestions?19:37
SpamapSsmoser: ifconfig.. /sys .. /proc .. not sure actually ;)19:37
smoseryeah.19:37
SpamapSthere may be a single command that does it19:38
smoserSpamapS, looks like ifquery might hold some information ... and also that /var/run/network/ifstate might just be suffiicent19:54
SpamapSsmoser: yeah that sounds like you're on the right path.19:55
SpamapSgotta go afk.. bbl19:55
=== dendrobates is now known as dendro-afk
=== dendro-afk is now known as dendrobates
=== dendrobates is now known as dendro-afk
RobertLaptopQuestion how do you define more then 1 DNS server?  VNET_DNS="192.168.100.213" to you add 2 lines or 1 line with a comma or a space?21:26
smoserhggdh, ping21:41
hggdhsmoser: pong21:57
smoseri think i don tneed you now, hggdh21:58
hggdhsmoser: k21:59
smoserand i'm done with rig21:59
hggdhsmoser: thank you, I will reset it for Daviey22:02
=== dendro-afk is now known as dendrobates
=== dendrobates is now known as dendro-afk
=== dendro-afk is now known as dendrobates
=== dendrobates is now known as dendro-afk

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!