[03:58] hey guys, got a question - i had an AMI in one region and used cloudyscripts "copy AMI to another region" thing to copy it. but when I start the new instance, I can't connect - ssl times out. Is there some step i'm missing? I can start copies of it in the original region and connect no problem. [04:01] (and I am running a modified 10.10 server AMI) [06:46] ernop: my first question would be why are you using a modified AMI? (keep in mind, I'm a novice, but I only ever use cloud-config customized instances) [07:13] kind of lost of where to start to get an image dl'd from http://uec-images.ubuntu.com/releases/ to run [07:13] any ideas what I should check? [07:22] SpamapS - by modified I just mean that I have installed a bunch of my own stuff on it. Thing is, it worked in the original region. [07:40] Makere: meaning downloaded to run in UEC (as in, eucalyptus) ? [07:42] ernop: yeah, I think I'd do that by putting all of my stuff in a dir and tarring that dir up, or use a proper config management system like puppet/chef. [07:42] SpamapS: yes [07:43] and used the uec-publish-tarball [07:43] to push it there [07:53] fml [08:43] SpamapS ok, good idea. === nijaba is now known as nijaba_afk === nijaba_afk is now known as nijaba [11:47] So I tried creating a new uec image using this guide: http://cssoss.wordpress.com/2010/05/10/eucalyptus-beginner%E2%80%99s-guide-%E2%80%93-uec-edition-chapter-4-%E2%80%93-image%C2%A0management/ [11:48] the instances get stuck on pending, any troubleshooting tips? [13:23] why the store images run [13:23] but I can't get my own images to run [13:24] nor images from uec-images.ubuntu.com [13:43] ok [13:43] the problem I had was that it doesn't work if I bundle on cloud controller === zul is now known as ep === ep is now known as zul [14:16] Makere, i would doubt that its related to where you bundle [14:17] remember that when you do a run-instances after uploading, the first time it will have to copy from walrus to node, change format of image (adding a partition table and such), before running [14:17] that is a lot of IO and will be very slow for something large [14:19] you can debug a bit more by figuring out where your instance is going to run (euca_conf --list-nodes) and then looking at the node. top will show you some things. likely there is something in /var/log/eucalyptus or /var/lib/libvirt [14:52] smoser: well I waited for an half an hour, kvm doesn't run on the node [14:53] node logs don't have any errors [14:53] hm.. [14:54] I have nc log upped somewhere [14:55] if you want I cna put it to pastebin [14:55] but personally I think to myself as case closed, and will not compile on the cloud controller anymore [14:55] bundle* not compile [14:56] it worked charmed after bundling on cluster controller [14:56] like a charm* [14:56] sorry, a bit tired [14:57] I have walrus and cloud controller on same machine [14:57] and cluster controller + storage controller on other two [14:57] plus nodes [14:57] well, for what its worht, i have a cloud (10.04 right now, but i've used 10.10) with just 2 systems [14:57] one is CC CLC and SC, one is just NC [14:57] and i do everything from the CC/CLC/SC node. [14:58] I think it might have something to do on cloud/walrus controller not having CC on same machine [14:58] it won't make any difference, but if you find the bundle and upload process to be a bit obtuse, you can use 'uec-publish-tarball' or 'uec-publish-image' === dendrobates is now known as dendro-afk [14:59] I did my own image with the publish-image [14:59] same thing as with publish-tarball on official images [14:59] ah. ok. [14:59] just wanted to make you aware that is there. [14:59] yea thanks [14:59] once you've uploaded, i seriously can't see a difference in where the bundle was done === dendro-afk is now known as dendrobates [15:00] after bundle, its already loaded into walrus, and the NC will do the get from walrus. no path there would be affected by where the 'bundle' command was run. [15:00] http://open.eucalyptus.com/forum/i-still-can-only-run-instances-store-images same problem here [15:01] found the answer from there [15:02] although in the "answer" I got the guy seems to have run CLC CC SC and Walrus on same PC [15:02] ("Ok, so I really don't know" by seigie) in the middle [15:03] weird thing is that I/We reinstalled the cloud 4 times [15:03] and everytime same problem [15:03] with diff settings etc [15:04] ah well, just wanted too put a heads up if anyone gets same kind of problem [15:04] you get the errors like that in the logs ? [15:05] don't have the logs here now [15:06] well, the right thing to do is open an ubuntu bug [15:06] yea [15:06] so, go through the steps again [15:06] writing down what you do and your setup (CC/CLC and such) [15:06] see the failure [15:06] then run 'ubuntu-bug eucalyptus' on the system that has those ERROR messages in the logs [15:07] sorry to give youthe run-around, but in the end, that is what is most likely to get the problem solved for you. [15:31] hello there, anybody? [15:32] I am trying to setup a cloud test environment and I managed to get it to work, but it gives random network problems when multiple users run multiple instances. The Eucalyptus team told me that it is a UEC specific problem and I would need some help to troubleshoot. Is there anybody willing to help? [16:23] TritoLux_: can't promise anything, have you tried connecting via public or private ip? And are the instances really up and running? You can check that with euca-get-console-output [16:25] hi there, yes everything seems to work fine, I can administer the instances via hybridfox with no problems, apparently.. but when a second user starts his own instance, then the other users loose the attached volumes from the SC and they even have to reboot their instance in orger to regain connectivity [16:26] after rebooting the instances, then nobody can detach the previously attached volumes and I have to reboot the cloud to restore functionality, until the same problem occurs again [16:26] thanks for replying btw [16:30] I have been troubleshooting this problem with the Eucalyptus team, but after a couple of months they are still puzzled and they said that it is UEC specific [16:31] TritoLux_: sounds bad. I would recommend opening a bug about this in Launchpad against the eucalyptus package [16:32] the Eucalyptus team suspects AoE to be the problem [16:33] for instance, when I launch an instance, I get this error message in the livirt log: pci_add_option_rom: failed to find romfile "pxe-e1000.bin" [16:33] for instance, when I launch an instance, I get this error message in the libvirt log: pci_add_option_rom: failed to find romfile "pxe-e1000.bin" [16:33] I wonder if it is related [16:34] I also removed the AppArmor security layer in order to see if it was causing the problem, but pretty much nothing has changed [16:36] as a matter of fact, I was never able to have a stable cloud based on UEC [16:39] thanks for your responses TeTeT, I will eventually shoot a bug in Launchpad, but if anybody else has any advices in the meanwhile that would great. I can provide you with all logs and configs of course [16:41] TritoLux_: I think the error is unrelated. it goes away when you add some kvm-pxe package [16:41] TritoLux_: btw are you on 10.04 LTS or on 10.10? [16:41] I started with 10.04 LTS and I performed a dist-upgrade [16:42] it is currently running with all latest available packages [16:44] the environment is set with 2 machines, one running CLC, CC, SC, Walrus and the other one running the NC [16:45] both have dual NIC's, the CLC uses a public NIC and the CC is installed on a private NIC [16:46] eucalyptus.conf is set to run both priv and public traffic on the bridge configured on the non public interface [16:47] the nic's are configured for multiclustering, even if it is running only one at the moment [16:48] the eucalyptus guys had a look at the network config and they said that it is supposed to work fine [16:52] however, because both public NIC and private NIC are configured with a static address, then avahi autoregistration failed during UEC installation and it cost me some troubles to arrange a properly registered node. Now the node registration seems fine, but I wonder if the initial failure might have caused more problems [16:52] the network mode is MANAGED-NOVLAN [16:53] I have no error messages in the logs, until the above problem occurs [16:53] TritoLux_: I register nodes from hand all the time and have not discovered your problem. But then I just do training and not some pre-production tests [16:54] if you try to install a cloud where the CC is running on a private NIC with a static IP then you will reproduce above registration problem [16:55] so to have a main IF on the public NIC and the nodes to be discovered on a private NIC, both with static IP's [16:57] the UEC installation process allows you to choose a static IP on the main NIC, but the same question is not asked for the secondary NIC and it assumes DHCP on that one [16:57] then autoregistration fails [16:58] the above problems are preventing us to start our cloud business at the moment and switching to Debian is currently being taken into consideration to be honest [17:04] basically, it seems that I cannot run more than a customer onto an apparently working cloud, otherwise the entire network gets messed up, as in connectivity getting lost, attached volumes becoming unaccessible, etc. rebooting the cloud seems the only solution to restore funzionality again.. for a single user, until a second one starts another instance. [17:06] I have tried to run more instances from a single user and it seems to work fine. The problems occur mainly or only when a different user starts to interact with the cloud. [17:09] TritoLux_: not sure if the multi user scenario is well tested. I use it in the trainings, but this cloud only exists for a week [17:10] I think that multiuser is essential to a cloud, otherwise we miss the point of having a cloud. But I think that the multi-nic scenario is even less tested. [17:11] at least when you have both IF's with static IP addresses [17:13] my idea is to have just the CLC facing the public and the rest of the cloud interacting through a private network so that the public cannot even reach the other nodes [17:16] TritoLux_: sound idea [17:18] eheh, it might be a good idea, but I wasn't able to get it to work in practice so far, unfortunately [17:18] TritoLux_: I believe it's a scenario that should be supported [17:19] I am quite sure it is supported indeed, but it hasn't worked for me so far [17:19] and I dunno why [17:21] as we speak, two more boxes are being setup for a debian testing environment which will be configured in the same way, so to see if it is really an UEC problem or not.. thing is that Debian alred [17:22] good luck with that [17:22] as we speak, two more boxes are being setup for a debian testing environment which will be configured in the same way, so to see if it is really an UEC problem or not.. thing is that Debian already runs Eucalyptus 2.0 [17:22] as does maverick [17:22] and UEC is running 1.6.2 [17:22] Maverick is not the dist I have then [17:23] TritoLux_: hmm, you really should try it then, I believe [17:23] TritoLux_: it's not an LTS, so support will run out in 18 months time though [17:23] I am running the LTS distribution [17:24] that's 10.04 LTS [17:24] yes [17:25] I thought that the LTS was more stable though, that's why my boss set it as a requirement [17:25] or at least better tested [17:27] TritoLux_: it basically is, but with UEC it's still a pretty new product and big improvements are made with each release [17:27] that's understandable [17:27] TritoLux_: the biggest step for 10.04 LTS in regards to UEC was IMO the neat installation via the CD [17:28] TritoLux_: I know there have been made all kind of tests for stability, but I doubt the multi user and multi nic scenario is one of them [17:29] I think that the multiuser and multi nic scenario would be one of the most requested for production environments though [17:32] if I remember correctly, the initial tests we run when we approved UEC were run on a single NIC and everything went pretty much ok, but when we had to go in production environment, then things changed of course and more requirements were set.. but UEC didn't work well anymore [17:32] when do you think that Euca 2.0 will be available for LTS? [17:33] TritoLux_: not sure, unless there's a strong business case probably never. I don't see anyone putting resources into a backport [17:34] auch, sorry what's the advantage of running LTS then? [17:35] TritoLux_: security updates for 5 years on the server. It's not about getting the latest software [17:35] TritoLux_: it's just my 2 cent, but I'd evaluate UEC on Maverick, upgrade that on a six month base and then go with the next LTS for long term production deployment [17:36] TritoLux_: but not sure if that meets your project specs [17:36] mmh.. I thought it was both [17:36] that's a good suggestion indeed [17:37] thanks for your help, I will have to discuss this with my colleagues and better explain the scenario [17:38] there might have been misunderstandings on the LTS concepts [17:38] TritoLux_: good luck, couldn't provide much though [17:39] I appreciate your response anyway, thanks for your effort nevertheless [17:39] it's not a simple issue anyway [17:40] agreed [17:41] I'll go for a break now, speak you soon maybe [17:41] thanks again [17:41] will be off soon, bye [17:42] bye bye === zul_ is now known as zul === xfaf is now known as zul [19:26] erichammond, you still unable to find anyway to update ami pages, right? [19:27] it sure seems like they're just in flux, but i opened a ticket and reopened several times. iwas told that to edit a page i should "submit an ami" and mention that it is an update to an existing page in the body. [19:37] anyone had any luck using the EC2 PHP SDK with euca/UEC? [21:18] smoser: I guess I've given up on Amazon's AMI pages since I haven't updated them in forever. [21:18] i'm somewhat peeved at them over it. [21:18] smoser: I guess I should put in one more update that just says "this AMI has been superceded by the new recommended official Ubuntu AMIs..." [21:18] i recently made an effort on them, to move out of the canonical account (the one that owns our images) to another [21:18] so that people who updated those dont have to know credentials that could cause issues. [21:19] but now, i can't even update the ones we have [21:23] :( [21:24] i really think that they're probably just in transition. visually there are changes. [21:25] but there is no external announcemnt of such a thing, and its been going on for the better part of a month [21:32] humm, smoser think I just discovered a bug in the UEC API .. Might be normal tho! You know sec group names on describe instances come back as "group" while on run_instances come back as "_group" .. [21:33] that the way EC2 does it? (I havent used their API) [21:34] what is uec_username ? [21:34] the username you use to login to the CLC web UI [21:35] i think its probably a bug. [21:35] so desc instances might return "default" while run will return "admin_default" when using the admin account ... [21:35] crap .. time to go digging again ;) [21:36] some things are different with admin [21:36] Also .. the only way I've gotton euca to build is on a PPA with updates turned off .. is there some trick to built it after you do a "apt-get dist-upgrade"? [21:36] Well, im not using the admin user .. just thought it was a good example ;) [21:38] if it fails to build from source, then please open a bug [21:38] and you're saying the API lists the user name ? [21:39] yes - the group name returned (assuming group = "app" and username = "kiall") by describe is "app" while run instance returns "kiall_app" [21:53] kiall, i'd have to look at it more, but i've never noticed differing output in the utility [21:53] in euca2ools when working with ec2 verusus uec [21:53] so i'd expect that its not different from what comes back from the api [21:53] but i dont use security groups much [21:53] I'll dig in a little more and look at the raw output rather than what the PHP EC2 SDK is giving me .. possibly a bug elsewhere [21:59] Yup - The euca API is def handles group names differently .. [22:02] hello there, I would like to upgrade 10.04LTS to 10.10, which means also a major upgrade from Eucalyptus 1.6.2 to 2.0.. is there any particular step to be concerned of? any known problems? would the existing cloud keep all volumes, credentials and configurations? thanks in advance for your answers [22:02] kiall, thanks. please do open a bug against eucalyptus [22:03] just getting it together now [22:03] again, using 'ubuntu-bug eucalyptus' [22:03] humm? [22:03] if you use 'ubuntu-bug eucalyptus' on the system, then it will put some info in the bug about what versions you're using [22:03] it will also collect logs if you allow it [22:04] thats why i suggested you do that on the machine with the Errors earlier (regariding your fialure to leave 'pending') [22:04] i've got to run. [22:04] Ah that wasnt me ;) [22:04] i think it was [22:04] Nope - last thing we talked about was hairpin NAT [22:05] hm.. so it was. i confused you with Makere [22:05] anyway. [22:05] same deal goes for you :) [22:05] use ubuntu-bug eucalyptus , that collects some system info [22:06] I'll see if that works over SSH to the server .. dont have it installed locally.. [22:06] thanks ...