/srv/irclogs.ubuntu.com/2016/04/01/#ubuntu-s390x.txt

jfhgood morning05:54
andrewcjfh, good morning!07:49
andrewcjfh, sorry to hear that you're having difficulty getting online :-(07:49
andrewcjfh, as well as your mail, you could also try asking for assistance on the "canonical-sysadmin" channel on freenode...07:50
jfhgood morning andrewc - well, I hope I can figure that out soon ... seems to be an issue with the Sign-in through Canonical ... let's see ...07:51
jfhgood point - will try that, too07:51
zachman:D09:59
cpaelzerjfh: welcome to the dark side10:46
jamespageo/10:47
cpaelzerhi jamespage10:47
cpaelzermihajlov: borntraeger: hi I hope you have a good weekend soon, but we would have a question regarding libvirt/kvm/openstack on s39010:49
cpaelzerI hope it is one of the former two so we can get a quick solution without asking the OS guys :-)10:49
cpaelzerjamespage: here in the channel hit thie bug just a few minutes ago10:50
cpaelzerhttps://bugs.launchpad.net/ubuntu/+source/nova/+bug/156483110:50
ubottuLaunchpad bug 1564831 in nova (Ubuntu) "s390x: error booting instance" [Undecided,New]10:50
borntraegercpaelzer, ?10:50
cpaelzermihajlov: borntraeger: and I wondered what/why you don't hit that with z/KVM+OS10:50
cpaelzerthe title is rather misleading IMHO10:50
cpaelzerlibvirtd[21610]: this function is not supported by the connection driver:  cannot update guest CPU data for s390x architecture10:50
cpaelzerborntraeger: mihajlov: that is closer to where things might start to break10:51
borntraegercpaelzer, I would assume that this is about the "not yet available" cpu model support10:51
borntraegercpaelzer, mihajlov : but there was a workaround in libvirt for that10:52
jamespagehttp://libvirt.org/git/?p=libvirt.git;a=blobdiff;f=src/cpu/cpu_s390.c;h=23a7f9d8d38a00dc9c673d224f797cf8a17aa5d1;hp=f9d7e216aec847df321d7c7d3a050415ee8550fd;hb=59403018893cf2c4f9a6f5145e387cefbd44399a;hpb=b789db36ae1cb5a48986c3b9e3bfb6413136787210:52
jamespagelooks relevant but we appear to have that in the libvirt version in xenial - just double checking10:52
jamespageyah - confirmed in 1.3.110:54
cpaelzerjamespage: hmm - to be sure is that OS against libvirt/KVM ?10:55
cpaelzeror containers anywhere in between?10:55
borntraegercpaelzer, jamespage , mihajlov its certainly a message from libvirt10:55
borntraegercpaelzer, jamespage, mihajlov , but I have not seen it here10:56
cpaelzerjamespage: could you identify the exact (api) call it made to trigger that?10:56
jamespagecpaelzer, borntraeger: actually yes there is a container in the way here10:57
jamespageI think that's the cause of the problem...10:57
jamespageempty /proc/cpuinfo is not helping I suspect10:58
cpaelzerjamespage: do you want to give it a try without containers just with KVM ?11:04
xnoxjamespage, is missing /proc/cpuinfo an lxc/lxd bug, given that it needs to emulate/whitelist/synthesise it or some such?11:06
jamespagexnox, yes I think so11:06
jamespagecpaelzer, not just yet11:07
xnoxjamespage, i guess a manual provider can be mixed into the thing... ?11:08
jamespagexnox, figured out how to bind mount the hosts cpuinfo into the container...11:11
xnox^_^11:11
jamespagexnox, getting alot of "Failed to allocate directory watch: Too many open files"11:11
jamespagexnox, cpaelzer: lack of /proc/cpuinfo is a problem for LXD, but does not appear to be the cause of this...11:17
jamespagexnox, cpaelzer: trying a trick to add the host machine to the deployment, but just hit the wall with the 2G root disk size...11:20
xnoxjamespage, what's your host? you should be able to active e.g. additional drives and add them to the vgroup.11:21
xnoxjamespage, btw i can reboot s1lp7 and give it to you as well, as an additional resource it should have ~100GB large rootfs.11:21
jamespagexnox, my problem is that all of the control plan IP addresses are on the local bridge and not generally accessible...11:22
jamespage2016-04-01 11:21:49 INFO install E: Write error - write (28: No space left on device)11:22
jamespagenot unexpected...11:22
cpaelzerxnox: does d-i in guided partitioning try to create a swap disk as huge as memory?11:47
cpaelzerxnox: the disk of james on a 40G memory system had split the available ~41G into 38.x swap and 2G root11:47
cpaelzerxnox: s390 is the land of small disks and (sometimes) a lot of memory11:48
cpaelzerxnox: there should/could be a cap on the swap size11:48
xnoxcpaelzer, jamespage: this is a classic d-i/partman bug. there are no caps, just multiples.11:50
xnoxdeactivate swap, remove it, enlarge partition, enlarge rootfs....11:50
cpaelzerxnox: already done that11:50
cpaelzerI just wanted to avoid the next oen running into it11:50
cpaelzerxnox: classic means the bug exists and is open=11:51
cpaelzer?11:51
xnoxhttps://bugs.launchpad.net/ubuntu/+source/partman-auto/+bug/103232211:51
ubottuLaunchpad bug 1032322 in partman-auto (Ubuntu) "Swap space allocation for large memory systems needs improvement" [Medium,Confirmed]11:51
cpaelzergreat, thanks11:51
xnoxfirst opened in 2012-08-02 but it has been around since forever, and typically reported by installer testers in e.g. qemu vms with like11:52
xnox"i gave it 16GB of ram and 8GB rootfs disk"11:52
xnoxthinking about it.11:53
xnoxcpaelzer, does it even make sense to have swap on lpar / z/VM?11:53
cpaelzerxnox: don't get this started in a public channel11:53
cpaelzernooooo11:53
cpaelzeryou did it11:53
cpaelzerthis is like vim and emacs11:53
xnoxlpars should be big enough, and z/VM can over-commit x2 RAM11:53
cpaelzerit can overcommit up to whatever you can accept performance wise and often I've seen 2-3x11:54
xnoxbut on z/VM only, not on LPAR, right?11:54
cpaelzereven on kvm it works reasonably most of the time although there could be soom improvements11:54
cpaelzerLPAR is only partitioning, no overcommit for memory11:54
xnoxthinking about it, maybe there should be a safe guard that e.g. swap cannot be more than 10% of total disk space, regardless of the sizing relative to RAM11:55
cpaelzerIMHO the Host should swap not the guests11:55
xnoxor maybe the 200% should be from the smallest of (ram, disk) sizes11:55
cpaelzerbut there are quite a lot of cases where that alone is not the truth11:55
cpaelzerI think I have seen some logic that groups into three categories by ram size11:55
cpaelzerram <2G, try 2*ram11:56
cpaelzerelse swap = ram size11:56
cpaelzerbut11:56
xnoxcan one at all hybernate lpar & z/VM? cause on server hibernate on emergency power shut down is a poor mans choice for redundant power.11:56
cpaelzernever go over 64G11:56
cpaelzerand never go over x% of the disks11:56
xnoxwhy 64G? why not 65G? why not 63G?11:56
cpaelzerxnox: suspend and resume is implemented11:56
cpaelzerarbitrary choice, like the old 2x, why not 1.8x11:57
xnoxsuspend&resume is not hibernate&thaw. E.g. swap is not needed for suspend, as RAM remains powered/active.11:57
cpaelzerif science people are involved we could suggest a smooth scaling formular no one would understand :-)11:57
xnox2x -> is reasonable to have a good chance at hibernate, when things have overcommited ram.11:57
xnoxcause one needs to dump all of ram to swap, to hibernate, plus whatever got overcommited/spilled over to swap.11:58
cpaelzerah you mean to disk11:58
xnoxyes, hibernate.11:58
cpaelzernever cares too much about that, I'd have to check if that works as well11:58
cpaelzerhca: ^^ ?11:58
xnoxis there hibernate on lpar / z/vm -> if not, i'll just remove swap from default recipes full stop, and people can install swapfile package to add swap.11:58
cpaelzerwait for hca's answer11:59
cpaelzerbut then power failure is so boo low end11:59
cpaelzerI mean most cpu calculations are doen twice for quantum effects of random particles11:59
cpaelzerpower failure - pffff11:59
xnoxi think mainframe deployements have better power failure mode handling than other architectures.11:59
cpaelzerwell - eventually they have way better handling, but then this is (sadly) one of the things the business/finance people cut costs12:00
cpaelzerit works the same without that battery pack, well then ...12:00
xnox>_<12:01
xnoxcpaelzer, reading all the bug reports it's like "high memory system -> too large swap" and "swap not large enough to hibernate"12:12
xnoxthe most reasonable comment is from superm112:12
xnoxhttps://bugs.launchpad.net/ubuntu/+source/partman-auto/+bug/57679012:12
ubottuLaunchpad bug 576790 in partman-auto (Ubuntu) "Partman should support disabling swap in impractical scenarios" [Undecided,New]12:12
xnoxe.g. it should be possible to have a flag to essentially "skip swap" and calculate that for "impractical scenarios" e.g. RAM >> root disk (high memory system)12:13
xnoxwith a threshold as to what a high memory system is12:13
xnoxand be able to preseed that key.12:13
xnoximho "high-memory" is anything where RAM >> 10% of total disk space12:15
xnox(specifically 10% of the /usr partition)12:15
xnoxwell... no.12:15
* xnox needs to look at partman-auto to see if it has total disk size numbers available12:16
cpaelzerI'm ok with almost any limit, as the hard part is creating the infrastructure not defining the exact ratio/size of the limit12:35
jamespagecpaelzer, borntraeger: OK so after looping around and re-deploying with the compute node directly on an LPAR running Ubuntu Xenial, I still see the same problem13:59
cpaelzerjamespage: so you now run without Containers just KVM&Openstack14:24
cpaelzer?14:24
jamespagecpaelzer, well the control plane bits are still in containers but the hypervisor is not14:24
cpaelzerok14:24
cpaelzerdidn't xnox already say it worked for him, maybe he has the workaround you need14:25
xnoxnot with latest nova generated libvirt config for our cloud image14:27
jamespagexnox, yeah - I suspect this is a break in nova's used of libvirt but not 100% sure yet...14:27
xnoxso we will need to debug the generated libvirt config i guess.14:28
xnoxjamespage, is the one you pasted on the bug report accurate?14:28
xnoxmost recent14:28
jamespagexnox, yes14:28
xnoxcool, i'll give it a poke in a few.14:29
xnoxneed to finish a few things up, and have a call, and then will be able to look into it.14:29
jamespagexnox, having a punt at setting the cpu-mode flags for nova to host-passthrough15:46
jamespagexnox, we do the same for ppc64el15:46
jamespagexnox, have you hit this "too many open files" warning/error on s390x?  I think its actually impacting my deployment15:47
jamespageI see it on the host and in containers as well..15:47
mpavoneHi, I have updated a comment to https://bugs.launchpad.net/ubuntu/+source/nova/+bug/1564831 regarding instance not starting on s390x16:27
ubottuLaunchpad bug 1564831 in nova (Ubuntu) "s390x: error booting instance" [Undecided,New]16:27

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!