/srv/irclogs.ubuntu.com/2017/08/08/#ubuntu-server.txt

braziercustomshello. I'm back again :D I am still having problems with conjure-up on 16.04.  can anyone tell me why I keep getting this error? 2017-08-07 21:29:26,447 [DEBUG] conjure-up/openstack-novalxd - __init__.py:21 - Showing dialog for exception: cannot retrieve charm "cs:glance-258": cannot get archive: Get https://api.jujucharms.com/charmstore/v5/glance-258/archive: dial tcp: lookup api.jujucharms.com on 10.228.217.1:53: re01:45
sarnoldbraziercustoms: you're3 cut off at ":53: re"01:47
braziercustoms read udp 10.228.217.50:38257->10.228.217.1:53: i/o timeout01:48
braziercustoms@sarnold that is the last of it01:49
braziercustoms16 machines are running but cannot access horizon of course01:49
braziercustoms@sarnold I just noticed that one is different than the last one.01:51
sarnoldbraziercustoms: is that a go-implemented dns lookup that's failing?01:51
braziercustomsim not sure what you mean "go-Implemented"01:51
braziercustomssarnold: it is hitting a DNS forwarder but its never been a problem, you think thats the problem? it works if I put the URL in the web browser it tries to download a file.01:59
sarnoldbraziercustoms: i'm guessing here.. try something for "for i in `seq 1 100` ; do dig api.jujucharms.com @10.228.217.1 ; done  ... see if you get any errors or all successes02:01
braziercustomssarnold: dont see a failure02:05
braziercustomsanswers each in 0ms02:06
sarnoldbraziercustoms: hrm. is the conjure-up error repeatable?02:06
braziercustomswell, last time seems like I got a different one but it was same type of failure02:07
braziercustomssarnold: I'm trying to find it. give me a sec02:08
braziercustomssarnold: here is a previous one https://pastebin.com/aHpjDAMP02:12
braziercustomssarnold:  so far every time I try to conjure-up, it fails at something "not retreivable"02:13
braziercustomssarnold: that is every time in the last week or so.02:14
sarnoldbraziercustoms: I -think- file a bug against juju with ubuntu-bug juju-core   .. unless you can track down if there is something wrong with the dns forwarder on 10.228.217.1 or the firewalls in between..02:15
braziercustomssarnold: I conjure-down, I will remove the controller (again) and start over see if there are any new errors.02:20
braziercustomssarnold:  :D you made me realize that my dns forwarder for my home network is replying on both interfaces.. its always been this way and didnt cause a problem before, but would it cause a delay?03:46
braziercustomsas I get ;; reply from unexpected source: 192.168.1.3#53, expected 192.168.1.2#5303:47
cpaelzerjamespage: the OVS change we had was applied to master, I asked for branch-2.8 as well05:52
cpaelzerjamespage: but that said we could also just carry the patch on 2.8 as we know it is accepted05:52
cpaelzerjamespage: were you able to complete your check on OVS 2.8 and if so what is the summary and expected timeline?05:53
cpaelzerjamespage: instead of writing more walls of text I'll wait until you are around05:56
cpaelzergood morning everybody btw06:03
dshapHey everyone -- all of a sudden today my production web server (running Ubuntu 14.04) started going down, and I'm trying to figure out why. By "going down" I mean I couldn't SSH into it, and when I rebooted the server (on EC2), I checked syslog and the first time I had this problem earlier today it showed memory issues, but then a few hours later it went down again and syslog didn't seem to show the same memory issues. What else should I l06:29
dshapinto to diagnose what could be causing this?06:29
ranjanhi all, I have a Linux box whose / partition is 100% utilised when using df command, but then when using du -shx /, show there is a lot of free space.06:48
cpaelzerdshap: what do you mean by memory issues - out of memory kills?07:48
dshapcpaelzer: a couple lines like this: "dhclient: fork: Cannot allocate memory"07:48
cpaelzerand even being away as documentation @ranjan: this is a known thing and not "wrong" https://unix.stackexchange.com/questions/45771/df-vs-du-why-so-much-difference07:49
cpaelzerdshap: you seem not to be the first one https://forums.aws.amazon.com/thread.jspa?messageID=67400007:51
cpaelzerdshap: but in general you'd need to track what is consuming your memory - assuming that really is the issue07:51
cpaelzerdshap: while usually next to useless/counterprodictive on a cloud you could set up some swap space to not die so immediately07:52
cpaelzerdshap: with that in place you could regularly track memory consumption07:52
dshapcpaelzer: That was my first thought when this happened the first time today. After it happened, I rebooted the server and my website came back online, but then 2 hours later it went down again -- that time, it didn't have any "Cannot allocate memory" messages in syslog07:52
cpaelzerdshap: either with basic "ps --sort -rss -eo rss,pid,command | head" or (my preference) with the tool smem07:52
cpaelzerdshap: what did it have in that second case - it must have had "something" I hope07:53
cpaelzerit doesn't shut down but loosing network connection right?07:53
dshapcpaelzer: According to the log, it doesn't appear to be shutting down, because I just see the reboot logs at the time that I initiate the reboot (from the AWS management console)07:54
* kickinz1 back online07:54
dshapcpaelzer: the last 2 things I see from sys log are: "dhclient: bound to <some_IP> -- renewal in 1701 seconds." and "CRON[1505]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)"07:55
dshapcpaelzer: the last 2 things before it went down the 2nd time*07:56
cpaelzerdshap: hmm - I might not have the best hints so other feel free to chime in, but if you really have not more after one of these reboots you need to set up some things to log more to disk08:10
cpaelzerdshap: you could also have auto-restarts if the HW under the cloud has issues, like https://forums.aws.amazon.com/message.jspa?messageID=275616 but I'd consider this unlikely to happen two times08:10
cpaelzerI tihnk you could find in the service health dashboard if such a case happened08:12
cpaelzerotherwise set up monitoring to close in on what issue you system in particular might run into08:13
dshapcpaelzer: got it. thank you for your suggestions!08:17
lordievaderGood morning08:22
cpaelzerhi lordievader, good morning to you as well08:29
lordievaderranjan: That is quite possible. Probably some application which still has a reference to a deleted file.08:35
lordievaderDu doesn't see those, df does.08:37
lordievaderHow are you doing cpaelzer?09:18
cpaelzerjamespage: I see the latest OVS upload still has the 15 errors in the self-tests09:19
cpaelzerlordievader: fighting the tide :-) how are you?09:19
jamespagecpaelzer: yes - looking at those now09:20
cpaelzerjamespage: I'll sync what we upload to Debian on Thursday afternoon - now that te patch with the pkg-config is accepted in OVS it will likely be the one with the multi-arch changes09:21
cpaelzerjamespage: but once I sync that from Debian it will fail to migrate until we have also uploaded a new OVS working with it09:21
cpaelzerjamespage: so ping me once your exploitation of OVS 2.8 makes and progress (or you are giving up for a reason)09:21
cpaelzerso we can coordinate those syncs/uploads as needed09:22
lordievaderDoing allright here :)09:22
=== clvx is now known as Guest40784
=== LaserAllan is now known as Guest21980
jamespagecpaelzer: picked your patch for ovs pkg-config/dpdk11:01
jamespagecpaelzer: having to skip some tests on the dpdk build which I've reported upstream11:01
jamespagecpaelzer: I've also been able to re-enable s390x testing which is nice11:01
cpaelzerwow, that is great jamespage11:03
cpaelzerdid you have to skip all 15 tests or were some fixable?11:04
cpaelzerjamespage: thanks for integrating my patch, as I said it is upstream now so it should be safe and not a maintenance burden11:04
cpaelzerjamespage: I sent you an optinal invite for the DPDK meeting, you certanily have enough but if you ever want to attend you now know time&date11:05
cpaelzerjamespage: so with the pkg-config applied we can upload the 17.05.1 including this change to Debian and sync from there11:06
cpaelzerjamespage: what is the timing on the OVS 2.8 release - do you know?11:06
jamespagecpaelzer: this month11:06
jamespage~(timing of release)11:06
cpaelzerok, so I'd then try to coordinate the upload to Debian and sync from there11:07
jamespagecpaelzer: re the tests - I've just deferred that to upstream tbh11:07
cpaelzeronce the 17.05.1 is in Artful proposed I can ping you to upload the OVS 2.8 then11:07
jamespagecpaelzer: I'm a little time short this week as off from end of thursday for a couple of weeks :-)11:07
cpaelzera couple you say11:07
cpaelzerI hope for vacation11:07
cpaelzerjamespage: so what shall we do about OVS then, should I try to get it to Debian and sync to Ubuntu earlier?11:08
cpaelzerjamespage: would that help you to close out the OVS upload before you leave?11:08
jamespagecpaelzer: it would but I'm happy to upload prior to that actually arriving in Ubuntu; it will dep-wait until 17.05.1 appears :-)11:09
cpaelzerand then fail on me while you are unavailable :-P11:09
cpaelzerbut ok, sounds like a plan for now11:09
cpaelzerI can still whine about it when it happens11:10
cpaelzerjamespage: ok, I have asked my co-maintainer to upload to Debian whenever he is ready11:13
jamespageta11:14
cpaelzerjamespage: I'll sync from there into artful once available, feel free to make a dep-waiting one being around in a-p if you want11:14
jamespagecpaelzer: I'll hold for as long as possible11:16
cpaelzerok11:17
jamespagecpaelzer: good smell test at least - https://bileto.ubuntu.com/excuses/2889/artful.html13:45
cpaelzerjamespage: yeah doesnÄt seem too bad13:50
=== Guest13936 is now known as med_
=== med_ is now known as medberry
=== admcleod_ is now known as admcleod
=== medberry is now known as med_
ahasenackubuntu-server meeting in #ubuntu-meeting if anyone is interested16:06
madLyfethis was the iso i installed: ubuntu-16.04.2-server-amd64 but im still showing kernel 4.4.0-89-generic.17:46
sdezielmadLyfe: that seems OK to me. Unless you expected the HWE kernel?17:51
naccmadLyfe: can you show `apt policy linux-generic-hwe-16.04` ?17:51
naccmadLyfe: yeah, so I'm thinking now the server ISO *doesn't* use the HWE kernel by default (not seeing it in the seeds)17:51
naccmadLyfe: which is a distinction from the desktop ISO I didn't realize17:52
madLyfeso run this command? 'apt policy linux-generic-hwe-16.04`'17:53
naccmadLyfe: that will just say if the hwe kernel is installed17:55
naccmadLyfe: if it's not, you can install it17:55
nacc(if you want)17:55
naccmadLyfe: but as we were discussing on #ubuntu, why do you want the hwe stack on your server?17:55
madLyfethat command didnt do anything17:56
naccmadLyfe: it should output something17:56
madLyfemy servers are miners, just trying to squeeze the most performance out of the CPUs as possible.17:56
naccmadLyfe: ok, and you've measured that the newer kernel is more performant?18:00
naccmadLyfe: in any case, back to taht command. It hsould either have outputted "Unable to locate package" or it would have outputted lines that show what versions are available/installed18:01
madLyfei have no idea. havent tested it. thats why i was seeing if i had it installed or not18:01
madLyfenacc: http://i.imgur.com/1RMDu1Z.png18:03
oerheksdrop t ' at the end18:04
oerheksapt policy linux-generic-hwe-16.04` <<18:04
madLyfei did on the next two lines18:04
madLyfewhere nothing happened18:04
oerheksthose next lines are within your previous command, try fresh18:05
madLyfemanually typed it in again and same18:05
madLyfeeven the clear command is doing it18:05
madLyfewth18:05
oerheksapt policy gives a search, nothing gets installed or changed18:06
madLyfeno i mean clear isnt doing anything either: http://i.imgur.com/ZVGr7MB.png18:07
oerheksagain, you are within your previous command, stop this with ctrl + c18:08
sdezielmadLyfe: type: ctrl-c18:08
madLyfeah yes. still new. sorry18:08
oerheks:-)18:08
oerheksyou will remember this next time, no worry18:08
madLyfehttps://gist.github.com/5f60456c093419cd708b6f08faf9852618:09
oerheksyour prompt should give a name @ machine + $ ( as user ) or + # as root18:10
oerheksgood, HWE not installed18:10
oerhekssudo apt install linux-generic-hwe-16.04 # if you *need* a newer kernel and such18:11
madLyfewell it says 4.10, but i dont have it?18:11
madLyfe"Installed: (none)" mean its not?18:12
madLyfesudo apt install –install-recommends linux-generic-hwe-16.04 ?18:14
naccmadLyfe: right (none) means not installed19:57
naccmadLyfe: i'm not sure if you need install-recommends or not, but yes, that's the package you want19:57
madLyfethanks. whats the worst that could happen by upgrading?19:59
naccmadLyfe: your system doesn't boot?19:59
naccmadLyfe: note, you're not upgrading, you're installing a differnet package19:59
nacc*a second package19:59
madLyfeits a pretty standard supermicro board. nothing proprietary on it. should be good.20:01
=== Epx998- is now known as Epx99o8
=== Epx99o8 is now known as Epx998
Epx998Is there a good command to get the status of a netapp share thats mounted?20:55
=== jathan_ is now known as jathan
Epx998hmm21:14
=== hehehe is now known as Guest2979
=== Guest2979 is now known as hehehe
naccmwhudson: there's a few golang packages that are showing up in the 'out of date' from Debian list for server. Am I right to assume you've got a handle on them all? :)22:27
mwhudsonnacc: er no but i can22:27
mwhudsonnacc: where's the list?22:28
naccmwhudson: http://reqorts.qa.ubuntu.com/reports/ubuntu-server/merges.html22:28
naccmwhudson: there are ~11 for golang22:28
naccmwhudson: it's ok if they are understood to be behind for a reason, I just want to make sure that we have a reason for each :)22:29
mwhudsonthe reason is that i haven't ripped out the shared library stuff yet i think :(22:32
naccteward: are you planning on updating nginx to 1.13.3 this cycle?22:56
hehehecould be good22:59
heheheteward: you going to do it22:59
hehehe-22:59
hehehe?23:00
nacchehehe: why would you ask the exact question I just did?23:00

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!