[05:13] good morning [06:27] Good morning === chmurifree is now known as chmuri === gogbog is now known as Guest72809 [15:19] Is there a known issue on some hardware where debian-installer partitions a disk as sdc vs sda? [16:07] went to update my server via update && upgrade and it was running fine till the end. here is the end of the log: https://paste.fedoraproject.org/paste/Ib96aSM5fiupeAbW5-gaqQ/raw [16:08] madLyfe: cp: error writing '/var/tmp/mkinitramfs_sr8t8I//lib/modules/4.13.0-41-generic/kernel/drivers/gpu/drm/i915/i915.ko': No space left on device [16:08] the errors that follow result from that (no space left) [16:09] is there a way to see via command the drive breakdown? [16:11] madLyfe: install ncdu [16:11] but space.. :P [16:12] madLyfe: then du -sh /* and drill down from there [16:12] i would suggest du -h --max-depth=1 [16:13] on / [16:13] well, you need -s(ummary) for each dir [16:13] no you do't [16:13] *don't [16:14] I see, max-depth does the summary [16:14] is the last number the total? [16:15] blackflow: right, it's an alternative (du doing the dirlist vs. the shell) [16:15] I like baobab for drilling down. It's a GUI but does support investigation remote machines over ssh. [16:15] investigating [16:16] i went to root to see the total of everything but only printed ./ .nano and . [16:17] madLyfe: you don't need to cd, you can just pass the argument [16:17] madLyfe: and . means the directory [16:17] so go back to home/user? [16:18] madLyfe: i mean it doesn't matter where you run `du` from, you can tell it where to check disk usage [16:18] so drop the '=1' to search the whole drive? rather than just the dir i was in? [16:19] madLyfe: which version of the command did you use? [16:20] du -h --max-depth=1 [16:20] madLyfe: run from what location? [16:20] user [16:20] madLyfe: that's not a location [16:20] madLyfe: what filesystem path [16:20] home/user [16:20] madLyfe: /home/ ? [16:21] sorry, yes. [16:21] madLyfe: that will just tell you the usage of that directory [16:21] madLyfe: that's why I said "on /" immediately after. `du -h --max-depth=1 /` [16:22] oh. wasnt sure that was supposed to be a part of the command [16:25] madLyfe: perhaps check the du(1) manpage. it's wise to check the manpage of commands you're given by random people on irc :) [16:26] https://paste.fedoraproject.org/paste/8K5JqNBszC2J5xtjevcwQA/raw [16:27] madLyfe: and how large is your disk (`df -h` can help) [16:28] 16gb [16:28] its a flashdrive [16:29] https://paste.fedoraproject.org/paste/Xz~tFEKAOKExJAym-vCjHg/raw [16:31] madLyfe: your / is full [16:32] do updates not overwrite? do they accumulate? [16:32] can that be pruned? [16:32] madLyfe: what do you mean? [16:33] like are old updated files kept around or are they overwritten? [16:33] madLyfe: 'updated' in what sense? [16:33] every time you run update && upgrade [16:34] madLyfe: update just updates the file lists locally [16:34] madLyfe: are you asking if it keeps around the old .debs? [16:34] ya [16:35] madLyfe: it keeps some amount in the cache [16:35] madLyfe: if you think that's a problem, use `sudo apt-get clean` [16:36] hmm i dont think that cleared anything up for me heh [16:37] madLyfe: usually it's not much space [16:37] madLyfe: right now your /opt and /var are relatively large [16:37] madLyfe: which might imply large logs [16:38] madLyfe: you can run the same du command with /var [16:38] ya its amdappsdk-3.0, amdgpu, amdgpu-pro, rocm. [16:38] in /opt [16:39] just not sure what i can get rid of [16:39] anything your __server__ doesn't need :) [16:40] lel i dont know what it doesnt need. [16:40] i am its plaything. [16:40] madLyfe: that doesn't make sense [16:40] here is /var breakdown: https://paste.fedoraproject.org/paste/tf6vLUjOZIpEC40USrgtww/raw [16:40] madLyfe: is it actually your server? [16:41] your /boot also looked bad, fwiw [16:41] madLyfe: have you run `sudo apt-get autoremove` recently ? [16:42] ya. its using amd drivers to rung GPUs but i dont remember how i set them up. someone helped me and it was kind of set it and forget it. months ago. update && upgrade never posed a space issue till now i guess. [16:42] nacc: nah [16:42] madLyfe: becuse you were ignoring the message from apt the whole time [16:42] it tells you to run autoremove every time, iirc [16:43] https://paste.fedoraproject.org/paste/-RIiGhNJboQs7-Edk7lH4Q/raw [16:43] i dont remember that msg [16:44] madLyfe: well, that would free up 3G of space if you run it [16:44] 4G of space! [16:45] yay! ty [16:46] what is that removing? are they old debs from previous udpates? [16:47] old debs, old kernels, packages which were installed as dependencies but are no longer needed since the dependent package or packages are not on the system or the deps changed, etc. [16:47] ah k. tyvm [16:48] to be clear, what it's removing is *only* the third thing teward listed [16:49] that happens to include the first two [16:49] i see. [16:50] when should this be ran? i dont recall seeing that msg anywhere. [16:50] madLyfe: probably worth a perusal of `man apt-get` if you don't know what it does [16:50] madLyfe: on my system(s) every time I run apt or apt-get it emits a message about X unused packages that can be removed [16:50] man is manual? [16:51] as in RTFM? [16:52] madLyfe: yeah :) [16:53] ok. i want to thank you guys for the hand hodling today! [16:54] blackflow: i agree about running commands from ppl on the internet but nacc has helped me plenty in the past so i trust the nacc. [16:54] madLyfe: it's not just about trust. it's about learning by yourself ;) [16:54] blindly running commands, you won't learn a thing. [16:55] unless hes been taken over by a malicious actor. very possible these days. :P [16:55] true. [16:55] madLyfe: heh [16:56] madLyfe: unattended-upgrades can be configured to run the autoremove periodically, iirc [16:56] nacc: yes, but it needs you to uncomment some things [16:56] in the default config *otherwise* it doesn't. [16:56] it also can barf if you try and let it upgrade kernels automatically, but that's a different issue. [16:57] (related to not having run autoremove in the past before enabling the option) [16:57] also, the updates that require restarts wont be applied because these servers are rarely restarted. [16:58] you can also have unattended-upgrades do that as well but if uptime is your concern then you shouldn't. And you should probably reboot every quarter or such because kernel security patches [17:00] ya thats kind of what i was doing today because power went out. good time to run the updates. [17:04] teward: good points [17:05] well crap it looks like it was removing amdgpu stuffs, hopefully it doesnt take out anything important. [17:05] getting amd drivers working wasnt fun. [17:13] also, if youve run updates and log into a fresh ssh session, it says **a restart is required*, why is the command 'reboot' and not 'restart'? or why does the statement not read '**a reboot is required**'? [17:14] sorry, its '*** System restart required ***' [18:03] technically you do not require a reboot; shutdown and power on will achieve the same restart goal [18:18] what should i do if one of my servers has just been hanging here for a long time during and update & upgrade? https://paste.fedoraproject.org/paste/J4gA9Z8HRYn5sBG9G0KGQw/raw [18:19] define "a long time" :P [18:20] close to an hour [18:21] previously it got stuck here: https://paste.fedoraproject.org/paste/QcOKX178hCZwEbDNr9MG2Q/raw at fetching. [18:22] had to pull the power on it and power it up again. [18:22] ok it just now lost the ssh connection. [18:22] im guessing it froze up or something [18:23] https://usercontent.irccloud-cdn.com/file/f8el4LyY/image.png [18:41] madLyfe: if you install updates remotely over a non LAN connection, be sure to always run apt in a GNU screen / tmux on the server. [18:42] well i am remote, but on the lan. [18:43] remote as in from another puter on the lan. [18:43] the L in LAN stands for local [18:44] maybe you need a KVM [18:44] or serial console [18:44] well these are SM servers w/o the IPMI. i only have one server with IPMI [18:45] regardless, i should just go pull the power again and try once more? [18:45] i was thinking they all had *some* kind of ipmi [18:46] not these old ones from what ive gathered. [18:46] still. if it killed the server, wouldnt it kill the server in a tmux session as well? [18:46] if it's frozen, you probably want to power cycle it, then probably take measures to ensure the hardware is in good shape, when continue working on it. [18:47] and if it keeps freezing then I'd start to assume something hardware-based may be at play, since it sounds like this is not hte first freeze you've had on this system. [18:47] sure, GNU screen + tmux dont survive power cycling or reboots in general. [18:47] well this is the second time on this server today after the power outage. other than that ive never had this issue before on any of the servers. [18:48] power outages cause all sorts of problems (this is why UPSes exist heh) [18:48] tomreyn: no i mean it freezing during update & upgrade. [18:49] if it freezes the server then being in a tmux session wouldnt really matter? [18:49] madLyfe: if the system freezes, the session is frozen, too, and it wont magically recover after power cycling. [18:50] what is your definition of a 'power cycle'? im just asking because on the IPMI there is actually a power cycle button.. [18:50] you need to make the hardwar enot freeze [18:50] didnt oyu say you had no ipmi there? [18:51] my definition of 'power cycling' is probably the same as everyone elses: cutting power for a short time, then restoring it. [18:52] Does the name I choose for a DKIM selector matter? [18:53] I have one ipmi. just not on this server. [18:54] holding the power button doesn't even shut it down lol [18:54] sounds like a hardware problem then [19:00] probably the flash drive [19:00] rbasak: anything you need from me re:git-ubuntu? [19:06] seems to be updating fine now: https://usercontent.irccloud-cdn.com/file/n5wlnmu8/image.png [19:44] welp looks like it froze again [19:59] now it gets stuck during boot. im guessing its because it froze during this upgrade process: https://usercontent.irccloud-cdn.com/file/PAlNMjhv/image.png [19:59] ill try and install the OS again later and see how that goes. [20:01] o_O [20:01] that's a funny place for it to hang [20:02] can you check dmesg output? journalctl output? [20:02] filesystem freespace? free inodes? [20:02] nah its a total freeze of the system [20:03] power down button didnt even work when it froze at that point in the apt update progress [20:04] ill just have to restart from scratch on that server. shouldnt take very long once i get rolling. [20:09] madLyfe: oh. ew. maybe try the .. [20:10] madLyfe: .. fwts package? [20:13] I'll come back if I can't get it situated after a fresh go. I'm not losing any important data so it's ok. [21:14] coreycb: around still? [21:14] jamespage: yep [21:15] coreycb: just wanted to check with you before I upload a whole load of neutron bits with py3 enabled [21:15] jamespage: that's fine with me [21:15] coreycb: I've been using a -common package for configuration files etc... [21:15] coreycb: which the python{3}- packages depend on - does that sound ok? [21:15] prior to that the python- package included the etc files [21:16] jamespage: that sounds different from python- depending on -common? [21:17] sorry. other way around. [21:17] yeah it is [21:17] jamespage: currently we have -common depending on python- [21:18] coreycb: I could not think of a different way todo it - we have to preserve etc files for upgraders who will only have install the python-module [21:18] jamespage: that's a good point [21:19] coreycb: these are generally python modules with config files, rather than more complex agent based packages like neutron for example [21:19] jamespage: that sounds like it makes sense. i missed that. i'm going to have to make a pass on the merged debian packages to do that. [21:20] coreycb: did you do it the other way around? [21:20] jamespage: yeah [21:20] coreycb: well I think its appropriate still for packages with agent binary packages - like neutron, nova cinder etc... [21:21] coreycb: http://launchpadlibrarian.net/375048921/python-oslo.versionedobjects_1.33.1-0ubuntu1_1.33.1-0ubuntu2.diff.gz is the diff for the generalized autopkgtest for oslo packages I'm doing [21:23] jamespage: i think autopkgtest-pkg-python should cause imports to be tested but maybe not dual stack? [21:23] coreycb: tbh it does not deal with the name of the module being oslo.* and the actual module import being oslo_* [21:24] jamespage: ok, i see [21:24] which is why most of them fail all of the time - I tried to get a fix into autopkgtest but it was rejected in favour of work not done todo better autodiscovery [21:24] which I don't have cycles for [21:24] the import test is basically autopkgtest-pkg-python and the -ostestr one is extra [21:25] basically we'll run unit tests on all reverse-depends changes which is nicer [21:25] jamespage: ok makes sense, cool [21:27] jamespage: thanks for the update. i'll check out your neutron uploads. === rev_strangehope_ is now known as revstrangehope [22:53] powersj: did you see the failure just now? (git-ubuntu-ci): error: flag provided but not defined: -t [22:53] failed: sudo lxd waitready -t 60 [22:53] nacc: I merged in rbasak's changes let me double check 'em in a bit [22:53] did that this morning [22:53] thanks [22:54] it seems weird, as lxd says it is an option (on my 18.04 using the deb) [22:57] I had thought I'd tested the deb version locally [22:57] I had to add waitready as on the snap version it takes a while for the socket to become ready (the snap doesn't appear to use systemd to manage the daemon, so no socket activation) [22:58] I will check again tomorrow. If necessary I can make the waitready only happen on the snap option. [22:58] Then the next change to land is the change of the Jenkins job itself to use --snapcraft-snap [22:58] sorry, i meant just that i've got the lxd deb locally and it says -t is an option and takes a parameter [22:58] whatver lxd the ci job is suing is complaing about the passing of -t [22:58] does it expect -t=60 ? [22:59] or -t60 [23:01] I can reproduce now. [23:01] Ah [23:01] Scrolling back I had tested it but not noticed that it failed :( [23:02] Looks like it doesn't have the -t shortcut and it's --timeout=60 [23:03] fun [23:07] https://github.com/CanonicalLtd/server-test-scripts/pull/3 [23:07] Tested and succeeds in both snap and deb modes now [23:07] Sorry for my test failure on the last PR [23:08] * powersj pulls [23:08] hi there [23:08] is there a way to find out why a package wants to install certain dependencies? [23:08] rbasak: cool, thanks! [23:08] gordonjcp: well, each package lists its dependencies [23:08] gordonjcp: do you mean what they use those dependencies for? [23:09] nacc: essentially yes [23:09] gordonjcp: no, i don't think that really makes sense [23:09] * rbasak attempts a retest [23:09] nacc: I want to install collectd on a server, which is for collecting up various stats into (among other things) an .rrd file [23:09] nacc: it wants to install a full X desktop [23:10] like, about 1GB of deps, including X.org, Wayland, all manner of associated libs [23:10] I know the Ubuntu packagers are generally a bit on crack with their dependencies, but that seems excessive [23:10] gordonjcp: ignoring the random snark, what version of ubuntu? [23:10] gordonjcp: try using --no-install-recommends [23:11] 16.04 [23:11] That cuts it down quite a bit. [23:11] a bit, but it's still trying to install libmir and friends, wayloand and friends and X.org and friends [23:12] I don't see that [23:12] Can you be specific about a package that you think shouldn't be included? [23:12] gordonjcp: i don't think that's correct and i just tested in a lxd 16.04 [23:12] libwayland-cursor0 libwayland-egl1-mesa libwayland-server0 libx11-xcb1 [23:12] ^ one line from th edeps [23:13] gordonjcp: with --no-install-recommends? [23:13] Do you have any third party repositories enabled? [23:13] https://paste.ubuntu.com/p/89dkDNG5Xs/ [23:13] is what you should get [23:14] I've already got the pango and cairo stuff in, because rrdtool uses that [23:14] although why collectd pulls it in when it doesn't draw graphs I don't know [23:14] I don't have any third-party repositories enabled [23:14] collectd links with librrd4 [23:14] librrd4 can render using GDK [23:14] gordonjcp: please use a pastebin and show the exact output [23:15] although now you mention it, this is a server on a VPS provider, maybe they've got some "funnies" in [23:15] (presumably to the screen, on request) [23:15] Try to reproduce on an official Ubuntu image please. [23:15] If you can't, then you'll need to figure out how it's different from the OS your provider gives you. [23:16] https://rangerovers.pub/static/out.txt [23:17] rbasak: okay, I'll grab a copy of 16.04 server and try that [23:17] rbasak: that won't be until probably wednesday though [23:18] so you might miss the fun :-D [23:32] this line works great ... until the server reboots; do i need to do something to make it permanent ? [23:34]  sudo iptables -t nat -A PREROUTING -p tcp --dport 500 -j REDIRECT --to-ports 5000 [23:37] powersj: am I right in thinking that https://github.com/CanonicalLtd/server-test-scripts/blob/master/git-ubuntu/integration_test is dead now as https://github.com/CanonicalLtd/server-jenkins-jobs/blob/master/git-ubuntu/jobs-ci.yaml no longer references it? [23:37] rbasak: I believe so [23:38] OK. I'll drop that file in my next PR then [23:38] ok [23:58] arooni: you can install the iptables-persistent package [23:58] arooni: it will take care of loading your iptables and ip6tables rulesets on boot