/srv/irclogs.ubuntu.com/2018/08/07/#ubuntu-server.txt

cryptodanTJ-: I found a stable version of Ubuntu Server that doesnt present the aacraid error02:14
TJ-cryptodan: a different release?02:15
cryptodanthe kernel is Linux capricorn 3.13.0-24-generic #47-Ubuntu SMP Fri May 2 23:31:42 UTC 2014 i686 i686 i686 GNU/Linux02:15
cryptodanLinux capricorn 3.13.0-24-generic #47-Ubuntu SMP Fri May 2 23:31:42 UTC 2014 i686 i686 i686 GNU/Linux02:16
cryptodanDISTRIB_DESCRIPTION="Ubuntu 14.04.5 LTS"02:17
cryptodanI also found that the bug goes all the back to CentOS 5 on kernel 2.602:18
TJ-ouch!02:19
TJ-this is what happens when devs mess with code for newer devices and don't ensure older devices aren't upset02:20
TJ-There's been a lot of that in the kernel in recent years02:20
cryptodanalso found a validated version of Red Hat for the server doesnt boot from the CD02:20
TJ-DevOps contagion02:20
cryptodanit stalls at loading kernel02:20
cryptodanlet it sit over night and no boot02:21
TJ-"loading kernel" is a boot loader message, if that's what you mean02:21
TJ-so the kernel doesn't start executing?02:21
TJ-it should go "loading kernel" ... "loading initrd" then kernel starts and you see its messages02:22
cryptodanno dmesg02:22
TJ-right, so hand-over failed. that can happen if the firmware e820 memory map confuses the boot loader02:24
cryptodanid expect a validated and supported OS per Dell would boot up02:25
cryptodanbut I posted my system specs on that one bug report https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1777586 that I found a stable system setup for people to try02:28
ubottuLaunchpad bug 1777586 in linux (Ubuntu Bionic) "Ubuntu Server 18.04 LTS aacraid error" [High,Confirmed]02:28
=== neel is now known as Guest16758
neellHI03:30
neellneed some help on package installation that in failing on my ubuntu server 18.04.1. i suspect its because of repository03:31
neellErr:1 http://archive.ubuntu.com/ubuntu bionic/main amd64 cpp-7 amd64 7.3.0-16ubuntu303:31
neell  Connection failed [IP: 91.189.88.149 80]03:31
neellErr:2 http://archive.ubuntu.com/ubuntu bionic/main amd64 gcc-7 amd64 7.3.0-16ubuntu303:31
neell  Connection failed [IP: 91.189.88.152 80]03:31
neellE: Failed to fetch http://archive.ubuntu.com/ubuntu/pool/main/g/gcc-7/cpp-7_7.3.0-16ubuntu3_amd64.deb  Connection failed [IP: 91.189.88.149 80]03:31
neellE: Failed to fetch http://archive.ubuntu.com/ubuntu/pool/main/g/gcc-7/gcc-7_7.3.0-16ubuntu3_amd64.deb  Connection failed [IP: 91.189.88.152 80]03:31
neellE: Unable to fetch some archives, maybe run apt-get update or try with --fix-missing?03:31
neellany idea folks?03:31
TJ-neell: no route to host? firewall?03:32
TJ-neell: can you connec to those URLs with curl/wget/w3m/lynx ?03:32
neellsome packages were installed successfully, after 4/5 tries03:32
neellfor this one, its showing 63% completed and then failing03:33
TJ-Connection Failed means the TCP connection broke03:33
TJ-could be a MITM and/or proxy03:33
neellok, i am running the same my command again. its showing now 63% waiting for header03:34
neellgot this now:03:35
neellErr:1 http://archive.ubuntu.com/ubuntu bionic/main amd64 cpp-7 amd64 7.3.0-16ubuntu303:35
neell  Connection failed [IP: 91.189.88.161 80]03:35
neell63% [Waiting for headers]03:35
neelland the same error as above03:36
TJ-I can connect to those IP addresses fine03:36
TJ-check the route with "tracepath 91.189.88.149", try a ping , see if there's any packet loss or variable latency03:36
neellsomething wrong from my end. i am kind of newbee on Ubuntu commands03:37
neelli am giving u tracert result03:39
neellhttp://paste.ubuntu.com/p/q3GjsG28WG/03:39
TJ-You're on Guam? you ought to set a mirror in Hong Kong03:41
neellok03:41
neellso u want me to connect to hk mirror instead and install package from there?03:41
neellhow do i change repo to my closest one and install package from there?03:43
TJ-neell: well, it'd likely be more reliable, you'd avoid the level-3 HK>London link which causes a lot of latency03:43
TJ-The best mirror would be https://launchpad.net/ubuntu/+mirror/mirror.xtom.com.hk-archive03:44
neellok03:44
neellhow do i change it to that?03:44
TJ-You'd edit /etc/apt/sources.list and match the info given on that web page03:44
neellany command to do that?03:45
TJ-use the "Display sources.list entries for" choose Bionic and it shows you what should be in sources.list03:45
TJ-neell: any text editor, using sudo because the file is owned by root03:45
TJ-neell: if you use vim, it'd be "sudo vim /etc/apt/sources.list"03:45
neellyes, i can see the repo list03:47
neellso i need to create the hk mirror at the beginning of the file?03:47
TJ-replace every archive.ubuntu.com with mirror.xtom.com.hk03:48
neellok03:49
TJ-I noticed apt allows editing now, you can do "sudo apt edit-sources"03:50
neelldeb http://mirror.xtom.com.hk/ubuntu bionic main03:54
neelldeb http://mirror.xtom.com.hk/ubuntu bionic-security main03:54
neelldeb [arch=arm64,ppc64el,amd64] http://mariadb.mirror.digitalpacific.com.au/repo/10.3/ubuntu bionic main03:54
neelldeb [arch=ppc64el,arm64,amd64] http://sfo1.mirrors.digitalocean.com/mariadb/repo/10.3/ubuntu bionic main03:54
neelldeb http://mirror.xtom.com.hk/ubuntu bionic-updates main03:54
neellthis is now the list03:54
neellis it ok?03:54
TJ-neell: I don't think they mirror security, usually we set that to security.ubuntu.com so you get updates immediately03:58
neellso only the "bionic main" will be updated to HK mirror?03:59
TJ-neell: oh, they do mirror it... just the mirror is delayed a few hours compared to security.ubuntu.com03:59
TJ-neell: try the HK server out for those you've just shown03:59
neellok03:59
TJ-neell: you can always change back if there's no improvement03:59
TJ-neell: "sudo apt update"03:59
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neellGet:5 http://mirror.xtom.com.hk/ubuntu bionic/main amd64 Packages [1,019 kB]04:00
neell30% [5 Packages 0 B/1,101 B 0%]04:00
neelland lots of these lines04:00
TJ-!paste | nell04:01
ubottunell: For posting multi-line texts into the channel, please use https://paste.ubuntu.com | To post !screenshots use https://imgur.com/ !pastebinit to paste directly from command line | Make sure you give us the URL for your paste - see also the channel topic.04:01
neellok, sure.04:01
cpaelzergood morning04:01
neellseems like its downloading...04:01
neellbut still 30%04:01
TJ-neell: neell it could point to problem with your connection04:02
TJ-between you and HK at least04:02
neellok. any idea how can i solve it04:02
TJ-neell: unless it's on your network, complain to your ISP :)04:03
neellbefore, it would take few seconds to finish command sudo apt update04:03
TJ-neell: what have you changed on the server, or on the network, recently?04:03
neellno04:03
neelli was able to install maria-db from au mirroe, it was i guess 183MB04:04
neellnow, while installing packages of 140MB, it was giving error04:04
TJ-neell: I wonder if your ISP has deployed a transparent proxy04:05
neellok, i got an error at last04:05
neellhttps://paste.ubuntu.com/p/xk43GW7YdR/04:05
TJ-neell: you might get a clue looking at the HTTP headers with "wget -S -O /dev/null http://mirror.xtom.com.hk/ubuntu"04:06
neellhttps://paste.ubuntu.com/p/7sKQ6ch4dB/04:10
TJ-neell: hahah "X-Custom-Job: If you see this header, please contact hello@xtom.com for a job"04:11
neellhmm04:12
neellok, so it looks good?04:12
TJ-neell: no tell-tale signs of an HTTP proxy, but there could be a transparent proxy04:12
neellcan I try to switch to an US mirror?04:13
TJ-neell: but we say it only got about 1100 bytes when it should have been much more, so I think the connections are being broken on the link between you and HK04:13
neellok, got it. may be I can try some other mirrors?04:14
TJ-you could but I doubt it'll help, looks like all your traffic goes via HK04:15
neellwhere can I get mirror list?04:18
TJ-neell: https://launchpad.net/ubuntu/+archivemirrors04:21
neellok, thanks. i will check04:22
neelllooking for help here.... I am trying to install Webmin07:34
neellgetting below error07:34
neellon Ubuntu Server 18.04.107:35
neellhttps://paste.ubuntu.com/p/D873FM5BXJ/07:35
neellit gives me this07:36
neellThe following packages have unmet dependencies:07:36
neell webmin : Depends: libauthen-pam-perl but it is not installable07:36
neell          Depends: apt-show-versions but it is not installable07:36
neellE: Unable to correct problems, you have held broken packages.07:36
neelland when I am trying to install the package "sudo apt-get -f install libauthen-pam-perl07:38
neell"07:38
neellit gives me this:07:38
neellPackage libauthen-pam-perl is not available, but is referred to by another package.07:38
neellThis may mean that the package is missing, has been obsoleted, or07:38
neellis only available from another source07:38
neellE: Package 'libauthen-pam-perl' has no installation candidate07:38
neellanyone can help on this?07:39
tafa2So I'm getting started on my first production deployment - there's going to be around ~27 production servers and I was looking into some sort of server management tool like SaltStack10:21
tafa2Anyone using it and love it? Would anyone recommend anything else?10:21
RoyKI use ansible10:22
RoyKthere's quite a few different tools to choose from, though - chef, puppet, ansible, cfengine, ansible, saltstack etc etc10:22
RoyKansible twice in that list - oh well - I like ansible :D10:23
tafa2thanks RoyK I'm looking at them all - trying to see which one has the best free offering :)10:25
RoyK"free offering"?10:36
RoyKiirc they're all open source10:37
tafa2indeed - I mean which one has the best open source option/easiest to use10:42
RoyKthat mostly depends on your own preferences ;)10:47
thegoat_so i seem to have an issue with slow dns query response. when i try to ping something like www.google.com it'll take like 6 seconds for it to do the dns lookup, then it'll start pinging.  but if i do a dns query with nslookup it finishes quickly.  i had read some stuff about disabling tx and rx offloading or something like that, but disabling it as per the sugestions,it didn't work.  not sure where to look12:24
compdocits a vm?12:26
thegoat_yes12:26
compdockvm?12:27
thegoat_esxi 6.512:27
TJ-thegoat_: I've seen that when there's an IPv6 record being returned for the hostname too. Try "dig www.google.com" see what is returned12:31
thegoat_n/m fixed it. it was an id10t error12:35
tewardrbasak: powersj: either of you around so I can bounce a question off of you?14:27
tewardprobably rbasak moreso since he's been around throughout nginx becoming a main package :P14:27
tewardrbasak: i think this was discussed, but would there be a problem with the non-LTS interim releases tracking NGINX Mainline instead of NGINX Stable so the interim releases can work with newer features/etc. that won't be available until next LTS in the Stable NGINX branches?14:29
tewardI think we had said there wouldn't be, but I forget the original conversation and what came from it.  (It's also why I emailed the list, but apparently I'm still stuck in the moderation queue, so someone needs to poke the mailing list admins)14:29
tewardTJ-: thanks for the assist yesterday, I've done some revisions to the package I was working on, and I think I am almost at the point where this can be tested for that LP bug requesting a daemon-only package.  Just gotta wait for the PPA builders to finish uploading to run some tests myself...14:44
tewardwithout the assist and guidance I'm not sure I'd have gotten this all solved.14:44
TJ-teward: :) it often helps to have a pair of unrelated eyes14:48
=== jamespage is now known as JamesPage
=== JamesPage is now known as jamespage
axisysreplaced the bad disk with a new disk and smartctl does not recognize it15:46
axisyshttps://dpaste.de/iThm/raw15:46
axisysany suggestion?15:46
axisysthe corrupted disk that were replaced were visible by smartctl.. so backend is fine. so possibly the new disk is bad?15:50
tomreynthis disk was hot swapped?15:50
tomreyni assume it is not 600 peta bytes?15:51
tomreynis /dev/sdb a proper device node still?15:52
TJ-looks like the hot-swap... didn't :)15:59
tomreynyes, this looks like live transplant without sedation15:59
tomreynmaybe you can:  for host in /sys/class/scsi_host/host*; do echo "[ Rescanning ${host##*/} ]"; echo "- - -" | sudo tee -a $host/scan 1>/dev/null; sleep 1; echo; done16:00
tomreynbut a reboot seems a good idea.16:00
tomreynaxisys: ^ still with us?16:00
axisysTJ-: yes, hot swap16:01
axisysok .. let me scan16:01
TJ-axisys: are the disk in a chassis? the messages look like there is some intermediary hardware/firmware between PC and disk16:03
axisyscool.. scan did the trick16:03
tomreynbe sure to find out how to hot swap properly for the future. this was not a healthy operation.16:04
TJ-!cookie | tomreyn16:05
ubottutomreyn: Wow! You're such a great helper, you deserve a cookie!16:05
tomreynwow, now i get cookies for blaming people, sweet. ;-)16:05
tomreynaxisys: dont take me too serious, good luck there. i'd still wnat to reboot it soon.16:06
axisystomreyn: why reboot? (learning)16:07
axisystomreyn: curious on hot swap properly ..16:07
tomreynaxisys: well you ripped this disk out while the controller was still accessing it, or thinking it was still there all the time. your dmesg will be full of errors., and a couple things may still be in an unsane state.16:08
axisyshow do I tell controller to stop accessing the disk? thanks for your help!16:09
tomreynaxisys: how to hot swap properly will be documented in your server operators manual. but i'd always announce removal via software before the fact.16:10
tomreyndon't do this now: echo 1 > /sys/block/sda/device/delete16:10
axisystomreyn: right.. documenting ..16:10
tomreynobviously you want to unomunt everything from there beforehand16:11
axisysok /dev/sdb is not in use anywhere16:11
tomreyni'd "eject", too, just in case16:12
tomreynalso not now16:12
axisyseject would eject cdrom.. no?16:12
axisysunderstand...16:12
axisyssystem is running off of /dev/sda .. right now.. so no harm either way.. but understood16:13
axisystomreyn: so once I swap the disk just scan it, right?16:14
tomreynaxisys: yes, that's usually enough afterwards. the important thing is to prepare for removal properly16:15
tomreynaxisys: and most of all not all hardware has hot swap capability16:15
axisysright16:15
tomreynaxisys: you need the controller, firmware, and OS to support it. and whatever else might sit between controller and storage.16:16
tomreynso start by reading your controller / mainboard / server manual16:17
axisysok.. I tested on another system exact same hardware sun fire x2250 .. and it worked.. first stop accessing, swap out and then scan.. awesome! saving a steps in my wiki.. thank you!16:17
tomreynanother computer museum?16:18
tomreyn:)16:18
tomreynyou'll pay a lot of power for those. might be worth replacing them by half as many current systems some day.16:19
tomreynor actually a third16:19
axisysyes.. those are from part.. recently most of our servers hp dl360 or dl380s16:21
axisysfrom past*16:21
tomreyncurrent gen hp is fine, as long as you have a support contract.16:22
tomreynand dont need to stack up fast...16:22
tomreyn^ personal opinion / experience, i'm not affiliated with canonical16:24
rbasakteward: what would happen if we tracked mainline before LTS-1 and we didn't get a stable release by LTS?17:05
tewardrbasak: NGINX stable releases are always cut from mainline around the time we release17:06
tewardfor 16.04 it was cut same-day as release and we did a version-string-only SRU with the Release team's approval post release17:06
tewardfor 18.04 it came out same week as FinalFreeze but I was able to get that in right before the freeze went into effect (same-day)17:06
tewardrbasak: if we track Mainline up to release date, then the delta between Mainline and Stable when it's cut is extremely minimal, and the past several cases we've run into this we really didn't have to do any feature changes, etc. just the version string revisions17:07
tewardby the time of NGINX Stable cut which is about when we release LTS, it would be most likely a trivial post-release version-string-only change SRU with no new 'features' by the release date17:08
tewardrbasak: basically it'd mirror what we had for 16.04, or this past time for 18.04.17:08
tewardthe other problem we're going to face though rbasak...17:09
tewardif we don't give the 'newer versions' people are going to become 100% dependent on the PPA for the "new features"17:09
tewardat which point the question is "why do we bother updating nginx in the repos then?" (to the non-informed user, that is)17:09
rbasakteward: what I mean is: we should bump to mainline unless the following stable is already scheduled to be release before freeze for Ubuntu's following LTS.17:09
rbasakwe *shouldn't*17:09
tewardrbasak: and that's the 'problem'17:09
tewardrbasak: it's always just before or just after our release date17:09
tewardconsistently falls around the same week or two, and they don't give firm date releases17:10
tewardrbasak: i have no issues keeping it at 'stable'17:10
tewardbut people are going to complain heavily, I guarantee it.17:10
rbasakIt's generally OK to bump to final stable, even in an SRU, if the changes are minimal (eg. just a version string bump, or a few bugfixes), since those changes qualify for SRU anyway.17:11
rbasakHowever, it risks pain.17:11
rbasakSo it depends on you I think. By Ubuntu policies we can do it.17:11
tewardit's actually less pain to bump to latest Mainline and switch it to nginx stable, because a large portion of the 'fixes' and changes to spec of HTTP/2 and such ahppen in Mainline17:11
rbasakBut, as I don't particularly want to commit Canonical's time to back that up, I'd prefer to stick to nginx stable consistently.17:11
tewardthis is why i posted to the ML17:12
tewardbut unless you can release that, it's stuck in limbo for eternity17:12
teward(read: mod queue)17:12
rbasakOh17:12
* rbasak looks at the mod queue17:12
rbasakAccepted17:13
tewardrbasak: To be fair, I tell people to use the PPA if they want the "latest and greatest" anyways, but the reason I'd like more feedback is because MaaS people or other departments might want to see whether their stuff works in the newer releases, etc.17:13
tewardrbasak: as for *my* workload it doesn't change17:13
tewardi have to keep both NGINX Stable and NGINX Mainline uptodate and working in two PPAs anyways, so17:13
tewardthe other problem is Debian17:13
tewardbecause they track Mainline usually most of the time17:13
teward... though they are far behind at this point, last thing they did was in april17:14
rbasakIt's the risk of work, I think. If nginx stable releases late and with feature changes, leaving us in a pickle if we've released our LTS pre-stable-release.17:14
teward(E:UnmaintainedInDebian?)17:14
rbasakSince we need to decide on that in advance, I think it depends on our relationship with upstream.17:15
rbasak(and on how much we need it)17:15
tewardwhere 'upstream' means nginx in this context?17:16
tewardand not Debian17:16
rbasakYes17:17
tewardrbasak: the only reason I am hesitant to track only Stable is because Stable is only supported officially upstream for a year17:18
tewardthat is, until the next Stable cut from Mainline17:18
tewardand we're going to have that problem with Mainline either way, because that's only good for a year before they drop official support for it17:18
tewardthe remaining 'bug fixes' are either nitpicked or microreleased as needed for substantial ones17:18
tewardand security patches need backported either way (but Security Team takes care of that for the most part)17:19
rbasakStable is a better fit for stable distribution releases I think?17:19
tewarddo we consider the interim releases "stable distribution releases" though as we only support them for 9 months?17:19
rbasakI don't see how mainline would be better to have in the distribution from a length of support perspective.17:19
tewardrbasak: wait until TLS1.3 is a thing?17:19
rbasakYes. The SRU policy applies equally on non-LTS releases.17:20
rbasakAnd our stability promise is roughly the same.17:20
rbasakIf anything, LTS is less stable, because we do HWE and occasionally feature enhancements in them.17:20
rbasak(because they have an extended life it's more necessary to do that)17:20
tewardy'know it sucks I can't search the mailing list archives easily17:23
tewardrbasak: i found a thread in the list about this, back from 2015...17:26
tewardsarnold: you were the last to reply to it heh17:26
tewardand it establishes the precedent that was used for 14.10 through 15.10 and then established the 16.04 changes.  https://lists.ubuntu.com/archives/ubuntu-server/2015-June/007075.html https://lists.ubuntu.com/archives/ubuntu-server/2015-June/007076.html17:28
tewardnot sure if that opinion still stands17:28
tewardnot sure this has to be determined today, we could wait for replies to my message to the list you just released, rbasak17:28
tewardI ultimately don't care either way, but you still have to realize that every x.04 release is going to run into the same problem with the current 'release schedule' that NGINX has.17:29
tewardeven if we stick to stable.17:29
=== kallesbar_ is now known as kallesbar
axisysbefore rsync completes .. sda disappeared ..18:55
axisyssdb is the new disk as part of md0 (sda1,sdb1) and md1(sda2,sdb2)18:55
axisys# ls -al fstab18:56
axisys-rw-r--r-- 1 root root 1113 Jul  2  2012 fstab18:56
axisys# cat fstab18:56
axisyscat: fstab: Input/output error18:56
axisysyikes!18:56
ahasenackwas that a striped raid?19:00
axisysahasenack: raid119:00
ahasenackso why did the raid fail if sdb was still there?19:01
axisyssdb is the new disk to replace bad sdb19:01
axisyssmartctl was saying FAILING and replace it now ..19:02
ahasenackyou had a failure during the raid rebuild?19:02
rangergordHiya. What would happen if I create a Virtualbox VM with Ubuntu Server on it, including the ubuntu-desktop package for GUI, then copy this VM to a headless server? Would Ubuntu still start and run the background services?19:02
axisysahasenack: yes :-(19:02
rangergordI don't yet have a headless server to try it out for myself19:03
ahasenackaxisys: yep, I heard that can happen19:03
ahasenackrangergord: it will still have a video card, right?19:03
rangergordahasenack: condolences19:03
rangergordahasenack: I'm not sure. Let's say it doesn't. What happens then?19:03
ahasenackrangergord: I don't think a PC boots without a video card19:03
ahasenackbut could be wrong19:04
rangergordpretend it's a typical 1U server. if it needs a gpu to boot, then sure, there's one.19:04
ahasenackrangergord: anyway, I would expect it to try to start X as usual and present the login greeter. If X failed (no driver), that would be ok. What wouldn't be running is services that run after a desktop user logins19:04
ahasenackall that UI stuff19:05
rangergordahasenack: good enough for me! thanks.19:05
ahasenackaxisys: that's why I hear that two disk redundancy is advised. If possible ($$), of course19:05
ahasenackthe rebuild stress can make the last disk fail19:06
ahasenacks/last/remaining/19:06
rangergordalso never forget backup :)19:06
* ahasenack ponders about adding a 3rd disk to his 2-disk mirror19:07
ahasenackI actually have one, but was considering it a spare19:07
rangergordcould make it a differential backup of the mirror19:07
rangergord*mirrored disks19:07
rangergordI got myself a Synology recently for home server use. Didn't want to invest the time to learn everything.19:08
rangergordSynology is just Linux with a GUI slapped on it19:09
ahasenackI have a synology19:10
ahasenackwhat I disliked about it is that they have their own patches on top of btrfs19:10
ahasenackI can't btrfs send/recv to/from it19:10
ahasenackfrom a linux box19:10
ahasenackso now I have an old desktop with zfs to backup that nas and other stuff19:11
ahasenackalmost headless :)19:11
axisysit is possible raid build completed before sda giving up.. but I am not 100%19:16
axisysahasenack: not sure how to confirm19:16
ahasenackis it still rebuilding?19:16
ahasenackcheck /proc/mdstat19:16
axisysahasenack: no19:16
axisysahasenack: yep19:17
ahasenackand you still hav the error?19:17
axisyssda is missing19:17
ahasenackanything useful in the last lines of dmesg?19:17
=== mark-otaris is now known as Guest43597
axisysahasenack: https://dpaste.de/Ayvu/raw19:17
ahasenackand, is sda really dead?19:17
axisysahasenack: yes19:17
axisysfile-system is readonly19:17
axisysreadonly now*19:18
ahasenackwhat fs is on top of that?19:18
axisysext419:18
axisysso md0 completed .. but md1 is the large disk and not sure if completed19:19
ahasenackI don't know19:21
ironpillowhi all, I have a ubuntu 18.04 LTS server box (quad core, 8GB memory) as a router and have two access points. I am running dnsmasq as dhcp and dns server. I have around 30 wifi devices connecting to the network. For some reason, the ubuntu box randomly looses internet. I can't ping 8.8.8.8 or google.com. But as soon as I reboot the ubuntu box, it works. I don't change anything. Any advice? really apprec19:31
ironpillowiate it19:31
ahasenackironpillow: that's very generic, sorry. It could be a million things19:33
ironpillowahasenack: yeah. I am not able to figure it out. syslog is not showing anything in particular. dmseg only shows one error perf: EDAC pnd2: Failed to register device with error -22.19:35
ironpillowahasenack: do you know if /etc/resolv.conf is automatically rewritten. I ask because, there is a bug in 18.04 and I have to re-write resolov.conf manually every time system is rebooted.19:35
ahasenackit's a generated file, yes. Changes will be lost19:35
ahasenacknetworks coming and going could trigger an update to resolv.conf19:36
axisysadded a disk on same slot where sda was.. it came up as sdc .. server still up ..19:41
axisyssdc1 is added to md0 fine, no complain.. but failing to add sdc219:44
axisys# mdadm /dev/md1 --add /dev/sdc219:44
axisysmdadm: cannot load array metadata from /dev/md119:44
tomreynwell md1 thinks it's active device is sda2. but you removed this, apparently uncleanly.19:45
ironpillowahasenack: so it might be re-written on a running system?19:46
tomreynyou'll need to mdadm --fail /dev/sda2, just telling mdadm about the facts, i guess. and probably delete the sda scsi device, too19:46
tomreyn(or sata)19:47
axisystomreyn: no .. it was failing and removed itself while  rebuilding19:47
ahasenackironpillow: nowadays, actually, I think it will stay put, with just the entry to 127.0.0.53. It's the resolver at 127.0.0.53 that gets reconfigured19:47
tomreynaxisys: so those dmesg records are old?19:47
axisys mdadm --fail /dev/sda219:47
axisysmdadm: error opening /dev/sda2: No such file or directory19:47
tomreynsorry, wrong usage19:48
axisystomreyn: that triggered I think during raid1 rebuild with sdb19:48
axisyslet me paste current /proc/mdstat19:49
ironpillowahasenack: I have to change the entry to 127.0.0.1 for internet to work.19:49
ahasenackfrom 127.0.0.53?19:49
ironpillowyes19:49
mike-zal2have someone already updated 16.04 to 18.04?19:49
axisyshttps://dpaste.de/OHfb/raw19:50
tomreynaxisys: actually it was the correct usage19:50
tomreynsorry abotu my confusion19:50
axisysyes /dev/sda seems disappeared19:51
tomreynaxisys: well it's still in mdstat19:52
axisysis there a way I can force in sdc2 into md119:52
ironpillowahasenack: https://bugs.launchpad.net/ubuntu/+source/systemd/+bug/162432019:54
ubottuLaunchpad bug 1624320 in systemd (Ubuntu) "systemd-resolved appends 127.0.0.53 to resolv.conf alongside existing entries" [Low,Confirmed]19:54
tomreynyou can "mdadm --remove /dev/sda2" now19:54
ironpillowso does systemd-resolved append 127.0.0.53 at random times on a running system?19:54
ahasenackno19:55
ahasenackI don't know what happens in an upgrade from < 18.0419:55
tomreynaxisys: and you can "mdadm --add /dev/md1 /dev/sdc2"19:55
ahasenackbut a fresh 18.04 will use just 127.0.0.5319:56
axisystomreyn: # mdadm /dev/md1 --add /dev/sdc219:56
axisysmdadm: cannot load array metadata from /dev/md119:56
ironpillowahasenack: yes, I am not able to ping or access internet. I have dnsmasq installed.19:59
ironpillowand my dns server is at 192.168.2.220:00
ironpillowsorry: my dns server dnsmasq is listening on 192.168.2.220:01
ironpillowbut forgetting about dns, ping 8.8.8.8 also is not working when resolv.conf has 127.0.0.53. it only works after changing it to 127.0.0.120:02
teward`ping 8.8.8.8` doesn't use DNS at all20:03
tomreynaxisys: hmm that's quite unfortunate. the array metadata explains how the data is aligned on the raid devices. if this is missing... there's no way to interpret it.20:03
tewardso if `ping 8.8.8.8` just doesn't work that's probably a different issue20:03
ironpillowteward: yeah, that's what confusing. ubuntu just stops working. I have to reboot in order for ping 8.8.8.8 to work20:04
ahasenackironpillow: does ping get stuck?20:08
ahasenackor what does it complain about?20:08
tomreynaxisys: you can try to --grow --raid-devices=2 /dev/md1   (but i assume it will fail with the same error)20:08
ironpillowahasenack: it gets stuck20:09
ironpillowdoesn't complain or anything.20:10
axisyslet me check20:12
axisys# mdadm --grow --raid-devices=2 /dev/md1 --add /dev/sdc2                             mdadm: /dev/md1: no change requested20:13
axisystomreyn: ^20:13
tomreynaxisys: can you show a current paste of the same info as before?20:14
axisystomreyn: https://dpaste.de/6L5P/raw20:17
tomreynaxisys: so do you still have file systems mounted on top of md1?20:20
axisyssystem is still up .. but readonly and sometimes even worse like here20:22
tomreynyou should not, so be sure to go to single user mode and unmount / disable anything that's on top of md120:22
axisys# ls -al fstab20:22
axisys-rw-r--r-- 1 root root 1113 Jul  2  2012 fstab20:22
axisys# cat fstab20:22
axisyscat: fstab: Input/output error20:22
tomreyni assume your OS is on md0?20:22
axisysmd0 is /boot20:23
tomreynso the Os is on md1?20:23
tomreynright md0 is too small to be /20:23
axisyshttps://dpaste.de/YrWm/raw20:23
tomreynlsblk would maybe answer my question20:24
axisyshaha ..20:24
tomreynbut i think this is a lost cause, rebuild system, restore backups20:24
axisys# lsblk20:24
axisysbash: /bin/lsblk: Input/output error20:24
tewardassuming there are backups, of course.20:25
tomreynwell, we always assume that ,right?20:25
tewardafter dealing with users on Ask Ubuntu for a couple years I lost hope that there're backups held by [Insert User Looking for Help Here]20:25
tewardbut you're not wrong20:25
tewardwe should always assume there's backups :P20:26
tomreynaxisys: so in the hopefully very unlikely case that there are NO backups: you could dd sdb2 to some other device, then boot to some recovery system and run mdadm against sdb2, creating a new RAID-1 array with a single active device20:31
tomreynand then see if there is data on there that you can recover.20:32
tomreynhow to recover this data will depend on which block device layers you had on top of md1, you would need to recreate those there as well20:34
axisysah.. so dd if=/dev/sdb2 of=/dev/sdc2; (no backup)20:34
tomreynyes, if sdc2 is not in use20:35
tomreyni thought we had added that to md120:35
axisysthis is one of the 6 servers to access the network.. so it is not an outage.. but I am taking this opportunity to learn to rebuild it graciously (if possible)20:35
axisystomreyn: failing to add sdc2 to md120:36
tomreynoh right there was no metadata, so it couldnt add it20:36
tomreynaxisys: also worth a try while you're still running: mdadm --grow --raid-devices=3 /dev/md120:38
tomreynbut this would also fail, i guess20:38
axisys# mdadm --grow --raid-devices=3 /dev/md120:39
axisysraid_disks for /dev/md1 set to 320:39
tomreynaxisys: any news on mdstat?20:39
axisysmd1 : active raid1 sdb2[2](S) sda2[0]20:39
axisys      243801976 blocks super 1.2 [3/1] [U__]20:39
axisys3/120:40
axisysI could try adding it again20:40
tomreynyes20:40
tomreynbut the metadata is still missing ;)20:40
axisysdoes not like it.. if I could just get metadata from where :-)20:40
axisys# mdadm /dev/md1 --add /dev/sdc220:40
axisysmdadm: cannot load array metadata from /dev/md120:40
axisysa good server with similar build20:41
axisysbuilt*20:41
tomreynwhat do yuo mean?20:42
axisysI wonder if I could take its metadata and place it in /boot dir since md0 is good and point to it20:42
axisysi have another good server with similar built20:42
tomreynhow far in was the resync from sda2 to sdb2 when it failed, do yuo know?20:42
axisysno ..20:43
tomreynthe other server wont help you20:43
axisysk20:43
tomreynyour only hope now is to try to carve off sdb2 what was copied there.20:45
tomreynor to have soemone recover data off your failed disk drives20:45
axisysyeah.. if that is the case we will just rebuild.. but using this opportunity to indulge all ideas..20:47
axisysdifferent topic => is it possible to convert from RAID1 to RAID10 without data loss with HP raid controller?20:53
axisysyes I am working on another set of hp servers when one of the server, whoever intern built, did not use LVM .. so trying to find a way to expand sda .. application vendor says it has to be all in sda ..20:54
axisyss/when/where/20:54
axisysall the other servers has LVM.. so had no issue on expanding20:55
tomreynso you have hardware raid 1 (which controller?) and the capacity it provides is insufficient, but you have more spare disks?20:57
tomreyndowntime is an issue?20:57
axisystomreyn: downtime is not an issue21:00
axisystomreyn: raid1 (800G,800G) .. need to expand and bought 2 2TB ..21:00
axisysso thinking another raid1 and then strip the two raid1s21:00
axisysstripe*21:01
tomreynif downtime is not an issue i'd just backup and rebuild from scratch.21:01
tomreynif downtime is an issue, it seems to be possible to migrate https://serverfault.com/questions/545809/how-to-move-raid-1-to-raid-1021:02
tomreynhmm this is really old, better make sure it's still valid21:04
axisysk21:04
axisysthanks for the link tho..21:04
tomreynhttps://support.hpe.com/hpsc/doc/public/display?docId=emr_na-c0351025321:05
axisyscool.. online resizing .. nice21:05
tomreynbut then i dont know what hardware you have there21:05
axisysSmart Array P440ar21:08
tomreynyou apparently operate as a company or larger organization, and dont have backups for everything. i really recommend you take some time to make sure your processes are in good shape.21:09
tomreynif this can happen now, it will only get worse in the future unless you re-evaluate how you do what you do.21:09
tomreynP440ar - that#s a common one.21:10
axisysthese HP servers have backup .. that old SUN FIRE was just bashtion server one of 6 which do not have backup21:10
tomreynit still semed mimportant enough thtat you spent time on evaluating whether the data on raid can be restored, if partially.21:13
tomreyn*seemed21:13
tomreyni don't mean to criticise you, it's none of my business, i'm just trying to provide suggestions21:14
axisyshey.. taking all as suggestions.. appreciate the help21:15
tomreyn:)21:15
tomreynHPE SSA user guide https://support.hpe.com/hpsc/doc/public/display?docId=c0390933421:15
tomreynHPE P440ar controller quickspecs https://www.scalcom.de/ftp-import/datasheets/726736-B21.pdf21:16
tomreynactually this one https://h20195.www2.hpe.com/v2/getdocument.aspx?docname=c0434629921:17

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!