/srv/irclogs.ubuntu.com/2019/03/15/#ubuntu-server.txt

geigerCountertomreyn: ^00:03
mwhudsongeigerCounter: you can configure what systemd does to shut down a service00:04
geigerCounterHow do I do that beyond specifying it in ExecStop?00:04
mwhudsongeigerCounter: there is also KillMode00:05
geigerCountermwhudson: Tell me more/link me to docs?00:05
mwhudsongeigerCounter: man systemd.kill00:05
mwhudsongeigerCounter: but i think execstop is what you want here00:05
mwhudsongeigerCounter: you need your execstop script to wait for the process exit though00:06
mwhudsonhm sounds like you are trying that00:06
mwhudsonso what you are doing _should_ work, i don't know off hand why it would not be00:07
geigerCounterYeah.00:07
geigerCounterI don't either really. Today is my first day ever making a systemd unit.00:07
geigerCounterWhat happens when I use service stop is the java process just dies and then since it has no more child processes, screen exits normally. This isn't the expected behaviour.00:08
tomreyngeigerCounter: sorry, i lost track there.00:09
geigerCounterIt should push 'stop\n' to the screen and the server's shutdown sequence should begin.00:09
geigerCounterInstead of just dying.00:09
blackflowgeigerCounter: did you look at the KillMode option?00:09
geigerCounterLooking at it now on the suggestion of mwhudson00:10
blackflowalso, that's not really usual or normal way services behave. essentially you want systemd to start your screen as an oneshot "service" and forget about it. then you deal with your screen and minecraft manually.00:10
blackflowthis situation has been asked about before and the only answer is: open a bug report to minecraft devs and have them build proper service management controls into the daemon, namely respnding properly to TERM or QUIT signals.00:11
geigerCounterThere is no daemon. It was never intended to be run this way. It's intended to be run as an interactive console application using Java's console tools with stdin and stdout.00:12
geigerCounterI want to run it *as* a daemon, using screen.00:12
blackfloweven systemd, the greatest controversy since Linus began his hobby, expects services to understand these signals.00:12
geigerCounterAlthough yes, it should learn to respond to signals correctly.00:12
geigerCounter>_<00:12
blackflowgeigerCounter: perhaps you can whip up some stdin command injection from a shell script exec'd via ExecStop00:13
geigerCounterThat's basically what I'm doing using screen's "stuff" command00:13
blackflowuse KillMode=none so it doesn't care about killing unresponsive processess (which this essentially is)00:13
tomreynmaybe think of some other (more common, maybe in ubuntu) java processes which might have the same issue and see whether a better solution was found there.00:13
geigerCounterWhich allows you to "stuff" characters into a detached screen as though you were typing it in yourself.00:13
geigerCounterBut the problem is systemd isn't paying attention to what I asked ExecStop to do.00:14
geigerCounterWill try that and report back.00:14
blackflowsystemd will normall send a KillSignal (TERM by default) and if the process doesn't quit in TimeoutStopSec, it'll KILL It and all the kids. modulo KillSignal and ExecStop existing00:14
blackflowsorry, modulo KillMode00:14
tomreynsounds sane to me.00:14
geigerCounterOh I see. So that's probably part of the problem. I have my timeout set too low.00:15
blackflowgeigerCounter: read the systemd.kill manpage00:15
geigerCounterI... rookie mistake.00:15
geigerCounterI'm reading it.00:15
geigerCounterThank you.00:15
geigerCounter:)00:15
blackflowgeigerCounter: also the first paragraph for ExecStop= entry in systemd.service(5) manpage00:17
blackflowit explains what's going on exactly, and which value of KillMode you need (none)00:18
=== cpaelzer__ is now known as cpaelzer
lordievaderGood morning07:06
kstenerudcpaelzer I got a build failure for amd64 (but not the other archs), but it wasn't  a build error. The build process and tests completed, then08:14
kstenerudBuild killed with signal TERM after 150 minutes of inactivity08:14
kstenerudE: Build failure (dpkg-buildpackage died)08:14
cpaelzerkstenerud: well this could be a real issue (background processes hanging or such)08:16
cpaelzerkstenerud: but TBH most likely it is something awkward, you can ask the ops if soemthing known happened or just hit rebuild on your build08:17
kstenerudok08:17
=== ykarel_ is now known as ykarel|lunch
=== ykarel|lunch is now known as ykarel
ahasenackgood morning12:08
kstenerudahasenack: I'm just not sure what would cause this kind of issue... It's a bug where php was closing file descriptors before a call to curl finished12:47
kstenerudand if it's testing whether a crash occurs or not, why would the entire testing rig die?12:48
ahasenackkstenerud: I'd try to run that test in isolation12:48
ahasenackkstenerud: well, it dies due to a timeout12:48
ahasenackside "A" is talking to side "B", side "B" crashes, side "A" doesn't notice and keeps waiting12:49
kstenerudIt dies like this:12:49
kstenerudTEST 3443/14261 [ext/curl/tests/bug48203_multi.phpt]12:49
kstenerudE: Caught signal ‘Terminated’: terminating immediately12:49
kstenerudThen the calling process times out12:49
ahasenackcheck that test in isolation, is my suggestion12:49
kstenerudyeah I'm running an sbuild locally. If it crashes I'll shell in and see what's up12:50
fricklerjamespage: coreycb: would it be possible to get an updated erlang version into UCA for xenial? like possibly the one from bionic? see the latest comments in https://bugs.launchpad.net/charm-rabbitmq-server/+bug/178320313:33
ubottuLaunchpad bug 1783203 in rabbitmq-server (Ubuntu) "Upgrade to RabbitMQ 3.6.10 causes beam lockup in clustered deployment" [Undecided,Confirmed]13:33
kstenerudSo when I run the build locally, it works :/14:24
JuJUBeeHey all.  I want to setup internal dns in my classroom.  I have an LTSP server running and dnsmasq is installed.  I have a separate server acting as my gateway running isc-dhcp server.14:32
JuJUBeeI ultimately want to be able to access servers using fqdn internally like LTSP-Server.foo.local14:32
JuJUBeeI know very little about setting up dns.  Any help?14:32
lordcirthJuJUBee, you can either set up DNS on your gateway, or on your LTSP server. Then set isc-dhcp to advertise the DNS server's IP14:37
JuJUBeelordcirth, dhcp currently advertising 2 dns servers (open dns), can I just add the third IP of my dnsmasq machine?14:39
lordcirthJuJUBee, you could, but if you want them to always use your DNS server, you should probably set only that.14:40
JuJUBeelordcirth, but if I only want to get to public sites, don't I need external dns servers?14:44
lordcirthJuJUBee, your DNS server should be configured to use those external DNS servers itself, for anything it doesn't manage.14:44
JuJUBeeSo this seems far beyond my expertise.  I was hoping it would be a fairly simple configuration14:46
leftyfbJuJUBee: the link I gave you shows you how to setup bind914:48
lordcirthJuJUBee, it is fairly simple. DHCP points clients to your dnsmasq. dnsmasq answers for .local, and for everything else, does a lookup to the outside and caches it.14:48
JuJUBeeleftyfb, bind kind of intimidates me.14:55
JuJUBeelordcirth, I need to see how to do that in dnsmasq.  I will read on it.14:56
leftyfbJuJUBee: and it will continue to intimidate until you try it, then it won't :)14:56
JuJUBeeleftyfb, I do have that link open and have been reading it along with dnsmasq setup.  dnsmasq seems more appropriate for my simple needs.  I just don't want to misconfigure bind and let the nastys in or prevent appropriate requests from getting out...14:58
sdezielabusing .local isn't a good idea but maybe that was just a made up example14:58
leftyfbJuJUBee: this DNS server should NOT be accessible to the internet. Regardless of which solution you go with.14:58
leftyfbsdeziel: abusing?14:59
sdezielleftyfb: .local shouldn't be used for internal purposes other than mDNS14:59
leftyfbsdeziel: why is that?14:59
sdezielleftyfb: https://serverfault.com/questions/17255/top-level-domain-domain-suffix-for-private-network/937808#93780815:00
sdezielleftyfb: systemd-resolved chokes on .local names unless you configure it in a special way15:01
leftyfbsdeziel: it's been working on my local network for about a year now15:01
sdezielleftyfb: do you also provide something.local as your search domain?15:01
leftyfbyes15:02
sdezielleftyfb: that's why15:02
sdezielleftyfb: this enables the special handling of .local by systemd-resolved15:02
sdezielleftyfb: but normally .local is reserved for mDNS15:02
leftyfboh wait, sorry. No. I only append .local15:02
leftyfbso <hostname>.local15:03
leftyfbworks just fine on my network. I can resolve everything locally15:03
sdezielleftyfb: yeah, as long as local is in the search domain, systemd-resolved will try to accommodate for this abuse15:03
sdezielleftyfb: as an experiment, drop local from you search domain, restart systemd-resolved then try to resolve <hostname>.local, should fail15:04
leftyfbof course it will. Unless I have avahi/bonjour setup on every client15:04
sdezielin theory, resolvers shouldn't attempt mDNS resolution when there are 2 labels with the last one .local but that is not implemented everywhere15:06
sdezielthat's why it's best to leave .local and everything under it reserved to mDNS15:06
sdezielleftyfb: you said off course it would break without the search domain but that's only true for the .local domain... any other domain would have kept working15:07
leftyfbsdeziel: how would that work? If you have a local .home as your local domain and you try to ping hostname.home without having home as a search, how would it know to append that tld?15:09
sdezielleftyfb: when you "ping hostname.home" no search label is appended because you provided one already15:09
sdezielhttps://www.freedesktop.org/software/systemd/man/systemd-resolved.service.html then search for ".local" it explains it with all the details you'd want :)15:10
leftyfbsdeziel: and you said in the test, remove the search label. Obviously it would fail when you do that15:10
sdezielleftyfb: the search domain is used to tell systemd-resolved that you are abusing .local and to back down on mDNS15:11
leftyfboh sorry, my post above was wrong. %s/ping hostname.home/hostname/g :)15:11
sdeziel> Note that by default lookups for domains with the ".local" suffix are not  routed to DNS servers, unless the domain is specified explicitly as routing or search domain for the DNS server and interface15:11
sdezielfrom the above link15:11
sdezielleftyfb: well, if you refer to the shortname, then of course it requires a search domain ;)15:12
=== ykarel is now known as ykarel|away
sdeziel> Note that today it's generally recommended to avoid defining ".local" in a DNS server, as RFC6762 reserves this domain for exclusive MulticastDNS use.15:13
leftyfbI'll think about changing it to something else at home :)15:13
sdezielit's worth it if you can :)15:14
leftyfbafter I figure out why my Unifi USG likes to disallow outbound DNS traffic around 6/7am requiring a reboot a few times a week :)15:14
sdezielIf only I had that insight years ago, I wouldn't have to leave with that damn foo.local at a client site ;)15:14
=== ykarel_ is now known as ykarel|away
leftyfbA fresh install of Ubuntu server 18.04.2. sudo apt install mysql-server. I run sudo mysql_secure_installation. I select No for everything except reloading the privilege tables and set my password. I cannot run: "mysql -uroot -p <password>" to successfully login. What am I missing?16:35
leftyfbsorry, not space between p and the password16:36
sdezielleftyfb: are you able to connect with mysql --defaults-extra-file=/etc/mysql/debian.cnf ?16:40
leftyfbnegative16:41
leftyfbhttps://pastebin.ubuntu.com/p/Tr9Q8XdTGX/16:41
leftyfbalso fails if I specify credentials16:42
leftyfbalso of note, I can login just fine with sudo and no credentials and every time I run mysql_secure_installation it says it's running with a blank password. It seems that utility isn't actually setting our password?16:43
sdezielleftyfb: I just tested in a fresh container and cannot reproduce the root auth failure16:45
leftyfbI've been 'just testing" it all morning with fresh installs from the same flash drive I used to several install of other types of servers since yesterday.16:45
sdezielhttps://paste.ubuntu.com/p/NDMV5D3YmJ/ root doesn't have any password despite having been asked for one by mysql_secure_installation, weird16:47
leftyfbyeah, that's a problem16:49
rbasakI don't know about mysql_secure_installation16:50
rbasakBut you get socket based auth by default on the default install.16:50
leftyfbwhat does that mean for credential based auth?16:52
rbasakI'm not sure.16:52
rbasakI'd have to read the manual, etc.16:53
rbasakIt seems odd to be using mysql_secure_installation to me.16:53
sdezielleftyfb: by default root has access through the Unix socket, without providing any password16:53
rbasakI was under the impression the maintainer scripts did the right thing, and there was no need to run it. I could be wrong though.16:53
leftyfbsdeziel: as in, the root user. No supplying root as the user on the command line as a non-root user16:54
sdezielleftyfb: correct, the root user which is why it worked for you with sudo16:54
leftyfbok, and if I want to authenticate using a php script?16:55
rbasakFor the root user?16:55
leftyfblike I'm doing with the web app/db I'm trying to migrate from 16.04 to 18.0416:55
leftyfbrbasak: yes, for the mysql root user16:55
rbasakThat seems dangerous. But if you insist, you'll have to set a root password. I'm not sure about how that interacts with socket auth (check the docs)16:56
sdezielrbasak: it uses the auth_socket plugin16:56
sdezielI'm assuming it's checking the UID of the user opening the Unix socket16:57
leftyfbhttps://bugs.launchpad.net/ubuntu/+source/mysql-5.7/+bug/161057416:58
ubottuLaunchpad bug 1610574 in mysql-5.7 (Ubuntu) "mysql_secure_installation/mysqladmin cannot set password when auth_socket is enabled" [Low,Triaged]16:58
sdeziela stock install: https://paste.ubuntu.com/p/t4JRNffHjR/16:58
leftyfblooks like I have a workaround I can try16:59
rbasak"Also note that mysql_secure_installation is largely redundant for a fresh 5.7 installation"17:01
leftyfbredundant how? What's the replacement?17:01
rbasakWhat are you expecting it to do for you?17:02
leftyfbset the root password17:02
rbasakSo just set the root password then: sudo mysql -e "ALTER USER 'root'@'localhost' IDENTIFIED WITH 'mysql_native_password' BY 'mypass';"17:03
leftyfbthere's other things you can config as well, but that's my main use case17:03
rbasak(frmo the bug)17:03
leftyfbyeah, I'm going to try that now on another fresh install17:03
rbasakPerhaps we should stop shipping mysql_secure_installation to avoid misleading users into thinking it's useful.17:04
rbasakThough that might be overkill because there are users who use mysql without the maintainer scrip tmanagement.17:04
rbasakBTW, you can do that if you want. Just use mysql-server-core-5.7 and operate mysql yourself directly.17:05
leftyfbThat works17:06
DammitJimdoes ubuntu support php 7.0? I think 7.0 is eol17:21
sdezielDammitJim: on 16.04, yes17:22
DammitJimTY17:23
sdezielDammitJim: upstream declared it EOL but Canonical will keep backporting security fixes to it, like it does for 5.5 on previous Ubuntu versions17:23
DammitJimty17:23
DammitJimhow do I determine if canonical supports a piece of software17:25
DammitJimsay, I'm looking at the fact that I installed nginx17:25
DammitJimhow do I know if it came from main or universe or a different repo?17:26
lordcirthDammitJim, apt show nginx | grep Source17:27
DammitJimso, multiverse is NOT supported by Canonica, right?17:28
lordcirthcorrect. Also things in multiverse usually have licensing issues17:29
DammitJimactually, I just looked at say mongodb and we are using the mongodb repo17:30
DammitJimwhat a mess17:30
lordcirthNothing wrong with using upstream repos, as long as they are well supported.17:30
lordcirthI have HAProxy 1.7 from PPA in production.17:30
DammitJimbut 1.7 is not end of life17:31
DammitJimI'm trying to keep track of what software needs to be upgraded because it's going to be end of life17:32
lordcirthOh I see17:32
lotuspsychjeubuntu-support-status --help17:36
rbasaklordcirth: I disagree. Using third party apt repositories is fundamentally broken and can break your system, even if well maintained.17:39
rbasakBut I appreciate that many people use them anyway.17:39
lordcirthrbasak, so is not having the software you need.17:39
rbasakSure17:40
rbasakBut they should understand the risks.17:40
lordcirthIt's more stable to run an Ubuntu LTS + upstream-maintained stable release than any other option.17:40
lordcirthI am happy that HAProxy 1.8.8 is in 18.04, so I don't need a PPA anymore17:40
rbasakIn particular I'd strongly recommend against attempting a release upgrade if a third party package has been on the system.17:41
lordcirthI just don't do release upgrades. nuke and pave17:41
rbasakThe other thing that goes wrong is experimentation, so in production it's essential to have prepared an automated deployment17:41
DammitJimubuntu-support-status --help?17:44
DammitJimOMG17:44
DammitJimthat's huge!17:44
lordcirthYeah, pretty cool17:46
sdezielubuntu-support-status reports odd things here. libwoff1 is said to be supported by Canonical for 3 years yet it's in main17:50
lotuspsychjewich ubuntu version are you on sdeziel17:56
lotuspsychjesame on bionic here: libbrotli1 libwoff1 3y18:01
sdeziellotuspsychje: 18.0418:03
sdeziellotuspsychje: yeah, same 2 packages in main that are reported as support for 3y only (instead of 5y)18:03
sdezieloh I see what happens18:04
sdezielthose packages started in universe then were MRE after 18.04 release18:04
lotuspsychjeah18:04
sdezielstill, I don't see why they wouldn't get the full 5y support, looks like a bug in ubuntu-support-status18:05
lotuspsychjeif you report, ill affect :p18:05
sdezielhow nice of you :P18:07
lotuspsychjei just want a cookie18:08
sdeziellotuspsychje: https://bugs.launchpad.net/ubuntu/+source/update-manager/+bug/182032918:14
ubottuLaunchpad bug 1820329 in update-manager (Ubuntu) "ubuntu-support-status wrong support period for packages that were MRE during a release's life" [Undecided,New]18:14
lotuspsychjesdeziel: affected mate18:14
lotuspsychjesdeziel: you think it would differ with someone with the hwe kernel?18:18
sdeziellotuspsychje: I wouldn't expect it to be different for those 2 libs but maybe for the hwe packages themselves since they've never been into bionic/main, only bionic-updates/main18:20
lotuspsychjekk18:20
DammitJimOMG, the things one doesn't know18:21
DammitJimignorance is bliss!18:21
DammitJimI had no idea vim-nox was no longer supported!18:21
sdezielDammitJim: I too was using vim-nox but have since moved to plain vim that's in main18:22
lordcirthI use neovim18:23
DammitJimnew vim?18:23
DammitJimneovim?18:23
DammitJimhhhmmm18:23
lordcirthIt's good. Some features have actually been backported to main vim.18:24
DammitJimis eol of ubuntu 14 at the beginning or end of APril?18:58
sdezielDammitJim: April 25th, 201918:59
DammitJimis it always on the 25th?18:59
sdezielCanonical seems to have a thing for Thursdays but other than that, I think the EOL date can vary19:01
sdezielDammitJim: 14.04 will not be officially EOL but will transition to ESM (paid support)19:01
blackflowWatched Popey's talk on snaps @ SCALE 17x. I couldn't disagree more on reasoning not to allow custom snap repos. What if a company wants to package their in-house, not publishable, apps as snaps in order to have a homogenous toolset and not a quagmire of snaps and dockers. Isn't that enough of a reason to allow custom repos?19:02
lordcirthblackflow, well, what was his reasoning?19:03
popeyI only gave one reason19:03
JanCwho'd want to use snap on a (production) server anyway?19:03
blackflowlordcirth: that allowing so would then cause the same issues PPAs have|had -- undiscoverability of programs published in custom repos19:04
popeyIt's entirely possible to do the use case you describe19:04
popeyhttps://docs.ubuntu.com/snap-store-proxy/en/19:05
blackflowpopey: yes but not with a repo, it'd have to be uploaded manually and handled --dangerous'ly19:05
tomreynsdeziel: in bug 1820329 , do you actuall ymean MIR, not MRE (which was replaced by SRU, as far as I can tell)?19:05
ubottubug 1820329 in update-manager (Ubuntu) "ubuntu-support-status wrong support period for packages that were MRE during a release's life" [Undecided,Confirmed] https://launchpad.net/bugs/182032919:05
sdezieltomreyn: right, I wanted to say MIR19:06
blackflowpopey: btw the vid linked from LINUX Unplugged, has bad audio and is missing segments. Is there a better one of your talk?19:06
popeynot that I'm aware of19:06
popeyI didn't even know it was streamed / recorded until afterwards.19:06
lotuspsychjesdeziel: a member tested it on 19.04 also with this result: libbrotli1 libwoff1 is shown as supported for 9 months (Jan 2020)19:06
tomreynsdeziel: thanks for clarifying, i was just on a journey trying to understand what those different abbreviations mean (and got help there, so now i know)19:07
sdeziellotuspsychje: 19.04 as a whole has 9 mo of support19:07
lotuspsychjesdeziel: allright, so not relevant for this bug right?19:07
blackflowpopey: ah, k.  btw that link is about proxying the official snap store, or am I missing something in that?  I was thinking about a completely custom store a company could use in-house on their servers19:07
sdeziellotuspsychje: I don't think19:08
blackflowpopey: good talk btw (even though I disagree on that little point about snap store)19:08
popeyblackflow: that's right. the proxy along with a brand store effectively gives you that19:08
popeythanks19:08
popeyWe have customers who have an enterprise proxy as a frontend to their own brand store, which they control the content of19:09
DammitJimthis might not be an ubuntu question, but more of a server question19:09
DammitJimis there anything I should consider with regards to a max amount of services that a server runs?19:09
DammitJimI have RAM and CPU if needed19:09
DammitJimthese are java processes19:10
DammitJimany input from any of you?19:10
lordcirthDammitJim, resource contention, effect of a reboot, security19:10
blackflowpopey: is that available to non-advantage users? I can't seem to find anything about it in those docs19:10
sdezielDammitJim: depends on the -Xms/-Xmx params you want to give to those JVMs I guess19:10
JanCthat depends on how many resources those services need19:10
lordcirthDammitJim, are these diverse services, or just a bunch of instances of the same?19:10
DammitJimdiverse, but similar in business logic19:11
lordcirthDammitJim, Consider running them all in unprivileged containers with resource quotas19:11
lordcirthThen you can reboot them independently, they can't spin out of control and eat all resources, etc19:11
DammitJimthanks... those are all good points19:11
popeyblackflow: the proxy is installable - it's a snap ;). There are indeed projects which have brand stores, which don't have UA19:12
lordcirthWe use LXC containers on big IAAS hosts, all controlled by Salt. works pretty well.19:12
DammitJimwhen you say unprivileged containers, do you mean their own JVM?19:12
DammitJimI'm using salt, too, but not to that leevl19:12
lordcirthDammitJim, no, linux containers, ie LXC / docker19:12
DammitJimlevel19:12
JanCOS containers19:12
lordcirthunprivileged LXC containers means that uid 0 (root) in the container actually maps to uid 100231 on the host, who doesn't exist and has no permissions.19:13
JanChttps://en.wikipedia.org/wiki/Container_(virtualization)19:13
lordcirthprivileged containers have no such mapping, so any hostile or compromised process that is root in the container can trivially break out19:13
blackflowlordcirth: Salt control on a completely isolated vlan, or is it publicly accessible? We rent servers with several providers and thus the control channel would have to traverse public networks. Something I am totally not comfortable with.19:13
DammitJimoh, these don't run as root either19:14
lordcirthblackflow, in this case, an internal, though not immensely locked-down, network19:14
DammitJimblackflow, ours is isolated vlan19:14
blackflowyeah19:14
lordcirthDammitJim, yes, but if someone exploits your java app, then escalates to root, they will only be root in the container.19:14
lordcirthblackflow, the only weakness Salt potentially has is when a minion first connects to the master - it trusts the master's key on first connect. If you don't trust the network, preseed all your minions with the master key, and no more holes.19:15
blackflowlordcirth: it's no that that I'm worried much about, it's the fact that the master would we a publicly accessible server -- because we rent them. I mean that one machine would be the gateway to all the servers. One point to compromise and then everything is.19:17
blackflowwould *be19:17
lordcirthblackflow, ah, you are worried about the master being compromised. Well, one thing you could do is require a VPN to reach the master.19:18
lordcirthIt would take a bit more preseeded setup, but you could require new minions to establish an 2-way auth'd VPN, then connect to Salt over it.19:18
blackflowlordcirth: it's still one publicly accessible port. one vuln in the network stack and poof...19:18
blackflowbut then.... the same vuln could be used on all teh servers individually... moot point. I worry too much.19:19
lordcirthblackflow, that vuln in the network stack would affect all of your minions...19:19
blackflowyeah19:19
DammitJimwe worry too much19:19
lordcirthThere's only so much you can do while being connected to the internet :)19:19
DammitJimand you know what the worse part about that is? the day you get tired of worrying ('cause that happens to me), that day is when I make very poor decisions19:19
lordcirthBut props for being paranoid :)19:19
blackflowyah :)19:20
lotuspsychjehealthy paranoia is good19:20
DammitJimhealthy paranoia is good, but it's not easy to achieve... gotta make sure to have a good balance19:20
blackflowthat balance is hardest to achieve19:20
blackflowat the very least I manage to convince everyone in the company that public clouds and VPS'es are a big no-no, in the post Meltdown+Spectre world.19:22
supamantalking about being paranoid ... I am taking over a system (a few virtual ubuntu servers) where the prev sysadmin did a apt upgrade without waiting a while and checking the packages for bugs or doing any kind of test ... would you continue that practice? :-)19:24
DammitJimthat's a good question... how does one deal with that supaman ?19:25
supamandepends on how paranoid you are ;-)19:25
blackflowwith ZFS snapshots of course.19:25
DammitJimI'm not for that kinda stuff19:28
DammitJimwe do run the production environment in a virtual lab and test our applications19:28
DammitJimbut I don't go looking for bugs in the updates19:28
lordcirthI can't remember the last time that an apt upgrade broke something. Once or twice a full-upgrade kernel update did, but you need to reboot to notice that, and then you just reboot back to the old one19:32
Ussatyou dont need ZFS snaps if you have good backups19:58
lordcirthUssat, best thing is to take ZFS snapshots, and then backup the snapshots. Prevents smearing (I forget if there's a more common term)19:59
Ussatnaaa...we dont and wont use zfs20:00
lordcirthThe snapshots are instant and make sure your backup doesn't get different files at different times as it chugs along20:00
Ussatno need20:00
lordcirthUssat, so, how do you detect bitflips?20:00
Ussatin all the years I have been doing this, I have NEVER had an issue with that20:01
lordcirthUssat, that you know of20:02
UssatThat I am sure of20:02
lordcirthSo, if a bit flipped somewhere on one of your drives, you are quite sure you would get an email about it?20:02
lordcirthJust a few months ago I suddenly got 30 checksum errors on each of 3 drives in a raidz. No idea why, maybe a power surge. Didn't matter, ZFS fixed it all20:03
lordcirthGranted that wasn't on enterprise hardware, but that's just a difference of degree.20:03
UssatI dont use non-enterprise HW at work20:04
Ussat"just" a different of degree.....20:04
lordcirthso you are less likely to get bitflips. Not 100% assured.20:05
Ussatnothing is ever 100%...ever20:05
lordcirthZFS decreases your odds of a bitflip by several orders of magnitude, which is enough for me. trusting hard drives isn't good enough for me.20:05
Ussatall my data I am concerned about is stored o a SAN, if a VM image is corrupted, no big deal.....20:06
UssatWe use a real enterprise SAN, sot a bunch of commercial disks off the shelf with a fancy FS on it20:07
lordcirthI trust ZFS a lot more than a "real enterprise SAN" blackbox20:08
Ussatto each their own20:08
UssatI dont20:08
lordcirthNot to mention the cost and vendor lock-in20:08
blackflowlordcirth: hear hear20:09
blackflowguess some people are too lucky and never had a failure20:09
UssatOh I have had failures...sure......none of them crippling20:09
lordcirthObviously a good SAN is better than what a lot of people do...20:09
UssatMy systems are very critical.....20:09
lordcirthSome people run production data on commercial drives with just RAID.. ow.20:09
Ussata lot of people use a good SAN20:10
blackflowZFS snapshots (sent offsite) are godsent for backups. quick, atomic, with data integrity checks.20:10
UssatNot in my industry they dont20:10
blackflow"a good SAN" is not replacement for offsite backups. ZFS (or any other CoW) is orthogonal to SAN. Your SAN can be powered by ZFS. So I don't get your point.20:16
lordcirthA SAN could be powered by ZFS, but are generally powered by blackbox magic20:17
UssatWho said I dont have offsite backups ?20:19
UssatI never said that, I have a very robust backup solution w/offsite backups20:20
lordcirthYes, I assumed you did.20:21
blackflowwell you seem to have put ZFS in a opposition to SANs.20:21
lordcirthblackflow, he just said he doesn't need ZFS because he has a fancy SAN. Doesn't mean you couldn't somehow have both.20:22
blackflowwhich is wrong really, they're orthogonal. So is this: "20:58 < Ussat> you dont need ZFS snaps if you have good backups"  --- but ZFS snapshots _are_ good backups, when you `zfs send` them offsite20:22
blackflowdoesn't make sense to put ZFS snapshots and good backups in opposing relationship.   that's like saying "You don't need a filesystem if you have good backups"20:23
Ussatincorrect, but thats ok you do what you want, I do what we do20:23
JanCand you would likely want to use some sort of snapshot on that SAN too...20:23
UssatJanC, the SAN is mirrored between sites20:23
JanCto make backups20:23
UssatNO...not to make backups20:24
UssatI am not going to explain how our san enterprise works...it works quite well and is very efective20:24
JanCso how do you guarantee data consistency on you backups?20:24
UssatIts not just a bunch of off the shelf disks and consumer grade PCs ....20:24
blackflowUssat: not sure which part is incorrect. perhaps you don't know what ZFS is? a snapshot is a backup per se. it's a copy of data you can revert to. you can back your data from it. it becomes _good_ backup when you ship it offsite. you might not like it, that's okay. use something else, fine, but how is all this "incorrect"?20:27
Ussatper se20:27
JanCa snapshot is not a backup20:27
Ussat^^^20:27
JanCbut you need it to make consistent backups20:28
blackflowyes it is, it's just not good enough if it's local20:28
blackflowyou can ship it elsewhere and it's still a zfs snapshot. in a physically different location.20:28
blackflowmaybe y'all don't know about the `zfs send` feature. it's made to send snapshots to external pools. and `zfs receive` to bring them in from external pools. how's that not a backup.20:29
JanCI know what it is, and when you do that you have a proper backup indeed20:30
UssatI am quite aware of them, and we looked at them and rejected them20:30
lordcirtharguing over the semantics of what is and is not a 'backup' is irrelevant when you agree on what they are20:30
blackflowit's okay if you don't like them. but fact remains they _are_ useful backups.20:30
UssatNot in our case20:30
lordcirthWhat is special about your case that ZFS is not sufficient?20:31
UssatCorrect20:31
UssatWe evaluated it, and it doesnt meet pur needs20:31
lordcirthI am interested to know in what way ZFS fell short of your needs.20:32
blackflowI'm dying to know too20:32
UssatI dont need to justify my descisionsto you.......as soon as you start cutting my paycheck, then that is a different matter20:33
blackflowlol20:33
UssatIt did not meet our needs20:33
lordcirthI don't see any reason to suddenly turn so hostile...20:33
UssatThats hardly hostile20:33
blackflowwhy do you think that's "justifying"? this is a discussion forum. if you don't like it you can always /part20:33
blackflowbut I know what it is. you just have no idea what you're talking about. so lashing out is best defense to hide ignorance. fine by me.20:34
lordcirthblackflow, while that is possible, I don't think you have sufficient evidence to assume that.20:34
UssatYes, yes I do, we evaluated it and it did not meet our requirements20:34
blackflowoh I do. seen that type too often.20:34
blackflowarmchair "admins" who feel threatened when asked to explain their use case.20:34
Ussatarmchair admins...um sure20:35
blackflowyup.20:35
JanCUssat: it would be useful to know in (roughly) what way it didn't meet your requirements20:36
UssatOK, and you are obviousely the expert......you have no clue what industry or what the requirements are, but you can make that assumption. You seem to think that ZFS will meet all requirements20:36
lordcirthUssat, no, we asked what requirements it didn't meet, and you refused to answer20:36
lordcirthWhich usually means you don't know.20:36
shibbolethany word yet as to this supposed horribad vulnerability that made scroogle and fakebook down their networks for "disruptive, lengthy, unannounced but totally planned and routine maintenance"?20:36
lordcirthThere are some requirements ZFS can't fulfill.20:36
Ussatlordcirth, ir which means it may be none of your business20:36
lordcirthUssat, and yet you are still here arguing about it?20:37
lordcirthIf you have an NDA, say so.20:37
blackflowUssat: you don't know how to read either. I said several times it's fine if you don't like it or if it doesn't work for you. I never said ZFS _must_ meet your requirements.20:37
UssatI am in a highly regulated industry, and ZFS did not meet the requirements20:37
blackflowthat's fine. so we asked what does.20:37
UssatHow effective is ZFS on a streached cluster ?20:38
lordcirthYou mean as a distributed filesystem? Normally I would run Ceph on top of ZFS for that.20:39
UssatNo I dont mean distributed FS20:39
UssatI mean streached cluster20:39
Ussatbehind SVC's20:40
blackflowa stretched cluster (not "streached"), is when two or more virt hosts are part of the same logical domain but localed in physically different locations. that's not in ZFS domain at all20:40
lordcirthYeah, just looked that up. Seems like something you'd implement above ZFS.20:40
blackflowunder20:41
blackflowor, well, depends on your strategies I guess.20:41
lordcirthEither would probably work, yeah20:41
UssatOr maby, just maby we tested it and ZFS crapped out20:41
lordcirthUssat, combining ZFS with what?20:41
Ussatand IDGAF about the spelling, I am dyslexic, correct away20:42
blackflowI personally don't have need for it nor personal experience so I can't vouch. I do know of people who happily use ZFS with lustre and for htat purpose exactly. zvol based virtualization like in a stretched cluster.20:42
UssatOh please.....I dont need to continue to justify what we do20:42
blackflownot justifying anything, just discussing your use case. that's what these public places are for.20:43
blackflownobody has brought you to court, judged you and pressed you to defend yourself.20:43
Ussatthe other issue was encryption at rest, ZFS , while it has it imlimented, does not meet the standards we must meet20:45
Ussatimplimented20:45
blackflowZFS encryption is still highly experimental. We use LUKS under id20:46
blackflow*it20:46
UssatThe time to encrypt/decrypt was horrendus20:46
blackflowin our tests, with AESNI, the difference was ~2%, very much acceptable in our case20:47
Ussatnot ours20:47
lordcirthUssat, you were using ZFS 0.8 rc?20:47
UssatWe are not useing ZFS at all now20:48
lordcirthUssat, in you tests, I mean20:48
UssatI believe so, it was a while ago20:48
lordcirthyour*20:48
lordcirthBecause ZFS 0.8 rc hasn't been out for that long, and still isn't a stable release, so it seems odd to rely on it for your apparently critical use case.20:49
lordcirthOr consider doing so, I mean20:49
UssatI said we are not useing it, why do you think we are relying on it /20:49
Ussator why do you think we would ? I said we tested and decided it would not meet our needs20:50
lordcirthUssat, if you did the tests with ZFS encryption, then you must have been considering using it, no?20:50
UssatWe considered it, yes, and put it asside20:50
lordcirthIt seems odd to run the tests only on 0.8 when it wouldn't be ready for your use case anyway, instead of testing, say, 0.7 on LUKS which would be.20:50
lordcirthWe have a Ceph cluster in production which is backed by ZFS on dm-crypted drives.20:51
lordcirthLooking forward to 0.8 simplifying that, though.20:52
UssatWe dont use software based storeage20:52
Ussatno plan to ever20:52
lordcirthWell, I'm sure somebody will keep selling blackbox storage as long as big companies buy them.20:52
JanCyour SAN is software-based too20:53
UssatJanC, yes yes.....and all raid is software based also, even when it is on a raid card20:53
lordcirthWell yes, but raid cards suck20:53
UssatYes its software...20:53
Ussatlordcirth, big companies buy them because they are very reliable, depending on what you buy20:54
Ussateverything is software based, when it comes down to it.....20:56
lordcirthUssat, but what most people mean by 'software defined storage' is that it is commodity hardware assembled by good, flexible, tunable software into a good system.21:34
lordcirthA big box that says "no user serviceable parts inside" uses lots of software, but it's not SDS21:35
blackflowneedn't even be commodity hardware21:56
lordcirthIt needn't be, but it's generally one of the selling points22:02
blackflowlordcirth: yeah, the strength of ZFS (and BTRFS) is that you can use crappy commodity hardware with no risk to your data22:43
JanCI wouldn't say that about btrfs...23:21
blackflowJanC: well okay, but in theory at least :)23:37

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!