=== Monthrect is now known as Piper-Off [03:26] hi team, this is general question not specific to Ubuntu but this is happening on Ubunut. So I am asking it here.. [03:28] We have a script which monitors network connections, if there is no network connection(3G connection) it just tries to enable networking. If this is not working out, this script will reboot the system. And these reboots happens, untill we get network connectivity back. [03:28] The issue is after 5 bad pings, this script is not writing anything to log file. We tried to replicate this in lab, and everything working fine.. [03:29] bit strange and I am not able to conclude what is happening with the script. [03:29] any suggestions? [03:29] based on that info? none at all [03:29] it did not provide any useful info [03:29] the file system is rw mode, so read-only mode is eliminated.. [03:30] pastebin the script? [03:30] pastebin how you're running the script? [03:30] How are you Arnold? [03:30] Give me 2 mins..! [03:30] what the filesystem is and what is the reboot? [03:30] not bad, surprised how quickly the weekend went, but not bad :) you? [03:31] * patdk-lap is just upset he debugged for 2hours why a patch wasn't working [03:31] to come to find, the patch was not installed [03:32] and to find, myself from 2years ago, was smarter than I am today :) [03:32] http://pastebin.com/6ymadTvb [03:32] I am good Arnold, thanks for asking.. [03:33] patdk-lap: oh yeah, been there. It's especially fun to find my own answers on stack overflow. [03:33] add some syncs in there [03:34] not sure what kick_watchdog.sh does [03:34] but it doesn't sound filesystem friendly [03:34] sounds more like, yank power [03:34] poor dog [03:36] patdk-lap: It is just a watchdog script which is poweroff the machine completely and boots after 5 seconds. [03:36] yes, but exactly how? [03:36] Up to 104 line it is fine, but 105 line on words I dont see any logs.. [03:36] it matters if the kernel has time to write everything to disk or not [03:36] watchdogs normally don't [03:37] patdk-lap: We have a c code in that kick_watchdog.sh script which take care of kicking power off. [03:37] so it's not friendly [03:38] at the top of kick_watchdog.sh add sync command [03:38] should help [03:38] patdk-lap: what you mean by not friendly? [03:38] it doesn't give the filesystem a chance to sync, umount, etc [03:39] patdk-lap: We can not use sync as we deliberately delayed writting to disk by 30mins in kernel.. but when we do kick_Watchdog.sh run, it have sync included. [03:39] sarnlod: Can you explain a bit? [03:40] neonixcoder: the kernel will buffer writes until some convenient time in the future [03:40] writes to files don't happen immediately, they happen at only certain points [03:41] yes.. [03:41] even worse if mapper stack is used [03:41] so the usual thing to do is to run sync to make sure that those changes have been flushed to disk, and if you can umount the filesystem first, that's even better [03:41] sarnold: my script struck up there like 2 to 3 days with out writing anything to log file. just a note, we sync data to disk every 30mins. [03:42] btw, all these 'sudo' in here seem funny [03:42] does this run as root? or not? [03:42] :) [03:42] I know it sarnlod, I am going to modify it. [03:42] if it doesn't run as root, is sudo configured to allow it to run those commands without prompting and so no? [03:43] This is written while back, if I past you actual script, you will laugh at me ;) [03:43] note that the sudo foo >> bar may not work as you want. everyone gets tripped up with that.. [03:45] sarnold: Yup, I am going to remove sudo from it. [03:46] sarnold: will that helps? [03:48] neonixcoder: unlikely [03:48] yes.. this script was running for a while with out any issue. [03:49] We are seeing this issue recently on couple of servers.. [03:49] Our servers are connected to network through 3G, if there is no 3G network, our script tries to disable/enable for some time. If its unable to bring network, it will reboot machine. [03:51] If there is no network for 10 mins, the system will be rebooted. At the time of reboot, system disks are synced. so there is no point why my script is unable to write to disk. [03:51] Any suggetions? [03:52] are you sure the machines arent hung entirely? [03:53] They dont hung.. [03:55] Y'day we sent a technician to see if can login. And he is able to do that. I ran some simple tests like checking network connectivity(Which is not there at the time of testing), weather my script is running or not(ps -ef | grep scriptname shows that script is running), I tried to create empty files to see file system is not mounted as readonly and I am able to create empty file as well. [03:56] And the funny part is, I am unable to replicate in our lab. [03:56] Our script is doing what it is meant for. [03:56] you've got a shell ona system that's showing the issue? that's awesome; you can attach strace to it and find out what it's doing :) [03:57] strace -f -o /tmp/strace.log -p pid [03:57] you sure the pci_modem status program isn't hanging? [03:57] due the the 3g issue [03:58] great, sarnold. I forgot to do this. [03:58] I will do this when we see this issue again. [03:59] and see if I can find, what our script is doing. [04:00] good point, if any of the called programs hang, it might be hard to tell it apart.. [04:01] well, we where assuming it rebooted [04:01] but if it isn't :) [04:01] hehe [04:02] patdk-lap: I too have same doubt about pci_modem status program. I will try to check next time when it happens.. [04:05] add a timeout timer to it [04:05] in shell script, that could be annoying though [04:05] a watchdog for the watchdog script :) [04:05] * sarnold runms [04:06] well, the watchdog script doesn't have a proper gun [04:08] patdk-lap: The status command worked fine till 4 bad pings.. have a look at my log file at http://pastebin.com/eztiyvub [04:09] patdk-lap: From the 5th bad ping, it stopeed writing to log file, I see other programs are able write to same log file successfully. === cz2 is now known as networkmanager === networkmanager is now known as NetworkManager === Lcawte|Away is now known as Lcawte === Lcawte is now known as Lcawte|Away === NetworkManager is now known as cz2 [12:25] Having trouble setting up postfix on Ubuntu server, I have POP3 working properly, but SMTP without authentication won't allow me to send outgoing mail, and I am having troubles setting up authentication. [12:26] mainly I have the configuration set up fine, but I have no clue how to setup the auth key to make it work [12:47] Having trouble setting up postfix on Ubuntu server, I have POP3 working properly, but SMTP without authentication won't allow me to send outgoing mail, and I am having troubles setting up authentication. [12:47] mainly I have the configuration set up fine, but I have no clue how to setup the auth key to make it work [12:49] been using this guide to set up smtp auth, but failing at proving authentication http://postfix.state-of-mind.de/patrick.koetter/smtpauth/smtp_auth_mailclients.html === Lcawte|Away is now known as Lcawte [14:23] Is that right that Ubuntu 14.04 the last version with Upstart ? I'm studying Upstart to add some scripts at the boot time, but someone told me it is deprecated with systemd? [14:30] eahmedshendy: it's optional on 15.04, and not used on 15.10 [14:53] TJ-: Ok, now we are working on Ubuntu 14.04, and I am studying Upstart, Am I wrong? [14:53] I have this error:http://paste.ubuntu.com/13492759/ [15:12] Hi, anybody here? [15:32] rbasak: ping [15:33] teward, fyi rbasak is out on holiday all this week [15:34] ok [15:34] then nevermind :) [15:34] * teward yawns [15:34] (it's only the nginx merge that's blocked because testing is in failure state due to the sbuild schroots being busted by a bad bug in ubuntu) [16:23] cpaelzer, feel free to add yourself to the irc meeting rotation -> https://wiki.ubuntu.com/ServerTeam/Meeting === Lcawte is now known as Lcawte|Away [16:33] jgrimm: done [16:35] thanks! === Lcawte|Away is now known as Lcawte [18:34] when building an ubuntu server AMI, is there a known list of directories/files to clear/empty out in order to have a clean image? [18:34] I know stuff like things in /var/log, and maybe .bash_history, but I can't think of anything else [18:35] I also dont know if completely deleting the contents of /var/log is safe [18:35] probably /etc/ssh stuff too, to ensure you generate a unique host key on reboot .. [18:35] where does openssh-server log to? [18:35] i see it is set to use AUTH syslog, but i see nothing in syslog or auth.log [18:36] /var/log/auth.log on my 14.04 lts system [18:41] heh, /var/cache [19:49] where do VMs on amazon get their entropy from? Re: > generate a unique host key on reboot .. [19:51] jelly: in theory they could use somethinglike virtio-rng http://wiki.qemu.org/Features-Done/VirtIORNG [19:52] jelly: another option is touse an entropy-as-a-service tool such as pollinate to grab some entropy from a service such as entropy.ubuntu.com [19:53] jelly: another option is to use something in cloud-init to provide a seed, but I'm not sure if that's pre-made and ready to go or not... [19:56] thanks, I was not aware of half that stuff [21:15] Howdy, folks -- I'm trying to set up an stunnel connection between an Ubuntu 14.04 machine and a CentOS 7 machine -- I'm invoking stunnel4 as the client on the Ubuntu machine. The problem is that even though I've hard-coded the config file to use TLSv1, it's still giving me SSLv3 errors [21:15] As though it's not using TLSv1 at all. [21:15] Anyone run into this before? [21:17] I keep getting "SSL3_GET_RECORD:wrong version number" in /var/log/syslog [21:20] that alone wouldn't worry me too much, there's enough similarity between sslv3 and tls1 that there's function re-use, keeping same names etc.. [21:20] try confirming both endpoints with openssl s_client or .. wireshark? ettercap? to try to figure out which specific protocol is being used [21:23] Well, I've tried hard-coding tlsv1 and/or sslv3 on both ends and I keep getting that error. [21:23] I keep seeing disconnects as well. [21:30] hi all,.. i was thinking if there is a distro or tool that specifies on "booting a buggy computer and backing up all of the standard data to a networklocation" type of thing [21:32] Hrm... openssl s_client -connect ubuntu.machine:port -tls1 -CAfile /etc/stunnel/ca-chain.pem returns "no peer certificate available" [21:32] What the devil did I do ... this was working a minute ago. :( === Lcawte is now known as Lcawte|Away === Lcawte|Away is now known as Lcawte [23:03] denbeiren: knoppix is popular for that [23:03] denbeiren: ubuntu livecd, if that still works.. [23:08] * teward pokes sarnold [23:08] afternoon teward :) [23:08] sarnold: got a few moments for a PM? [23:09] sure