/srv/irclogs.ubuntu.com/2020/10/13/#juju.txt

=== _thumper_ is now known as thumper
blahdeblahec0, xavpaice: woutervb tells me one of you might be able to help with testing of my fix for https://github.com/paulgear/ntpmon/issues/509:34
blahdeblahI've also been talking to ganso from support about the same issue.09:34
manadartstickupkid, achilleasa: Still chasing a review of https://github.com/juju/juju/pull/1211809:48
achilleasamanadart: on it09:49
achilleasamanadart: did you see my comment in 12111?09:49
manadartachilleasa: Yep.09:49
manadartThanks.09:50
ec0[blahdeblah](https://matrix.to/#/@freenode_blahdeblah:matrix.org) sure thing, although I haven't seen that issue myself. I'm not sure about converting those conditions to warning though, or did you have something else in mind?11:00
stickupkidmanadart, here is the thing we just discussed https://github.com/juju/description/pull/9111:42
manadartstickupkid: OK, gimme a bit.11:44
Hybrid512Hi everyone12:02
Hybrid512Can I bother somebody regarding prometheus-ceph-exporter from the "-next" branch ?12:03
Hybrid512there are great improvements in this branch that we'd love to use but deployment always fail due to this bug : https://bugs.launchpad.net/charm-prometheus-ceph-exporter/+bug/189553112:04
mupBug #1895531: -next fails to deploy with TypeError: 'str' object is not callable in ceph_client.auth() <Prometheus Ceph Exporter Charm:New> <https://launchpad.net/bugs/1895531>12:04
xavpaiceblahdeblah, to be honest we've disabled that check on the clouds where it was a problem, because of the noise.  Re testing, we didn't have a reliable reproducer so confirming yay or nay is going to be tricky20:16
blahdeblahec0: Yeah - I am not planning to convert either problem from critical to warning; just planning to prevent the NaN from leaking through to the alert value.21:42
blahdeblahxavpaice: Understood re: the noise and the difficulty of reproducing.  ganso has a couple of clouds where he seems able to reproduce it fairly frequently, so I'll work with him on that.21:42
blahdeblahMostly just wanting someone to test the patches, and if possible, do a code review on an upcoming test suite addition.21:42
ec0@blahdeblah - that makes sense to me, if you get a patch together I'll review & test21:43
ec0great to see you still hilight on NTP in a round-about way :)21:44
blahdeblahec0: Actually jsing poked me about it a few weeks back. :-P22:12
blahdeblahec0: Also, drewn3ss submitted https://github.com/paulgear/ntpmon/issues/6 a while back, but the Nagios check is stateless.22:34
blahdeblahGiven that you're just muting the check at the moment, I'm reluctant to invest time on introducing state management.22:34
blahdeblahThe alternative is using telegraf -> prometheus and adding a minimum time period to the check in prometheus alerter.22:34
ec0well, we shouldn't be muting it, frankly22:34
blahdeblahI agree, but when it's hard to find time to make progress on actually fixing the reason for the sync failure, and it's intermittent, I can understand making that choice...22:35
blahdeblahI've also got limited time I can put into this, and I feel like it's probably better spent making better tests and helping ganso fix the underlying cause of the sync failure (at least in the 2 clouds he's working on at the moment).22:38
ec0totally understand22:45
ec0the other way to approach it is we could move it into a shared namespace and have some of the people reporting these issues help to contribute and review22:45
blahdeblahHappy to consider that - any suggestions as to where?22:50
ec0we could set something up on Launchpad maybe?22:56

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!