/srv/irclogs.ubuntu.com/2022/03/28/#ubuntu-server.txt

cpaelzergood morning05:23
cpaelzerthanks ahasenack for continuing on those PG-14 fixes I've found05:24
cpaelzerbryceh: ahasenack: yeah that (lp-test-ppa + ppa-dev-tools + wishes + some work => a great useful thing) is exactly what I have hoped for as well05:26
cpaelzerlet us see how the lightning talks resonate05:26
bryceh:-D05:30
athosgood morning!11:10
ahasenackcpaelzer: bryceh: btw, juliank asked to no longer trigger autopkgtests from ppas, the clusters are swamped12:25
ahasenack(not directly asked us, but mentioned it in general)12:26
cpaelzerahasenack: ok, I have none running or queued atm12:29
cpaelzerahasenack: does this also imply stopping ppa builds, or only autopkgtests?12:30
ahasenackI only heard about dep812:30
kanashirodo we have any FTBFS left to be investigated? Or everything is sorted out in some way?12:31
cpaelzerkanashiro: I think we have them all by now12:31
cpaelzersome still in -proposed, but we have them12:31
kanashirocpaelzer, is any of them unassigned?12:33
ahasenacksamba takes 5h30min to build on riscv64 :/12:33
ahasenackkanashiro: I'll have one for review shortly12:33
ahasenackbut it's a bit awkward12:33
ahasenackpython-tempita12:33
ahasenackit's still python2 :/12:33
cpaelzerahasenack: I have an unsubscription request up for that btw12:33
ahasenackand finally stopped building with python 3.1012:33
cpaelzerit is openstack12:33
ahasenackyeah12:33
ahasenackit's abandoned upstream12:33
ahasenacksame version out there for years12:34
ahasenackand did I mention it's py2 only? :)12:34
cpaelzerso you do not "have to" fix it, but I'm not stopping you if you have an idea how to handle it12:34
ahasenackit survived this long because it was using a setuptools trick to run 2to3 on-the-fly during build12:34
ahasenackcpaelzer: my "idea" (quite the compliment) was to build it with py3.9, where 2to3 still runs, and diff the result12:34
ahasenackso we would end up with exactly the same12:34
ahasenackin py3.1012:34
cpaelzerif it works it is fine for ubuntu overall, but if it explodes into something long and complex log your insight in a bug and assign the openstack team please12:35
ahasenackoh no, it's done12:36
ahasenackit's just not one of those smart patches you would put up in your CV12:36
cpaelzerwith a package of this kind, that is just the right kind of fix for now12:37
ahasenackok, I'll put it up12:37
ahasenackI even improved it a bit, the unit tests were not running12:37
ahasenacknow they are :)12:37
ahasenackall 3 of them :P12:37
* ahasenack can't stay away from getting tests to run12:39
sdezielahasenack: I wouldn't worry too much about stuff to actually put on your CV if I were you ;)12:39
ahasenackhah12:39
cpaelzerHmm, am I overreacting on bug 1893716 (found in housekeeping of long dormant cases) or is it as important as I think?12:51
ubottuBug 1893716 in update-motd (Ubuntu) "scripts in /etc/update-motd.d/ run even on login via non-interactive scp and sftp sessions" [High, Triaged] https://launchpad.net/bugs/189371612:51
cpaelzerahasenack: rbasak: (or anyone that wants) ^^ if I could have your sanity check before spending (potentially useless) time?12:52
ahasenackcpaelzer: kanashiro https://code.launchpad.net/~ahasenack/ubuntu/+source/python-tempita/+git/python-tempita/+merge/41774012:53
ahasenack(since you asked :P)12:53
ahasenackcpaelzer: taking a look at that other one12:54
ahasenackbtw, migrations are very slow12:55
cpaelzerahasenack: I know, I have a few I'm waiting on, but we are all behind the test queue delay12:55
ahasenackcpaelzer: about tempita, don't you/we need a pr to subscribe the openstack team to it?12:56
cpaelzerthey are12:56
cpaelzerit is double subscribed, hence no need12:56
ahasenackn/m then :)12:56
ahasenackbest outcome12:56
rbasakcpaelzer: I think your performance test is important. I was mentally dismissing the bug as "need metrics to convince me" until I saw that. Problem is, as you discovered, I don't think PAM makes any distinction on "interactivity".13:09
rbasakI wonder if instead we can figure out why pam_motd is slow and fix that.13:09
rbasakBecause it seems to me the (only) problem is speed. Extra stuff happening on non-interactive ssh is not an issue AFAICT _except_ for speed.13:10
rbasakAnd plenty of other extra stuff happens all the time, and we generally accept that as a trade-off for convenience. Provided that there isn't actually a measureable performance penalty.13:11
cpaelzerrbasak: thanks, yes as I've also written - a lot more is happening on login anyway13:46
cpaelzerrbasak: I'll have a short time-boxed look if it could detect interactivity - if not then as I posted already we should at least make each of them cache the results13:47
cpaelzerrbasak: that way e.g. if only once evey 5 seconds things are "slow" it will be much less overhead13:47
cpaelzerand we have prior examples for that13:47
ahasenacksounded like landscape-sysinfo was the biggest culprit13:48
ahasenacki remember it checks the load before running, but yes, it should cache13:48
rbasakMaybe we need to refactor things to not happen by blocking login. Like in the background on an independent systemd timer.13:56
rbasakAnd rather than have every motd service reinvent the wheel, maybe they should be integrated somehow.13:57
rbasakLike a .d/ directory where a script's output is automatically cached13:57
cpaelzerrbasak: this time (we had that one) it is not about slowness and therefore not about background execution14:17
cpaelzerrbasak: it really seems to be more about useless work consuming cpu time14:18
cpaelzerrbasak: the problem with a global caching is that each sub-element might have different contraints how often or under what condition they need to be refreshed14:18
rbasakcpaelzer: surely "useless working consuming cpu time" is exactly "slowness" from a user perspective?14:21
rbasakIf subelements have special needs then they don't have to use a general mechanism14:21
rbasakBut most will probably be able to fit into one.14:21
ahasenackI think a long time ago landscape-sysinfo was showing cached information14:22
ahasenackit had a timestamp in the output because of it14:22
ahasenackI'm thinking precise, trusty14:22
ahasenackmaybe lucid14:22
sdezielputting a file in a `.d/$JOB.$TTL` would make it possible to have different expiry14:22
ahasenackrbasak: hi, are you going to take a look at https://code.launchpad.net/~athos-ribeiro/usd-importer/+git/usd-importer/+merge/416951 ? 14:30
ahasenackthe snap isn't working on jammy14:30
rbasakAdded to my list - thanks14:34
lvoytekThis bug came up during my triage and I was wondering if anyone more familiar with smartmontools could see if anything can be done for it. From what I can tell it seems like a pretty rough race condition. https://bugs.launchpad.net/ubuntu/+source/smartmontools/+bug/196661015:56
ubottuLaunchpad bug 1966610 in smartmontools (Ubuntu) "smartctl assert failure: free(): invalid pointer" [Undecided, New]15:56
bryceh@lvoytek, have you run the core dump through gdb?15:57
lvoytekI haven't yet, no15:58
brycehthere's a page in UMH with some discussion on using gdb with .crash files, it's got some paint-by-numbers steps and tips for what to do once you have a core dump15:59
lvoytekAlright, I'll see what I can find with that16:00
brycehgenerally with these types of problems, the crash itself can be identified and papered over, but it'll usually just start crashing somewhere else in the code, so having reliable steps to reproduce ends up being pretty important16:01
brycehfor triaging, it may be enough to try running the suggested command 10 or 20 times (if the command doesn't take too long), and if it doesn't repro ask reporter for help figuring out how to make it more reproducible16:02
brycehbut anything more probably turns more into bugwork than triaging :-)16:02
brycehfrom gdb I suspect the best you can hope for is to identify what the variable with the invalid pointer is, and maybe what line of code is encountering it.  But the source of the invalidness can often be hard to tease out, esp. just from a backtrace alone16:06
brycehif I understand correctly, looks like it's in malloc_printerr() and presumably the invalid pointer is the str entry being passed in?16:06
bryceh__pthread_kill_implementation suggests threading is involved, which is notoriously hard to debug, but could explain why it's hard to reproduce16:07
brycehsergiodj might also have some tips, he's good with gdb stuff16:08
lvoytekThanks for the help! I tried running the command 20 times with no errors. I'll ask them to provide the crash file for futher debugging and possibly help with repro16:21
lotuspsychjelvoytek: if it can help anyhow, i tested your command on sda1 on ubuntu-desktop 22.04, seems to work well on my side16:29
lvoytekThanks for testing that, I tried with my nvme1on Jammy16:31
lotuspsychjeon -server ?16:31
lvoytekI used desktop and lxd. That would be a good thing to try though16:32
adacI try to apply netplan, but I get the following error: https://pastebin.com/2v4VPNtQ Any ideas what is causing this?18:48
adacWhen I check the systemd output, all seems to be ok: https://pastebin.com/uMEY0qVF18:49
adacHmm journal seems to tell us a bit more: https://pastebin.com/5mxZR1vx18:52
ahasenackadac: the journal seems to be about something else (openvpn?)19:02
ahasenacksee if you can find this file, netplan-ovs-cleanup.service'19:02
adacI removed openvpn for now. Now I get: Failed to set up mount namespacing: /run/systemd/unit-root/: Host is down19:02
ahasenackmaybe in /run/ or /etc/systemd/ somewhere19:03
adacsystemd-networkd.service: Failed at step NAMESPACE spawning /lib/systemd/systemd-networkd: Host is down19:03
adacahasenack, will check19:03
adacahasenack, https://pastebin.com/gScq0DCy19:05
ahasenackI don't know how ovs is normally configured...19:07
adacwhat is ovs even?19:07
adacwhat does it do?19:07
adacI'm not sure why it is even there19:07
ahasenackcheck the conf files in /etc/netplan/*, nothing mentions it?19:08
ahasenackoh, I have that too19:08
ahasenackinteresting19:08
adacactuall no19:08
ahasenackdo you have /usr/bin/ovs-vsctl? (I don't)19:08
adacno file mentions it19:08
adacnope I havent19:08
ahasenackdoes this work? systemctl start systemd-networkd.service netplan-ovs-cleanup.service19:09
ahasenackit's the command it was trying to run19:09
ahasenackor do you get that namespace error you pasted above19:09
adacahasenack, let me try. What I try currently is "netplan try -timeout 120"19:10
adacsince I'm online remote so it could kick me out otherwise19:10
adacthat is a bit the problem19:10
ahasenackthat's a bit of a problem indeed19:11
ahasenackmy other suggestions won't apply then19:11
adacMaybe I need to stop KVM machines19:11
adacto make this workl19:11
adacSince they are still using the network interface19:12
adacthat I want to have removed19:12
adacwith my new netplan file19:13
ahasenackcareful and good luck19:13
adacahasenack, lets try. thanks man!19:13
ahasenackdo you have network mounts?19:15
ahasenacklike samba/cifs/windows, nfs, etc?19:15
adacahasenack, i Have samba mount19:20
ahasenackis it working fine?19:20
adacahasenack, unmounting now solved the issue!!!19:20
adacthanks so much!!19:20
ahasenackgood to know, the "host is down" message was for the smb server then I guess19:36

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!