/srv/irclogs.ubuntu.com/2020/02/19/#snappy.txt

mupPR snapd#8152 opened: managers_test: add uc20 kernel snap update happy and panic tests <Test Robustness> <UC20> <Created by anonymouse64> <https://github.com/snapcore/snapd/pull/8152>02:06
mborzeckimorning06:15
mborzeckischool run, back in 3006:38
mborzeckire07:06
=== abeato_ is now known as abeato
mborzeckimvo: morning07:45
mvohey mborzecki ! good morning07:48
pstolowskimorning08:01
zygare08:14
zygagood morning08:14
zygatests are somewhat unhappy08:16
zygaI've seen snapd-failover fail a lot yesterdat08:16
zygaI know Ian was looking into that but perhaps it needs more eyes08:16
zygaand snap-run hangs, oh my08:18
pstolowskizyga: right, i saw snapd-failower failures as well08:18
zygaso08:22
zygaI need bionic08:22
zygaand I need tests that run as user08:22
* zyga gets to work08:22
zygaok, test written, let's run it08:40
zygamborzecki: do you expect https://bugs.launchpad.net/snapd/+bug/1863859 to be true given our environment generators?09:10
mupBug #1863859: Using su removes /snap/bin from PATH <snapd:New> <https://launchpad.net/bugs/1863859>09:10
zygamborzecki: I just noticed this in tests09:10
mborzeckizyga: su - ?09:13
zygasu test -c "snap run foo" vs su test -c "foo"09:13
mborzeckizyga: try su -l09:13
zygamborzecki: k09:13
mborzeckior . $TESTSLIB/user.sh and then as_user_simple ?09:14
mborzeckizyga: simple will not load the profile09:14
zygaI'll look at some tests that suffer from this09:16
* zyga loves calls with baby crying next door09:32
mupPR snapd#8153 opened: [RFC] "snap run --explain" with different formating <Created by mvo5> <https://github.com/snapcore/snapd/pull/8153>09:33
zygamvo: I'll review in a moment, need to check what the crying is all about09:33
mvozyga: no rush, thank you!09:34
zygare09:55
* zyga debugs all kinds of things through one failing test10:10
zygamborzecki: quick question10:46
zygahttps://www.irccloud.com/pastebin/B2mlykUV/10:46
zyga(this is test-snapd-sh without arguments10:46
zygaspawning endless loop of bad substitution10:46
zygawhy would our argv[0] cause this?10:46
zygaI'm patching snap-exec to show what's going on at that layer10:48
zygait is driving me crazy, I want to finally fix it10:48
zygamborzecki: it's a dash specific message10:53
zygathat's interesting10:53
zygaI think we do something incorrectly10:53
zygaoh well, let me grab coffee10:53
zygadesktop 18.04 install in progress10:55
zygaI think we have a problem with systemd there :/10:55
zygabut I think my test setup is not doing what is right10:55
zygabrb11:08
zygaha11:14
zygagot it11:14
zygafound a bug in spread11:14
zygaPS1 breaks dash11:17
zygabecuase SPREAD_PATH is unset11:17
zygais it just me or is this a quiet channel?11:18
sdhd-saschazyga: hey :-)11:30
zygahey11:30
zygahow are you?11:30
sdhd-saschazyga: i'm fine. thank you. Little bit tired. But fine :-) And you ?11:30
zygaa bit scared I missed something big that came up yesterday evening11:30
zygadebugging it now11:31
sdhd-saschaoh, good luck11:31
sdhd-saschai'm just started to fix the snap for the github actions.11:32
sdhd-saschazyga: on which point it's best to do chmod of a script ? I have a /run.sh and snapcraft says it's not executable. I use chmod at `override-build` currently.11:34
zygasdhd-sascha: are you building on windows?11:35
sdhd-sascha`Failed to generate snap metadata: The specified command 'run.sh' defined in the app 'runner' is not executable.`11:35
zygasdhd-sascha: it's best to keep it executable in the tree11:35
sdhd-saschaI wonder, why the generated tar isn't executable ... i will see11:35
sdhd-saschazyga: if i go into `multipass exec ` ... Then inside of `/root/parts/runner/build/run.sh` it's executable.11:38
sdhd-sascha```11:38
sdhd-saschaapps:11:38
sdhd-sascha  runner:11:38
sdhd-sascha    command: run.sh11:38
sdhd-sascha```11:38
sdhd-saschaBut i still get: `Failed to generate snap metadata: The specified command 'run.sh' defined in the app 'runner' is not executable.`11:38
sdhd-saschahmm11:39
zyga oh great11:39
zygakilling dash killed gnome-terminal11:39
sdhd-saschawow11:39
zygaanyway11:40
zygajust more bugs11:40
cmatsuokaijohnson: thanks for the PR11:43
cmatsuokaijohnson: it seems that tests failed in a strange way11:43
cmatsuokacommand systemctl is-active snapd.failure.service keeps failing after 30 attempts11:46
cmatsuokaand snapd.service: Failed to execute command: Exec format error11:47
cmatsuokazyga: does that sound familiar to you?11:47
zygacmatsuoka: hey11:47
zygayeah, there's something really broken11:47
zygait's worse recently perhaps the test is new or something related changed11:47
zygait fails left and right11:47
cmatsuokaEXEC spawning /snap/snapd/x2/usr/lib/snapd/snapd: Exec format error11:48
zygaI haven't looked into it yet11:48
zygacmatsuoka: yeah, the test is trying to install corrupt snapd11:48
zygato check failover11:48
zygabut failover never kicks in11:48
zygathe exec format error is expected and harmless noise (for the test)11:48
cmatsuokaah ok11:49
mupPR snapd#8154 opened: tests: reset PS1 before possibly interactive dash <Created by zyga> <https://github.com/snapcore/snapd/pull/8154>12:02
zygamborzecki: ^12:02
zygathis caused me some grief12:02
zygacmatsuoka: I can look after I investigate more systemd stuff12:02
ijohnsoncmatsuoka: zyga: yes I was looking into this a bit yesterday, I think that the patch mborzecki recommended to snap-failure is useful here, I had that small change proposed in my larger core spread PR, but perhaps I should break it out just to get that merged12:03
ijohnsonand morning folks btw12:03
mborzeckiijohnson: hey!12:03
ijohnsonhey mborzecki12:03
* ijohnson is here for real this morning12:03
mborzecki`The dash bug has been reported to the dash mailing list (there is no bug tracker).` hahah welp12:04
zygamborzecki: oh well12:08
zygamborzecki: it's very unixy12:09
zygamborzecki: mailing list archive, CCing people12:09
zygamborzecki: 3-character names12:09
zygathank you for the review guys12:09
mvoI see a lot of red in the most recent spread runs, has anyone investigated yet?12:09
zygamvo: snapd-failover12:09
zygait's failing ... over and ... I'll shut up now ;)12:09
cmatsuoka:D12:10
* zyga returns to debugging systemd 12:10
mupPR snapd#8155 opened: tests: mv ubuntu-core-snapd{,-failover} to core/ suite <Test Robustness> <Created by anonymouse64> <https://github.com/snapcore/snapd/pull/8155>12:14
ijohnsonmvo: I opened ^ which I think will help with the snapd-failover test12:14
cachioijohnson, zyga failover test started failing a time ago but with external backend12:15
cachionow is started failing on travis too12:15
ijohnsoncachio: interesting you mean on the lab machines ?12:15
ijohnsoncachio: do you have logs?12:15
cachioijohnson, http://paste.ubuntu.com/p/DXD9WN7CtJ/12:15
cachiothis is from the 2.43.312:16
cachioline v12:16
cachio133012:16
cachioI though it was related to our configuration for the external backend12:16
cachiobecause it was not possible to reproduce on travis12:16
cachiousing gce12:16
cachiobut seems to be something else12:17
ijohnsonyeah that seems to fail the same it's failing on travis now12:17
ijohnsonthe issue is that snapd.failure.service is not being started ever by systemd12:17
cachiothe weird part is that when using the external backend the test is not failing 100% of the times12:17
cachioright12:17
ijohnsonyes I don't think it is failing 100% of the time on travis either, indeed when I tried reproducing yesterday I didn't hit it12:18
cachioa time ago I shared more info about the error, I am trying to find it12:18
cachioI remember I sent the error to mvo12:19
cachiofailover was trying to find core and core snap was not installed12:20
cachioijohnson, but I cant find the logs12:20
ijohnsoncachio: hmm that sounds different12:20
cachiothe error is the same you see in the log I sent you12:21
cachioI'll reproduce it12:21
cachioand share the logs again, give me 15 minutes12:22
zygagoogle:ubuntu-18.04-64 .../tests/main/cgroup-tracking# test-snapd-sh.sh12:22
zyga$12:22
zygagoogle:ubuntu-18.04-64 .../tests/main/cgroup-tracking#12:22
zyga^ getting this feels like this day is already worth it12:22
ijohnsonthanks cachio12:24
zygamborzecki: but it was worth reporting, it's a known problem12:25
zygahttps://patchwork.kernel.org/patch/11343121/12:25
zygaand there's a patch12:25
mvonice12:28
mupPR snapd#8156 opened: [RFC] cmd/snap-bootstrap: subcommand to detect if we want a chooser to run <UC20> <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/8156>12:40
mborzeckimvo: ^^ the key-watching thing12:41
mborzeckipedronis: mvo: we should probably discuss the naming there12:41
zygaoh12:46
zygaI wanted to mention that yesterday12:46
zygaabout recovery and prompt and stuff12:46
zygasomething to keep an open mind12:46
zygasmart IOT bulbs12:46
zygado recovery12:46
zygaif ...12:46
zygayou turn them on and off five times in a row with specific time delay12:46
zygaour discussion about key detection feels incorrect12:47
zygait should not be something we mandate12:47
codingpanicHello all. I instaleld snapd on a raspbian based pi today. Unfortunately, none of the built-in interfaces/slots are available. What creates these?12:47
pedroniszyga: we don't mandate it, we need a reference experience12:47
codingpanicSince they dont exist, snaps are unable to connect to anything12:48
ijohnsoncodingpanic: did you install snapd as a deb pkg ?12:48
codingpanicijohnson: yes12:48
ograhrm12:48
ograubuntu@localhost:~$ snap version12:48
ograsnap    2.43.3+git1047.g6b52b3712:48
ograsnapd   unavailable12:48
ograseries  -12:48
ograubuntu@localhost:~$ snap list12:48
ograerror: cannot list snaps: cannot communicate with server: timeout exceeded while waiting for response12:48
ograubuntu@localhost:~$12:48
ijohnsoncodingpanic: what's `snap list` ?12:48
ograubuntu@localhost:~$ ps ax|grep snapd12:48
ogra 8932 ?        Ssl    0:00 /snap/snapd/6521/usr/lib/snapd/snapd12:48
ijohnsonogra: why did you break your snapd12:48
ograthis is how i found one of my qemu machines this morning12:49
ijohnsonwell don't leave it like that :-)12:49
ograseems it got some reboot-triggering update yesterday evening12:49
zygaogra: journalctl -u snapd.service12:49
ijohnsonogra: anything in the system journal for snapd?12:49
codingpanicpi@raspberrypi:/var/log $ snap list12:49
codingpanicName     Version   Rev   Tracking  Publisher     Notes12:49
codingpaniccore18   20200124  1671  stable    canonical✓    base12:49
codingpanicscummvm  2.1.1     3126  stable    snapcrafters  -12:49
codingpanicI've also tried retroarch12:50
zygacodingpanic: snap install core12:51
ograhttps://paste.ubuntu.com/p/xvRDVNPW4S/12:51
ograthe last part simply repeats over and over12:51
zygacodingpanic: it's a known bug12:51
codingpaniczyga: got it, thank you!12:51
zygaogra: Feb 19 08:11:22 localhost snapd[824]: panic: cannot checkpoint even after 5m0s of retries every 3s: write /var/12:51
zygaogra: did you run out of disk space12:52
zyga?12:52
codingpaniczyga: that fixed it12:52
ograzyga, bah, yeah12:52
* zyga hugs ogra 12:53
zygabeen there done that (too many times)12:53
zygaI need a coffee12:53
zygabrb12:53
ijohnsonpedronis: I need to run an errand right after SU (3:30 PM your time to be specific), can we put something on the calendar for 4:30 PM your time ?12:54
ijohnsonzyga: I thought we fixed that bug though ?12:54
zygaijohnson: no, that's a separate bug12:54
zygaijohnson: look at snap list output12:54
zygaijohnson: according to our design you have 0 interfaces in that case12:54
zygaijohnson: you must get snapd or core12:55
ijohnsonzyga: but I thought we fixed the bug where the core snap was not installed when you to go install only a snap with `base: core18` ?12:55
zygaijohnson: also this is raspbian so probably runs an older copy12:55
ijohnsonzyga: ah right yes probably older version of snapd with the bug still in place12:55
zygayep12:55
ijohnsonzyga: we should probably update snapd in raspbian, seems to happen somewhat frequently now I think12:56
zygaijohnson: I don't think we can12:56
zygaijohnson: we should update snapd in debian12:56
zygait very old12:56
ijohnsonzyga: who handles snapd update in debian?12:56
zyganobody12:56
zygaus12:56
zygasometimes12:56
zygawhen $fire12:56
ijohnsonalso doesn't raspbian have it's own archives based on, but separate from debian? perhaps I'm misremembering12:57
ograhrm ... freeing up 120MB and rebooting just gets me back to that state :/12:57
zygaijohnson: it does12:57
zygaijohnson: but we don't have any way to change it12:57
ograbah ... because it is secretly upgrading ... it just rebooted again12:57
ijohnsonhooray now ogra's ephemeral qemu VM is more secure13:00
jdstrandzyga: hey, fyi, https://github.com/snapcore/snapd/pull/7825#discussion_r38127706813:07
mupPR #7825: many: use transient scope for tracking apps and hooks <Security-High> <Created by zyga> <https://github.com/snapcore/snapd/pull/7825>13:07
zygahye13:07
zygahey*13:07
zygaack13:07
zygaI'm still looking into this13:07
zygaand especially what happened to your interactive blackbox testing13:07
jdstrandzyga: that's fine. I just wanted you to be aware of that weird thing I saw where I semi-started a transient unit and couldn't get it to go away in case it saves you some time13:10
zygathank you13:11
jdstrandzyga: otoh, can you also make sure that these tests aren't skipped on core? you mentioned trusty but I thought I remembered something about user sessions on sore.... that said, xenial is starting systemd --user so hopefully it is all ok13:12
zygajdstrand: I will13:13
zygajdstrand: I  think this approach was picked specifically to support 16.04 desktop13:13
jdstrandthat sounds familiar :)13:13
jdstrandthat was a long time ago :)13:13
jdstrandzyga: fwiw, I'm really bought into the technique and like the PR overall :)13:14
jdstrandjust need to make sure systemd does what we need everywhere13:14
zygajdstrand: yeah, I just hope I didn't miss anything13:14
jdstrandzyga: I did notice yesterday that runc was using StartTransientUnit for at least something... could look there to build confidence. but one of the things I like about this is we are just asking systemd to take care of things for us. so long as we use the api correctly (which it appears you are), we should be good (barring bugs (systemd or otherwise))13:17
zygajdstrand: the only thing I'm worried about is --user13:17
zygathe API is sold and supported for a long time13:17
zygabut user session is more complex13:17
zygasystemd --user is asked to stuff13:17
zygabut cannot13:17
zygaso asks system systemd to do it13:17
jdstrandzyga: kinda like how we ask it to manage starting and stopping services. we tell it a little bit and say "thanks for doing this for me!". same here13:17
zygaand that path I believe is somewhat new13:17
zygaI'm digging into this now13:18
* jdstrand nods13:18
jdstrandzyga: well, it isn't *terribly* new if it is in bionic (I didn't test xenial), but there is definitely some different behavior13:18
jdstrandwhich might be cherrypicks, etc13:19
zygaI think it's all the way back to xenial, but give me a moment to debug it13:19
jdstrandzyga: we always have in out back pocket runtime detection and gracefully degrading too (not saying we need it yet)13:19
jdstrands/in out/in our/13:19
zygayes, it's not essential in a way13:20
cachioijohnson, this is the output https://paste.ubuntu.com/p/qNnRgrHktg/13:22
mvomborzecki: thanks, looking13:22
cachiofrom a local run on core-1813:23
cachioijohnson, at the end you can see the issue that I raised a time ago snap-failure[9015]: cmd_snapd.go:124: restoring invoking snapd from: /snap/core/current/usr/lib/snapd/snapd13:23
ijohnsonthanks cachio, I'm looking now13:23
cachioalso the error snapd.service: Failed at step EXEC spawning /snap/snapd/x1/usr/lib/snapd/snapd: Exec format error13:24
ijohnsoncachio: the EXEC spawning issue is very odd, because it should be x2 that has the Exec format error13:25
jdstrandzyga: so it is crystal clear: the reproducer is: take a desktop vm install, install snapd/enable the feature/blah, in the graphical session, open a terminal, use hello-world.sh/snap run --shell/etc and it works (find /sys/fs/cgroup -name '*.scope'). ssh in/console login and try the same and it doesn't (but not errors in SNAPD_DEBUG=1 or anywhere else I could find13:26
jdstrand)13:26
* zyga nods 13:26
zygalet me check that quickly13:26
zygaI was going via spread and I hit another issue there13:26
* jdstrand nods13:26
cachioijohnson, I have a debug session opened13:27
cachioijohnson, if you need any output please tell me13:27
jdstrandI figured it might be easier to see what to recreate if you saw the problem13:27
ijohnsonI think there is a known issue in that snap-failure should be more robust in trying to restart the socket, and isn't13:27
ijohnsoncachio: ^13:27
* jdstrand leaves zyga to it13:27
ijohnsoncachio: sorry let me keep reading I'm not sure what I would like you to run in the debug session13:28
zygajdstrand: wwo13:32
zygajdstrand: so13:32
zygajdstrand: it works on a fresh install of bionic13:32
zygajdstrand: desktop session13:32
zygajdstrand: I recorded bustle (dbus traffic analyzer) dumps13:32
jdstrandzyga: right, in the desktop session, it works13:32
zygaand even confirmed it the simple way13:33
zygahttps://www.irccloud.com/pastebin/pENvRQqK/13:33
zygaoh? I missed that13:33
zygawhen didn't it work?13:33
jdstrandzyga: ssh in or do console login and it doesn't13:33
zygaah13:33
zygaI see13:33
zygaokay that's much clearer now13:33
zygathanks, I'll focus on that13:33
jdstrandzyga: that is what made it tricky for me to diagnose :)13:33
zygajdstrand: but I'm 99% sure it's because there's no session bus there13:33
zygaand this is a dbus interaction13:33
jdstrandzyga: I saw systemd --user started with just ssh13:34
zygajdstrand: ok, I'll debug both cases (console login and ssh)13:34
zygajdstrand: were you logging into the same user as the interactive session?13:34
zygajdstrand: or another user13:34
jdstrandzyga: but yeah. maybe there is an activation thing or something. idk13:34
jdstrandI sopped looking when I ran out of day13:34
jdstrandstopped13:34
zygaok13:34
zygaI'll explore those13:34
zygabecause previously it was a bit magic13:35
jdstrandzyga: I was13:35
zygaas to why it worked in places I checked13:35
jdstrand(same user)13:35
zygaok13:35
zygathat would explain the systemd --user aspect13:35
jdstrandzyga: well, no. I logged into core a few minutes ago before asking about adding uc tests and there was --user13:36
zygajdstrand: ok, please give me a day to get to the bottom of this13:36
zygajdstrand: if you could review any of the other PRs I mentioned I could perhaps make progress on those13:36
zygajdstrand: but for this one, I know what to do13:36
jdstrandthat doesn't mean there is a dbus session bus...13:36
zygajdstrand: right13:36
jdstrandyes, that was the plan13:36
zygajdstrand: perhaps 18.04 doesn't have enough activation13:36
zygatesting is paramount13:36
zygaI'll try to cover those13:37
* jdstrand nods13:37
sdhd-saschamvo: fyi. github runner tries to create config files in the binary directory. This needs to be fixed first. Seems to be hardcoded path's.13:42
sdhd-saschahttps://github.com/sd-hd/runner-snap/releases13:42
zygajdstrand, mvo: fyi https://github.com/snapcore/snapd/pull/7825#discussion_r38129677213:43
mupPR #7825: many: use transient scope for tracking apps and hooks <Security-High> <Created by zyga> <https://github.com/snapcore/snapd/pull/7825>13:43
mvozyga: 0.05s? that is not bad :)13:46
zygamvo: that's wall clock time13:48
zygamvo: the actual new code is 1ms13:48
ograyou have a wall clock that measures 0.05s ?!?13:57
roadmrogra: well if you look hard enough a lot of wristwatches measure 0.25s (seconds hand moves at 4hz)13:58
roadmrthat's mechanical wind-up wristwatches mind you13:58
* roadmr now looks really old and anachronistic ⌚13:58
ograhaha, yeah14:00
zygajdstrand: reproduced14:08
zygahttps://www.irccloud.com/pastebin/lg1cEV4Q/14:08
zygajdstrand: ^ that's bionic over ssh as user14:08
zygajdstrand: running systemd-run --user --scope ls14:08
zygajdstrand: on the up side, I now have systemd --user running as my user (remote ssh activated it)14:09
zygajdstrand: on the down side, no idea why it failed yet14:09
zygabut I can dig14:09
* jdstrand nods14:10
jdstrandI <3 reproducers14:10
zygajdstrand: but that's really weird, raspbian (older) works okay14:10
zygajdstrand: but I'll dig14:10
zygajdstrand: I'll check 20.04 and 16.04 as well14:12
zygajdstrand: _complexity_ :)14:12
jdstrandzyga: yes. hopefully it isn't too deep (I recall how the snap_daemon spun out cause of bugs, bugs, bugs. I hope very much that isn't the case here (I don't think it is))14:14
zygaI think it's just configuration (famous last words?)14:16
mborzeckijdstrand: zyga: your ssh shell should already be part of a user slice/session scope (?)14:18
zygait is14:18
mborzeckizyga: same when you log in on a tty, though no clue who arranges that, pam & logind maybe?14:19
mborzeckis/who/what/14:19
zygamborzecki: the problem is why remote cannot systemd-run14:20
zygasmells like polkit to me14:20
zygaI'll look after the standup14:20
mborzeckihmm hmm intersting14:23
roadmrjdstrand, pedronis : is there a non-system-files interface that would allow write access to /etc/ssh/ssh{,d}_config ?14:23
roadmrif not, is it acceptable to grant system-files in this case (it's a device-specific snap in a brand store, so mostly well-scoped)14:24
zygamborzecki: it's polkit14:30
zygamborzecki: give me a while to get the bits in place14:30
mborzeckizyga: polkit?14:30
zygayep14:30
zygait's always polkit ;)14:30
mborzeckiit's always /pluseaudio/polkit/snapd/14:30
mborzeckiheh forgot systemd14:30
zygaI need to take the dog out14:35
roadmrwho let the dogs out? woof wof...14:38
mborzeckizyga: doesn't look like polkit to me14:42
mborzeckiopenat(AT_FDCWD, "/sys/fs/cgroup/unified/user.slice/user-1000.slice/user@1000.service/run-rbcaa5f3010d646e38684382cacc4aa70.scope/cgroup.procs", O_WRONLY|O_NOCTTY|O_CLOEXEC)14:42
mborzecki= 3414:42
mborzeckifcntl(34, F_GETFL)                      = 0x8001 (flags O_WRONLY|O_LARGEFILE)14:42
mborzeckiwrite(34, "2821\n", 5)                  = -1 EACCES (Permission denied)14:42
mborzeckiuh, w8 that's on focal ;)14:42
mborzeckihm weird, there's this sequence14:46
mborzeckiopenat(AT_FDCWD, "/sys/fs/cgroup/unified/user.slice/user-1000.slice/user@1000.service/run-r51cceeed963d4c33ace8cf029e2657d5.scope/cgroup.procs", O_WRONLY|O_NOCTTY|O_CLOEXEC)14:46
mborzecki= 3014:46
mborzeckiwrite(30, "2504\n", 5)                  = -1 EACCES (Permission denied)14:46
mborzeckiclose(30)14:47
mborzeckifollowed by opening *.scope directory and getdents()14:47
mborzeckianyways, the message is Failed to add PIDs to scope's control group: Permission denied which kind of makes sense in this context14:48
zygare14:53
zygamborzecki: which systemd is that?14:53
zygaah, focal14:54
zygamborzecki: systemd --user should ask system to to that write14:54
zygamborzecki: that's what I read from sources14:54
zygamborzecki: anyway, I'll keep digging14:54
* zyga went outside for a moment and finally cooled the fever of his laptop while using google meet15:01
pedronisroadmr: I don't think there is one atm15:02
roadmrpedronis: :( any objections with using system-files for this then?15:03
pstolowskimvo:  thanks for the suggestion re version check!15:03
zygadns timeouts ugh15:05
zygaFeb 19 07:03:47 bionic systemd-resolved[618]: Grace period over, resuming full feature set (UDP+EDNS0) for DNS server 172.16.153.2.15:05
pedronisroadmr: no deep objections as long it's scoped, we might add something like sshd-control at some point but not immediately15:05
jdstrandroadmr (cc pedronis): there isn't one. if this is in a brand store, 'no', but they can certainly hurt themselves15:06
jdstrandpedronis: I might suggest 'snap set core' things instead of sshd-control15:06
pedronisjdstrand: yea, but we need clear use cases15:07
pedronisto get tehre15:07
pedronisget there15:07
jdstrandyes15:07
jdstrandpedronis: they can be added one by one to snap set. sshd-control is risky and security sensitive15:08
jdstrandimho15:08
pedronisjdstrand: I understand, the surface of the config is large though15:08
jdstrandroadmr: but yeah, no objections for system-files in a brand store so long as they know the risk15:08
pedronisanyway not something we'll do soon15:08
* jdstrand nods15:08
roadmrjdstrand: yep, like pedronis said this can be scoped to the specific store to limit risk. I can tell them there is indeed a risk so they are super aware15:08
roadmrthanks jdstrand pedronis15:09
pedronisroadmr: but if they can tell use what kind of config they are changing it could help us design something later15:09
pedroniss/use/us/15:09
roadmrpedronis: sure thing, I'll also ask15:09
pedronisroadmr: thx15:10
zygajdstrand: FYI https://github.com/snapcore/snapd/pull/7825#discussion_r38136033615:44
mupPR #7825: many: use transient scope for tracking apps and hooks <Security-High> <Created by zyga> <https://github.com/snapcore/snapd/pull/7825>15:44
zygagithub is kind of slow for me today15:44
zygabut new notifications are really cool, a big improvement over YOU HAVE NAN NOTIFICATIONS15:44
jdstrandgithub is both slow and broken for me today15:45
jdstrand500 on the above url15:45
zygaalso 500 often15:45
zygahaha15:45
zygayeah, just reload a few times15:45
zygait works eventually15:45
jdstrandnot able to expand resolved conversations...15:45
zygajdstrand: at some point we can say that github is slow because new core snap refreshes ;-)15:45
jdstrandheh15:46
zygajdstrand: the comment says that cgroup v2 doesn't have pids, it's just one big flat tree15:46
zygajdstrand: in v1 we _could_ scope it but current code doesn't require it15:46
zygajdstrand: scoping it mainly requires us to know more about systemd, we could do it at a low cost but I deem it non-essential now15:46
zygajdstrand: certainly something we can improve ahead of making this stable15:46
zygajdstrand: I don't know how you develop locally but I find a desktop (workstation) installation (vm) of fedora 31 is very useful for exploring a full complex system that uses pure v2 mode15:47
jdstrandzyga: I obviously need some practical v2 experience since I can never remember how the layout is. I occasionally have a fedora vm, but I find by the next time I need to look at fedora, the one I have is eol ;P15:52
jdstrandzyga: I replied15:52
zygajdstrand: :)15:52
ijohnsonokay I think that I have the snapd-failover test working again15:55
ijohnsonwill push my fix up15:55
zygaijohnson: great news15:55
pedronisgreat15:55
zygaijohnson: cannot wait to see what it was15:55
ijohnsonStartLimitInterval{,Sec}=0 doesn't play well with OnFailure, because the OnFailure only runs after the unit is in "totally really absolutely failed" state, which systemd only considers after it has exhausted the start limits15:56
ijohnsonso in our spread tests we specifically set StartLimitInterval=0, which breaks the OnFailure15:57
roadmrjdstrand: how can I combine allow-installation constraints? I have used "on-store" by itself, and "plug-attributes" by itself but for this one I need both to apply (ANDing them). I can provide both snap.yaml and my partial snap-decl plugs thingy15:57
ijohnsonIt's unclear why this broke now, I think it may have been a systemd bug that this test worked at all on spread before, and they just now fixed the bug that makes it resiliently fail15:57
jdstrandroadmr: please supply both. feel free in privmsg15:59
roadmrjdstrand: coming up!15:59
ijohnsoncachio: zyga: mvo: can y'all (re-)review #8155? I think the test is fixed now, but I want to be sure that there's not unintended consequences to setting StartLimitInterval=10016:01
mupPR #8155: tests: mv ubuntu-core-snapd{,-failover} to core/ suite <Test Robustness> <Created by anonymouse64> <https://github.com/snapcore/snapd/pull/8155>16:01
zygay16:01
zygaanother new github feature? https://usercontent.irccloud-cdn.com/file/m2oxIZH8/Zrzut%20ekranu%202020-02-19%20o%2017.02.13.png16:02
mborzeckizyga: the first bit was from focal, the sencod from bionic16:04
zygaijohnson: I cannot find StartLimitInterval documentation16:04
zygaijohnson: but there is StartLimitIntervalSec16:04
zygaijohnson: and also StartLimitIntervalBurst16:04
ijohnsonzyga: it was renamed to StartLimitIntervalSec in systemd 230 I think16:04
zygaijohnson: are both names supported?16:04
ijohnsonzyga: I'm pretty sure, but let me double check16:05
zygaijohnson: interesting, I would love to understand why it worked before16:08
zygaijohnson: perhaps there's more to it than that16:08
zygaijohnson: I think that the explanation on how the value 0 interacts with OnFailure is good but I need to cross check it with documentation16:08
ijohnsonleonard has some comments on bug reports explaining the interaction, does that count :-)16:09
ijohnsonzyga: see comment 7 https://bugs.freedesktop.org/show_bug.cgi?id=8779916:09
ijohnsonbut then of course see comment 8 right below it :-)16:10
zygaijohnson: indeed, we should reference in the code there16:12
zygahttps://bugs.freedesktop.org/show_bug.cgi?id=87799#c716:12
ijohnsonshould we add a comment in the code or the commit message ?16:12
cachioijohnson, nice, reviewing it16:12
zygain the code I think16:12
zygaeasier to keep track of16:12
ijohnsonhmm I guess if it was in the override I would have seen it much quicker in the running system16:12
ijohnsonyeah good point I'll add it to the code16:13
pedronispstolowski: I re-reviewed 813016:17
pstolowskipedronis: just saw it, thank you!16:18
ijohnsonokay I have it added to the override unit we put in the spread image, I will put it in a followup PR so that as soon as this one is green we can merge16:21
pstolowskiijohnson: can you take another look at #8003 when you have a moment?16:29
mupPR #8003: o/ifacestate, api: implementation of snap disconnect --forget <Needs Samuele review> <Created by stolowski> <https://github.com/snapcore/snapd/pull/8003>16:29
ijohnsonpstolowski: I'll try to look this afternoon or tomorrow morning16:29
pstolowskithx16:32
zygaijohnson: reviewed, please look16:32
zygaI'll be back shortly16:32
ijohnsonzyga: it looks like StartLimitInterval is only valid in [Service] in systemd 229 and below, and for systemd 230 and above, StartLimitInterval is supported in systemd 230+ in [Service], but StartLimitIntervalSec is only supported in [Unit], so we are fine to keep using StartLimitInterval in [Unit] like the test does16:38
ijohnsonsee https://lists.freedesktop.org/archives/systemd-devel/2017-July/039255.html, but this is poorly documented in systemd docs16:38
ijohnsonzyga: how would you like me to document that in the PR?16:38
ijohnsonzyga: see the corresponding commit too https://github.com/systemd/systemd/commit/f0367da7d1a61ad698a55d17b5c28ddce0dc265a16:40
zygaijohnson: a code comment will suffice17:01
zygaI just felt it is a bit too magic to leave as-is17:01
mupPR snapcraft#2948 opened: Regenerate the GDK pixbuf loaders cache file if for whatever reason it isn't there (LP: #1863801) <Created by oSoMoN> <https://github.com/snapcore/snapcraft/pull/2948>17:03
ijohnsonzyga: a follow-up okay for the comment then?17:03
zygaijohnson: is it green now?17:05
ijohnsonAh debian seems unhappy now17:06
roadmr😞  <- debian17:07
cwayneI love how roadmr is just like the dad of all IRC channels17:08
roadmrI am everywhere.17:08
roadmrIam inevitable 👌17:08
* zyga needs a nap17:08
zygaijohnson: feel free to discard my review and land with a comment17:09
zygaI will be back in 2 hours17:09
zygaSo sleepy now17:09
mupPR snapd#8157 opened: tests: using google storage when downloading ubuntu cloud images from gce <Created by sergiocazzolato> <https://github.com/snapcore/snapd/pull/8157>17:30
* cachio afk 17:43
cachiogoing to the doctor17:43
zygare19:22
zyga*ah, I needed that*19:22
zygaso now nfs-support fails19:23
zygaijohnson: do you think that is related to the change?19:24
zygathat's an 18.04 failure19:24
zygabut actually, not19:24
ijohnsonzyga: yes I noticed now that fails on the PR19:24
ijohnsonzyga: I have a fix for that19:24
zygaacross the board19:24
zygado you know what happened?19:24
ijohnsonI was setting StartLimitInterval wrong, that's just the, well, interval, not the number of times starts can happen19:24
ijohnsonStartLimitBurst is the number of times it can start19:24
zygaaha19:25
zygaok, let's try that19:25
ijohnsonSo I changed it to set both StartLimitBurst and StartLimitInterval and now it's good19:25
ijohnsonlet me push it up19:25
zygathanks!19:25
ijohnsonwas in the middle of UC20 debugging19:25
zygano worries, I just woke up19:25
mupPR snapcraft#2949 opened: Fix clean on Windows <Created by NickZ> <https://github.com/snapcore/snapcraft/pull/2949>19:39
mupPR snapd#8158 opened: tests: disable archlinux system <Created by sergiocazzolato> <https://github.com/snapcore/snapd/pull/8158>19:55
ijohnsoncachio: should we use the no-spread label on that PR?19:56
mupPR snapd#8158 closed: tests: disable archlinux system <Skip spread> <Created by sergiocazzolato> <Merged by sergiocazzolato> <https://github.com/snapcore/snapd/pull/8158>20:05
mupPR snapcraft#2933 closed: remote-build: introduce --launchpad-snapcraft-channel option <Created by cjp256> <Closed by cjp256> <https://github.com/snapcore/snapcraft/pull/2933>21:16
mupPR snapd#8159 opened: snap-bootstrap: remove created partitions on reinstall <UC20> <Created by cmatsuoka> <https://github.com/snapcore/snapd/pull/8159>23:36

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!