/srv/irclogs.ubuntu.com/2018/03/23/#snappy.txt

bashingboiand if so, how does it run performance wise?00:00
zygabashingboi: I haven't tried on debian yet00:04
zygaI can try tomorrow00:04
zygatoday I should really hit the bed :)00:04
bashingboiha, np. I will try for myself and check how it runs00:05
mwhudsonLaney: yeah i guess unpacking repacking the initramfs is even more steps02:17
mwhudsonLaney: it's at https://github.com/CanonicalLtd/subiquity/blob/master/scripts/inject-subiquity-snap.sh although bits of that are hardcoded for subiquity, the general bits could be extracted i think02:18
=== chihchun_afk is now known as chihchun
=== chihchun is now known as chihchun_afk
mborzeckimorning06:10
zygao/06:14
zygamborzecki: can you please review https://github.com/snapcore/snapd/pull/491206:16
mupPR #4912: overlord/configstate: change how ssh is stopped/started (2.32) <Critical> <Created by zyga> <https://github.com/snapcore/snapd/pull/4912>06:16
zygamborzecki: if you have a kvm or a pi at home it's best to try it for real too06:16
mborzeckizyga: looking06:17
mborzeckizyga: wasn't sshd.service -> ssh.service already addressed by a patch from mvo?06:18
mborzeckizyga: or maybe it was the other way around06:18
zygait was06:19
zygabut it broke badly06:19
zygaonce sshd is disabled06:19
zygait cannot be enabled06:19
zygabecause it is an alias06:19
mborzeckizyga: snapd alias or some sort of system alias?06:20
zygasystemd alias06:21
zygalook at systemctl cat ssh.service06:22
zygasshd is an alias there06:22
zygaif you disable ssh.service06:22
zygathe alias goes away06:22
zygaand cannot be enabled, started or stopped06:22
zygawe used sshd instead of ssh because of the issue with writable paths06:22
zygaif you just switch to ssh everything is fine06:22
zygabut then you reboot06:22
zygaand ssh is re-enabled06:22
zygabecause writable paths is terrible06:22
zygaand ssh is enabled by default so it bring the file back06:23
zygathis is why I used two things: sshd -> ssh so that enable/disable works06:23
zygaand the conditional file to actually control being off or on06:23
mborzeckii'm reading up the systemd.unit manpage06:24
zygaok06:24
mborzeckinice, > aliases specified through Alias= are only effective when the unit is enabled.06:24
zygaI may have missed something, hence the request for review06:24
zygait's an urgen issue affecting customer deployments06:24
mborzeckizyga: wow that service file is interesting06:32
mborzeckizyga: sshd_not_to_be_run is a debian thing right?06:32
zygaProbably06:34
mborzeckizyga: a similar attemp was there in configure hook in the core snap https://github.com/snapcore/core/commit/dc782a37a3aef3fb11f256260f5205f2ff256acd06:52
zygaoh, that's interesting06:53
zygawe we had this06:53
zygaand lost this?06:53
mborzeckiyeah, i was looking for the actual contents of writable paths and found this instead :P06:54
mupPR snapd#4903 opened: tests: [WIP] split prepare-restore.sh into prepare-restore.d modules <Created by zyga> <https://github.com/snapcore/snapd/pull/4903>07:09
mupPR snapd#4913 opened: interfaces,release: probe seccomp features lazily <Created by zyga> <https://github.com/snapcore/snapd/pull/4913>07:36
zygahmmm, my PR just got a massive TLS handshake failure with everything (google, linode)07:38
zygaall tasks aborted07:38
zygahope this is not a norm today07:38
mborzeckizyga: ok, played a bit with core image, if i understand it right since we disable the service, the symlink is removed from etc, but then it's still restored from core snap because /etc/systemd/system is synced in writable-paths?07:39
zygayes07:40
zygathat's the issue07:40
zygathat's the reason we switched to sshd service07:40
zygabut this had unintended consequences07:40
mborzeckifrom what i see disabling/masking sshd does not work either07:40
zygayep07:46
zygaI'm testing another mode now07:46
zygawhere sshd was disabled07:46
zygathus breaking people07:46
zygaand chceking if the new core with my fix  heals that07:47
mborzeckihah, now i'm locked out of the ssh, and serial console only shows the prompt with ssh login :P, how can i log in?07:48
zygamborzecki: ha07:50
zygayank the card07:50
zygaset a password07:50
zygaand put it back in07:50
zygamvo: so to recap07:50
zygamvo: I tested the scenario where sshd.service was disabled07:51
zygaand that the new patched snapd can fix that state simply by rebooting (which happens on update) and starting ssh.service07:51
zygaby setting the core config value07:51
zygaso I think this is a fix for the issue in general and for the machines affected in particular07:51
mvozyga: we do more than just disable, we also mask, I think this will result in a different outcome07:52
mvomy feeling is that we need to at least unmask (opportunisticly and ignore the error)07:53
zygaI will test that now07:55
zygayes, perhaps07:55
zygamvo: I think masking doesn't work07:56
zygazyga@localhost:~$ sudo systemctl disable sshd.service07:56
zygazyga@localhost:~$ sudo systemctl mask sshd.service07:56
zygaFailed to execute operation: File exists07:56
zygathe initial state was that neither sshd nor ssh were masked07:57
mvozyga: let me check07:57
zygahmm07:57
zygamaybe I'm wrong07:57
zygalet me reset everything07:57
mvozyga: it used to be the case, we masked ssh.service and then sshd.service would reappear via the writable-magic. but let me double check :)07:58
zygaok07:58
zygatested that now07:58
mvomy vm will take a moment to be ready, it has an older core07:59
zygainitial state https://www.irccloud.com/pastebin/BMckHA77/07:59
zygaso sshd is disabled and masked07:59
zyganow I'm setting the core config07:59
zygaand ssh starts and I can log in07:59
zygaah07:59
zygabecause I *start* it07:59
zyganot enable it08:00
zygalet's reboot now08:00
zygaI mean, the hack just starts ssh directly08:00
zygaso even if sshd is masked08:00
zygathat's not an issue08:00
zygaI think we should unmaks it though08:00
zygabut it's not for corectness08:00
zygabut more for purity08:00
zygamvo: it's also good after reboot08:01
zygaeven though sshd is masked08:01
mvozyga: even with the mask?08:01
zygalrwxrwxrwx 1 root root 9 Mar 23  2018 /etc/systemd/system/sshd.service -> /dev/null08:01
mvozyga: wuuut? crazy08:01
zyga(this is after reboot)08:01
zygawoah08:02
zygahttps://www.irccloud.com/pastebin/LHAlPevl/08:02
zygalook at this08:02
zygasshd.service changed on disk08:02
zygathis is after reboot, no touching at all08:02
zygahow come?08:02
zygaI'll reboot again just to be 100% sure08:02
mvozyga: I think its confusion from ssh.service vs sshd.service but let me double check08:03
zygaso  +1 to unmask it08:03
mvozyga: I'm so happy we have the /etc/ssh_no_start thing, this unit aliases seem to be crazy08:03
zygayes, I have the same feeling08:04
zygatesting the new build now08:04
zygaI need to buy some more serial ports08:07
=== pstolowski|afk is now known as pstolowski
pstolowskimornings08:07
zygathis makes testing so much easier08:07
mborzeckimvo: pstolowski: morning guys08:08
mvohey mborzecki08:09
mborzeckizyga: mvo: could we straighten out the situation with ssh instead of using that ssh_no_run.. monstrosity?08:09
mvomborzecki: maybe08:09
mvomborzecki: so there are multiple issues08:09
zygamborzecki: do you have a proposal?08:09
zygaI mean, yes, but not trivially08:10
kalikianagood morning08:10
zygawe'd have to remove writable-path sync on systemd services08:10
mvomborzecki: our ssh.service is using Alias=sshd.service08:10
mvomborzecki: our core snap has "/etc/systemd/system/sshd.service" which is in writable-paths marked as "sync". this means that when the file is missing it will be copied from the core snap08:11
mborzeckias i understand we could fix it in the core snap, have just one ssh.service and no aliases08:11
mvomborzecki: so we need to unmask both ssh and sshd service08:11
mvomborzecki: yes, I think that is a good idea, fix the core snap to a) remove the alias b) not use /etc/ but /lib for the unit08:11
mvomborzecki: but its more risky than what zyga is doing08:12
mvomborzecki: what I really like about the zyga PR is that its dead-simple08:12
zygamvo: force pushed with the unmask08:12
zygatested that it works all the way08:12
mvota08:12
mborzeckimhm, it is :)08:12
mvomborzecki: but gustavo also wants to talk about this so maybe it will be solved differently :)08:13
zygamvo, mborzecki: one note, we need to tie this to core1608:13
zygawith core18 the approach should be direct and without hacks08:13
mvozyga, mborzecki I think its fine to revisit this very soon and fix core16 directly, i.e. fix our ssh install and go back to straight systemctl stop/disable ssh08:13
mborzeckias far as i'm considered, it's +1, but ideally we should fix this and use ssh.service (or sshd, just make sure that it's a single one, no aliases or whatnot)08:14
mvoagreed08:14
zyga(sshd cannot be used)08:14
zygawe must use ssh.service08:14
zygaand get rid of that sync stanza on that director08:14
zyga*directory08:14
mvomborzecki: whats also annoying is that even if we do stop/disable/mask ssh and sshd it fails on enable because the operations are not symetric08:14
mvomborzecki: i.e. unmask/enable/start ssh sshd will fail during enable sshd because it says there is a symlink alsready08:15
mborzeckimvo: yeah, i did this in the console :P08:15
mvomborzecki: which boggles my mind, but maybe I'm missing something08:15
mborzeckiand then locked out myself :P08:15
zygahaha08:15
zyga:D08:15
zygawell08:15
mvomborzecki: like how can you sanely stop a service with aliases08:15
mvomborzecki: haha08:15
zygaI'm happy with this patch08:15
* mvo hugs zyga 08:15
zygaI won't touch it until we talk to gustavo08:15
* mvo hugs zyga harder08:15
mborzeckimvo: 'Failed to execute operation: Too many levels of symbolic links' maybe it's something in system(d,ctl) too08:17
mvomborzecki: yeah08:19
mvomborzecki: like "wuuuuut"?08:19
mvomborzecki: what kind of error is that to begin with08:19
pedronismvo: hi, there's quite bit of PRs marked for 2.32,  anything in particular that I could help with a review?08:19
mvomborzecki: anyway, sorry, I'm still perplexed how difficult it is to disable ssh via systemctl08:20
mborzeckizyga: pushed an update to #490208:20
mupPR #4902: cmd/snap-confine: nvidia: preserve globbed file prefix <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4902>08:20
mvopedronis: let me unmark some08:20
mborzeckimvo: disabling is simple, making sure that it never gets enabled is a bit harder :P08:20
mvopedronis: 4882 might be a good one, that is the most critical one right now08:20
mvomborzecki: heh, exactly08:21
zygaI'll push a unmask of ssh.service now08:22
zygajust testing that after reboot08:22
mvomborzecki: quick question about 4902 - that fixes nvidia issues on arch?08:24
pedronismvo: does 4882 need a follow up to use 4911 ?08:25
mborzeckimvo: yes, there's another pr for ubuntu #4908 (though still RFC) that includes the patch from 490208:25
mvopedronis: yes, I will talk to gustavo about it08:25
mupPR #4908: [RFC] cmd/snap-confine: attempt to detect if multiarch host uses arch triplets <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4908>08:25
mvopedronis: I think technically 4882 will solve the problems we are seeing but gustavo was keen to add the second layer of check (snap run talking to snapd on system-key mismatch)08:26
mvomborzecki: ta08:26
zygamvo: unmasking both now08:27
mvozyga: ta08:27
mupPR snapd#4909 closed: interfaces: harden snap-update-ns profile (2.32) <Created by zyga> <Merged by mvo5> <https://github.com/snapcore/snapd/pull/4909>08:28
mupPR snapd#4907 closed: advisor: deal with missing commands.db file <Created by mvo5> <Merged by mvo5> <https://github.com/snapcore/snapd/pull/4907>08:30
mupPR snapd#4913 closed: interfaces,release: probe seccomp features lazily <Created by zyga> <Merged by mvo5> <https://github.com/snapcore/snapd/pull/4913>08:30
mupPR snapd#4914 opened: advisor: add comment why osutil.FileExists(dirs.SnapCommandsDB) is needed <Created by mvo5> <https://github.com/snapcore/snapd/pull/4914>08:34
mupPR snapd#4915 opened: interfaces,release: probe seccomp features lazily <Created by mvo5> <https://github.com/snapcore/snapd/pull/4915>08:35
pedronismvo: left some questions/comments08:38
mvopedronis: yay,thank you!08:38
zygamborzecki: https://github.com/snapcore/snapd/pull/4902/files#r17667068508:54
mupPR #4902: cmd/snap-confine: nvidia: preserve globbed file prefix <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4902>08:54
mborzeckizyga: responded :)08:56
zygareplied too08:57
mborzeckizyga: ok, makes sense, btw. i was surprised by it too08:59
pedronismvo: not urgent but I'm confused/see some duplication of things in the roadmap page (for 2.32 and 2.33 and after)09:01
zygamvo: did you notice https://github.com/snapcore/core/commit/dc782a37a3aef3fb11f256260f5205f2ff256acd09:03
zygamborzecki pointed me to it09:03
mborzeckizyga: force pushed09:05
zygathank you09:05
mborzeckidamn, i feel sick, finally caught something from the junior09:08
mvozyga: no, huh09:12
zygamborzecki take it easy09:12
zygamvo: fun, right?09:12
zygawhere is that, it's gone now?09:12
zygait was lost when we went to in-core config09:13
mvozyga: yeah :(09:13
mvozyga: sadly09:13
zygafull circle09:13
Chipaca'sup?09:14
zygaChipaca: fire, smoke, screams and tears09:14
Chipacazyga: I didn't ask about your bedroom09:14
zygaChipaca: it's slowly getting better onw09:17
Chipacazyga: good to hear :-)09:18
Chipacawhat did we lose by moving to in-core config?09:18
* Chipaca curious09:18
zygaChipaca: ssh control09:19
zygahttps://github.com/snapcore/snapd/pull/491209:19
mupPR #4912: overlord/configstate: change how ssh is stopped/started (2.32) <Critical> <Created by zyga> <https://github.com/snapcore/snapd/pull/4912>09:19
kalikianaso, I have a question about the ssh-keys interface. I added it to my snap, did `snap connect` to enable it but when the snap runs I'm getting `Permissions 0644 for '/home/rumo/.ssh/id_local' are too open.`09:23
kalikianaThe permissions that ls reports within --shell as the same as on the real system, though. And of course the permissions are correct on the real system.09:24
mvozyga: quick question about reverts from 2.32 -> 2.31. the new/updated security profiles will break reverts, no? I mean, suppose 2.32 has the stricter profiles and it loads the per-snap update-ns profiles. now on revert the old snap run will start network-manager and that will use the new 2.32 profiles which are strict and won't let n-m run, is that correct?09:26
zygaon revert snap-confine will load the old profile which is permissive for content and doesn't support layouts09:27
Chipacakalikiana: that's very weird09:27
Chipacakalikiana: is the snap in the store? i'd like to take a look09:27
zygamvo: the snap-confine profile is from core directly09:27
mvozyga: awsome09:27
zygamvo: and the issue didn't affect the regular profiles, just s-u-n profile09:27
mvozyga: so no problem here, just wanted to double check09:28
zygamvo: I think it should work09:28
zygabut I did not check09:28
mvozyga: its part of our standard tests so we will know09:28
zygaok09:28
kalikianaChipaca: I haven't pushed that bit yet since it's not working, but I can push it09:28
Chipacakalikiana: is it an easy to build snap?09:29
Chipacai mean, just run snapcraft and wait sort of thing?09:29
Chipacain that case i could use just the snapcraft.yaml :-)09:29
Chipacato be clear, this is just me being lazy: i'd rather tinker with it to figure out what's going on, than think about it in the abstract09:30
mvozyga: hey, does https://paste.ubuntu.com/p/jxtYz7z9CD/ ring any bells?09:43
zygammm09:43
mvozyga: happend on master09:43
zygais that on release09:43
zygaaww09:43
zygaI didn't push the debug thing to master09:43
zygamvo: can you please force-land that https://github.com/snapcore/snapd/pull/491609:45
mupPR #4916: tests: change debug for layout test <Created by zyga> <https://github.com/snapcore/snapd/pull/4916>09:45
mupPR snapd#4916 opened: tests: change debug for layout test <Created by zyga> <https://github.com/snapcore/snapd/pull/4916>09:46
Chipacakalikiana: so, I can't reproduce your issue09:56
Chipacakalikiana: wondering what's different between here and there09:56
Chipacakalikiana: one thing though, i don't think your snap needs to include openssh-client, as that's part of core and ssh-keys gives you access to it (at least to /usr/bin/ssh)09:58
Chipacabut here it works with both the ssh in the snap, and the one in core09:58
kalikianaAh, I didn't realize that. So I can drop the package.10:00
Chipacakalikiana: but that doesn't answer where the 0644 error is coming from :-)10:00
kalikianaChipaca: Could it be a bug in snap connect then? Vaguely thinking of content interface issues depending on the order of install/connect/snap executation10:01
Chipacakalikiana: wait, can you ssh from within --shell?10:01
Chipacathat's what i am doing and it works, if it works for you as well the issue is even weirder10:01
kalikianaChipaca: No, same error, even on a simple "ssh helga" (helga being an arbitrary host that works outside confinement)10:02
Chipacaphew10:03
kalikianaChipaca: Also getting a `Control socket connect(/home/rumo/.ssh/socket-root@***.***.***.***:22): Permission denied`with just ssh, before the other error10:04
Chipacakalikiana: just to make sure, can you ‘find /home/rumo/.ssh -ls’?10:04
kalikianaChipaca: That works and lists each key with -rw-r--r--10:06
Chipacakalikiana: inside, or outside?10:06
kalikianaChipaca: both are the same10:07
Chipacakalikiana: why are your private keys world-readable?10:07
Chipacakalikiana: (see also: how is your ssh 'outside' not complaining about it all the time?)10:08
kalikianaHmmm now that you mention it, they probably shouldn't be...10:08
kalikianaChipaca: Outside is fine10:08
Chipacakalikiana: fine how?10:09
kalikianaAs in, I get no errors10:09
Chipacaright10:09
kalikianaBut lemme try and change them10:09
Chipacakalikiana: but you should :-)10:09
Chipacakalikiana: only thing I can think of is that you have seahorse auto-adding your ssh keys on session start, and it's less picky about permissions, and ssh never looks at your actual keys because by the time you run it they're in the agent10:10
Chipacakalikiana: if that's the case, it sounds like a bug in seahorse to me ;)10:11
Chipacakalikiana: but also, sounds like an easy thing to detect and warn about from your snap10:12
Chipacaie, can i access the keys? no: tell user about snap connect, yes: are they the right perms? no: tell user about it10:12
kalikianaDropping the group/world reads seems to fix it indeed.10:13
kalikianaNow ssh just tells me `bind: Permission denied` and `unix_listener: cannot bind to path`10:14
mborzeckizyga: i think i need to check for a specific library in autodetection, one thats' always there, libnvidia-glcore.so.<maj>.<min> looks like a good candidate10:14
kalikianaBut it unlocked the key/passphrase fine10:14
* kalikiana tests git push from the snap10:14
Chipacakalikiana: that socket might be a control master thing, which isn't supported from inside10:16
Chipacakalikiana: (the thing that lets you multiplex ssh sessions over a single connection)10:16
kalikianaChipaca: You were right, disabling it explictly works10:21
kalikianaChipaca: Thanks a lot for helping me track this down!10:25
cachioSaviq, a new fedora image with the correct size is uploaded10:26
kalikianaNow if it'd actually use the ssh agent, that'd be sweet, but it works perfectly10:26
Saviqcachio: ack, me tries10:26
cachioSaviq, great, tell me how if goes10:27
Saviqcachio: -64 suffix, too?10:27
Chipacakalikiana: talk with jdstrand, but I suspect much laughter to be had :-)10:28
kalikiana:-D10:29
cachioSaviq, fedora-27-6410:40
cachiothis is the name you have to use10:40
cachioSaviq, you don't need to specify the image10:41
mborzeckizyga: can you take another look at #4902?10:44
mupPR #4902: cmd/snap-confine: nvidia: preserve globbed file prefix <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4902>10:44
mborzeckizyga: also i'm finishing with fixes for 4908 and will be switching back to ubuntu to check if things didn't break :)10:45
Chipacazyga: might #1757284 be an instance of the 'interface connections go away' bug?10:45
mupBug #1757284: Several snap apps fail to launch <amd64> <apport-bug> <bionic> <snapd (Ubuntu):Confirmed> <https://launchpad.net/bugs/1757284>10:45
Chipaca(now that i think about it, it probably is -- but don't know that bug #)10:46
zygamborzecki: looking10:48
mupPR snapd#4916 closed: tests: change debug for layout test <Created by zyga> <Merged by zyga> <https://github.com/snapcore/snapd/pull/4916>10:49
* Chipaca attempts to dad up and get out of bed10:49
zygaChipaca: updated the bug with a question10:50
Saviqcachio: yeah, it's lookin' good https://travis-ci.org/MirServer/mir/builds/35732500310:57
* Chipaca AFK for a while (physio, etc)11:01
mvopedronis: hrm, hrm, my system-key pr has a bit of a problem on core. on firstboot on core snapd starts, generates the system-key and the core revision is empty (because things are not seeded yet). then stuff gets seeded but that means the systme-key changes because the core revision is part of the inputs11:04
mvozyga: (cc -^)11:05
* mvo will need to think about this while taking a short lunch break11:05
zygahmm hmm hmm11:05
zygamvo: if we drop core revision?11:05
zygawe keep it because apparmor profiles are there (includes)11:05
zygahmm :/11:06
mvozyga: yeah, feels strange to just drop it, but maybe we can drop it on core?11:06
zygawell, if apparmor gets updated11:07
mvozyga: let me try that11:07
zygaand snapd doesn't11:07
zygait's not correct then11:07
mvozyga: hm, good point11:07
pedroniswill apparmor come from the snapd snap?11:08
pedronisfun questions11:08
cachioSaviq, nice11:09
cachiogood to hear that11:09
pedronisdon't we check the feature of apparmor becuase of that?11:09
mvopedronis: partly, the abstractions are also part of the mix and those come from the core snap11:10
mvopedronis: apparmor has two inputs: the kernel capabilities and the abstractions with pre-writen rules11:11
mvowe could force a re-gen of the profiles after seeding but thats a bit ugly11:11
mvofull of dragons this pr11:11
mborzecki-ubuntuhttps://paste.ubuntu.com/p/77k9YJbVQK/11:12
mborzecki-ubuntuhumanSuite.TestHuman failed here, but was passing yesterday11:12
zygapedronis: yes, apparmor will come from core snap (on core)11:12
zygapedronis: I mean the include statements, we check for kernel features but each profile we make has include statements that reach to apparmor profiles from either native package or from the core snap on all-snap boxes11:13
mborzecki-ubuntuChipaca: DST change https://paste.ubuntu.com/p/77k9YJbVQK/11:14
zygaLOL11:14
zygastuff will never stop to amaze me11:14
mborzecki-ubuntuotoh, i had no clue that we're changing clocks on saturday ;)11:15
pedronismvo: also the issue with seeding exists also on classic11:22
pedronismvo: we support seeding there with snaps11:22
zygamvo: I have an idea11:23
pedronismvo: it's a bit unclear to me why are we writing a key before being seeded ?11:23
pedronismvo: we should wait/skip that, if there are not snaps, we are not seeded11:24
zygamvo: instead of revision we can inspect a single file in /meta/manifest11:24
zyga(we don't have that file yet)11:24
zygabut as long as it encodes the version of every package11:24
zygait caputres the essence of core11:25
zygapedronis: we need that on boot to setup profiles for core itself11:25
zygapedronis: on snapd daemon startup actually11:25
Chipacamborzecki: on the one hand, huzzah it works11:25
Chipacamborzecki: on the other, booo11:25
pedroniszyga: ok, there's a recursion problem here11:25
Chipacamborzecki: on the last one, a test that works 350 days a year is fine, right?11:25
pedronisthe double build-id doesn't make sense then11:25
* Chipaca now really goes11:26
zygapedronis: can you explain?11:26
pedroniszyga: we generate profile with a key id without build-id  for a thing that by definition has one11:26
Chipacamborzecki: actually that might be a real bug, need to look when i get back11:26
pedronisthen we regenerate just because11:26
zygapedronis: why without a build-id?11:26
pedroniszyga: because there's no /snap/core/current11:27
pedronisat that point11:27
pedronisat least that's what I understood was the problem11:27
pedronismvo mentioned11:27
zygapedronis: on core we could just use /usr/lib/snapd/snapd11:27
zygano need for core's build-id11:27
pedroniszyga: as I said the same issue exists with classic seeding11:27
pedronisfirst boot is not a core only thing11:27
zygaon classic we can use /proc/self/exe11:28
zygabut we cannot capture apparmor changes in the distro11:28
zygabut I think that is done by apparmor itself, it has a trick to detect that in the init script11:28
pedroniszyga: did you look at the PR, the system-key has two build-ids in it11:28
pedronisatm11:28
zygayes, we discussed that last night11:28
mupPR snapd#4917 opened: repo: added repo ConnectionsInfo method (for the new snap connections API) <Created by stolowski> <https://github.com/snapcore/snapd/pull/4917>11:28
zygaI think we can evolve that pattern11:28
pedronison core we need only one11:28
pedronisbut on classic11:28
pedroniswe have a problem11:28
zygaon classic we also need only one but we did two because that's easier11:29
pedronisbut then seeding11:29
zyga(because we don't know which)11:29
pedronisit's not easy anymore11:29
zygamvo: what do you think?11:29
zygapedronis: I agree11:29
zygapedronis: no easy way out yetr11:29
zyga*yet11:29
pedronisone issue is also that we think reexec11:29
pedronisis something you can turn on and off11:30
pedroniseasily11:30
pedronisbut is quite true11:30
pedronisespecial if you are outside snapd and the inside snapd11:30
pedronisare far apart11:30
pedronis*is not quite true11:30
zygamborzecki: https://github.com/snapcore/snapd/pull/4908#issuecomment-37539238411:45
mupPR #4908: [RFC] cmd/snap-confine: attempt to detect if multiarch host uses arch triplets <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4908>11:45
* cachio afk11:45
mupPR snapd#4903 closed: tests: [WIP] split prepare-restore.sh into prepare-restore.d modules <Created by zyga> <Closed by zyga> <https://github.com/snapcore/snapd/pull/4903>11:46
mborzeckizyga: it's alredy fixed, i also improved detection and look for a specific nvidia lib (libnvidia-glcore.so), the packaging is also updated, now i'm cleaning up s-c apparmor profile11:47
mborzeckizyga: https://github.com/bboozzoo/snapd/commits/bboozzoo/nvidia-glob-prefix-ubuntu-triplet-wip if you want to take a look11:48
pedronismvo: zyga: I made a comment,  one idea  could be to simply delay writint the key when we are not yet seeded11:50
zygathanks,11:50
zygabut this is going to change one important property11:51
zyga(perhaps)11:51
zygathat daemon doesn't respond to api requests before having stable security profiles11:51
pedronis?11:51
zygamaybe if we are not seeded (rare) we should just always rewrite profiles on startup (mvo: opinion?)11:51
zygaactually, I'm confused: writing the system key is not the same as computing it in memory11:51
zygaso perhaps that's okay11:51
pedronisif we are seeded there no snap profiles11:51
pedronissorry11:52
zygathat's true11:52
pedronisif we are not seeeded11:52
zygayes, that's a good point11:52
pedronisI'm  saying,  if not seeded delay until we really need it11:52
pedroniswhich is about when we generate profiles for core11:52
pedronis(which is the first thing we seed)11:52
zygaright, I understand that now, I forgot that not seeded == nothins is installed11:52
zygaso I think this is okay11:52
pedronisI might be missing something11:53
zygamborzecki: interesting https://build.opensuse.org/request/show/59039011:56
zygapstolowski: https://github.com/snapcore/snapd/pull/4917#pullrequestreview-10646971512:16
mupPR #4917: repo: added repo ConnectionsInfo method (for the new snap connections API) <Created by stolowski> <https://github.com/snapcore/snapd/pull/4917>12:16
mborzecki-ubuntuzyga: that opensuse review looks good, maybe advise him to update to 2.31.2 while at it12:23
zygaI'll merge it and encourage him to iterate12:24
zygamborzecki-ubuntu: the package is not buildable on leap though12:25
zygaactually, I didn't merge it but I added some comments12:27
* zyga lunch12:32
mborzecki-ubuntuzyga: i'll force push to #490812:33
mupPR #4908: [RFC] cmd/snap-confine: attempt to detect if multiarch host uses arch triplets <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4908>12:33
zygak12:33
mborzecki-ubuntuand pushed12:35
mborzecki-ubuntuohmygiraffe works with nvidia and confinement now ;)12:36
mborzecki-ubuntulet me try the the flare snap12:37
niemeyerGlad for your giraffe :P12:41
niemeyerMorning all12:41
mvopedronis: delaying sounds reasonable, let me have a look at this12:46
pedronisChipaca: is you that moved the waitMixin stuff to its own wait.go file?12:50
Chipacapedronis: that sounds like me12:58
Chipacapedronis: why12:58
pedronisChipaca: there's a function in cmd_snap_op.go  that probably should have moved too, given that is used only in the wait code, lastLogStr12:58
Chipacapedronis: ah, good call12:59
pedronisChipaca: I noticed because I'm solving conflicts in one of my old PRs12:59
mvopedronis, zyga yet another `snap run` system-key issue on core. when we install a new core we update "current". but we only reboot ~10min later. during that time the system-key on disk and the one that snap run calcualtes are different because /snap/core/current is part of the inputs13:25
pedronismvo: I'm fixed the 10 minutes things but that's part of the problem13:25
pedronis*that's only13:26
mvopedronis:13:26
mvopedronis: yes13:26
pedronismvo: I suppose there the endpoint might help13:26
sergiusensmvo: is there a way to `snap run --shell core18....`?13:30
Chipacasergiusens: no, snap run only runs apps*13:36
Chipaca* and hooks but you don't want to do that13:37
mvosergiusens: a trivial snap that just has a /bin/sh and uses base: core1813:38
mvosergiusens: might be the solution13:38
sergiusensmvo: yeah, I am doing just that; to verify core18 is in good shape (using the test job I created, but using yours should be fine as well, I couldn't find it from the store just yet though)13:39
sergiusenspython/asciinema base: core18 and install to test the story end2end13:39
sergiusensmvo ok, everything is working now13:42
mvosergiusens: sys13:43
mvosergiusens: eh, I mean yay13:43
sergiusenslol, I was trying to decipher that :-P13:44
pedronisChipaca: this bit of code seems nonsensical or am I confused?:   https://github.com/snapcore/snapd/blob/master/cmd/snap/cmd_snap_op.go#L54313:44
zygaE: Type 'curity' is not known on line 50 in source list /etc/apt/sources.list13:45
zygaE: The list of sources could not be read.13:45
zygahow did we get 'curity' out of 'security'13:45
Chipacapedronis: that looks broken indeed13:45
zygahttps://api.travis-ci.org/v3/job/357284871/log.txt13:45
pedronisChipaca: I find it by chance, I suppose not all the error paths are tested :/13:47
zygamborzecki-ubuntu: https://github.com/snapcore/snapd/pull/4908 broken13:51
mupPR #4908: [RFC] cmd/snap-confine: attempt to detect if multiarch host uses arch triplets <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4908>13:51
mborzecki-ubuntuzyga: force pushed  just now13:51
mborzecki-ubuntuzyga: `dpkg-architecture -a i386` doesn't work on 14.04, you need to do `dpkg-architecture -ai386` :/13:52
Chipacapedronis: the wait one is hard to test i guess13:52
Chipacaseems to be the only one with that particular bug though :-)13:52
pedronisyes13:53
pedronisfunnily enough my test used that one command13:53
niemeyerHangouts going crazy13:53
Chipacapedronis: do you want me to fix it, or will you?13:56
pedronisChipaca: I can make a PR in a bit, also moving that other function13:58
pedronisChipaca: trying to finish the merge right now13:58
Chipacapedronis: k13:58
mvocachio: for the sru validation, could you pastebin the failures again for me please (or link)14:02
mborzeckiChipaca: if you're willing to try the nvidia stuff, please grab https://github.com/snapcore/snapd/pull/490814:12
mupPR #4908: [RFC] cmd/snap-confine: attempt to detect if multiarch host uses arch triplets <Created by bboozzoo> <https://github.com/snapcore/snapd/pull/4908>14:12
Chipacamborzecki: actual nvidia, right?14:13
ChipacaI need to restart my session for that, give me a mo'14:13
mborzeckiChipaca: yes14:13
ackkmvo, hi, WRT the discussion at the sprint about having a "nobody" user snaps, has there been further discussion on whether that's acceptable?14:14
Chipacamborzecki: what do i need to build against this to test?14:17
mborzeckiChipaca: just build and reinstall the package, add SNAP_REEXEC=0 to /etc/environment to make sure you're runing the right thing14:18
mborzeckiChipaca: once snapd deb is installed, please snap install graphics-debug-tools-bboozzoo too14:20
Chipacamborzecki: … i'm getting the DST change error also14:28
mborzeckiChipaca: i've commented it out locally :) sorry14:29
Chipacamborzecki: is this your revenge for that code :-)14:29
* Chipaca runs it agian with DEB_BUILD_OPTIONS=nocheck14:29
mborzeckiChipaca: can you also do `cat /sys/module/nvidia/version`?14:34
Chipacamborzecki: 384.11114:34
Chipacamborzecki: snapd build and running14:35
Chipacamborzecki: an' now?14:35
mborzeckiChipaca: have you installed graphics-debug-tools-bboozzoo --edge?14:35
* mborzecki forgot to mention --edge before14:36
Chipacamborzecki: graphics-debug-tools-bboozzoo  1.0      3    edge      maciek-borzecki  -14:36
cachiomvo, https://paste.ubuntu.com/p/KB7SyvgRn4/14:36
mborzeckiChipaca: SNAP_CONFINE_DEBUG=1 SNAPD_DEBUG=1 snap run graphics-debug-tools-bboozzoo.glxinfo |& tee log and paste the log somewhere14:36
cachiothese are the spread tests14:36
mborzeckioh, and SNAP_REEXEC=014:36
cachiomvo, give me 5 minutes14:37
mborzeckiChipaca: SNAP_CONFINE_DEBUG=1 SNAPD_DEBUG=1 SNAP_REEXEC=0 snap run ..14:37
cachiomvo, also many of the autopkgtets failed because of "package context: unrecognized import path "context" (import path does not begin with hostname)"14:38
cachioseem to be a different go version installed there14:39
Chipacamborzecki: http://paste.ubuntu.com/p/94hmf6KcHN/14:40
Chipacamborzecki: do you also want me to test it with the intel board (ie with prime turned off)14:40
Chipacaor turned to nvidia14:40
mborzeckiChipaca: looking good, can you try some snap that does graphics? eg. ohmygiraffe14:40
* Chipaca doesn't know what these things are called14:41
Chipacamborzecki: ohmygiraffe, supertuxkart and minecraft worked14:41
pedronisChipaca: I managed to add tests, not super interesting tests but better than nothing14:41
* Chipaca hugs pedronis 14:42
Chipacapedronis: thank you14:42
Chipacamborzecki: so as far as i'm concerned there is no regression with this :-)14:42
mborzeckiChipaca: one more thing, can you sudo nsenter -m/run/snap/ns/ohmygiraffe.mnt14:43
mborzeckiChipaca: and then `cat /proc/self/mountinfo` and paste it14:43
Chipacamborzecki: snapd, but yes14:43
mborzeckiright /run/snapd/ns/..14:44
Chipacamborzecki: whoa, leaving the mountspace makes x unhappy for a bit14:44
mborzeckihm? how so?14:45
mborzeckiChipaca: if you could also check that intel works :)14:48
mborzeckizyga: mvo: do we want the nvidia fixes for 2.32?14:48
Chipacamborzecki: http://paste.ubuntu.com/p/4GhMJmt22t/14:52
Chipacamborzecki: or http://paste.ubuntu.com/p/Rp7TQVjsDY/14:52
Chipacamborzecki: I can try nvidia in a bit14:53
mborzeckiChipaca: great, so it doesn't regress and mounts the right thing :)14:53
Chipacamborzecki: wrt leaving the mount namespace, it's as if i'd gone to console and back14:53
Chipacaeven redshift gets reset14:54
mborzeckiChipaca: that's weird, maybe it's because the env is still there14:54
Chipacamborzecki: you don't see that?14:54
mborzeckinope14:54
Chipacaok, switching to nvidia14:54
* Chipaca in a rush14:54
Chipacamborzecki: http://paste.ubuntu.com/p/yKb7RhVffw/14:56
Chipacamborzecki: with intel14:56
Chipacamborzecki: but … the ns are still there, i guess, maybe that's the problem?14:56
Chipacamborzecki: http://paste.ubuntu.com/p/8xhdRfjnZw/14:56
Chipacanow 3d apps don't work14:57
mborzeckiChipaca: is the driver loaded now? lsmod|grep nvidia?14:58
Chipacamborzecki: i've got to go, but i'll be back in ~4514:58
Chipacamborzecki: it is not14:58
mupPR snapd#4918 opened: cmd/snap:  fix one issue with noWait error handling logic, add tests <Created by pedronis> <https://github.com/snapcore/snapd/pull/4918>14:58
* Chipaca runs14:59
mborzeckiChipaca: sudo /usr/lib/snapd/snap-discard-ns graphics-debug-tools-bboozzoo14:59
mborzeckiah ok :) i'll leave you some notes here14:59
mborzeckiChipaca: thank you for testing this14:59
mupPR snapcraft#2022 opened: Polish our landing page ✨ <Created by evandandrea> <https://github.com/snapcore/snapcraft/pull/2022>15:12
pedronismvo: zyga: on classic we use the apparmor abstractions from the host? or from core ?15:24
mvopedronis: host15:24
pedronismvo: that is not really part of the system key atm, no?15:25
pedronisthen15:25
mvopedronis: yes, we just discussed htis that we can drop15:25
mvopedronis: which makes the whole problem much easier15:28
mvopedronis: also on core we apparently don't need this input because one of the init scripts (apparmor init) will rebuild the profiles if the abstraction change15:29
mvopedronis: this makes the problem on core go away when the current symlink is missing15:29
mvopedronis: eh, missing or updated too early15:29
mvopedronis: which is really nice, I will update my PR now with this, I hope we finally have figured it out15:29
Chipacazyga: ping15:30
Chipacaah, maybe nothing15:31
pedronisChipaca: I pushed #491815:31
mupPR #4918: cmd/snap: fix one issue with noWait error handling logic, add tests plus other cleanups <Created by pedronis> <https://github.com/snapcore/snapd/pull/4918>15:31
Chipacapedronis: ok15:31
zygaI’m on a walk15:32
cachiomvo, there?15:34
pedronismvo: let me know when/if I should look again15:34
mvocachio: yes, sorry15:35
mvopedronis: will do15:35
cachionp15:35
cachioso, about the sru15:36
mvoyes15:36
cachiomvo, did you see the expect error?15:36
mvocachio: what was the pastebin again, sorry, missed it was in a very long meeting15:36
cachionp15:37
cachiomvo, many execs failed because of this https://objectstorage.prodstack4-5.canonical.com/v1/AUTH_77e2ada1e7a84929a74ba3b87153c0ac/autopkgtest-xenial/xenial/i386/s/snapd/20180319_122711_e1003@/log.gz15:37
cachioin autopkgtests15:37
mvocachio: uhhh, I don't know where this one comes from :(15:38
cachiomvo, in xenial all the execs failed because of that15:39
cachioseem to be a problem with the go version15:39
pedronismvo: cachio: seems like spread doesn't build with 1.6 anymore15:40
pedronisthat's the issue I think15:40
cachiopedronis, yes15:40
pedronisit's using context directly15:40
cachioI am builing with 1.915:40
cachioto avoid issues on spread15:40
pedroniswell but that test15:40
pedronisis trying to build spread15:41
pedronison xenail15:41
pedronisgo get -u github.com/snapcore/spread/cmd/spread15:41
pedronisboom15:41
cachiobut, why it is doing go get instead of downloading spread?15:41
pedronisdownloading from where?15:41
pedronisthis is autopkgtests15:42
cachiofrom aws15:42
pedronisnot even sure they will let us download a random binary15:42
cachioor the snap15:42
pedronisthe snap might be an option I suppose15:42
pedronisor spread need to be made to build 1.6 again15:43
pedroniswith 1.615:43
cachiopedronis, well that's something for niemeyer15:43
pedronisthough I'm not quire sure if the autopackage test for snapd can use snap15:44
pedronismvo: we cannot build spread anymore on xenial15:44
mupPR snapcraft#2008 closed: Release changelog for 2.40 <Created by sergiusens> <Merged by sergiusens> <https://github.com/snapcore/snapcraft/pull/2008>15:45
mvoChipaca: what was the status of "human_tsest.go:67" ? it fails here with http://paste.ubuntu.com/p/nyxKqcQFqQ/15:56
Chipaca:-(15:56
Chipacamvo: yeah, mborzecki alerted me of that, and indeed it's failing here as well15:56
Chipacaso there's two bugs in it15:57
mborzeckimvo: it will be fixed tomorrow, or a day after15:57
Chipacaone in the test, which happily pretends it'll never get run on dst, and one in the code because there's a day missing15:57
mborzeckiuntil the next dst change15:57
Chipacaneed to look into that15:57
Chipacamborzecki: glxinfo is still not finding the intel driver, but the games work15:58
Chipacamborzecki: http://paste.ubuntu.com/p/XcmQZmvN5z/15:58
Chipacamborzecki: and http://paste.ubuntu.com/p/cZBYzXvX73/15:59
mborzeckiChipaca: hmm interesting, can you check if it behaves the same with the regular snapd?15:59
Chipacamborzecki: it did15:59
mborzeckiChipaca: ah ok, then there's no regression :)16:00
Chipacabah, i can double check just in case16:00
mvomborzecki: I can't wait that long …16:01
Chipacamborzecki: hmm, hold on, some of these tests were run without SNAP_REEXEC=0 :-(16:02
Chipacaaugh16:02
cachiomvo, also there errors I saw in linode exec for sru https://paste.ubuntu.com/p/KB7SyvgRn4/16:09
cachiomvo, many "Permission denied"16:09
mvocachio: I think these are the ones i'm concerned about16:10
Chipacamborzecki: yeah, same16:10
cachiomvo, I am gonna debug those16:10
zygamvo: did we land https://github.com/snapcore/snapd/pull/4877 for 2.32?16:10
mupPR #4877: snap-confine: fallback to /lib/udev/snappy-app-dev if the core is older <Created by mvo5> <https://github.com/snapcore/snapd/pull/4877>16:10
zygamvo: I think we need too16:10
zygato too16:10
mborzeckiChipaca: that's sort of great :) thank you for checking16:10
Chipacamvo: I can fix that human time bug16:10
Chipacamvo: probably :-p16:11
mvoChipaca: please16:11
mvozyga: I think so16:12
mvozyga: but not for 2.31 .(16:12
mvozyga: that might be it16:12
zygaall the failures there look crazy16:13
zygaI restarted it now16:13
mvozyga: I suspect its the symlink for snappy-app-dev to snap-device-helper16:15
zygamvo: symlink => copy16:15
zygaactually, symlink is fine16:15
zygaI think16:15
mvozyga: I think we did not do that16:15
mupPR snapd#4891 closed: tests: add support for phased prepare-restore logic <Created by zyga> <Merged by zyga> <https://github.com/snapcore/snapd/pull/4891>16:16
mupPR snapd#4914 closed: advisor: add comment why osutil.FileExists(dirs.SnapCommandsDB) is needed <Created by mvo5> <Merged by zyga> <https://github.com/snapcore/snapd/pull/4914>16:16
mupPR snapd#4915 closed: interfaces,release: probe seccomp features lazily <Created by mvo5> <Merged by zyga> <https://github.com/snapcore/snapd/pull/4915>16:16
pedroniszyga: did you merge #4891 with one review?16:19
mupPR #4891: tests: add support for phased prepare-restore logic <Created by zyga> <Merged by zyga> <https://github.com/snapcore/snapd/pull/4891>16:19
zygamvo: I will tweak the ssh fix now16:19
zygapedronis: yes, because it's tiny and I have useful fixes on top16:19
zygapedronis: (and doesn't do anything yet)16:19
pedronisdid you discuss this with niemeyer?16:20
pedronisI remember we had long code org discussion around this16:20
zygaI wasn't aware of that, what was the discussion about specifically?16:21
pedronishow stuff should be organized16:21
pedronisthe usual contentious topic16:21
zygawas the outcome captured somewhere?16:22
pedronisno16:22
zygawell, I can still back that out, I just want to do something because we have real issues and no solution in sight16:23
pedronisI'm just saying that merging somethign that changes deeply how we structure spread bits16:23
zygawell, it's not doing that yet16:23
pedroniswith one review and without niemeyer input16:23
pedronisis perilious16:23
pedroniszyga: also is the usual issue with do nothing code, it's hard to tell how it will be used in practice16:24
niemeyerzyga: Yeah, please revert this..16:24
zygasure, in a moment, just working on ssh now16:25
niemeyerzyga: This sounds like a recipe for shell sausage16:25
zygaI explained how it will be used in the example file but I'm fine with reverting it16:25
niemeyerzyga: Yes, it's an example of sausage16:26
zygaI don't know what a sausage is16:26
niemeyerzyga: It's also not the right way of doing these changes.. please hear pedronis16:26
niemeyerzyga: I'll send you a picture later16:26
zygaour code is a mess there, I want someone to own it and fix it; I may do it in a way some people don't agree with but my motivation was to fix this16:27
niemeyerzyga: We have a forum here: https://forum.snapcraft.io16:27
zygaI don't mind if someone fixes our test suite to run better, really16:28
niemeyerzyga: Merging these changes, which sound controversial and unclear to begin with, with no actual discussion despite the fact we spent hours talking today, on a Friday, before a release, when people are leaving on holiday... man...16:28
niemeyerWith one review!16:28
zyganiemeyer: none of that is in the release, none of that is contoversial or unclear, did you see what that code did? IMO that is an over reaction now16:29
niemeyerzyga: Please step back and breath.. it's not the way we do things..16:30
niemeyerzyga: If you want to reorganize that logic, please open a forum topic and let's talk16:30
niemeyerzyga: As perdronis insightfully pointed out, we are very careful to not create a mess of shell on our tests.. we simplified things out of seemingly innocent practices several times..16:31
pablo_Hi: I'm trying to use wifi-ap snap on ubuntu core in Raspberry Pi 3, but status keeps changing to active=false after a couple of seconds i activate it. Any hints?16:33
cachiozyga, any news on https://travis-ci.org/snapcore/snapd/builds/357331248#L8138 ?16:34
zygacachio: ha, interesting16:34
mvocachio: can you look at the denial for the permission denied failures please?16:35
zygacachio: only that I still don't understand something there16:35
mvocachio: can you pastebin those?16:35
cachio1 sec16:36
cachio[Fri Mar 23 11:08:29 2018] audit: type=1400 audit(1521803309.374:626): apparmor="DENIED" operation="open" profile="snap-update-ns.test-snapd-layout" name="/proc/sys/kernel/seccomp/actions_avail" pid=16648 comm="3" requested_mask="r" denied_mask="r" fsuid=0 ouid=016:36
cachio[Fri Mar 23 11:08:29 2018] audit: type=1400 audit(1521803309.402:627): apparmor="DENIED" operation="mount" info="Failed name lookup - deleted entry" error=-2 profile="snap-update-ns.test-snapd-layout" name="/etc/group" pid=16648 comm="3" flags="rw, bind"16:36
zygacachio: the 1st one is fixed in another pr16:36
cachiotoday at least 3 builds failed on master because of this :(16:37
zygacachio: the 2nd one says something removed /etc/group while we were looking and I don't understand it16:37
zygajdstrand: ^ do you know what could cause the denial above?16:37
zyga(the second one)16:38
mupPR snapd#4919 opened: Revert "tests: add support for phased prepare-restore logic" <Created by niemeyer> <https://github.com/snapcore/snapd/pull/4919>16:39
zyganiemeyer: I pushed the ssh changes16:39
niemeyercachio: After you're done with the current topic, what spread images are ready to merge?16:39
mupPR snapd#4919 closed: Revert "tests: add support for phased prepare-restore logic" <Created by niemeyer> <Merged by niemeyer> <https://github.com/snapcore/snapd/pull/4919>16:40
zygais that what you had on your mind earlier?16:40
zygahttps://github.com/snapcore/snapd/pull/4912/files16:40
mupPR #4912: overlord/configstate: change how ssh is stopped/started (2.32) <Critical> <Created by zyga> <https://github.com/snapcore/snapd/pull/4912>16:40
niemeyercachio: Sorry.. I mean, what snapd PRs moving images to Google16:40
niemeyerzyga: Looking16:40
cachioniemeyer, opensuse is almost ready, I think today will be ready to be merged16:40
cachiothen, just fedora is missing, we need to see how to fix those deniails from selinux16:41
pedroniszyga: I'm aware that we have issues with tests,  it's a bit unclear that they are organisation problem though,   the feeling in that prepare/restore do a lot, unclear if doing a lot differently is what we need, or just try to be more careful/streamline16:42
zygayes, I know16:42
niemeyerzyga: One comment and LGTM16:42
zygaanyway, we can discuss this later when there's more time to see what I wanted to do16:42
cachioniemeyer, well then just missing debian sid which is currently set as manual16:42
zygaI have a refactor of the original code, that does what it did before but is more readable, that I wanted to use a base for refactoring16:43
zygato restore the property that setup sets stuff up from pristine and restore undoes that to pristine16:43
zygawith hard checks that ensure this is happening for real16:43
niemeyerpedronis: Right, exactly.. the moment we allow ourselves to just throw files at a directory, which in fact *need* to be ordered because they depend on each other, is the day we lose complete control over the sanity of something that is already not as clean as it should be16:43
cachioniemeyer, are we gonna try arch or centos ?16:43
zyganiemeyer: they don't depend on each other much; I'd love to discuss this but I think you are jumping to conclusions now16:44
niemeyercachio: Yeah, mborzecki had arch working already.. I closed the PR because it was still on Linode, but you two should be able to quickly get it up again on Google once you have everything else sorted16:44
zygaupdating ssh again, just a sec16:44
cachioniemeyer, ok16:44
niemeyerzyga: I'm not jumping into conclusions.. I'm concluding from experience from many years looking at similar systems evolve16:44
niemeyerzyga: What's the rationale for the change in the first place?16:45
cachioniemeyer, then we need to discuss the changes needed to make fedora work on google with selinux16:45
cachioniemeyer, that's missing to move fedora16:45
mborzeckicachio: ping me when you'd want to give arch a try, IIRC there was just a couple of tests failing there (one of those was merged usr which got fixed in master not long ago)16:45
niemeyercachio: Ok.. but gback to the original question: what can I merge?16:46
zyganiemeyer: pushed ssh,16:46
zygathe rationale...16:46
zyga" This patch will help to modularize the prepare/restore logic and hopefully make it more testable and easy to reason about."16:46
cachioniemeyer, opensuse will be ready but later doday16:47
zygaalso that related prepare/restore code is in one file16:47
niemeyercachio: So nothing yet?16:47
cachioniemeyer, #488616:47
mupPR #4886: tests: adding opensuse-42.3 to google <Created by sergiocazzolato> <https://github.com/snapcore/snapd/pull/4886>16:47
zygaand not far away from each other16:47
niemeyerzyga: Yep.. so exactly what I said..16:47
cachioyesterday we merged debian16:47
zyganiemeyer: I disagree but I dont' want to argue abut it this way now16:47
niemeyerzyga: Me neither.. that was part of the point16:48
cachioniemeyer, I pushed a fix for opensuse, tests are waiting to be executed16:48
zyganiemeyer: is https://github.com/snapcore/snapd/pull/4912/commits/1213460ccc6b96baf28f478bde752d25b8212dbb ok? I think that's it for this fx16:48
mupPR #4912: overlord/configstate: change how ssh is stopped/started (2.32) <Critical> <Created by zyga> <https://github.com/snapcore/snapd/pull/4912>16:48
niemeyerzyga: Yeah, that was all, thank you16:48
zygaexcellent, now what's left for the system key work16:48
zygamvo: how can I help?16:48
niemeyermvo, zyga: I had a quick question/comment.. can we hang out for a few (hopefully short) minutes?16:49
zygayeah16:49
mvozyga, niemeyer sure16:50
niemeyerI'm there16:50
pablo_ogra_: hi, i followed your wifi-ap tutorial, but i cannot make it work... could you help me pls?16:58
ogra_pablo_, on what board is that ?16:58
ogra_(did you check jrounalctl for errors ?)16:58
ogra_*journalctl16:59
pablo_ogra_: raspberrypi 3. I'll check right now16:59
ogra_any other network related snaps installed ?16:59
pablo_ogra_: nmcli17:01
ogra_ah, i think that will take over the wlan device ... remove NM17:01
ogra_(if youo need to configure ethernet, use the config in /etc/netplan/17:02
ogra_=17:02
ogra_)17:02
pablo_ogra_:journalctl shows lots of messages i don't understand.. i'll pastebin them.17:02
pablo_ogra_: i'm using raspberry through ssh17:02
ogra_well, likely that wifi-ap cn not switch the interface to ap mode17:02
pablo_if i remove nmcli would i be disconnecteD?17:02
ogra_well, if you did the initial setup with console-conf (to get the user set up etc) you should have a working config for eth0 already ... but you can also double check in /etc/netplan/17:03
pablo_ogra_: i'll check... if eth0 is configured there i can safely remove NM? then i install wifi-ap?17:04
ogra_yes, that should work17:04
cachiojdstrand, I see this denial https://paste.ubuntu.com/p/8ZQVRQjZvZ/ running the test security-device-cgroups17:06
cachioit is happening when execute test-snapd-tools.env17:06
cachiojdstrand, any idea about what could be causing that?17:07
pablo_ogra_, i removed it, then executer wifi-ap.setup_wizard. Connection keeps geting ap.active=false17:07
ogra_did you reboot ?17:08
pablo_no :S sorry17:08
ogra_the wlan driver might be in a weird state17:08
pablo_ogra_: same thing... after a couple of seconds, ap.active changes to false17:10
ogra_well, then lets check the log ...17:10
pablo_ogra_: how do i check the logg? sorry i'm very noob with this17:12
pablo_journalctl? https://pastebin.com/hcRmcvbq17:14
ogra_Mar 23 17:10:07 ema systemd-networkd[577]: wlan0: DHCPv4 address 192.168.2.111/24 via 192.168.2.117:15
ogra_thats clearly in client mode17:15
ogra_do you have an entry for wlan0 in your netplan config ?17:16
ogra_(in /etc/netplan/00-snapd-config.yaml )17:16
=== pstolowski is now known as pstolowski|afk
pablo_ogra_, yes i have... it was created when i flashed ubuntu core..17:18
pablo_ogra_, how i change it?17:18
ogra_well, if you use the wlan device in clinet mode it can not run as AP17:18
ogra_*client17:18
pablo_I cannot even use the wifi as client right now.. how can i change client mode?17:19
ogra_you can re-run console-conf with "sudo console-conf" and make sure to only configure wired ... or you can edit /etc/netplan/00-snapd-config.yaml and remove the "wifis" block (but make sure you keep ethernet configured to still get into the system)17:20
pablo_thanks! i'll try with console-conf17:20
Chipacamvo: 4920 fixes the human thing17:21
mupPR snapd#4920 opened: timeutil: in Human, count days with fingers <Created by chipaca> <https://github.com/snapcore/snapd/pull/4920>17:21
pablo_ogra_, in console-conf, should i erase everything in wlan0? is that enough? or how should i config it to work as ap17:22
ogra_just pick "do not use"17:22
pablo_ogra_, i can select "do not use" in ipv4 and ipv5 settings, but it says "associated to mywifi"17:24
ogra_well, then change the wifi settings too17:24
ogra_not sure how the "dont use" option in there is named17:24
pablo_there isn't... i try to erase configuration.17:25
mvopedronis: 4882 is ready for a second look I think17:29
pablo_ogra_, excellent! now it works! thanks a lot..17:29
ogra_:)17:29
ogra_enjoy17:29
pedronismvo: ok,  almost dinner here17:30
pedronisthough17:30
mvopedronis: yeah, same here17:30
pablo_ogra_, is it possible to connecto trhough ssh if i connecto to the ap also? i mainly want it to do a simple webhost, but connecting trhough ssh could be helpful17:30
ogra_sure, as long as you use the same key on the client side it doesnt matter through which network device you connect17:31
pablo_thanks a lot! i'll disconnect a little time to connect to the boards ap17:32
=== sparkieg` is now known as sparkiegeek
mupPR snapcraft#2023 opened: repo: catch error due to broken build packages <Created by kalikiana> <https://github.com/snapcore/snapcraft/pull/2023>17:43
Chipacathe BBC is wondering about Ubuntu's future: https://www.bbc.co.uk/cbbc/quizzes/beyond-bionic-mega-quiz-117:43
* kalikiana wrapping up for the day17:43
niemeyerkalikiana: Heya.. didn't forget our conversation.. still need to write the notes into the forum.. let me do that now17:50
niemeyerkalikiana: Enjoy the weekend17:50
niemeyercachio: Haha :)17:51
niemeyerOops.. that was Chipaca17:51
kalikiananiemeyer: Grand! Looking forward to reading it.17:53
ChipacaI think this is a mostly-EOD from me17:54
ChipacaI'm off to do housework, might pop back in when the neighbours object to the hoovering17:55
mupPR snapcraft#2022 closed: Polish our landing page ✨ <Created by evandandrea> <Merged by sergiusens> <https://github.com/snapcore/snapcraft/pull/2022>17:58
niemeyerChipaca: Sweet.. I'm might have a review done for you, but you shouldn't care anyway because it's your evening isn't it :)18:00
Chipacaniemeyer: sorry i can't hear you over all the hoovering18:00
niemeyer:)18:00
=== alan_g is now known as alan_g|EOW
mupPR snapcraft#1999 closed: tests: document the SRU testing process <Created by elopio> <Closed by sergiusens> <https://github.com/snapcore/snapcraft/pull/1999>18:04
cachiomborzecki, hey I'll start preparing the image18:08
mborzeckicachio: ok18:08
cachiomborzecki, which version should I use?18:09
cachiomborzecki, I think we can use this https://github.com/GoogleCloudPlatform/compute-archlinux-image-builder18:10
mborzeckicachio: take the last one, 2018.03.01, it's a rolling release anyway, so we'll have to update the image (bi-)weekly18:10
cachiothe ones tht are prebuilt are pretty old18:10
mborzeckicachio: you can start with those scripts, if they work then we'll have an image, if not we'll try another way :)18:11
cachiomborzecki, ok, I'll try those scripts and see18:14
cachiomborzecki, shnould we try first with some stable image?18:15
cachioor it is ok with the last one?18:15
mborzeckicachio: there's no stable image per se, it's just a snapshot of current repos, it's always updating18:16
cachiomborzecki, ah, ok18:16
mborzeckicachio: https://www.archlinux.org/download/ 2018.03.01 is a good starting point18:16
cachiolet's try with this last one18:16
cachiomborzecki, nice thanks18:16
mborzeckicachio: let me check one more thing though, there was some discussion in the mailing liast about a cloud/vagrant image of arch18:16
mborzeckicachio: does google require cloud init?18:19
cachiomborzecki, I don't think so18:19
cachioit requires its own cloud dependencies18:19
cachiomborzecki, why?18:20
mborzeckicachio: do you know what these deps are?18:20
cachiomborzecki, some of these ones https://github.com/GoogleCloudPlatform/compute-image-packages18:22
mborzeckicachio: ok, let me know if you run into problems, mayble i'll need to package something18:23
mupPR snapd#4902 closed: cmd/snap-confine: nvidia: preserve globbed file prefix <Created by bboozzoo> <Merged by mvo5> <https://github.com/snapcore/snapd/pull/4902>18:26
pedronismvo: back from dinner, I will look at the PR now18:28
mvopedronis: great, thank you18:28
zygaThank you pedronis18:32
niemeyerHere is a short teaser18:41
niemeyerroot@spread-cluster:/proc# cat cpuinfo | grep processor | wc -l18:42
niemeyer9618:42
niemeyerroot@spread-cluster:/proc# ls -ls kcore18:42
niemeyer0 -r-------- 1 root root 141905719468032 Mar 23 18:42 kcore18:42
mupPR snapcraft#2024 opened: errors: improve the UX for sending error data <Created by sergiusens> <https://github.com/snapcore/snapcraft/pull/2024>18:43
pedronismvo: left a few comments,   I'm wondering what the move from yaml to json means in terms of the behavior on 2.31->2.32 transition18:45
cachioniemeyer, awesome18:45
cachio141905719468032 is bigggg18:45
niemeyercachio: Yeah, and wrong.. sorry.. :)  It's close to the real one, but kcore shows virtual memory.. the actual size is 128GB, which is close, but in GB not TB18:46
cachioniemeyer, so, spread should be in charge of creating vms inside this server?18:47
cachioniemeyer, is it possible to create vms with arm32 architecture too?18:48
mvopedronis: yeah, thats a critical point indeed18:48
niemeyercachio: Yeah.. I'll do a tiny daemon that basically allows a conversation like "give me a server" => "okay, server 142 has ssh at ...:8782" => "thanks, destroy 142"18:48
mvopedronis: I think we need to handle this as a mismatch without18:48
pedronismvo: bug we ingore mismatches18:48
pedroniss/bug/but/18:49
mvopedronis: I mean, I think we need to return that we have a mismatch and no errror here which sucks a bit18:49
mvopedronis: or we could first try json and if that fails try nyaml it18:49
pedronisbut the content is different, no?18:50
mvopedronis: yes, the content is different so it would be a case of "there is a mismatch, try to talk to snapd before contiue"18:50
cachioniemeyer, opensuse tests passed 100% but I had to retrigger because the layout test failed for ubuntu-sore18:50
niemeyercachio: Due a known issue or hiccup?18:51
mvopedronis: maybe that is actually the answer, on error we try to reach snapd instead of silently ignoring the issue?18:51
cachioniemeyer, there are 2 issues related, for one there is a PR18:51
cachioniemeyer, the second one is under research18:51
niemeyercachio: Ack18:51
cachioniemeyer, zyga has more info for sure18:52
mupPR snapd#4921 opened: skip test if no user "daemon" in build jail <Created by rkitover> <https://github.com/snapcore/snapd/pull/4921>18:52
pedronismvo: the issue is that there are various scenarios18:52
pedronisthe race described in the comment is only one of them18:52
pedronisthere's also switchin on or off  REEXEC18:52
pedronispotentially18:52
pedronisand this yaml to json transition18:53
zygamvo: I sent my review18:54
mvopedronis: indeed, at least for the race described in the comment and for the yaml -> json handling this by waiting for snapd should be ok. i.e. on error try to reach snapd and then continue18:54
mvozyga: cool, thanks18:54
zygasorry for taking so long, I took a break from this18:54
mvozyga: no problem, thanks for the review. will address things in a little bit18:55
mvopedronis: I'm not sure about the switching on/off of re-exec though18:56
pedronismvo: there is nothing we can do in general there18:56
* zyga EOWs18:56
pedronisexcept for testing/development it shouldn't really be done18:57
pedronismvo: anyway hopfully people don't have things trying to run while they switch18:58
zygawe just have to keep the packages up to date ;-)19:00
niemeyermvo: Just reviewed 4882 as well.. very nice, and looks super close19:07
mupPR snapd#4918 closed: cmd/snap: fix one issue with noWait error handling logic, add tests plus other cleanups <Created by pedronis> <Merged by pedronis> <https://github.com/snapcore/snapd/pull/4918>19:20
mvoniemeyer: yay, thank you all. fixing the comments now19:31
cachiomborzecki, the images that that project is creating are about 100GB19:31
cachio107 GB19:32
cachioI am aborting this, creating arch linux image from scratch19:32
pedronismvo: niemeyer: the comment about connection to old snapd means we really need #4911 too ?19:42
mupPR #4911: daemon,client: add build-id to /v2/system-info <Created by mvo5> <https://github.com/snapcore/snapd/pull/4911>19:42
cachioniemeyer, #4886 in green19:48
mupPR #4886: tests: adding opensuse-42.3 to google <Created by sergiocazzolato> <https://github.com/snapcore/snapd/pull/4886>19:48
niemeyercachio: Is it stable, or are there issues to sort out still?19:48
cachioniemeyer, opensuse is stable19:48
cachioI ran more than 10 times and I didn't see any error19:49
cachiolast 3 executions in travis 0 errors for opensuse19:49
niemeyercachio: LGTM then!19:51
* niemeyer needs to step out.. back later19:53
SyntistAs Snap is container based solution, does it effect the performance?19:59
mvoniemeyer, pedronis updated the PR20:04
mupPR snapd#4912 closed: overlord/configstate: change how ssh is stopped/started (2.32) <Critical> <Squash-merge> <Created by zyga> <Merged by mvo5> <https://github.com/snapcore/snapd/pull/4912>20:14
mvozyga: quick question - 4912 needs a port to master, right?20:14
mupPR snapd#4922 opened: overlord/configstate: change how ssh is stopped/started (2.32) (#4912) <Created by mvo5> <https://github.com/snapcore/snapd/pull/4922>20:16
zygaRe20:27
mupPR snapd#4886 closed: tests: adding opensuse-42.3 to google <Created by sergiocazzolato> <Merged by sergiocazzolato> <https://github.com/snapcore/snapd/pull/4886>20:31
* zyga checks 491220:43
zygamvo: yes b20:43
mvozyga: I added it20:43
zygamvo: I assumed it would not be an issue if this lands with the beta release you plan on making today20:43
mvozyga: all except 4882 are now in20:43
mvozyga: its fine20:44
mvozyga: I was just asking to not duplicate work :)20:44
zygasorry I wasn't around, I didn't hear the pings20:44
zygaI was on headphones watching a movie20:44
mvozyga: toally fine20:45
mvozyga: no worries20:45
* mvo hugs pedronis for his review20:46
* zyga makes tea and reads the reviews20:46
mvozyga: just 488220:49
mvozyga: :)20:49
zygamvo: I added one nitpick and +120:49
zygait's not important though20:49
zyganot worth waiting another round of tests if they go green20:49
* mvo hugs zyga 20:52
mvozyga: the fearful kernel20:52
zygahaha, did I write fearful?20:52
mvozyga: no but that is what I read at first20:52
zygaYeah they sound kind of similar20:54
zygaSo after this branch lands then what?21:01
zygaDo you need a PPA build21:01
mvozyga: yeah, when it lands-> ppa build21:05
zygaoooooh21:24
zygaI reproduced the /etc issue!!!!!!21:24
zygafor the first time, I have debug shell21:24
zygathis is like candy21:24
zygabut diet21:24
mvozyga: yay21:27
* mvo hugs zyga 21:27
pedronismvo: do we need #4877 ?21:28
mupPR #4877: snap-confine: fallback to /lib/udev/snappy-app-dev if the core is older <Created by mvo5> <https://github.com/snapcore/snapd/pull/4877>21:28
zygaand I have a strace21:31
mvopedronis: thats the master version of a pr that landed in 2.32 already21:31
zyga[pid  2951] mount("/tmp/.snap/etc/group", "/etc/group", 0xc82010fb6b, MS_BIND, NULL) = -1 ENOENT (No such file or directory)21:31
zygathis is what fails21:31
pedronismvo: ah, ok21:31
mvopedronis: its "funny" becasue the tests for this one in master fail all the time with snap-confine.dsajlkjfslda (tmpname) leftovers21:31
mvopedronis: and its not clear what is writing these leftover files21:31
pedronismvo: I have a PR that gets layout problems most of the time, interestingly follow up PRs are all green21:32
* zyga has a feeling this is a kernel bug21:35
zygaI'll paste the strace21:35
zygahttps://pastebin.ubuntu.com/p/qKyPqTnT3N/21:37
mvopedronis: hm,hm,I saw the layout ones too but not on recent test runs21:37
* cachio afk21:38
zygasuper interesting facts:21:39
zyga1) /dev/sda3 on /etc/group type ext4 (rw,relatime,data=ordered)21:39
zyga: /etc/group is a bind-mount21:39
zygahmmm21:39
pedronisis that the result of threspassing ?21:40
zygano21:40
zygathis is just /etc/ on core21:41
zygaI don't get this thing: [pid  2951] lstat("/etc/group", 0xc8201097c8) = -1 ENOENT (No such file or directory)21:41
zyga[pid  2951] mount("/tmp/.snap/etc/group", "/etc/group", 0xc82010fb6b, MS_BIND, NULL) = -1 ENOENT (No such file or directory)21:41
zygathis last error is the error we are seeing21:42
zygawe carefully create /tmp/.snap/etc/group21:42
zygaand it exits, we can see that on line 1478 in the pastebin21:42
zygabut when we try to bind mount /tmp/.snap/etc/group over to /etc/group, something fails and we get ENOENT21:43
zygathat pastebin is really weird,21:43
pedroniswe are bind mounting over a bind-mount?21:43
zygaI will need to study it21:43
zygano, technically we are mount --rbind /etc /tmp/.snap/etc21:44
zygamounting tmpfs over /etc21:44
zygaand bind-mounting things back. one by one21:44
zygaat this stage /etc/group should be an empty file21:44
zygabut I think the code gets confused21:44
pedronisbut you said it's a bind-mount?21:44
zygabecause we check if it's a file or directory21:44
zygaon the host21:44
zygaon core it seems to be21:45
zygalet me look on my dragon21:45
zygahmm21:45
zygait's not21:45
zygait's squashfs there21:46
* zyga wonders what's the google core-16 image then21:46
pedroniswe build the image21:46
pedronisin the tests21:46
zygamountinfo on google ubuntu-core-16 https://www.irccloud.com/pastebin/h9nNyYdf/21:46
pedronismmh21:47
zygaahhh21:47
zygaI see what's wrong now21:47
zyga117 25 8:3 /system-data/var/lib/extrausers/group//deleted /etc/group rw,relatime shared:7 - ext4 /dev/sda3 rw,data=ordered21:47
zyga!21:47
zygaafter booting21:47
zygasomething wiped /writable21:47
zygaand all hell broke loose21:47
zygait's a very peculiar thing in mounts21:48
zygaI need to add code to detect that21:48
zygaand bail early21:48
zygamaybe then we'll know more21:48
zygaso.......21:48
zygaI would say that some of our tests killed /writable21:48
zygathis broke /etc in the real system21:48
zygaand then my code doesn't cope with a file being there but not really being there because it's a //deleted mount21:48
zygaI need to check the kernel what the semantics is then21:49
zygabut I bet a beer that this is broken prepare/restore21:49
pedroniszyga: notice that we do special things to the image to get the spread user in it21:49
pedronislet me find a pointer21:49
pedroniszyga: we add special units that bind mount  /var/lib/extrausers/x  to  /etc/x21:50
zygaI don't understand why /etc/group is the extrausers/group in this image but not in my dragon21:50
zygaooooh21:50
zygawhere is this?21:50
zygaman,21:50
zygaI didn't know that21:50
pedronisone sec21:50
zygasure, thank you!21:50
zygaI bet it's suite-level restore21:51
zygaas I only ran this test (layout) with -repeat21:51
pedroniszyga:  no,  it's done with the image21:51
pedronisis not a restore thing21:51
pedroniszyga: https://github.com/snapcore/snapd/blob/master/tests/lib/prepare.sh#L38821:51
zygabut it doesn't always fail21:52
pedronisthat is a different question21:52
zygait must be the ordering or tests so that layout runs after something restored suite in the same system21:52
pedronisbut this operation is done with the image once21:52
zygawhen I ran layout (and just layout) with repeat21:52
pedronisbut the mounts will happen at boot each time21:52
zygayes, but it doesn't hurt normally21:52
zygawhat is wrong is that something removed them21:52
zygaso reset_all_snap has this at the end21:54
zygahttps://www.irccloud.com/pastebin/SPpOyIr1/21:54
zyga-rw-r--r-- 1 root root 453M Mar 23 21:22 /home/gopath/src/github.com/snapcore/snapd/snapd-state.tar.gz21:56
zygathis doesn't feel right21:56
zyga450MB?21:56
zygaha21:56
zygait's not tar.gz21:56
zygait's tar21:56
zygawhat?21:56
zygasome this is clearly wrong here21:56
pedronis?21:57
pedronisit's probably mostly /var/lib/snapd/snaps that make up that size22:03
pedroniszyga: I see a few tests that manipulate things in /var/lib/extrausers but nothing removes things totally22:14
pedronisah but22:15
zygare22:22
zygabut?22:23
zygahttps://github.com/snapcore/snapd/pull/4923 may help us find the culprit22:23
mupPR #4923: spread.yaml: look for signs of //deleted mounts <Created by zyga> <https://github.com/snapcore/snapd/pull/4923>22:23
mupPR snapd#4923 opened: spread.yaml: look for signs of //deleted mounts <Created by zyga> <https://github.com/snapcore/snapd/pull/4923>22:23
zygaI need to check what the kernel semantics is22:23
zygabut I think we are much closer to understanding the issue22:24
zygawhere //deleted comes from https://www.irccloud.com/pastebin/jtTaEdO3/22:26
* pedronis -> rest22:28
zygapedronis: let's catch up with this next week, have a great weekend22:28

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!