[00:41] PR snapcraft#1929 closed: sources: proper errors for invalid handlers [00:44] PR snapcraft#1942 opened: Release changelog for 2.39.2 [01:05] snappy-m-o autopkgtest 1926 xenial:armhf xenial:amd64 xenial:arm64 [01:05] elopio: I've just triggered your test. [01:52] snappy-m-o autopkgtest 1942 xenial:armhf xenial:amd64 xenial:arm64 [01:52] elopio: I've just triggered your test. [03:39] yo guys, my snaps seem to not have any internet connection... Spotify says no internet connection and so does Writefull [03:40] i made sure the network plug was enabled [04:41] snappy-m-o autopkgtest 1943 bionic:amd64 bionic:arm64 bionic:armhf [04:41] sergiusens: I've just triggered your test. [04:42] PR snapcraft#1943 opened: Release/2.39.2+18.04 (DO NOT MERGE) [06:07] morning [06:24] mvo: morning [06:24] hey mborzecki ! good morning [06:25] mvo: i noticed this in one of the travis jobs: https://paste.ubuntu.com/p/qYKjfqYdxF/ is this the restart on refresh mode thing? [06:27] mborzecki: yes - looks like a race, do you have the full log? there should be logging to syslog around this to help tracking this down [06:28] mvo: https://api.travis-ci.org/v3/job/343470218/log.txt [06:35] mborzecki: hm, hm, the log looks ok, the right restart reason and all that. puzzling [06:38] mvo: maybe the log comes from before the actual test action happened? [06:39] mborzecki: yeah, I wonder why [06:41] mvo: backend calls systemd.Restart(), that's implemented as stop & start [06:43] mborzecki: oh, thats an interessting clue, a restart with no restart reason. when is backend calling this? [06:44] mvo: i believe the path is ifacestate.setupSnapSecurity() -> backend.Setup() -> systemd.Restart() [06:46] ta [07:45] mvo: do you need any help with that issue? [07:46] mborzecki: help would be great, I'm still fighting with the 2.31.1 stable release and the oom error and the autopkgtests [07:47] mvo: oom error? you got me interested now [07:47] mborzecki: on bionic/i386 the interfaces-many tests triggers an oom error [07:47] mborzecki: its not a new bug, I can reproduce it with 2.29.x too [07:48] mborzecki: but its very unclear right now what is causing it [07:48] mborzecki: its reliable to reproduce on i386 [07:49] mvo: is snapd getting killed? [07:50] mborzecki: everything is getting killed :) its also strange as htop does not show any user space grwoing like mad. also the swtich from ok(ish) to oom comes very fast [07:50] mborzecki: http://paste.ubuntu.com/p/gVgDtTqPBg/ this is the script [07:51] mborzecki: I can prepare a tar that includes the two snaps to run this [07:53] mvo: no need, if they are in tests/lib/snaps i can build them myself [07:54] mborzecki: I will do it anyway I think because I want to run it on an amd64 and compare the logs. it seems like th eoom condition is not triggered there [07:54] mborzecki: might be kernel, might be udev [07:55] mborzecki: each connect seems to take in the range of 12mb [07:55] mborzecki: it could be apparmor too, we load profiles on each connect [07:55] mvo: wow, that's a lot [07:55] mborzecki: well, thats what "free" tells me, its not clear (to me yet) where this memory actually goes [07:56] mvo: i'll try to play with pprof, see if anything comes up [07:57] mvo: there's a bunch of maps flying around in the code, also we may be collecing some output from processses that we run [08:00] mborzecki: my gut feeling is that its external to snapd, we need more data, but afaict this is fine on xenial/i386 [08:01] mvo: interesting, is ti possible to say, disable apparmor and not call apparmor_parser? [08:01] mborzecki: yeah, that might be worth a shot [08:02] mborzecki: its puzzling, free telle me I have "479404" used and "784648" free - thats confusing (but memory reporting apparently is) [08:03] mborzecki: I also see a lot of kmalloc-8192 in slab-info [08:03] mborzecki: the objects there grow from 500 to 5000 [08:03] (which is still not that much in total memory size about 40mb) [08:06] mornings [08:09] mborzecki: my current theory is that for some reason "normal" memory runs out on i386. there is plenty of "highmem" free but the normal memory falls below the threshold of 20mb [08:10] pstolowski: morning [08:10] hey pstolowski ! good morning [08:12] mvo, heya, is this re to the 'killed...' issue on connect from yesterday? [08:15] pstolowski: yes [08:16] interesting [08:17] PR snapd#4676 closed: timeutil: introduce helpers for checking it time falls inside the schedule [08:20] mvo: hm the test only connects the interfaces, so maybe temporarily linking apparmor_parser to /bin/true would allow to skip this path if that's what you suspect [08:23] pstolowski: if you're up for reviews https://github.com/snapcore/snapd/pull/4695 needs looking at :) [08:23] PR #4695: wrappers: generator for systemd OnCalendar schedules [08:26] mvo: xenial i386 vm is updating, meanwhile i'll look at refresh mode thing [08:26] mborzecki, sure! [08:26] pstolowski: great, thanks :) [09:02] mborzecki: I will write a forum post in a wee bit that summarizes my current findinds [09:05] snappy-m-o autopkgtest 1942 xenial:armhf xenial:amd64 xenial:arm64 [09:05] sergiusens: I've just triggered your test. [09:06] sergiusens: morning! (?) [09:06] Chipaca: intermittent, just woke up to trigger these darn tests :-) [09:12] snappy-m-o autopkgtest 1943 bionic:amd64 bionic:arm64 bionic:armhf [09:12] sergiusens: I've just triggered your test. [09:31] PR snapd#4708 opened: overlord/snapstate: use spread in the default refresh schedule [09:36] mvo: so no apparmor no leak? [09:37] mborzecki: thats what it looks like to me at least [09:39] that's interesting observation [09:39] , udev/seccomp/cgroups is all there right? [09:39] mborzecki: yes, I just disabled this one [09:42] PR snapd#4709 opened: tests: fixes for autopkgtest in bionic [09:43] mvo: can you try with something more recent, like 4.15.4? [09:43] mvo: i mean the kernel ofc [09:47] mborzecki: yeah, I am preparing a more automatic thing now [09:48] mborzecki: I mean a tarfile so that one can just run ./test.sh [09:48] mborzecki: and then I can test on more systems (artful etc) to see when it started [09:49] mvo: i did not observe any issues on xenial, but that's a really old kernel [09:51] mborzecki: interessting [09:53] mvo: https://paste.ubuntu.com/p/bWVBw4Yp6v/ [09:53] LowTotal: 858144 kB [09:53] LowFree: 723696 kB [09:53] mborzecki: this is xenial? [09:54] mvo: yes [09:55] mborzecki: it seems to have a different lowmem split [09:55] mborzecki: i.e. more available (my system only had ~400mb) [09:55] mborzecki: is it also shrinking fast? [10:02] mborzecki: same here, no craziness on 16.04-32 [10:03] mvo, hey [10:03] mvo, can you help us? andyrock needs some input on changes he would like to suggest for snapd that he needs for his livepatch work [10:04] mvo: https://imgur.com/a/xDeVR [10:04] seb128: sure, what kind of changes does he have in mind? [10:04] mvo: the test script stars around 20s mark [10:05] mvo: so we need to login in snapd from the installer. The problem is that we can't talk with the to-be-installed snapd in ubiquity [10:06] mvo: the chroot hack does not work (in a nutshell the chroot is not ready when we need it) [10:06] mborzecki: what is the y axis? [10:06] mvo: kB, LowFree [10:06] mborzecki: ta [10:06] mvo: would be possible to have a way to seed the auth state? something like this: https://paste.ubuntu.com/p/sSbKstQ2Sy/ [10:08] andyrock: interesting! pedronis, what do you think about the ability to put auth.json into the seed directory and import it on firstboot? this is for the desktop installer use-case [10:10] mvo: I'm a bit missign the context [10:11] pedronis: I've been asked to add snapd login in the installer [10:11] pedronis: they allow people to login into the store/snapd in the livecd session. but the snapd that is running in there is not the snapd that is copied to the hardisk. the harddisk snapd will have an empty state.json [10:11] pedronis: so the user would have to login again after the system was installed [10:13] what is that auth.json? the one from the homedir? [10:13] or something else [10:13] something else [10:13] the one in the homedir doesn't have all the data [10:13] something we need to take from the livecd state.json [10:14] we can take all under auth: {...} [10:14] no you don't want everything [10:14] because there's device info there [10:14] kk [10:14] and I think we are trying not to make livece session count the same wa [10:14] as full installs [10:18] pedronis: so last-id, users, and macaroon-key? [10:18] or last-id and users are enough? [10:27] andyrock: you need the macaroon-key as well, if you plan to port ~/.snap/auth.json over or the gnome-software equivalent [10:27] kk [10:28] pedronis: so are you willing to accept something like that? [10:28] maybe, is not just my decision [10:29] it needs also to be a bit more paranoid about checking the perms of that file [10:29] kk who else should we ask? [10:31] you need niemeyer +1 as well [10:33] I'm mildly worried that it allows to clone the same creds on a lot of images, it's not your use case but it can be used that way [10:36] pedronis: should we move the discussion the forum? [10:36] *in the [10:37] yes [10:37] you should make a proposal there I suppose === ikey|afk is now known as ikey [10:49] pedronis, andyrock another option might be to join the standup as a guest [10:50] o/ [10:50] I'm sick and I won't be around today [10:50] I just got off the bed to let you guys know [10:50] sorry about that :/ [10:50] zyga_: get well [10:50] * mvo hugs zyga_ [10:50] thank you, I be back as soon as I can [10:51] PR snapd#4704 closed: tests: store journal in autopkgtest artiffacts and add 18.04-32 to qemu [11:09] mborzecki: fwiw, artful seems to show the same behavior [11:10] random question, is there a pattern for dealing with migrating configs and stuff on a snap package upgrade? [11:11] or do you just bake something into the start script? [11:15] mvo: I'm told there's a PR in flight somewhere which will enable a snap (like gnome-calculator) to auto-download & install a platform snap (like the gnome one). Do you know what stage that's at? [11:15] popey: that is #4103 [11:15] PR #4103: snapstate: auto install default-providers for content snaps [11:16] popey: we want to get it in for 2.32, its a bit hard because there is some work to do still but its super important to us [11:16] mvo: excellent, thanks. [11:16] Yes, that would be awesome. [11:32] * pstolowski lunch [11:35] mvo: that problem with test-snapd-service endure, the only idea i have atm is that we do journalctl --rotate, but no journalctl --sync, if a test using test-snapd-service was run before that one, we could end up with a log from the previous run [11:36] mborzecki: interessting, that sounds like a good theory [11:37] funnily enough, looking at the source code of journalctl, you cannot use `journalctl --sync --rotate` in a single command, both flags are set to the same variable [11:45] PR snapd#4710 opened: tests/lib/prepare-restore: sync journal before rotating and vacuuming [11:46] mvo: ^^ a simple fix for now, let's see if this is enough [11:46] PR snapd#4709 closed: tests: fixes for autopkgtest in bionic [11:47] mborzecki: \o/ [12:00] mborzecki: mvo: #4708 gtg [12:00] PR #4708: overlord/snapstate: use spread in the default refresh schedule [12:02] PR snapd#4708 closed: overlord/snapstate: use spread in the default refresh schedule [12:07] Chipaca, mborzecki thanks, cherry-picked into 2.31.1 [12:09] mvo, hey [12:10] hey cachio [12:10] test-snapd-hello-classic in the store for s390x, stil waiting a review [12:10] mvo, updating the kernel in the qemu machine [12:10] cachio: aha, cool, I can do that after lunch [12:30] mvo, same issue with an old kernel [12:30] cachio: oom? [12:31] mborzecki, yes [12:32] cachio: how much ram are those vms getting? [12:32] mborzecki, well, I could not connect anymore [12:32] mborzecki, 3500 MB [12:34] cachio: tried this today on 16.04 32bit, with -mem 4096 and haven't observed any issues, lowmem did drop a bit though, see ~20s mark https://imgur.com/a/xDeVR [12:36] mborzecki, I ran some scripts yesterday and it is like in 18.04 i386 the garbage collections is not done [12:37] so you connect disconnect and do stuff and the memory is not release [12:37] d [12:40] cachio: can you try to collect this `while true; do echo "$(date +%s) $(awk '/LowFree/ {print $2}' < /proc/meminfo)"; sleep 1; done` while the test is run and paste is somewhere? [12:40] mborzecki, sure [13:05] mborzecki, https://paste.ubuntu.com/p/JTBkvh5PqN/ [13:14] hmm it's really low to begin with [13:21] jdstrand: ping [13:22] hey Chipaca [13:22] jdstrand: pm [13:23] jdstrand: hey, https://forum.snapcraft.io/t/oom-for-interfaces-many-on-bionic-i386/4101 might be interessting for you guys, I suspect it is apparmor related (but maybe not) [13:25] stgraber: hi there, was wondering if there is a probe command in lxd to figure out that we have been `init'ed` [13:54] i'm experimenting with `snap set core refrsh.timer=<...>` and it's a bit weird, you see the current setting appear right away in `snap refresh --time` but it becomes effective only after restart or when the current next expires [13:57] mborzecki: shouldn't be like that looking at the code, but it might take 5 minutes to take effect if we are not careful [13:59] pedronis: how often is autorefresh.Ensure() called? [14:02] mvo, mborzecki I ran with this kernel and I for the oom issue too https://paste.ubuntu.com/p/4X9s7jtybd/ [14:02] mborzecki: if nothing is going on, each 5 minutes [14:02] that's where my 5 minutes comes from [14:03] PR snapd#4710 closed: tests/lib/prepare-restore: sync journal before rotating and vacuuming [14:04] cachio: oh, interessting. [14:04] mborzecki: it's ensureInterval in overlord/overlord.go, probably worth getting familiar with that bit [14:04] pedronis: yup, looking at it now [14:05] mborzecki: we have state.EnsureBefore to trigger it earlier, various places use it [14:05] some in taskrunner.go [14:06] right, i've seen this one before :) === pbek_ is now known as pbek [14:07] mvo, is it any way to disable apparmor and make the tests work? [14:07] mvo, to discard that? [14:09] mvo, I'll try just removing it [14:09] cachio: I did: "mv /sbin/apparmor_parser /sbin/apparmor_parser.saved" [14:09] pedronis: there's a funny effect, that once a refresh runs as is successful, next becomes time.Time{}, so when you do `snap refresh --time`, next will be 'n/a' [14:10] cachio: cp /bin/true /sbin/apparmor_parser [14:10] mvo: interesting. As you said, this sounds like a kernel issue since you couldn't reproduce on 4.10 or lower [14:11] mvo, you already tried that and it didn't work? [14:11] mvo: you said 4.10 was not affected. 4.13 (artful) is? why are we only seeing this now? is it just now is when you decided to get to the bottom of artful and bionic issues? [14:11] jdstrand: take it with a grain of salt, cachio said he sees it with 4.10 too, I will double check [14:11] cachio: that worked for me [14:11] cachio: when I moved the apparmor_parser away I did not see the mem leak anymore [14:11] mvo: how much memory is in the vm? [14:12] jdstrand: I think its some factors: the interfaces-many test got added relatively recently, we don't run much on i386 and autopkgtest for 2.30 did not run because we had problems getting this snap accepted into the archive at all [14:12] jdstrand: the vm has 1500mb [14:12] jdstrand: but it seems like the LowMem is what triggers the oom on i386 which is just 400mb [14:12] mvo: this is i386 only/ [14:13] ? [14:13] jdstrand: I suspect its amd64 too, it just takes a lot longer as it has the full 1500mb of memory [14:13] jdstrand: for some reason its the lowmem on i386 that runs out [14:13] jdstrand: the oom-test.tar.gz should have all that is needed to reproduce [14:14] jdstrand: let me run it (in a loop) on amd64 to see if it eventually also dies [14:14] mvo: yes. I'd like to remove snapd from the equation though so I commented in the topic [14:15] jdstrand: I see ~15mb per "snap connect" decrease in free memory with 4.13. maybe with 4.10 (and below) the leak is just smaller that is why cachio sees it (because he was looking harder) and I don't (because I was just looking for the oom to trigger) [14:15] mvo: the interfaces-many test is a bit of a hog, it has so many changes that towards the end it takes seconds to do each one [14:16] Chipaca: indeed [14:16] jdstrand: I run it now on amd64 in a loop just to see what will happen (bionic with 4.13) [14:16] mvo: what if you try to reload apparmor profiles in a loop? [14:16] mvo: note that an apparmor profile does take kernel memory. it should not take that much and it shouldn't lose 15mb with a replace operation [14:17] jdstrand: *nod* [14:18] off to pick up the kids [14:26] jdstrand: fwiw, it looks like amd64 is also affect, just takes longer because of the different memory available (1480 vs 390) [14:28] mvo, https://paste.ubuntu.com/p/YsCpXvfgNg/ [14:28] 56MB free [14:28] preparing the suite [14:36] re [14:49] snappy-m-o autopkgtest 1942 xenial:armhf [14:49] sergiusens: I've just triggered your test. === rharper` is now known as rharper [15:18] cachio: yeah, same here, eventually amd64 also runs out of memory [15:22] mvo, I am gonna try with an older bionic image [15:23] cachio: thank you [15:24] mvo: https://forum.snapcraft.io/t/oom-for-interfaces-many-on-bionic-i386/4101/5 [15:25] oh heh, you already liked it [15:25] jdstrand: very much indeed! thanks for the reproducer [15:25] cachio: -^ [15:25] cachio: looks like jdstrand has an even better reproducer that does not involve snapd, probably easy(ish) to bisect using this [15:26] Hi guys! I installed bluez in Ubuntu Core Snappy and I'm having this problem: [15:26] bluez.sdptool browse local [15:26] Failed to connect to SDP server on FF:FF:FF:00:00:00: No such file or directory [15:26] mvo: I think for now the test should be disabled on i386 [15:26] Does anyone knows how can I solve this issue? [15:26] mvo: the test really only needs to be run on one arch imho [15:27] yes+ [15:27] well [15:27] the snapd part of the test is only useful on one arch [15:28] * mvo nods [15:28] the fact that it showed an issue on i386 is certainly helpful :) [15:38] o/ can anyone help me with this: https://forum.snapcraft.io/t/refreshing-disabled-snap-kubectl-not-supported/4060/11 [15:39] Hi guys! I installed bluez in Ubuntu Core Snappy and I'm having this problem: $bluez.sdptool browse local Failed to connect to SDP server on FF:FF:FF:00:00:00: No such file or directory [15:39] Does anyone knows how can I solve this issue? [15:40] brunosfer, it is probably best to ask this on the forum ... koza can probably help but i think he is not around currently (so he can more easily pick up the question on the forum later) [15:40] also make sure to describe what HW you run and if all your interfaces are connected properly etc [15:41] how much later? because I posted that issue since December 17th and so far I'm stuck and nobody replied... [15:42] brunosfer, where did you post it ? [15:42] Chipaca: o/ [15:42] Everytime I make this question here people redirect me to the forum and it's a dead end... [15:42] (got a link to the forum post ?) [15:42] https://forum.snapcraft.io/t/failed-to-connect-to-sdp-server-permission-problem/3085 [15:47] hm, hm, today https://forum.snapcraft.io/t/all-revisions-of-snaps-are-mounted-when-they-dont-need-to-be/2294 came up, I wonder if anyone ever looked what is taking >500ms [15:48] mvo: fyi, jjohansen is aware of a profile replace memory leak and will continue to look into it [15:55] marcoceppi: o/! [15:55] marcoceppi: how's things? [15:55] marcoceppi: so, tell me more about this messed-up snapd you have [15:55] great except for my broken snaps :( [15:55] I mean, it's my laptop [15:55] marcoceppi: ok. Can you enabled debug, if you haven't already? [15:56] enable* [15:56] 16.04 full patched, few days ago a new kubectl snap was published and since then it's not been well [15:56] marcoceppi: that's add SNAPD_DEBUG=1 to /etc/environment and 'systemctl restart snapd' [15:56] yeah, restarting snapd hangs [15:57] niiice [15:57] marcoceppi: hangs how? [15:57] marcoceppi: what exactly is broken? does snapd itself misbehave? or one (or muliple) snaps? [15:57] mvo: https://forum.snapcraft.io/t/refreshing-disabled-snap-kubectl-not-supported/4060/11 [15:57] mvo: https://forum.snapcraft.io/t/refreshing-disabled-snap-kubectl-not-supported/4060/7 [15:57] Chipaca: restart command blocks indefinitely [15:57] haha [15:57] youch [15:57] marcoceppi: what does 'journalctl -u snapd' show? [15:57] I can straight up kill it [15:58] Chipaca: https://paste.ubuntu.com/p/fDhqFCPGkC/ [15:59] marcoceppi: is that the _end_ of the journal? [15:59] yeah, I mean, I just did an -f and it's printing a few more things [15:59] like sysctl just SIGKILL'd the proc [15:59] with the SNAPD_DEBUG, and it hanging, i'd expect lots of stuff [15:59] nnngh [15:59] oh, no, still waiting for it to restart [15:59] so it can get SNAPD_DEBUG [16:00] ok, so just kill it [16:00] something is bad [16:00] * marcoceppi nods [16:00] and the oom will kill one thread and leave go wondering [16:00] so when that happens you need to manually kill the whole thing [16:00] kubectl is classic fwiw [16:00] ack [16:01] yes, but lxd isnt pedronis and it has the same problem [16:01] pedronis: i mean, if it were just the snap misbehaving i'd kick it over to sergio :-) [16:01] pedronis: but this is snapd doing something weeeird [16:01] but you said it started when you got the new kubectl [16:01] that's when I noticed it exhibiting issues, yes [16:01] marcoceppi: how's the kill-everything-and-start-over going? [16:01] it's started [16:02] Chipaca: https://paste.ubuntu.com/p/vR9Nky4bhx/ [16:03] marcoceppi: can you 'curl -s -N -0 --unix-socket /run/snapd.socket http://localhost/v2/changes?select=all | python -m json.tool | pastebinit'? [16:03] marcoceppi: assuming you have pastebinit, otherwise you know what i mean [16:03] that's a mouthful [16:04] http://paste.ubuntu.com/p/nRTM6jyyvT/ [16:06] marcoceppi: can you do a 'sudo du -sh /root/snap/* /home/user/snap/* /var/snap/*' ? [16:06] (depending on your permissions you might need to tweak that for the glob to expand everywhere) [16:07] Chipaca: https://paste.ubuntu.com/p/HNq9hGGm66/ [16:08] marcoceppi: if you say 'snap watch 481' does it still say it's copying stuff? [16:08] yes [16:08] marcoceppi: that's kubefed, yes? [16:08] correct [16:08] marcoceppi: so my question for you is, how does it take this long to copy 28k of data [16:09] well, that's my question for you - isn't it :D [16:09] SSD with over 60GB of free space on EXT4 [16:09] marcoceppi: can you try copying that directory somewhere, by hand? ie 'cp -a /var/snap/kubefed /var/tmp' ? [16:10] Chipaca: [16:10] time cp -a /var/snap/kubefed /var/tmp/ [16:10] real 0m0.003s [16:10] user 0m0.000s [16:10] sys 0m0.000s [16:10] worked without issue [16:11] mvo, using an 2 months old bionic image I see the same oom error [16:11] cachio: ta [16:11] marcoceppi: the other one you had in doing was copying the lxd data [16:11] marcoceppi: that one could take a bit longer, it being 5 gigs [16:12] cachio: jdstrand told me the memory leak is something that jjohansen is investigating. maybe he knows (roughly) when this started ? if not we might be able to help bisecting [16:12] yeah, but it's been over 24 hours [16:12] and LXD was a problem after 24 hours of the kubectl issue [16:12] I feel like snapd is unable to perform this action [16:13] marcoceppi: is there anything new in the logs? [16:13] Feb 20 11:03:59 T430 snapd[28114]: 2018/02/20 11:03:59.146915 daemon.go:233: DEBUG: pid=28499;uid=1000;@ GET /v2/changes?select=all 1.369681ms 200 [16:13] that is the only new line [16:13] that's you [16:14] yes, I realize that, but wanted to be complete in my reporting [16:14] yes yes, i was just confirming [16:14] I don't understand what it thinks it's doing [16:14] can I - like delete this change attempt and just start over? [16:15] I really /really/ DO not want to lose my lxd containers [16:15] but kubectl has no userdata [16:15] marcoceppi: i mean, you could, but it'd be hairy [16:15] marcoceppi: can you stop snapd, and start it straced? [16:16] you got what I should invoke for snapd to be strace'd? [16:17] marcoceppi: so, first, systemctl stop snapd.\* [16:18] mvo, so, are you preparing 2.31.1 today, or we need to wait until this is fixed? [16:18] marcoceppi: and then: sudo strace -E SNAPD_DEBUG=1 -f -s 9999 -o /tmp/snapd.trace /usr/lib/snapd/snapd [16:18] mvo, or any other fix? [16:20] Chipaca: 5G in versioned data, that's a problem in itself, we though common is for that [16:20] pedronis: is it versioned? [16:20] well if it's being copied [16:20] we don't copy common data, no? [16:20] we don't [16:20] but I don't know if that's what it's doing [16:20] it says it's copying a 28k directory, also [16:20] ok, ignore me [16:20] for hours [16:20] pedronis: no, no, i need ideas :-) [16:21] pedronis: especially if nothing turns up in the trace [16:21] anyway both these snaps are classic I suppose [16:21] Chipaca: so, I can't start snapd manually , says socket is in use but no other proc is really running [16:21] so they could do strange stuff on their own [16:21] fwiw [16:21] though hooks have timeouts [16:21] in theory [16:21] Chipaca: nvm, fixed it [16:21] marcoceppi: what was it? [16:22] I have like 100% lack of chill right now, heh [16:22] how long do you want me to collect strace data? [16:22] cachio: today [16:23] cachio: it looks like we need test-snapd-password-manager-consumer for s390x [16:23] cachio: I'm just inspecting the s390x autopkgtest failure [16:23] marcoceppi: a couple of minutes after it says 'Running task 5559 on Doing: Copy snap "kubefed" data' [16:23] cachio: but no blocker, I will go ahead with 2.31.1 onw [16:23] Chipaca: cool, it's been doing that, I'll let it run a wee bit [16:24] marcoceppi: you can tail /tmp/snapd.trace if you're anxious (i know i'd be) [16:25] mvo, ok, I can't update the snap recipe [16:25] cachio: oh, ok. let me check [16:25] to generate the snap fo s390x [16:26] PR snapd#4711 opened: tests: make restore of interfaces-password-manager-service more robust [16:26] mvo, it is yours [16:26] Chipaca: Okay 5 mins of strace data since copy message [16:26] cachio: *cough* indeed - fixed [16:27] Chipaca: https://paste.ubuntu.com/p/4CphPf7stb/ [16:28] marcoceppi: and /tmp/snapd.trace ? [16:28] Chipaca: yeah, let me paste it [16:28] ah :-) [16:28] i was worried and perplexed, there [16:28] although - heh - I can't get the proc to interrupt [16:29] Chipaca: http://paste.ubuntu.com/p/7bCBVTMMTV/ [16:29] I hope I didn't crash pastebin [16:30] marcoceppi: you're probably going to have to sudo killall strace [16:30] sure, I'll also gz upload that file [16:30] can confirm that pastebin is not a happy bunny [16:32] Chipaca: http://marcoceppi.com/snapd.trace.gz [16:33] PR snapcraft#1944 opened: tests: split the plugins tests in the same directory [16:33] mvo, I see several errors on the arm64 execution for autopkgtest [16:33] mvo, should I fix them? [16:34] Chipaca: interestingly, I have a bunch of orphaned sync commands like in the paste above [16:34] cachio: yeah, but not high priority, it was not green before so it will not be a blocker [16:35] mvo, yes, smae for ppc64el and s390x [16:35] cachio: the issues on ppc64el and s390x are considered regressions by adt so that is higher priority [16:35] cachio: afaict ppc64el is mostly good [16:35] cachio: and s390x has some snaps missing, then we need to retrigger and see [16:36] ppc64el just 4 errors [16:36] cachio: its "funny", s390x was considered working before because we skipped it because it was only a lxd container [16:36] cachio: the timezone ones, right? [16:36] brb, going to restart [16:36] cachio: those are fixed with the systemd in -propsoed [16:37] mvo, yes and confinement-classic because the snap was not relased [16:37] released [16:37] cachio: indeed [16:38] cachio: apparently it is possible to retrigger the adt with &trigger=systemd/proposed&trigger=snapd/proposed at the end of the retrigger url - this will cause snapd to be tested with the right systemd version [16:38] mvo, perfect [16:40] Chipaca: the copies are cp -av afaict [16:41] yep, "cp", "-av", "/home/marco/snap/kubectl/303", "/home/marco/snap/kubectl/328" [16:41] so ps should tell use which one is taking forever? [16:41] cachio: I uploaded snapd 2.31.1 to the ppa, if all goes well you should have a beta core in ~45min or so [16:42] mvo, great, thanks [16:43] mvo: I just noticed that this bit of test mock := testutil.MockCommand(c, "kdialog", "sleep 9999999") seems to leave around the sleep [16:43] pedronis: it looks like the cp's finished [16:43] mvo, we need test-snapd-wayland for arm64 in edge [16:44] mvo, could you please share it to me? [16:44] tx [16:44] pedronis: lxd and kubectl at least, looking at the other two [16:44] Chipaca: I restarted my machine and everything's fixed [16:45] marcoceppi: oh no [16:45] for a bit, usually snapd will restart doing what it was doing before [16:45] I mean, the changes are all flushed, and the snaps are enabled and the currect current versions [16:45] ID Status Spawn Ready Summary [16:45] 481 Done 2018-02-17T01:48:48Z 2018-02-20T16:39:28Z [16:45] 482 Done 2018-02-19T05:32:01Z 2018-02-20T16:39:45Z Refresh snaps "lxd", "go" [16:45] 483 Done 2018-02-20T16:39:41Z 2018-02-20T16:40:23Z Auto-refresh snap "core" [16:46] Only problem is debug is stillenabled despite removing the environment line and restart snapd [16:46] marcoceppi: what's "snap tasks 481"? [16:47] that was the first one that got stuck, the kubectl and kubefed one [16:47] Chipaca: we had a bug when you refreshed some exact number of snaps the summary would come out empty [16:47] ah ok [16:47] marcoceppi: this wasn't the first time you restarted since this started, right? [16:47] it's fixed, I don't remember what the magic number was [16:48] no, this was the first time [16:48] it was a confusion bug around go switch stmt != C switch stmt [16:48] I don't like restarting, messes up my state [16:48] pedronis: heh [16:48] marcoceppi: fair enough [16:48] Chipaca: a missing fallthough [16:48] or something like that [16:48] marcoceppi: I hate losing state, and i hate problems that disappear without me learning from them [16:48] I suppose I should have done so earlier; either way it seems something about sync getting stuck causing a backup [16:48] marcoceppi: but, you're fixed, so yay :-) [16:49] hah, yes, back to work now [16:49] marcoceppi: does running 'sync' hang, for you? [16:49] not anymore [16:49] but I should have tested earlier [16:49] marcoceppi: was it doing so before the reboot/ [16:49] ? [16:49] did not test that [16:49] ah [16:50] if it was, I'd start shopping for SSDs [16:50] considering there were sync processes from days ago, I'm inclined to believe that may be the problem [16:50] but ¯\_(ツ)_/¯ [16:51] hum, I don't have any smart errors [16:51] but I'll start a disk copy anyways [16:52] marcoceppi: it might be something weirder, some corner case in the mount handling code that we're tripping up [16:52] marcoceppi: (we've fixed so many of those already...) [16:53] marcoceppi: if it ever happens again, jump on here [16:53] marcoceppi: glad you're unstuck, and good luck [16:53] Chipaca: I'll keep an eye out - thanks! [16:58] Chipaca: any way to get rid of the debug output? === anewman_ is now known as anewman [16:58] marcoceppi: if should be enough to remove it from /etc/environment and restart [16:59] yeah, did that :| [16:59] marcoceppi: but, note you'll still have it in your shell and stuff [16:59] oh, yeah, it's in my shell [16:59] DUH [16:59] marcoceppi: but systemctl shouldn't pass them to the units [16:59] marcoceppi: but if you're running it by hand because that's cool, then there's why [17:00] I mean, it's showing in all my snap commands [17:00] but because snap_debug was in my shell [17:00] so unset and ready to go! [17:00] ah, i thought you were seeing it in the journal [17:00] :-) ok [17:02] jdstrand: I noticed something curious - it looks like c1093d46437a33d60d7378b1c6676818655bc359 is not in master currently, I noticed when merging back the changes from 2.31.1 [17:02] PR snapd#4712 opened: release: version 2.31.1 [17:06] mvo, in fact I can't see where the test-snapd-wayland code is? do you know? [17:06] mvo, I found it [17:10] jdstrand, hey, we need the test-snapd-wayland for arm64 [17:11] to make autopkgtests pass [17:11] jdstrand, could you please create the snap? [17:28] * kalikiana wrapping up for the day [18:14] cachio: i386/amd64 core 2.31.1 ready in beta [18:16] cachio: arm* should be ready in ~1h or so [18:22] PR snapd#4711 closed: tests: make restore of interfaces-password-manager-service more robust [18:46] cachio: hrm, hrm, armhf/arm64 does not give me a built slot, maybe the 1h was too optimistic - I keep an eye on things [18:50] what version of intel vaapi driver is present in snappy core package in stable channel [18:54] jdstrand: Looks like the review tools need to learn about the "adapter" field: [18:55] - unknown fields for app 'jmes': 'adapter' [19:01] niemeyer: ack, noted. I'll get that fixed [19:02] jdstrand: Thanks! [19:02] jdstrand: I just added a note to the original topic, btw: https://forum.snapcraft.io/t/telling-snapcraft-to-skip-generating-wrapper-scripts/1635 [19:04] thanks [19:05] jdstrand: Meanwhile, would you mind to unblock the review and let it through.. it's a pretty boring snap otherwise [19:05] strict, no interfaces, etc [19:05] yes, I was heading over there [19:05] Thanks! [19:09] niemeyer: done. you'll need to release it to a channel [19:09] jdstrand: Thanks again! [19:10] np :) [19:43] cachio: all arches for 2.31.1 in beta now [19:43] great, 32 and 64 bits already running [19:43] mvo, no errors so far [19:43] cachio: \o/ [19:44] cachio: 32bit is 2.31.1? I think I did something wrong with the promotion before but its correct now (sorry for that) [19:45] yes, 2.31.1 [20:45] I'm trying to use this sockets feature in a snap, doesn't seem to be supported by snapcraft. it is supported yet? https://docs.snapcraft.io/build-snaps/syntax#sockets [20:48] cmars: It is supported, but not with that syntax.. this documentation is out of date as of a couple of years ago :) [20:48] niemeyer: gotcha.. do you have an example of how it's supposed to look? [20:49] cmars: It took us a while to reimplement support for it, but it's now in.. let me provide you with something you can go forward with.. just a sec [20:49] niemeyer: awesome to hear, thanks! [20:51] cmars: https://github.com/snapcore/snapd/blob/master/tests/lib/snaps/socket-activation/meta/snap.yaml [20:52] trying.. [20:53] hmm [20:54] niemeyer: hey, so looking at adapter. this seems to be a snapcraft.yaml thing and not a snap.yaml thing. I don't see anything for processing adapter in snapd. shouldn't snapcraft remove adpater when generating snap.yaml? [20:54] snapcraft still doesn't accept it, https://paste.ubuntu.com/p/8FKdbnHmTt/ [20:54] i guess i'll open a bug? [20:55] jdstrand: Waaait [20:55] jdstrand: Yes! [20:55] jdstrand: Good catch.. it makes no sense to have this in snap.yaml [20:55] niemeyer: ok, I'll file a snapcraft bug then and follow up in the forum [20:56] jdstrand: Thanks! [20:56] cmars: :/ [20:57] cmars: It's been there for a few months.. let me find the topic.. [20:57] cmars: https://forum.snapcraft.io/t/socket-activation-support/2050 [21:09] cmars: Meanwhile, after you get the snap fully cooked, you can workaround the problem by changing the meta/snap.yaml in prime/ and then packing the snap [21:13] niemeyer: interesting, ok, thanks [21:31] how to know what version of intel vaapi driver is used in ubuntu snap core in stable channel? [21:33] phoenix_firebrd: I dont believe we ship vaapi drivers in core [21:35] popey: so vlc snap uses driver from the host? [21:36] yes [21:36] popey: in general how can you find the versions of software used in a core snap? [21:36] it uses whatever the right driver is for the gpu [21:36] that's a question for someone who works on the core snap I think [21:37] (not me) [21:37] (I imagine ogra_ knows) :D [21:37] the yaml is probably somewhere abouts [21:37] popey: I patched my vaapi driver, the normal install of software reflects that, but it has no effect on the snap version of vlc [21:38] phoenix_firebrd, http://people.canonical.com/~ogra/core-builds/ has links to manifest files for all auto-built core snaps [21:38] ogra_: thanks, I will take a look at that [21:39] and no, i dont think vaapi from the host is used, the vlc snap definitely uses something it ships itself [21:40] (my laptop has no working vaapi setup yet, i can configure vlc from the snap with the vaapi driver and get accelerated playback) [21:42] libva info: VA-API version 0.39.0 [21:42] libva info: va_getDriverName() returns 0 [21:42] libva info: Trying to open /snap/vlc/158/usr/lib/x86_64-linux-gnu/dri/i965_drv_video.so [21:42] libva info: Found init function __vaDriverInit_0_39 [21:42] libva info: va_openDriver() returns 0 [21:42] [00007f3ced0781c0] avcodec decoder: Using Intel i965 driver for Intel(R) Ivybridge Mobile - 1.7.0 for hardware decoding [21:43] this pretty clearly points to the in-snap driver ... [21:43] ogra_: vainfo from where? [21:43] thats simply from starting vlc in a terminal [21:44] oh, it depends :) [21:44] ogra_: ok. can you show me a link to the manifist of this package ubuntu-core-libs [21:44] yeah, probably different on non-intel [21:44] yeah [21:44] phoenix_firebrd, no, i have no idea where a manifest for the vlc snap would live [21:45] phoenix_firebrd, the vaapi lib is included in the vlc snap ... not anywhere in core [21:45] for intel, yeah, and it builds using the 16.04 archive, so whatever is in 16.04 [21:45] cant be [21:45] the vaapi in the archive is broken [21:46] ogra_: thats right [21:46] i assume the vlc snap builds some upstream version [21:46] (I made that bit of the vlc snap) [21:47] ogra_: who maintains the vlc snap, the vlan team or a ubuntu dev? [21:47] vlc [21:47] https://github.com/videolan/vlc/blob/master/extras/package/snap/snapcraft.yaml [21:47] $ snap info vlc|grep publisher [21:47] publisher: videolan [21:47] ogra_: it cant be an upsteam version, you version as you saw is pretty old [21:47] note the libva stage packages [21:47] popey: the latest is version 2.1.0 [21:48] the latest version of what? [21:48] ogra_: may be i show file a bug report to ask them to bump the version [21:49] What's up with the one it ships? [21:49] popey: libva [21:49] * ogra_ hasnt had any issues with in on ivybridge and kybylake HW [21:49] ok, well currently it's using the one from 16.04 [21:50] popey: it has bug that when playing a vp9 codec video (4k videos from youtube) shows some corrupted video frames [21:50] i still dont get why i cant get vaapi to work at all with the archive version while vlc just bundles it and it works ... very weird [21:50] phoenix_firebrd: interesting, I'd file a bug upstream in vlc [21:50] yeah [21:50] popey: niced [21:50] thresh: is the maintainer of the vlc snap, maybe they can comment :) [21:50] nice [21:51] note that bumping such a version is quite some work ... (the snap would have to build a newer upstream version at build time) [21:51] ogra_: popey let me find the upstream bug report and the official patch , so that we can have some idea [21:53] note that snaps are currently bound to 16.04 versions of the dependencies they use ... [21:53] regardless where they are executed [21:54] (this is about to change in 18.04 at some point but currently it means if you want to patch a dependency it gets kind of awkward) [21:55] https://github.com/intel/intel-vaapi-driver/issues/297 [21:55] https://github.com/intel/intel-vaapi-driver/commit/9d66570032fb02b1e79a883af7697b035d700a8e [21:56] thats the bug report and the next is the patch [21:56] fun, a one liner [21:56] ogra_: what about in the edge channel? [21:56] ya [21:56] that has nothing to do with channels [21:56] snaps use deb packages ass build dependencies and as runtime deps [21:57] ogra_: both stable and edge are 16.04? [21:57] with the current design thies debs have to be 16.04 ones [21:57] yes [21:57] all snaps are 16.04 based currenty [21:57] I understand [21:57] you *can* build your deps from source [21:58] you mean for snaps or normal packages? [21:58] or you can maintain a PPA with backports of newer debs for your dependencies and hack usage of that PPA into your snapcraft.yaml [21:58] for snaps [21:58] they usually ship their dependencies ... [21:58] ogra_: I did that, but the point is i dont this issue to be present in 18.04 [21:59] if you read the snapcraft.yaml popey linked above you see "build-packages" and "stage-packages" ... these are typically 16,04 deb packages [21:59] yes, i understand [22:00] but the version in the snap is 16.04 ... unless someone backports the fix into the ubuntu archive or the vlc maintainer jumps through some hoops (build vaapi from source or backport the 18.04 version in a PPA and maintain it there ... (and hack the snapcraft.yaml of vlc to use that PPA)) [22:01] there is no easy solution for this atm ... [22:01] ok [22:02] the design is changing to allow newer dependencies via "base snaps" in the future (then your whole snap can be built against i.e. 18.04 but still run on 14.04 or 16.04) [22:02] but thats not done yet [22:02] so for now its a big amount of extra work for the snap packager to include such a patch [22:03] ok [22:03] what happens in case of a security patch for a dependency? [22:04] the developer rebuilds the snap [22:04] (or it's automatic) [22:05] if the core snap is rebuilt does the vlc snap for example have to be rebuilt? [22:05] no [22:05] ok [22:05] vlc snap is like a static executable? [22:06] not really :) [22:06] ok [22:06] it is a dynamically linked executable that ships all the libs it is linked against and sets a library path pointing to them [22:07] ogra_: understood [22:08] ogra_: do you know/guess what version of libva will be shipped during release of 18.04? [22:08] nope [22:09] and until there is snap support for 18.04 (beyond the 16.04 base we have today) it will still take some time [22:09] so for a while snaps will still be 16.04 based ... even in 18.04 ... until the base snap spec is fully implemented [22:10] (which will likely happen within the 18.10 timeframe) [22:11] ogra_: If a snap is strictly confined, is it denied of gpu access?, I read in a forum. Is that true? [22:11] it needs to define an interface for access [22:11] https://docs.snapcraft.io/reference/interfaces [22:12] there's an opengl interface [22:12] right [22:12] and an x11 one ... and a wayland one [22:12] (and various others) [22:13] popey: Its like policykit [22:16] lol [22:16] :) [22:16] thats quite a simplification :) [22:18] ogra_: Can a theme be installed to be used for snaps? [22:18] currently only if you ship it along with your app [22:18] there is work going on to solve this though ... [22:19] ogra_: oh o_O [22:19] ok [22:26] popey: ogra_ thanks for the support [22:27] np :) [22:27] no problem [23:38] snappy-m-o autopkgtest 1943 xenial:armhf xenial:amd64 xenial:arm64 [23:38] elopio: I've just triggered your test.