[03:00] kyrofa, I think you've made the steps in snapcraft.yaml too magic [03:00] to me, prep, build, etc should be stages where you explicitly describe what it should do [03:01] rather than having to do the backwards thing of overriding default behaviors === chihchun_afk is now known as chihchun [05:05] Caelum: do you have more info? [05:07] morning [05:11] Hey [05:11] Maciej, can you please boot tumbleweed [05:11] If you have one [05:12] don't have one installed atm [05:12] Ah, ok [05:12] Ill check [05:12] what's with tw? [05:13] Something broke apparently [05:13] Just woke up, will check soon [05:14] PR snapd#5003 closed: cmd/snap-seccomp: graceful handling of non-multilib host [05:14] zyga: ok, let me know if you need help, i have a spare t60 i could try it on, though i don't remember if there's an hdd inside [05:15] I thought you have VMs on your beefy box :-) [05:15] zyga: cloud vms yes, graphical ones, not so much [05:36] re [05:41] PR snapd#5028 opened: cmd/snap-seccomp: graceful handling of non-multilib host (2.32) [05:44] trivial review ^^ [05:51] * zyga updates his opensuse tunmbleweed snapd to what is in system:snappy repo [05:55] Caelum: I cannot see any issue [05:55] please tell me what you experienced [05:59] mborzecki: can you look at https://github.com/snapcore/snapd/pull/4957 [05:59] PR #4957: cmd/snap-update-ns: remove the need for stash directory in secure bind mount implementation [06:00] 3rd review just to be sure [06:00] ok [06:05] * zyga -> breakfast [06:26] chihchun: tee-hee-hee [06:27] zyga: tee-hee-hee [06:27] I hope this boosts code reviews [06:28] zyga: I guess you pinged the wrong person [06:28] oh [06:29] sorry, yes [06:29] chipaca sometimes has a nickname similar to the one you use [06:29] I missed that :) [06:29] :) [06:31] morning mvo [06:31] zyga: good morning! how are you? [06:32] good morning, I'm not taking a longish walk today :) [06:32] so hopefully I'll make some code [06:40] zyga: :) [06:49] zyga: no reports yet about any issues with 2.32.3 afaict? [06:50] zyga: did you see anything? [06:50] no, nothing about 2.32.3 yet [06:54] mvo: hey, can we land #5028? [06:54] PR #5028: cmd/snap-seccomp: graceful handling of non-multilib host (2.32) [07:07] PR snapd#5028 closed: cmd/snap-seccomp: graceful handling of non-multilib host (2.32) [07:07] mvo: thanks! [07:08] np! 5026 needs a second review, very trivial [07:11] PR snapd#5026 closed: tests: add check for OOM error after each test [07:12] PR snapd#4987 closed: tests: add test to ensure `snap refresh --amend` works with different channels [07:12] mborzecki: 5024 looks very good, just some testing tweaks and it is golden === pstolowski|afk is now known as pstolowski [07:12] mornings [07:13] morning [07:13] hey [07:13] mvo: yes, thanks for you suggestions, i'll be pushing a patch soon [07:13] pstolowski: hello [07:13] and a second look at 5020 would be great, sorry for being pushy, want to get the SRU out this morning :) [07:13] hey pstolowski [07:17] good morning [07:17] hey ho [07:17] o/ zyga [07:19] PR snapd#4845 closed: snap, store: store version numbers in the commands database [07:20] mborzecki: curious why is 4942 marked as blocked? it looks like its ready to go in, no? modulo the small ugliness in the journal but thats ok IMO for now [07:21] mvo: heh, forgot about the label :) let me fix that [07:21] mvo: squash merge right? [07:24] mborzecki: yes please [07:27] PR snapd#4942 closed: cmd/snap: user session application autostart v3 [07:34] PR snapd#5029 opened: cmd/snap: user session application autostart v3 (2.32) [07:34] PR snapd#5030 opened: packaging/amzn2: initial packaging of 2.32.3 for Amazon Linux 2 [07:35] need more coffee [07:35] PR snapd#5020 closed: errtracker: check for whoopsie.service instead of reading /etc/whoopsie [07:37] mborzecki: question about 5030, why are we using /var/lib/snapd/snap there? [07:38] zyga: it's very rhel like and also lists `ID_LIKE="centos rhel fedora"` in /etc/os-release [07:39] so just because? [07:39] zyga: yeah, we already have the right switches to cover rhel/fedora [07:40] sure but we can add a new distro to the list [07:40] and just use /snap [07:41] PR snapd#5031 opened: errtracker: check for whoopsie.service instead of reading /etc/whoopsie (2.32) [07:43] zyga: no strong opinions here, this just saves vetting DistroLike switches and there's one rpmlint warning less [07:43] I'd go for /snap [07:49] mborzecki: added three comments [07:49] zyga: thanks [08:01] willcooke: hey, there's an issue that skype crashes wayland on startup [08:01] is that something on anyone's radar on your team? [08:02] zyga, oh, no, not something I was aware of [08:03] zyga: why do you torture yourself with wayland :-) [08:03] willcooke: just login into wayland ssh in and look at the journal [08:03] run Skype and boom [08:03] Chipaca: x11 corrupts screen on my thinkpad [08:03] Chipaca: not sure why, wayland works better for that [08:03] (apart from taking down gnome-session when things go) [08:03] zyga: intel board? [08:03] popey: ping [08:03] Chipaca: my thinkpad [08:04] om26er: good morning [08:04] good day popey :) [08:04] om26er: I spoke to the store team, turns out we cannot rename snaps, so we should publish s-t and remove s-t-3 [08:04] zyga: btw is the fix for #1760841 in a core already? [08:04] Bug #1760841: snapd does not parse /etc/fstab properly when using mhddfs [08:04] Chipaca: yes, it's in .3 [08:05] aha [08:05] popey: so lets merge that one then ? [08:05] well actually my current "ping" was for https://github.com/snapcrafters/android-studio/pull/19 :) [08:05] PR snapcrafters/android-studio#19: Fix app startup based on latest snapcraft changes [08:08] popey: android studio release that we pushed yesterday won't start so this fixes it [08:09] speaking of which, I think if we(I) enable the CI for that snap, we won't have that kind of issue again. [08:10] "build and try to run" [08:17] Chipaca: some comments on 5027 [08:17] and sorry for the branch summary change ;) [08:18] lol :D [08:18] 10 best patches you read this week [08:22] zyga, confirmed. LP: #1762954 [08:22] Bug #1762954: Running the skype snap under the Wayland session crashes the whole session [08:22] thank you! [08:22] np, thanks for the ping [08:23] probably a good chunk of our users will just use wayland for some reason and will bump into this [08:24] thanks for the review on 4957 [08:24] I'll get right to it [08:31] willcooke: zyga: FWIW I think there's already a bug for that issue [08:31] or is it an issue for that bug [08:32] ah, maybe i'm thinking of https://forum.snapcraft.io/t/skype-crashes-gnome-on-ubuntu-18-04/4927 [08:33] PR snapd#5031 closed: errtracker: check for whoopsie.service instead of reading /etc/whoopsie (2.32) [08:35] or https://forum.snapcraft.io/t/vs-code-makes-shell-crash/4362 [08:36] willcooke: zyga: from that last one, https://bugs.launchpad.net/snappy/+bug/1760252 [08:36] Bug #1760252: starting slack crashes xwayland on 18.04 [08:36] so generally classic snaps then [08:36] on wayland [08:36] i mean, it's wayland (or xwayland?) that crashes [08:36] maybe bundled lib input? [08:37] yeah, was just looking at that, seems like it could be xwayland [08:37] * willcooke uploading stack traces [08:37] it was xwayland from what I've seen [08:38] call me ornery but imo if a client can crash wayland, there might be a bug in the client but there is a bug in wayland [08:38] one or two [08:38] :-) [08:38] zyga: sorry, looks like the problem disappeared [08:39] no worries, what was the problem? [08:39] snaps weren't starting from gnome app menu [08:42] k, hopefully LP will do some retracing and I will tidy up all the various bugs once that's happened. [08:42] Off to the office now, will be back later on [08:42] sure, thank you willcooke [08:42] willcooke, you are on a wayland session? [08:43] seb128, I was to test that [08:43] oh, k [08:43] thx [08:52] * Chipaca is off to the dentist's. ttfn! [09:09] om26er: ok. === chihchun is now known as chihchun_afk === chihchun_afk is now known as chihchun [09:46] PR snapd#4957 closed: cmd/snap-update-ns: remove the need for stash directory in secure bind mount implementation [09:46] PR snapd#5032 opened: repo: pass and return ConnRef via pointers [09:47] pstolowski: why by pointer? [09:48] zyga: to save on passing, that was pointed out by pedronis in interface hooks review [09:54] mvo: is .4 ready? [09:55] zyga: .3.1 [09:56] zyga: yes it just got pushed to xenial-proposed and once my test builds finished also to the other releases [09:56] 3.1? is that a deb only or a real upstream release? [09:56] zyga: its pretty small and *finally* use git-buildpackage which makes me incredible happy [09:56] nice, thank you! [09:56] zyga: its deb only but we could still put it into the beta channel for consistency [09:57] is that only the OOM thing? [09:57] zyga: either way .4 needs to wait a little bit until we are sure we don't need to do anything for 2.32.3, i.e. no regressions [09:57] zyga: a little bit more, one sec [09:57] should it be on releases page on GitHub? [09:58] zyga: https://github.com/snapcore/snapd/blob/2.32.3.1/packaging/ubuntu-16.04/changelog [09:58] mvo: let me know when you push the tag [09:58] zyga: probably, I'm a bit hesitant because .4 will happen soon and I don't want to put too much burden on the downstream packagers but otoh I think its more correct to have a release on GH [09:58] zyga: it is pushed [09:58] I can drop it on the release page [09:59] thanks! [09:59] zyga: also the release script can now be simplified because the orig.tar.xz is sane now (snapd-2.32.3.1) [09:59] whee, good [10:00] so orig.tar.gz doesn't need renaming internally [10:02] zyga: yeah, I upload the file to the ppa in a sec [10:06] mvo: we should discuss at the standup when to cut .4 [10:07] pedronis: +1 [10:07] pedronis: my strawman would be tomorrow morning [10:07] pedronis: but happy to discuss [10:16] 14.04 journalctl, no --show-cursor, no --after-cursor, no --identifier, damn [10:19] yeah [10:19] lack of cursor sucks [10:19] does anyone mind not running user session app autostart spread test on 14.04? [10:22] it's fine, 14.04 was supposed to be server-only [10:28] mvo: want me to look into #5029? [10:28] PR #5029: cmd/snap: user session application autostart v3 (2.32) [10:39] mborzecki: want to look at https://github.com/snapcore/snapd/pull/5033 [10:39] PR #5033: cmd: generalize locking to global, snap and per-user locks [10:39] it's very straightforward [10:39] PR snapd#5033 opened: cmd: generalize locking to global, snap and per-user locks [11:07] PR snapd#5034 opened: userd: set up journal logging streams for autostarted apps [11:14] mborzecki: hey, how do we go vendor stuff in fedora? it's seems to be the only one unhappy about my new dependencies (go-udev..)? [11:15] pstolowski: which pr? [11:15] pstolowski: we package [11:15] mborzecki: https://github.com/snapcore/snapd/pull/4940 [11:15] PR #4940: RFC: added UDevMonitor for future hotplug support [11:16] zyga: package as separate rpm for every go package we need? [11:16] pstolowski: package according to the way we need to in fedora, yes [11:16] pstolowski: btw. do have a solution for go-dev already? other deps from from rps [11:17] and iirc go-udev was not packaged for fedora yet [11:17] mborzecki yes, that's very probable, it's a small and not very popular project i think [11:17] gofed is pretty nice [11:18] zyga. mborzecki ok, if that's the case then I think I need a general +1 on the PR first before investing time into packaging (it's ~3 go packages as go-udev pulls others) [11:18] anyways i think the tests were using vendored deps on fedora [11:19] pstolowski: do you know what the transitive deps are? govendor list -v should show it [11:28] mborzecki, they'd better not be [11:28] I delete the vendor directory in prep [11:29] I've already had that happen once and found out that we were testing Fedora wrong [11:29] Son_Goku: yeah, saw that, i'm thinking of something to just unblock spread in that pr [11:29] or maybe we should just package go-udev for fedora and submit it for review [11:29] if you package the godeps, I'll review and merge them into the archive [11:30] as zyga knows, I can be very fast with go package reviews [11:30] yes [11:30] and go deps are easy to package because the gofed tool does nearly all the work [11:31] i can probably take a look, unless pstolowski you want to give it a 'go' :) [11:32] mborzecki: happy to do it, although as I said perhaps it makes sense to get general +1 on using go-udev first, because it hasn't been decided yet [11:33] dependencies aren't free :) [11:33] pstolowski: hm thought we were already good with using it [11:35] PR snapcraft#2058 closed: python plugin: install python-distutils when run on bionic [11:35] mborzecki: I'm not sure, Gustavo hasn't yet commented on the branch (and on the HLD) [11:36] Son_Goku: thanks for the hints! [11:37] gofed is in .... python? [11:37] yes [11:37] most of Fedora tooling is in Python [11:37] most of openSUSE tooling is in ruby :P [11:38] and most of Debian tooling is in Perl [11:47] PR snapcraft#2062 closed: packaging: simplify snapcraft.yaml [11:57] Son_Goku: gofed generates a busted spec, https://paste.debian.net/1019710/ there's no newline after %changelog [11:58] I know :/ [11:58] the changelog entry is also wrong [11:59] other than that the rpm gets built :) [12:01] off to pick up the kids, coming back for standup [12:02] cachio: mvo: I'm looking again at tests again staging, seems we need to copy canonical-livepatch to staging, and test-snapd-only-in-edge [12:03] we probably want to copy core from edge to edge as well [12:07] pedronis, sure,I'll do it [12:08] cachio: once we have solved the cred problems with spread-cron, we should revisit the various store tests branches there and add back staging I think [12:08] pedronis, ok, we could have a nightly exec [12:08] I think we could trigger on store deploys [12:09] but we'll see [12:09] probably next week at this point [12:09] ok [12:25] whoo, dentist anesthetic waring off [12:25] * Chipaca is not having fun [12:25] wearing off* [12:41] pedronis, my internet is so slow today [12:41] it is taking long time to upload the core === sergiusens_ is now known as sergiusens [12:59] pedronis, snaps ready [13:03] pedronis: standup? [13:13] something new: 2018-04-11 11:08:43 Cannot allocate linode:fedora-27-64: cannot allocate new Linode server for fedora-27-64: no open slots for this plan!t [13:25] pstolowski: sorry, i've just restarted the travis build in 4940 [13:32] mborzecki: np, it's going to fail unless you pushed something to the branch? [13:33] pstolowski: no, i did not, it's going to fail ;) [13:33] ok ;) [14:11] mvo: should I squash-merge #5002 given that it has many commits? otoh it means that the next two might need to be changed because of conflicts with the squash [14:11] PR #5002: many: use the new install/refresh /v2/snaps/refresh store API (2.32) [14:11] pedronis: no squash please [14:11] ok [14:11] pedronis: this will be a super messy merge back otherwise [14:12] pedronis: thanks [14:13] PR snapd#5002 closed: many: use the new install/refresh /v2/snaps/refresh store API (2.32) [14:13] mvo: merging [14:14] PR snapd#5022 closed: overlord/snapstate: on multi-snap refresh make sure bases and core are finished before dependent snaps (2.32) [14:16] mvo: my 2.32 stuff is merged [14:16] PR snapd#5021 closed: overlord/snapstate: introduce envvars to control the channels for bases and prereqs (2.32) [14:18] pedronis: thank you [14:19] mborzecki: 5029 needs a review - just double checking that its the right commit etc [14:20] PR snapd#4840 closed: many: add `core.problem-reports.disabled` option [14:26] * cachio afk [14:29] PR snapd#5029 closed: cmd/snap: user session application autostart v3 (2.32) [14:39] Wimpress and kenvandine does this (https://forum.snapcraft.io/t/snap-application-and-snap-themes/4946) not somewhat align with what was discussed around a month ago? That theme does a bit more, I'll let you comment on that though :-) [14:46] sergiusens, yes, i replied with a link to jamesh's post [14:49] PR snapd#5035 opened: release: snapd 2.32.4 [15:01] Chipaca: hey, could you look at https://github.com/snapcore/snapd/pull/5033 [15:01] PR #5033: cmd: generalize locking to global, snap and per-user locks [15:01] nothing major [15:03] kenvandine: your the man! [15:03] 're [15:12] who fancies digging into snapcraft? https://forum.snapcraft.io/t/error-while-building-argument-list-too-long/4948 [15:14] diddledan, yuck [15:14] I aim to please :-p [15:14] diddledan, can you try on edge? [15:14] I did [15:14] the second paste [15:14] diddledan, scriptlets give better errors there [15:14] Oh [15:14] * kyrofa scrolls down further [15:15] Dang, still not helpful [15:16] diddledan, but yeah, I'll own this one [15:16] yey [15:16] did you look at the yaml yet? ;-) [15:16] a mere snip at 1350 lines [15:16] diddledan, no, you're scaring me [15:16] * kyrofa cries [15:17] haha [15:17] * diddledan cuddles kyrofa [15:17] diddledan: hey, at least three of those lines are comments now [15:18] :-p [15:25] diddledan, ignoring the actual cause, the fact that this is so opaque is problematic. In an ideal world, how would this be presented to you? [15:25] well to begin with I think actually telling me what command was being executed would help me to understand what it's doing when it fails === chihchun is now known as chihchun_afk [15:26] at least then I'd be able to determine whether there is a workaround while the problem gets fixed [15:28] I guess it would also help me discern whether it's my fault with wonky build configuration or an endemic problem with snapcraft itself [15:29] zyga: is #5033 part of not landing new stuff? [15:29] PR #5033: cmd: generalize locking to global, snap and per-user locks [15:29] Chipaca: it was proposed in the morning [15:33] pstolowski: Do you want to have a call in a few mins to discuss it? [15:33] haha, but that was posted in the morning [15:33] :) [15:33] no more newies [15:34] niemeyer: what is that? [15:35] We tried to have a call on Monday but it didn't work.. wondering if you'd like to discuss now.. there's not hurry since per the meeting we should be stabilizing until next week [15:37] mvo: I completely missed that 2.32.4 is now in beta [15:38] pedronis: .4 has new refresh API? [15:39] noise][: yes [15:40] niemeyer: ah, about udev? if so then yes, i'd be happy to do it in a moment [15:45] pedronis: it happend ~15sec ago [15:45] pedronis: ok, maybe a bit longer but armhf literally finished in the last 5min [15:45] I just saw it in info for amd64 [15:46] pedronis: yeah, that finished ~ 15min ago or so [15:49] jdstrand: bad news, the issue https://forum.snapcraft.io/t/oom-for-interfaces-many-on-bionic-i386/4101 is back it seems https://objectstorage.prodstack4-5.canonical.com/v1/AUTH_77e2ada1e7a84929a74ba3b87153c0ac/autopkgtest-bionic/bionic/i386/s/snapd/20180411_152411_beda4@/log.gz is the most recent autopkgtest, we added a check for lowmem and it seems like its running out of low kernel mem during the tests [15:49] :-( [15:50] jdstrand: for refrence this is from http://people.canonical.com/~ubuntu-archive/proposed-migration/update_excuses.html#snapd [15:50] low memory is that special area that is below 4G (not-PAE) or is that the special memory limit so that PCI devices can map it with their limited address space? [15:51] jdstrand, jjohansen: do we have any news on the kernel memory leak when apparmor profiles are unloaded/reloaded [15:51] mvo: I have alternative solution that would make that test pass [15:51] mvo: but it's new code and definitely risky at this stage [15:51] mvo: I talked about this to jdstrand without association to the leak issue that I was unaware of at the time [15:52] mvo: I need to double check by looking at the test first [15:52] but the general idea is that we would defer apparmor parser invocation until end of transaction [15:52] Bug #1763071 opened: Error message installing a paid snap as unauthenticated user [15:53] mvo: now we run it way more often than needed, in some cases [15:54] cachio: all well with the sru validation so far? [15:55] mvo: actually, I think it would not help with this issue but let me read this more carefully :/ [15:57] mvo: no, it would not help here [15:58] Son_Goku: 2.32.4 is out [15:58] :-) [15:58] Son_Goku: how is that server WG proposal coming along [15:59] can I help in writing down the plan somehwere? [16:00] I will have something drafted in a few minutes [16:00] gotta grab lunch :) [16:00] zyga: thanks, well, we need to figure out what is going on. it also relatively new [16:01] aww, my pohne just died [16:01] zyga: lets see if we also see it in artful [16:01] Son_Goku: awesome, I'll say in touch [16:01] zyga: that can't be - its an iphone ;) [16:01] I forgot to charge it yesterday [16:01] mvo: but I take the joke [16:02] mvo: still, way happier with it than with my android phones [16:02] zyga: I know, just teasing [16:03] mvo: the current software shows battery health now and it claims my battery is at 83% [16:03] not so low that I'd be tempted to replace it yet [16:04] * zyga -> grocieres [16:04] mvo: and btw, thinkpad modems are worth every penny :) [16:04] I love working outside [16:04] * mvo hugs zyga [16:36] mvo: i have had a user ping me that they have a serious issue when installing a snap and had a full lock up and reboot [16:36] wondered if i get them in here you or someone else on the team might be able to help if you're around [16:37] the machine may have state info on it that could help debugging perhaps? [16:37] PR snapd#4832 closed: tests: move fedora 27 to google backend [16:38] popey: do we know with what version and what snap? [16:38] a full lock and reboot is quite a lot [16:38] I mean snapd version [16:38] pedronis: a snap he made, he installed devmode [16:38] not yet, he's booting a live cd now [16:38] will hopefully drop by here soon [16:38] out of diskspace ? === jkridner|pd is now known as jkridner_ === jkridner_ is now known as jkridner__ [16:41] we'll see. === jkridner__ is now known as jkridner [16:45] * zyga is back from shopping [16:45] popey: I could help [16:46] popey: ping me if you have anything I can jump on [16:46] thanks. he's having trouble getting online [16:49] * zyga hopes vmware workstation will work on the next wave of LTS distros [16:50] I cannot use the zoo of libvirt-based things as they all are more or less broken in more or less obvious ways :/ [16:52] Son_Goku: what's the process of proposing go-udev package in fc once i've it ready (currently spinning fc27 vm up)? do you have a wiki describing it? [16:52] yes [16:53] Howdy, popey led me here: I ran into the following issue with snap: I installed a package I created (daemon snap, devmode) but unfortunately it led to completely locking up/freezing the whole system. After some time I did a hardware shutdown and tried to reboot. The system starts until displaying the cursor but then completely locks up/freezes again (tried several times now). Is there anything I can do to try debug this issue before tr [16:54] zyga: ^ [17:05] pstolowski: https://fedoraproject.org/wiki/Package_Review_Process [17:05] Son_Goku: thanks [17:06] pstolowski, zyga and kyrofa can walk you through it too, as they've both done it [17:06] pstolowski, yeah, docs are pretty good, let me know if you want any help [17:10] mvo (cc zyga and jjohansen): we never did anything about that leak. the conclusion was that while there was a small apparmor leak, it was just that. I recorded that in the forum. jjohansen was going to look at the small leak [17:11] mvo (cc jjohansen): zyga's idea of batching the loads makes a lot of sense to me-- I think it would might help with timeouts in the spread tests (separate issue) [17:11] s/would might/might/ [17:11] thanks kyrofa! I think i'll attack this tomorrow morning; do you have a link to any of your specs? i wonder how much cleanup is expected from the gofed-generated spec [17:11] pstolowski, totally, although I'm not sure I'd suggest mine as model specs, haha. Let me get them for you [17:12] kyrofa: if it passed the review, it's model ;) [17:12] :D [17:12] https://src.fedoraproject.org/user/zyga [17:12] you can see zyga's packages [17:12] and kyrofa's: https://src.fedoraproject.org/user/kyrofa [17:13] Yep, there you go. With git repos to everything [17:13] awesome, thanks! [17:14] and then mine... https://src.fedoraproject.org/user/ngompa [17:14] Don't look at Son_Goku's, they'll make you feel inferior [17:15] :D [17:15] lol [17:16] so we found another instance where there reading of snaps ignoring errors is hiding other problems [17:21] i believe zyga is away, pedronis are you able to help suebt ? [17:26] popey: probably needs to boot into emergency mode and look around at logs , it's a bit hard to understand what from snapd would take over the system so much [17:27] Hey pedronis. I'm currently logged into a live session on the machine. Can I access logs from there? [17:29] suebt, that sounds a bit like your daemon simply goes mad [17:29] do you have the surce for that snap public ? [17:30] Yeah, I assume so. Could be constantly rebooting or something. Wondering whether snapd doesn't stop it at some point from doing that, though? [17:30] *source [17:30] zyga: you said there was some other gnome software that needs packaging, can I help with this? [17:31] Nope, it's neither public nor finished yet, unfortunately, was just the first try of snapping it up ._. [17:31] there are some self checks for snaps, but if the daemon first starts fine and then ... say ... fills up all RAM, snapd wont be able to do anything about that [17:31] s/rebooting/restarting/ [17:31] ogra_: yeah, right ... [17:31] suebt: once is installed snapd mostly let the management be done by systemd [17:31] for services [17:31] yeah [17:31] Okay, is there any log I could check to find out? [17:31] you should be able to just remove the systemd units from disk [17:32] from your live session [17:32] then you can reboot into your normal system [17:32] /var/log/syslog and then you would need to point journalctl from the live session to the logs on your main disk [17:33] which should be possible, but I never tried myself it seems (I see there are relevant options though) [17:33] well, syslog from disk should be enough [17:33] at least if the daemon logs anything [17:34] if it just goes crazy in I/O or fills your ram (just enough to not hit OOM) you might not see anything at all [17:34] otherwise it seems journalctl --root=your/main/disk/root .. could help [17:34] Okay, thanks, I'm not seeing anything helpful, just a bunch of 00\00\00\00\00\00 at the end [17:36] Okay, well, when there are no additional snap-specific logs that could be helpful, I'll try to restore my system as you proposed ogra_. Thanks! [17:37] suebt: btw it seems you can also disable your daemon using systemctl --root=... disable snapd.snap-service... [17:37] snaps usually just log to logger ... which writes to syslog and journald ... [17:37] oh, wow, i didnt know that one [17:37] oh ok, thanks ogra_ pedronis [17:38] ogra_: yea, haven't quite tried but apparently both journactl and systemctl can take an alternative root dir [17:38] very cool [17:42] pedronis: systemctl --root worked just fine, so that was easier than I thought it would be. Thanks! Okay, will report in case I find anything interesting which is not related to my application going wild for some reason .-. [17:44] make a wrapper script around your daemon that reports ram and disk usage or some such ... and prehaps add a timeout so you dont kill the system again [17:46] re [17:46] I'm home [17:46] zyga: hey, you said earlier we need to package some other things for OS, can I help with any of that? [17:47] yes, there is a glib wrapper for talking to snapd that is a dependency of gnome-software [17:47] but I think we should see what it would take to submit the package to factory [17:48] as the extra package I'm talking about won't work without snapd [17:48] ah ok [17:48] well let me know if you need help with anything [17:51] zyga: seems there's a real bug where during install we think a snap is mounted but is not, somebody hit it with beta, but it's not a regression (it fails strangely because of the ignore error code we have in readInfo) [17:53] it's rare though we have a report now and one from long ago , Chipaca might know more [17:56] pedronis: interesting, so we think it is mounted and then go an and try to use it [17:56] pedronis: I added some code that prevents snaps with snapInfo.Broken from entering the repository [17:56] pedronis: but it is a new thing that landed in 2.32.3 first [17:56] Caelum: I think we should send a mail out to the packaging mailing list [17:56] Caelum: and propose the current incarnation of snapd [17:56] Caelum: and see what the response is [17:56] zyga: but it fails much earlier in this case, we really need to revist readInfo [17:57] and really decide when returning broken vs error makes sense [17:57] Caelum: I bet we will get some quick things like "change that, fix this" etc [17:57] pedronis: I agree [17:57] zyga: we did to support listing and removals [17:57] pedronis: but I also think this is a secondary effect, we should understand what is wrong with mounting [17:57] pedronis: yes, I remember [17:57] but right now it makes other places fail in very strange ways [17:57] pedronis: for interfaces we should probably fail [17:57] because they were never changed to look at broken [17:58] nor tbh it makes a lot of sense for them [17:58] pedronis: though now we kind of do (but again, since just a moment ago) [17:58] they would much better get an error [17:58] zyga: yea, but it is earlier, it seems doMountSnap itself can explode (strangely because of the ignored error) on a not really mounted snap [17:59] jdstrand: re oom> thanks for your update. so you say that apparmor leak is too small to trigger this oom situation and something else is mostly likely eating the lowmem? [17:59] Caelum: so I think we should definitely just do that [17:59] zyga: it means do that our mount code has problems , we trust something that isn't correct (or yet something else is going on but unclear what) [17:59] Caelum: let's propose what we have [18:00] Caelum: and see what the feedback is [18:00] Caelum: can you do that? [18:00] s/do/though/ [18:00] pedronis: I would like to know if it happens inside a container or inside a non-container [18:00] pedronis: perhaps FUSE is the thing that makes it more "broken" [18:00] I can ask about the last case [18:01] pedronis: who was the last case? what do we know about it? [18:01] somebody on the store team [18:02] I'm asking him (but he was having lunch) [18:03] zyga: not a container apparently [18:03] that's good information, so it probably is a generic issue [18:03] xenial [18:03] I'll get some tea, keep talking about what we know [18:04] mvo: that was the conclusion before: https://forum.snapcraft.io/t/oom-for-interfaces-many-on-bionic-i386/4101/7 [18:04] back [18:05] * zyga checks what systemd does for mount units in xenial [18:05] mvo: nothing was changed in apparmor wrt this. the fact that it went away and came back coupled with ^ suggests it is something else. I can't comment on the progress of the work to make the 30M smaller or the 2M leak go away [18:06] zyga: he also said he remove the snap and reinstalled, might also be some interaction between lazy detaching and remounting [18:08] mvo: somebody hit a old bug with the beta, seems we really have some corner cases in which SetupSnap returns without errors but the snap is not really mounted (yet) [18:09] pedronis, mvo: let's add some simple code that after the mount call, checks [18:09] we can wait, raise red flags, etc [18:09] and then run it in a very long loop [18:09] maybe something will come up [18:09] opinions? [18:11] zyga: sure [18:12] jdstrand: well, we have more tests I think it never went away we just didn't trigger it because we ran less tests. but fair enough, I start looking tomorrow again, its pretty important as it blocks us from entering bionic right now [18:13] pedronis: oh, interessting [18:13] mvo: is it easy to reproduce? [18:13] mvo, do you know who is the owner of the dragonboard-kernel snap [18:13] dragonboard-kernel_45.snap is getting stuck starting [18:13] zyga: I think so, I think its a matter of installing removing a snap [18:13] cachio: I think the kernel team owns all kernel snaps [18:14] zyga, do you know who? [18:14] zyga: http://paste.ubuntu.com/p/26xzSJ8NHJ/ [18:15] mvo: on which kernel? any? [18:15] specifically i386 bionic? [18:15] zyga: bionic/i386 [18:15] ok, I'll try to reproduce it now [18:16] jdstrand: hi there, we are having an isolated issue and might find it interesting if you could provide insight (snapcraft snap inside lxd). Here's a paste I have https://paste.ubuntu.com/p/mkhz73q7K7/ where on first run everything works but on a second one (where we do not install core nor snapcraft again) it fails with "cannot change profile for the next exec call: No such file or directory" [18:16] zyga: also artful :/ [18:16] zyga: https://objectstorage.prodstack4-5.canonical.com/v1/AUTH_77e2ada1e7a84929a74ba3b87153c0ac/autopkgtest-artful/artful/i386/s/snapd/20180411_144134_835a7@/log.gz [18:16] sergiusens: what prints that line [18:16] cannot change profile for the next exec call: No such file or directory [18:16] is that snap-confine? [18:16] zyga: running snapcraft [18:17] zyga: yeah, most likely [18:17] sergiusens: right, but as a part of that you are runnig some snaps, right? [18:17] can you add export SNAP_CONFINE_DEBUG=yes please [18:17] snapcraft is a snap [18:17] and reproduce [18:17] may be some hint [18:17] what this tells us is that we're trying to switch to some profile [18:17] but that profile doesn't exist [18:17] zyga: yeah, not me, popey and it seems he can consistently reproduce. This also only happens for him on on lxd 3.0.0 [18:17] you can also get one more thing [18:17] when it fials [18:17] niemeyer: hm, did spread change and is no longer compatible with go1.6 or something? I see in the xenial autopkgtest output: + go get -u github.com/snapcore/spread/cmd/spread [18:17] package context: unrecognized import path "context" (import path does not begin with hostname) [18:18] zyga: yeah, interesting that it works on the first pass and fails on the second, so it seems something is lost on container stop/start [18:18] when it fails run "sudo cat /sys/kernel/security/apparmor/profiles" [18:18] sergiusens: yeah, likely so [18:18] mvo: yes, spread changed and it is not compatible with go 1.6 anymore [18:18] mvo: we discussed this last week [18:18] zyga: the `cat` inside the container or outside or is it irrelevant? [18:19] zyga: right, the next question is - what can we do about it :) [18:19] zyga: also, snapcraft is classic, so there should be no profile to switch to, right? [18:19] mvo: with the way that interfaces are connected (run apparmor_parser after each snap connect/disconnect, even if a particular command has multiple interfaces rather than only running it once) and sooo many tests, I wonder if the leak, even though it is small, might compound and therefore aggravate the situation. I don't really know how the leak happens though, so that might be a question for jjohansen when he is around again. he seems timme shifted, [18:19] mvo: we discussed this at length, one idea is to get the pre switch spread in a branch and build it for go 1.6 [18:19] niemeyer: anything we can do to make spread 1.6 compatible again? right now this blocks the autopkgtests in xenial. or should we use a fork for spread on xenial? [18:20] zyga: right, did we discuss with gustavo? [18:20] zyga: yet? [18:21] mvo: no, not yet [18:21] mvo: both you and gustavo were away at that time [18:22] mvo: you could probably enable backports and pin in your adt test to get the latest go [18:22] jdstrand: ok, thank you. its late in my TZ but i can try to gather data tomorrow [18:22] zyga: aha, indeed [18:22] or install the go snap, prior to enabling squashfuse to get the arches running in containers covered [18:23] sergiusens: that is a good idea [18:23] sergiusens: we don't support testing in containers currently so that is ok [18:23] unless spread is part of the packaging, then I have no ideas [18:23] sergiusens: also 2.32 fixes the squashfuse issue :) [18:23] sergiusens: actually, I don't think that is true [18:23] sergiusens: classic has profiles, just very open [18:23] * zyga checks [18:23] sergiusens: yes, classic has profiles and is confined [18:23] mvo: but armhf runs in a container, or do you get special hw for adt? [18:24] mvo: did you see the bit about jjohansen time-shifted? looking back, it seems my comment may have been too long [18:24] zyga: ok, just a lean and mean one :-) [18:24] sergiusens: we skip if we detect containers, we don't run tests there right now unfortunately [18:24] jdstrand: do you know which timezone jj currently inhabits/ [18:24] jdstrand: heh, I did see that, thank you :) [18:25] mvo: ok, I was looking at installing squashfuse in debian/test/control to figure out if we could get that going [18:25] mvo: did you see the bit about jjohansen time-shifted? looking back, it seems my comment may have been too longwhen he is around again. he seems timme shifted, so you might ask him when you come online tomorrow" [18:25] I'll report back, as it might interest you (we currently skip build-snaps tests on armhf and such) [18:25] man [18:25] mvo: ok, you saw it, I'll stop trying to paste it :) [18:26] niemeyer: about spread and go1.6 - a fix in the spread upstream repo would be great as it would allow us to avoid another upload. if that is hard/impossible I can try to workaround it via installing a different go when building spread or using the snap or trying to be creative in other ways. [18:26] ogra_, pedronis: Hey, regarding my lockup issue we just talked about: I found out that the application basically just crashes and exits. I just reproduced it with a one-line app that just panics and quits. Looks like snap by default makes daemons auto-restart in case of failure. Is it expected that this will lead to locking up the whole system when the app is quitting straight away? [18:26] suebt: hey, sorry for being absent earlier [18:26] I think system will back off eventually [18:26] hey zyga, no problem :) [18:27] but even if you essentially create a "while true; crash; done" app [18:27] it should not bring the system down [18:27] I can post the code, give me a sec [18:27] sergiusens: fyi, what zyga asked for is what is needed to understand what is happening. the profile (likely for the lxc command if I were to guess) seems to have been unloaded [18:27] can you provide as much information as possible please [18:27] suebt: oh, perfect [18:27] jdstrand: maybe when the container is stopped and started something is off and apparmor profiles are not loaded [18:28] mvo: I think the "stable quiet period" is a good thing for now [18:28] we have plenty of things to attack [18:29] zyga: oh yes! [18:29] zyga (cc sergiusens): that's interesting and plausible. a snapd restart on container start would workaround that [18:29] jdstrand: note that what is super odd [18:29] jdstrand: is that snap-confine would say "aha, I'm not confined" [18:29] jdstrand: and would bail out way before reaching that code [18:29] jdstrand: so its own profile must have been loaded [18:30] jdstrand: but it is only loaded by apparmor init scripts [18:30] jdstrand: and whenever we install core [18:30] jdstrand: so ... ? [18:30] something is off [18:30] or [18:30] well, that's silly [18:30] or the profiles from /var/lib/snapd/apparmor are not loaded [18:30] snapd will load it if it detects overlay or nfs too [18:30] we changed one thing recently [18:30] zyga: here you go: https://github.com/tim-sueberkrueb/snap-daemon-lockup-example I'm on Ubuntu 17.10 [18:30] we have the system key [18:30] not that this is the case, just mentionign that [18:30] we don't reload profiles like we used to do, all the time [18:31] does reexec make sure it is there? [18:31] anyway, need more data [18:31] suebt: perfect, me too [18:31] get a live stick ready xD [18:31] jdstrand: reexec yes but only when core is installed (that is during that operation) [18:31] jdstrand: what I am saying is that now with the system key we are not loading profiles on startu [18:31] jdstrand: so if there was a bug since forever in lxd [18:31] jdstrand: it was masked [18:31] jdstrand: but not anymore [18:31] suebt: my desktop runs 17.10 [18:32] ok, I just mean in case of locking your system up ^^ [18:32] suebt: inspecting now [18:32] thanks :3 [18:33] thank you for using snaps :) [18:33] and sorry for the bad experience [18:33] * zyga loves rust [18:33] Yeah, rust is awesome [18:34] just started learning it though :) [18:34] * suebt is still a noob in rust [18:34] I wonder if this is related to https://bugs.launchpad.net/ubuntu/+source/apparmor/+bug/1746463 [18:34] Bug #1746463: apparmor profile load in stacked policy container fails (Ubuntu Xenial):Fix Released> [18:34] perhaps the kernel is out of date [18:35] sergiusens (cc zyga): ^ [18:35] anyway, I need to move to another task now. if there is more data, we can look [18:35] jdstrand: perhaps, good call [18:37] suebt: what does snap version say? [18:37] are you on 2.32.3/ [18:37] yep [18:37] perfect [18:37] ok, installing now [18:37] (I hope it doesn't explode that hard :)" [18:37] ok, hope you can reproduce it :D [18:38] suebt: nope [18:38] it starts, it is restarted a few times [18:39] nothing happens [18:39] I'm typing this from that system [18:39] Well, this is both good and bad ^^ [18:39] Hmm ... [18:39] can you ssh into your machine [18:39] from something else [18:39] I didn't build it with cleanbuild [18:39] run journalctl -f [18:39] But that doesn't make any difference [18:39] neither did I [18:40] and then install the snap [18:40] and let's see what you get [18:40] Okay [18:41] jdstrand: well popey is on kde neon [18:41] sergiusens: perhaps popey should be talking to us :) popey, make sure your kernel is up to date (there was a fix last week for the above bug) and try the lxd stuff again [18:46] zyga: review? https://paste.fedoraproject.org/paste/IBruVUkBu79voKY439~bOg [18:46] zyga: uhm, ya, writing this from the other machine: https://paste.ubuntu.com/p/nxqmQydfrp/ that's all [18:46] jdstrand: heh, I wanted to rule out snapcraft any lxd interaction, but it seems the error is not snapcraft originated (while it is snapcraft driven) [18:47] Pharaoh_Atem: ack [18:47] suebt_: nothing there, what happens with your system [18:47] is the ssh session dead? [18:47] Yep it is xD [18:47] it completely locked up again [18:47] The other suebt just died [18:47] hmmmm [18:47] sergiusens: yeah, I think it is how snapcraft is driving lxd that is uncovering the issue [18:47] I wonder if your kernel crashed [18:47] is it a laptop/desktop? [18:48] do you have a screen [18:48] It's a laptop. [18:48] can you reboot it [18:48] go to vt4 [18:48] or something without X [18:48] log in into the consoel [18:48] ssh in remotely [18:48] trigger the error remotely [18:48] if the kernel crashed there's bound to be something on the tty [18:48] jdstrand: so the line that fails almost looks like `lxc exec -- snapcraft` [18:48] Okay, will try [18:49] suebt_: ok, perfect, thank you [18:49] Pharaoh_Atem: reading now [18:49] sergiusens: yeah, that is what I figured [18:49] jdstrand: the difference from the first run and second is that on the first run we snap install core and snapcraft while on the second run it is already there (so we do not install) [18:49] * jdstrand nods [18:50] jdstrand: and it works for me too; but I did not go through an upgrade path of 2.0 to 3.0.0 as he has (I installed 3.0.0 from scratch) [18:50] oh, and I am on bionic [18:50] sergiusens: that hints at the fact that core installation triggers profile setupo [18:51] sergiusens: I did go from 2 to 3. what do I need to do to reuse the container? (I use cleanbuild all the time) [18:51] that said, I suspect it has nothing to do with lxd [18:51] jdstrand: look at the top of the paste for the feature flag [18:51] export SNAPCRAFT_CONTAINER_BUILDS=1 [18:52] it behaves like a local build but inside a container, so you can run `snapcraft pull` and it will do the pull in the container and you get to see the bits locally [18:53] sergiusens: that's nice [18:54] sergiusens: yes, it works fine here (bionic) === grumblr is now known as grumble [18:54] sergiusens: I'd like to see what kernel popey has [18:55] ok, we can probaly use a smaller test case for this when he's back by installing a small classic confined snap and go with that [18:57] sergiusens: a good reproducer would be nice. that said, I can snapcraft twice on a small snap just fine and fast [19:01] Pharaoh_Atem: it looks good [19:01] let me think if there's anything to tweak [19:02] Hey zyga: On the laptop screen I get: "watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [snap-exec:10B49] ..." [19:02] aha [19:02] let it run [19:02] it's not dead [19:02] how long should I let it run? [19:02] I wonder what's the kernel shortcut to get a backtrace from stuff [19:03] a few minutes [19:03] okay [19:03] looks like a deadlock [19:03] suebt_: can you send me the snap you are running [19:03] the built one [19:03] (next time after reboot) [19:03] but don't reboot yet [19:03] and uname -a [19:04] and tell me if you have any kernel modules you've built from source or dkms [19:04] Sure, as soon as I'm able too reboot again. Okay, then I'm going to wait 5 more minutes? [19:04] Pharaoh_Atem: +1 from me [19:05] I think it's a start of a new era :) [19:05] suebt_: yes, I think there's a keyboard combo that can be useful [19:05] but I don't recall it [19:05] zyga: sysrq? [19:05] yes [19:06] mborzecki: do you know which one panics the kernel or shows something useful (backtrace) [19:06] hmm i have muscle memory for sending one over uart ;) [19:06] zyga: echo l > /proc/sysrq-trigger [19:07] zyga: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/Documentation/admin-guide/sysrq.rst#n51 [19:08] Issue snapcraft#2026 closed: Implement a passthrough feature [19:08] PR snapcraft#2053 closed: meta: implement pass-through of properties to snap.yaml [19:08] mborzecki: do you know what to press on suebt_'s keyboard to get useful backtrace? [19:09] zyga: On x86 - You press the key combo :kbd:`ALT-SysRq-`. and then l or t, probably l if it's a lockup [19:10] Pharaoh_Atem: can you please cross-post this to the forum [19:10] suebt_: ^^ what mborzecki said [19:10] can you try? [19:11] okay, sure, so alt print l? [19:14] suebt_: did it work? [19:14] hmm nothing happens, maybe I try the wrong combo for my laptop [19:15] tried alt+print+l [19:15] if it's a laptop you may need to press Fn+prtscr [19:15] ok [19:15] Nothing, also tried with the external keyboard [19:16] hm it may be disabled then :/ [19:17] on arch & thinpad x220, it's press alt, press Fn, press prscr, release prscr, release fn, press l, and then ther's a nice message in dmesg "This sysrq operation is disabled" [19:20] Issue snapcraft#2064 opened: Support for set-grade [19:20] hmm [19:21] in the lenovo forum FN + S seems to be also a thing, but that doesn't work either [19:22] okay, meh, sorry, then I'll reboot now, I gues? [19:22] *ss [19:24] suebt_: yeah, go for it [19:25] suebt_: send me uname -a [19:25] and your snap [19:25] I will look into it [19:25] but not tonight, I have some high priority work now [19:29] okay, will provide the info you requested in a few minutes, thanks! [19:31] zyga: why would I post this to the forum? [19:31] this is the email I'm going to send to server WG [19:31] Pharaoh_Atem: cross post, this is a big and important topic [19:31] this way people will know it happens [19:32] * Pharaoh_Atem sighs [19:32] I guess [19:34] snap package: https://drive.google.com/open?id=1eXHPcZLn-lVFf38EWvke-k8SWflOW6l- [19:34] system info: https://paste.ubuntu.com/p/ZySVSw8GQn/ [19:34] No custom kernel [19:34] anything I forgot? [19:35] zyga ^^^ [19:36] looking [19:36] ah, [19:36] I'm on a custom kernel! [19:36] I'll boot to -38 and try [19:36] thank you, that's all for now [19:36] can you hop in tomorrow [19:36] and check with us again please? [19:37] Sure, I will try to be around 18 UTC :) [19:37] *18 pm [19:38] 18 pm is not a thing nevermind xD [19:38] ok, thanks! [19:53] meh, too late - I have the answer for sysreq on the new lenovo keyboards: https://mvogt.wordpress.com/2017/05/24/sysreq-on-lenovo-x250/ - oh well [19:53] is there a way to display very old builds? [19:54] I see only like last 10 [19:54] I need to check some like 100 builds ago [19:55] petan: snapcraft history iirc [19:55] ok [19:56] mwhudson: did I mention today how great "snap install go --channel=1.6/stable" is? [19:56] mwhudson: thank you so much for this [19:59] jdstrand: 4.13.0-37-generic. KDE neon [20:03] * diddledan peeks [20:10] niemeyer: I pushed a possible fix for the spread go1.6 compat in https://github.com/snapcore/spread/pull/56 - as a stop-gap. if that is acceptable I can trigger the autopkgtests again on xenial (after that is merged of course). but please do let me know if you prefer a different approach [20:10] PR spread#56: add go1.6 compatibility [20:12] mvo: What's that about? [20:12] * niemeyer ooks [20:12] looks [20:12] niemeyer: in a nutshell our autopkgtests on xenial are broken because they try to build spread with go1.6 [20:13] mvo: That's an easy one [20:13] mvo: It's in [20:13] niemeyer: \o/ [20:13] niemeyer: man, thank you so much [20:13] mvo: Thanks, and sorry for missing this earlier [20:14] niemeyer: I will trigger the tests to re-run now, thank you! === sergiusens_ is now known as sergiusens === TinoGuest_ is now known as TinoGuest === MrGeneral_ is now known as MrGeneral === davdunc_ is now known as davdunc === iatrou_ is now known as iatrou === icey_ is now known as icey [20:31] popey: linux (4.13.0-38.43) artful; urgency=medium [20:32] popey: that ^ has the fix for the bug I mentioned. please upgrade to that and try again [20:53] jdstrand: I am on xenial [21:08] popey: linux-hwe (4.13.0-38.43~16.04.1) xenial; urgency=medium === sergiusens_ is now known as sergiusens === sergiusens_ is now known as sergiusens === sergiusens_ is now known as sergiusens === sergiusens_ is now known as sergiusens