[00:00] no, not :) [00:00] :( [00:00] wallyworld: one common idiom is an 'integration' branch holding things merged from other people that is merged to trunk [00:00] another is a 'bits' branch where you do little trivial things [00:00] see e.g. stubs pending-db-patches branch [00:00] yes, i totally ignored those [00:00] in my thnking [00:00] wallyworld: its ok, really. LP has -massive- use cases [00:00] bad mistake. mad at myself for making it [00:01] and our object model being so heavy warps our thinking. [00:01] I do it all the time, and have to back up and do another run at the problem. [00:01] that's true actually, about the object model [00:01] especially for newer people like me [00:01] yes [00:02] its a map, with lots of here-b-dragons, and not the territory. [00:02] wallyworld: here's another case - one I'm looking at at the moment [00:02] it's also "decay" in the codebase [00:02] wallyworld: https://bugs.qastaging.launchpad.net/ubuntu/+source/afflib/+bug/230350/+index [00:02] <_mup_> Bug #230350: Missing Debian Maintainer field that has 160 bug tasks. [00:03] *160* [00:03] * wallyworld looks [00:03] wallyworld: this is just an example of where users go waaaaay outside the expected bounds us mere developers created. [00:03] * wallyworld tried to look but got a timeout [00:03] right [00:04] thus I'm looking at it :> [00:04] :-) [00:05] * lifeless goes back to bug-279513 [00:06] Hm [00:06] Jenkins, I'm disappointed. [00:06] lifeless: Thanks. [00:06] I was expecting a "trigger another build when this one is started." [00:07] StevenK: I think the advanced triggers plugin can do that [00:09] lifeless: I can't see that on the plugins page [00:11] StevenK: #jenkins I think [00:11] a quick google doesn't show me anything [00:11] it would be easy to do a plugin for it [00:11] but I'm reasonably sure there is one [00:13] I can't see one either, but I can work around it [00:14] Project db-devel build #407: STILL FAILING in 5 hr 36 min: https://hudson.wedontsleep.org/job/db-devel/407/ [00:14] wgrant: So, what do I do to this to make it only run windmill tests? [00:15] StevenK: How do you call the test suite at the moment? [00:15] make TESTOPTS="--subunit" check | subunit2junitxml -f -o test-results.xml [00:15] Add '--layer=WindmillLayer' to TESTOPTS [00:16] make TESTOPTS="--subunit --layer=WindmillLayer" check | subunit2junitxml -f -o test-results.xml [00:16] Yup. [00:16] Added [00:16] Thanks. [00:17] Scheduled a new build of it [00:17] They only take ~35 minutes on ec2. [00:19] StevenK: Is there a bug for the checkwatches.txt thing? [00:19] Not that I can recall [00:19] OK. [00:19] It's only 500 lines, it should be shot. [00:21] I need to rewrite most of it to get rid of more OOPSes. [00:21] So I might. [00:21] But checkwatches :( [00:21] Haha [00:22] Ah, good. [00:30] oops [00:30] forgot to add a particular tag to that landing [00:32] thumper: Which? [00:32] wgrant: the mantis reland [00:33] I did an ec2 land [00:33] It seems to have been rejected anyway. [00:33] Ahh. [00:33] What was missing? It doesn't want to be --rollback. [00:33] It looks fine to me. [00:33] oh, ok [00:33] I already rolled it back last night. [00:33] So we are clear QA-wise. [00:34] ok [00:34] good [00:34] * thumper needs lunch === Ursinha is now known as Ursinha-afk [01:28] Finished: SUCCESS [01:29] wgrant? [01:29] StevenK: windmill worked. [01:29] \o/ [01:29] Thanks! [01:30] It took an hour, though. :-( [01:30] Yeah. [01:30] I should be able to cut about 30 minutes off Jenkins build times by working on EBS [01:30] But I have no idea where to start [01:31] is bug 181368 fixed? [01:31] Maybe. [01:32] Certainly not released. [01:32] But maybe fixed. [01:35] wgrant: I thought the publisher was in nodwontime [01:35] lifeless: No, because of poppy. [01:35] No separate tree yet. [01:47] https://code.launchpad.net/~thumper/lazr.enum/case-insensitive-getTokenByTerm/+merge/51843 anyone? [01:52] why does jenkins now show recent success for db-devel? [01:54] It doesn't? [01:56] last success 6 days ago [01:56] that isn't recent to me [01:57] Oh. s/now/not/ ? [01:57] oh [01:57] yeah [01:57] * thumper is on school run [01:57] Because checkwatches.txt has been failing pretty much every test run [01:58] But not on ec2 or buildbot :-( [02:15] hmm.... [02:15] why? [02:16] Neither wgrant or I have any clue [02:18] thumper: I adjusted a couple of things relating to that test. [02:18] thumper: So I'm going to look at it this afternoon. [02:18] And either fix or delete it. [02:19] ok [02:19] lifeless: you mentioned before that the lp templates mostly show bug tasks and not bugs..... however i think we want to stick with just displaying bugs for revisions listing on the branch index page [02:20] why? [02:20] a bug can have many bug tasks and we really just want the one bug item which the user can click on don't we? there's already a lot of info displayed [02:20] wallyworld: bugtask:+index already picks a single task [02:20] wallyworld: see the method I pointed you at ;) [02:20] the bug summarises the issue the mp is solving [02:21] if you mean the getLinkedBugTasks(), i think we need an anlogous getLinkedBugs() method [02:21] we don't always want the entire list of bug tasks, sometimes just the bugs [02:22] We have too many h2s. [02:22] But it is tempting to keep them because they just look so nice. [02:23] lifeless: i think the current extended revision listing looks ok as is, but we just want to be able to hide private bugs properly [02:23] wallyworld: you're not making any sense. [02:24] wallyworld: get linked bug tasks already chooses *precisely one task* [02:24] wallyworld: the difference you are arguing for is precisely empty. [02:24] but a bug can have many associated bug tasks, no? [02:24] yes, but thats unrelated to the discussion [02:24] because getLinkedBugtasks never returns more than one task per found bug [02:25] just saw that now [02:25] wallyworld: you *must* have a task to show importance, status, tags, context. Bugs on there own are approximately never interesting. [02:25] why is that? [02:26] why only return just the one task? [02:26] wallyworld: because thats what bugtask:+index spent 500 queries figuring out. [02:26] wallyworld: I was merely fixing the performance not the definition. [02:27] ok. fair enough. where in lp then can the user see all bug tasks associated with a branch? [02:27] and why is the method called getLinkedBugTasks if it only returns one? [02:28] because it returns one task per bug - it returns many tasks [02:28] just not many tasks per linked bug [02:29] of course, reading comprehension problem :-( [02:29] there is nowhere in LP that you can see all the bug tasks associated with a branch. The model links to bug, but linking to bug isn't useful for the rest of the system. [02:29] s/model/schema/ [02:29] ok [02:30] wallyworld: series branches only show bugtask for bugs which have an open bugtask; regular branches show a bugtask for all their bugs. [02:30] wallyworld: whether this is right or wrong is a separate discussion IMO : I want to just highlight a few key things: [02:31] - everything about a bug except visibility, description, comments,attachments - is on bugtask [02:32] - our /convention/ throughout the system is to not show bugs, only bugtasks. [02:32] and also on bug are the associated branches from memory [02:32] wallyworld: bugs, questions, specs and other htings are indeed related by bu. [02:32] *bug* [02:33] - we show bugtasks because we show status and importance. And we badge things with 'has brnach', 'has spec' etc etc. [02:33] ok. there's currently functionality which constructs mp emails using bugs not bugtasks. i guess this needs changing too? [02:33] wallyworld: case by case: we're talking web UI here. [02:34] the web UI is different to email because: [02:34] - its live: folk expect data to be presented rather than clicking through [02:34] - email they expect it to be stale - to be given deltas rather than seeing it as it is right now [02:35] i ask because i think (from memory) the email stuff uses branch.linked_bugs and that needs to be removed [02:36] because that also exposes private bugs to email recipients [02:37] so it possibly needs to replaced with something using getLinkedBugTasks [02:37] The codebrowse OOPSes have stopped! [02:37] Surely not! [02:38] Maybe it's just hung again :P [02:39] wallyworld: right, I'd use getLinkedBugTasks [02:39] wallyworld: you could use task.bug [02:40] wallyworld: or you could show bug status as well in the email [02:40] lifeless: yes, i'm doing just that elsewhere atm [02:40] using task.bug i mean [02:41] have to look at performance though. hopefully any alternatives are no worse that using branch.linked_bugs [02:42] wgrant: saw the email about windmill. me sad. a windmill test of mine actually caught a legitimate bug so we do need those tests [02:43] wallyworld: Which are being run on Jenkins [02:43] wallyworld: Sure, we need to turn them back on at some point. [02:43] But for now they are out of the critical path, but Jenkins will tell us if things go bad. [02:43] StevenK: yes but not as part of ec2 land hence we can now land bad code [02:44] wallyworld: It was turned off for most of last year anyway. [02:44] The number of issues that it catches is miniscule. [02:44] sure, we just got lucky [02:44] We would be better served by finding them later and being able to fix them more quickly. [02:44] Which is what this gives us. [02:44] Because branches will no longer be delayed by 10 hours due to Windmill being shit. [02:45] well if we use more and more XHR then miniscule won't be necessarily true [02:45] This is a short-term thing. [02:45] huh [02:45] qatagger is lying [02:45] Eventually we will convince deryck to fix it. [02:45] wgrant: just wanted to hear that it is short term :-) [02:45] lifeless: Oh? [02:45] look at it now [02:45] then file a bug :) [02:45] You mean how it shows it as deployable? [02:46] It has always done that. [02:46] no [02:46] I mean the 2nd line [02:46] earlier today it was showing 12486 [02:46] which was correct [02:46] * wallyworld needs food [02:46] now its showing 12488 [02:46] rather than 'nothing can be deployed' [02:46] lifeless: That was probably before the rollback was in place. [02:50] wgrant: no [02:50] wgrant: it wasn't [02:50] the /rev/ can be ok without it being show deployable at the top [02:50] Ah. [02:51] wgrant: this is a previously unobserved corner case [02:51] the rule fo rdeployable is 'all the revs before deployable and any rollbacks for broken revs included in the deploy [02:55] wgrant: https://bugs.launchpad.net/qa-tagger/+bug/727552 [02:55] <_mup_> Bug #727552: when first rev is bad, with a later rollback, qatagger incorrectly reports it deployable atthe top < https://launchpad.net/bugs/727552 > [02:55] lifeless: Thanks. [02:55] its probably an off by one [02:57] * thumper hangs head [02:57] WTF... [02:57] Project devel build #492: FAILURE in 6 hr 8 min: https://hudson.wedontsleep.org/job/devel/492/ [02:58] WTF @ Hudson [02:59] But checkwatches.txt passed again... [03:00] what's the sop for deprecating something in the web services interface? [03:00] we don't [03:00] getUtility(ILaunchBag).user.preferredemail.email inside the email handler [03:00] if its in beta, leave it there. [03:00] even if it's a security/privacy flaw? [03:00] if its in 1.0 leave it there. [03:00] if its devel, just delete it from devel. [03:01] wallyworld: What is? [03:01] there isn't a user in the launchbag for process mail [03:01] wallyworld: change its behaviour [03:01] because we export branch.linked_bugs [03:01] wallyworld: lazr.restful checks launchpad.View on each object in a collection before it returns it. [03:01] wallyworld: just change it to only return visible bug. But I fixed this already I thought. [03:01] it is handled by lazr.restful [03:01] we don't need to mess with it [03:01] currently it's still defined as an SQLRelatedJoin [03:02] wallyworld: do you *think* its broken, or have you *shown* it [03:02] ok. didn't realise lazr.restful did some filtering [03:02] wallyworld: it will be doing sucky late evaluation at the moment [03:02] wallyworld: and the collection size will be wrong. [03:02] lifeless, i guess "7% headroom" comes from 13/14 [03:02] wallyworld: so its improvable, but it won't actually be disclosing [03:02] poolie: 1/14, yes. [03:03] poolie: I realised the flaw right after I hit send. [03:03] but i don't really see how that corresponds to requests/day [03:03] oh ok [03:03] nice mail otherwise though [03:04] poolie: we'll free up 1 second for every request that is > 13 seconds today. [03:04] poolie: which is a smaller number. A thousand or so [03:04] * wallyworld still thinks we should be able to mark stuff as deprecated if required whether it be a security hole or implementation issue [03:04] wallyworld: we can in extremis, but it needs to be last-resort [03:05] python, java etc all do it :-) [03:05] wallyworld: this is why I don't want *any* new verbs / attributes or types in the 1.0 web service. [03:05] so you'll save ~701s of cpu time per day? [03:05] oh, a bit more than that if you also kill those in the >13s range [03:05] right [03:05] wallyworld: we made a promise to not break 1.0 for the folk using it in shipped Ubuntu versions. [03:05] over 32 cores? [03:05] poolie: the 701 is the >14 requests. [03:06] right, got that [03:06] so say 1000 requests, and you'll cut off one second each [03:06] poolie: currently over 18 python processes. [03:06] so we could mark something as deprecated and then support it until the lts runs out [03:06] wallyworld: there is no deprecation marker. [03:06] wallyworld: we'd just remove it from devel. [03:06] so it's less than .001% more headroom [03:06] yes, and i'm suggesting we need such a marker :-) [03:07] wallyworld: I think it would be counterproductive. [03:07] still worth doing [03:07] wallyworld: 1.0 users cannot use the replacement. [03:07] wallyworld: devel users don't need the old method anymore. [03:08] but in the general case, marking things as deprecated is a fairly common paradigm [03:08] sure [03:08] this isn't the general case [03:08] its a webservice with fairly specific constraints [03:09] one of which is that things work smoothly and seamlessly with the 1.0 version of the service. [03:09] Separately, I don't want use carrying the debt of things we want to remove *in devel* until we delete every prior edition. [03:09] we have enough debt. [03:10] sure, but we should still be able to work towards a 1.1 version, no? ie we need a story around how we evolve the interface [03:10] wallyworld: we have one [03:10] wallyworld: whats the problem you are trying to solve? [03:11] talking generalities. the problem of being able to marked certain things as "do not use, will disappear in a future version". give people time to migrate away [03:11] wallyworld: but it doesn't for us. [03:12] this is more a pub discussion :-) [03:12] wallyworld: the timescale impedance is immense. [03:12] wallyworld: 5 years vs daily change. [03:12] wallyworld: and new things *are not accessible* until you step over the version, at which point - unless we carry that *5 year* burden for *10 years* - the old stuff is flat gone. [03:13] yes, the timeframes are large [03:14] it would be interesting to know which apis were being actively used. i don't suppose we gather those sorts of metrics [03:14] we have logs that can tell us reasonably well [03:15] moot point for now. i was more just thinking out loud. i have to stop doing that [03:15] wallyworld: its fine - I do that too [03:15] wallyworld: but expect to be engaged on such utterances ;) [03:16] :-) [03:16] wallyworld: happens to me all the time - surely you've seen me go 'I have no real answer to hand' :) [03:18] yes :-) [03:19] \o/ I now can start coding today. [03:19] this is why I'm so unproductive, no cycles to write code [03:20] doesn't help you get dragged into various discussions all the time [03:24] You know your jet lag is bad when you catch yourself thinking that you want database sharting. [03:30] it seems to me that there should never be a user in the ILaunchBag during process-mail [03:32] why not ? [03:32] when the sender of the mail is identified we should setup a participation and security policy for them. [03:33] I filed a bug about this. [03:37] Rargh, and then devel #492 fails with _lock_actions garbage [03:46] lifeless: https://code.launchpad.net/~wgrant/launchpad/unbreak-incoming-mail/+merge/51850 [03:46] Slow scan and diff generation are slow. === StevenK changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | firefighting: - | On call reviewer: StevenK | https://code.launchpad.net/launchpad-project/+activereviews [03:55] is lib/lp/soyuz/tests/../stories/soyuz/xx-distributionsourcepackagerelease-pages.txt of any use [03:56] or is it redundant with unit tests? [04:02] lifeless: That doctest makes me sad [04:02] StevenK: It's a doctest. [04:02] Haha [04:15] can I just delete soyuuz ? [04:15] Please. [04:15] But delete checkwatches first. [04:15] lifeless: Patches welcome [04:15] sob [04:16] why is default distroseries different to current [04:16] Default distroseries? [04:16] - http://launchpad.dev/ubuntutest/+source/testing-dspr/1.0 [04:16] + http://launchpad.dev/ubuntutest/hoary-test/+source/testing-dspr/1.0 [04:16] in that test [04:17] after I tell getPubSource where to publish - the ubuntutest currentseries [04:17] That's a DSPR wersus a DSSPR... [04:17] Wow. versus. [04:17] >>> anon_browser.open( [04:17] ... 'http://launchpad.dev/ubuntutest/+source/testing-dspr') [04:18] >>> anon_browser.getLink('1.0').click() [04:18] are the lead in lines [04:18] >>> print anon_browser.url [04:20] getLink('1.0') is a little ambiguous. [04:21] wgrant: I'd really like to just delete the test :) [04:31] wgrant: so [04:31] you think there are multiple 1.0 links ? [04:32] lifeless: Quite possibly. [04:32] Can't check details right now though, sorry. [04:57] wgrant: argh [04:58] Oh? [04:58] wgrant: I *think* I may have found a reason for distro.getCurrentSourceReleases. [04:58] wgrant: perhaps not a good reason [04:58] wgrant: /ubuntu/+source/packagedeletedinseries-1 [04:59] lifeless: Latest upload? [04:59] yeah [05:07] what about dropping 'latest upload' as uninteresting [05:10] Component, Maintainer and Architectures are potentially useful. [05:10] Urgency and Latest upload are not. [05:15] sob [05:15] maintainer etc all come from 'current' [05:15] so, does this need to handle packages never published in 'currentseries' ? [05:15] * lifeless thinks [05:16] It would be nice. But it's not really critical. [05:16] I'm going to shelve the branch [05:16] its taking up too much time [05:16] I say use the current series' publications, otherwise say it's not published. [05:17] makes sense to me too [05:17] but the perf win for this is no where near big enough to justify the time investment [05:19] https://code.launchpad.net/~lifeless/launchpad/bug-279513/+merge/51063 - abandoned [05:37] anyone know - does storm sniff tables in SQL(...) ? [05:39] whats a good test class for Archive:getPublishedSources? [05:48] wgrant: ^ [05:49] wgrant: also, soyuz permissions - are they lists of attributes, or just-add-to-interface-and-its-done ? [05:50] lifeless: You may need to create a new class in test_archive. [05:50] lifeless: Most of Archive is tested in archive.txt. [05:50] lifeless: As for permissions, it depends on the class. [05:50] You'll have to check configure.zcml. [05:59] wgrant: thanks [06:01] lifeless: No, SQL is essentially a string. Storm doesn't know how to parse SQL - just generate it. [06:01] stub: thanks [06:01] I'll use a using clause [06:03] whats the easiest way to find what a Archive:EntryResource:getPublishedSources?ws.op=getPublishedSources looks like given that its timing out [06:04] You want to know the data from a specific URL that is timing out? [06:04] (also, QA party) [06:05] wgrant: yes [06:05] wgrant: we're qa'd ? [06:06] No, it's time for a QA party to QA all the stuff that is now on qas. [06:06] wgrant: ah [06:06] I'll do mine straight up [06:07] !!! frell [06:07] ? [06:07] Still broken? [06:07] TypeError [06:07] just getting a traceback [06:07] ('Could not adapt', , ) [06:07] Hah. [06:07] I suggest rolling both back. [06:08] ... and adding tests. [06:08] doit [06:08] Me, or you? [06:08] you, if you would; its 7pm here and I'm about to get snatched for dinner [06:08] k [06:09] I landed this by accident if you recall :( [06:10] But it passed buildbot [06:10] yes [06:10] so ec2 would not have helper. [06:10] *helped* [06:10] I'm just saying its cursed. [06:10] Heh. [06:14] In PQM. [06:15] thanks [06:23] OK, how do I delete launchpadlib's credentials? [06:23] I am trying to log in as one of my alternate SSO accounts on qas. [06:24] But I can't work out how to remove my system authorization. [06:24] Hm, maybe my system launchpadlib still uses GNOME Keyring. [06:25] Ah, there. [06:26] so, we get 800 hits a day on /branches [06:27] > 800 - 800 soft timeouts [06:27] so probably 0.02% of traffic or so, if we wanted to let it die for a day. [06:27] just a thought [06:28] It should be through buildbot in 7 hoursish. [06:29] And there is no QA required after what is there now. [06:29] So, in the likely event that nobody requests one overnight, we can do it in the morning. [06:31] wallyworld: I like your fix for the AJAX build request. [06:32] wgrant: great. it seems to work better now [06:32] Assuming that this revert works, we are QA'd up. [06:35] Project windmill build #2: FAILURE in 1 hr 9 min: https://hudson.wedontsleep.org/job/windmill/2/ [06:35] My point exactly. [06:37] wallyworld: Could you check that test? It passes locally, but this probably indicates that it has a race. [06:37] grr [06:37] TacException: Error running ['/usr/bin/python2.6', '-Wignore::DeprecationWarning', '/home/robertc/launchpad/lp-branches/working/bin/twistd', '-o', '-y', '/home/robertc/launchpad/lp-branches/working/daemons/librarian.tac', '--pidfile', '/tmp/tmpZckiy9/librarian.pid', '--logfile', '/tmp/tmpZckiy9/librarian.log']: unclean stdout/err: No handlers could be found for logger "QueueProcessorThread" [06:37] Oh, it's setting daily_build, not requesting a build. [06:37] So not your fault :P [06:37] Project db-devel build #408: STILL FAILING in 2 min 18 sec: https://hudson.wedontsleep.org/job/db-devel/408/ [06:38] bzr: ERROR: http://bazaar.launchpad.net/~launchpad-pqm/launchpad/db-devel/.bzr/repository/packs/09fb66d3dfb9df78e2bb1ca7b6c7aac9.pack is redirected to https://launchpad.net [06:38] Not again :/ [06:38] and poolie is gone [06:38] wgrant: i don't think that test is one of mine. i would have to look to be sure [06:38] lifeless: It doesn't appear to have been during a push. [06:39] ok [06:39] something solidly else then [06:39] the redirect is berk [06:39] PQM has been checking my branch for 25 minutes. [06:39] speak of the man [06:39] Shouldn't push for another couple. [06:39] It always used to redirect on 404, if branch-rewrite failed. [06:39] Not sure if it still does. [06:40] It doesn't. [06:40] poolie: https://hudson.wedontsleep.org/job/db-devel/408/ [06:40] Another pack redirect. [06:40] But PQM wasn't pushing at the time. [06:40] well [06:40] lifeless: Just a guess, but shouldn't that command line be bin/py or python2.6 -S for all the buildout deps and environment config to be pulled in? [06:41] stub: bin/twistd is a similar alias [06:41] hi wgrant [06:41] stub: that wraps /usr/bin/twistd with the local python etc etc [06:41] yup. ic. [06:41] the fail is the 'no handlers' message [06:42] lifeless: We silence some loggers in lib/lp_sitecustomize.py if that is appropriate here. [06:42] wgrant, well, the big thing is to just work out what the redirect message is, and who is sending it [06:42] but i'll reopen the bug [06:43] stub: I haven't looked [06:44] poolie: I don't think devpad has codehosting HTTP logs :( [06:45] stub: it may be a message we want - won't know until we dig [06:45] wtf do I feel like I have a hangover? Maybe I should have gone out last night and at least had a reason? [06:45] wgrant: for crowberry ? [06:45] stub: cause you slept in :) [06:45] lifeless: Yes. [06:45] lifeless: Getting better. Awake at noon today. Might even make our call tomorrow :-P [06:45] Project db-devel build #409: STILL FAILING in 0.47 sec: https://hudson.wedontsleep.org/job/db-devel/409/ [06:45] are there logs at all for this? [06:45] Er, still? [06:46] Ah, looks like the slave needs a good killing. [06:47] poolie: I presume there are logs from Apache... [06:48] i discovered that's not a reliable kind of presumption [06:48] I know we don't keep haproxy logs. [06:48] Hi poolie, funny to see you online when I'm waking up :) [06:49] Morning jam. [06:49] jam: You'll be pleased to know that loggerhead has stopped OOPSing. [06:49] \o/ [06:49] wgrant: what was up ? [06:49] Thanks for fixing that. [06:49] Or that is the 16k oops / day thing? [06:49] jam: The 16000 daily socket errors. [06:49] :) [06:50] congratulations on waking up [06:50] there are more fixes in loggerhead trunk, to handle various bits of that correctly all the way through the stack [06:50] at what seems like a reasonable time in that tz [06:50] as it happens i'm just answering your mail [06:50] poolie: Still getting my head around the area, and setting up K with the new schools, etc. So I'll probably only be around a half-day today [06:50] jam: Although when QAing it I found bug #726985 [06:50] <_mup_> Bug #726985: codebrowse OOPSes with GeneratorExit when connection closed early < https://launchpad.net/bugs/726985 > [06:50] but yeah, jet lag doesn't seem to be too bad for me this time [06:50] my son isn't quite sure about the new TZ, though. [06:50] Heh. [06:51] lifeless: So we only gain using pgbouncer for appserver connections. At the moment, at any point in time we have 160 connections to the master db. With pgbouncer, that will usually be sitting at 40-50 (about half the main connections are active at any point in time, only < 10 of the session connections are active at any point in time - they run in autocommit and normally are touched only at the start of an appserver request). [06:52] wgrant: seems pretty easy to fix up. I'm curious who is killing the generator, or how it is falling out of the stack early, etc. [06:52] I think there is already a StopIteration check, probably fine to add GeneratorExit [06:52] jam: Yeah, I think so. [06:52] It's not OOPSing on prod, at least. [06:52] right [06:52] hmm... can use it too for librarian, codehosting and friends too. I think everything but the cronjobs. [06:52] I think the issue is just that *any* exception is getting logged [06:52] even the ones that don't mean much. [06:52] like GeneratorExit means we exited early [06:53] but is likely to not actually matter [06:54] btw, poolie, welcome back from your bike ride, sounds like you had a good time [06:54] stub: we only have 18*4 = 72 appserver threads [06:55] stub: so 1/2 active would be a reduction of 32 on that 160 ? [06:56] lpnet1-8,lpnet11-14 each using 4 threads. lpnet15 using 32. [06:56] Morning, all. [06:57] I'm not even counting edge actually - guess I should be here. [06:57] Morning deryck. [06:57] stub: 15 is getting reconfigured [06:57] Looks like lpnet15 is a bogus config [06:57] stub: yes, along with 9 and 10 that are not currently running [06:58] deryck: I have some questions about writing Windmill tests properly, if you have a moment. [06:58] we have 13 lpnet servers, 5 edge [06:58] for 18*4 once lpnet15 is fixed [06:58] wgrant, I think the sprint I'm in will start shortly. But ask away and I'll respond as I can. [06:58] deryck: Ah, didn't realise you were at one. [06:58] stub: will be going up to 80 with singlre threaded (but we'll actually have two threads per appserver to permit opstats queries [06:58] wgrant, yes, in South Africa. Ensemble sprint. [06:58] lifeless: So If the 50% active connections remains, it could be a win. If you are correct about GIL contention and stuff the activity will grow. I need to decide if it is enough of a win anyway to bother. [06:59] wgrant, this is why I haven't looked at Windmill this week yet. ;) [06:59] stub: I'd be inclined to move ahead with it, if only because we can more easily kick everyone off [06:59] stub: though you were saying we had cross transaction temp tables? [07:00] deryck: Looking at lib/lp/code/windmill/tests/test_recipe_index.py, test_inline_recipe_daily_build just failed on our Windmill-specific Jenkins job. [07:00] deryck: It is a race of some sort. [07:00] deryck: The lack of a wait between the getUserClient and click looks very suspicious to me. [07:00] Should there be one there? [07:00] Project db-devel build #410: STILL FAILING in 0.49 sec: https://hudson.wedontsleep.org/job/db-devel/410/ [07:00] I don't really know how much waiting a page load does. [07:00] * deryck looks [07:01] StevenK: Could you please kill/clean the db-devel Jenkins slave? [07:02] wgrant, what line in the test? I don't see a getUserClient call. do you mean the getClientFor stuff? [07:02] deryck: https://hudson.wedontsleep.org/job/windmill/lastCompletedBuild/testReport/ has the failure. [07:02] lifeless: Yes. A pgbouncer instance runs in a single pooling mode. Session or Transaction interest us. Transaction would save us idle database connections, but doesn't support cross transaction resources. Session doesn't change our utilization, so we would be trading some extra connection control for crappier authentication and extra complexity. [07:02] deryck: Er, getClientFor, yes. [07:02] ok [07:02] wgrant: i-3EDE07B4 [07:02] ? [07:02] deryck: It loads the page them immediately tries to click on a JS widget. [07:02] deryck: Which sounds prone to races. [07:03] stub: So I think what I care about is perf and reliability [07:03] stub: if we won't gain measurable perf, and will code reliability, lets defer it. [07:03] StevenK: Yes. [07:03] stub: we can work on getting rid of those tables and cursors later. [07:03] wgrant, there's a waits.forPageLoad in the getClientFor method before it returns. So I think it's fine to click immediately after the method. [07:03] I'm looking at pgpool-ii to see if it gives us better options. [07:03] deryck: :( [07:04] stub: cool [07:04] stub: go with what makes the most sense to you [07:04] deryck: So that waits for all the JS to finish? [07:04] I don't think we want to drop temporary tables and cursors entirely - they are the only sane way of doing some bulk operations. The rosetta scripts make use of them extensively. [07:05] wgrant, sorry, sprint starting. will look again when I can. [07:05] deryck: Thanks, no rush. [07:05] It's not blocking anything any more. [07:05] wgrant, but yes, the js should be loaded and done by that method return. [07:06] stub: they aren't crash proof though [07:06] stub: I think if we make a crash proof implementation for those scripts, we wouldn't need cursors / cross trnasaction temp tables. [07:07] wgrant: Done. Why? [07:07] lifeless, re bug heat, perhaps it would be useful to try to gauge how many users actually find it useful? [07:07] StevenK: It failed to check out the branch due to an odd pack redirect. [07:07] AGAIN?! [07:07] Yes. [07:07] i think it could be potentially useful but it never actually seems useful [07:08] poolie: the distro do use it [07:08] and like it? [07:08] that's good to know [07:08] poolie: I think we should finish and polish it before assessing whether its good or not. [07:08] like many things its been done just-enough, rather than made excellent [07:08] hm [07:08] that doesn't seem like a very general algorithm [07:09] but, i think it's worth working out whether the problems are essential or just bugs [07:09] lifeless: The pattern is to prepare the data into a holding area and then perform bulk operations in multiple transactions with small batches. [07:09] stub: yup, thats what I thought it was. [07:10] stub: the crash proofness I refer to is the assumption [or fact] that identifying the work is expensive. [07:10] I think you are proposing the holding area be a real table rather than a temporary one, which is doable but a little more effort (db patch to create the table, extra code to clean out the holding area when necessary) [07:10] stub: for instance, yes. alternatively make identifying the remaining work cheap. [07:12] lifeless: Even when identifying the remaining work is cheap, it is still more expensive than doing it upfront. Garbo and the librarian-gc have made use of pretty much every possible technique. [07:13] do storm StringColumns support a .like() call for queries? [07:13] e.g. foo.like('%bnar%') [07:13] I think they do, or there is a LIKE function. [07:14] stub: I think it depends on the operation whether upfront checking is needed, is all I'm saying [07:14] stub: I recognise the need for variety. [07:14] yes. For our current use cases, I think we could minimize their use if we wanted to, but I don't think it would be sane to eliminate them altogether. [07:15] stub: if we *were* to eliminate them, we could: [07:15] - parallelise more easily [07:15] - kill the tasks during db deploys more easily [07:15] - use pgbouncer more easily [07:15] its just a thought [07:16] storm.expr has a Like binary operator. Don't know about foo.like() syntax. [07:17] yup === almaisan-away is now known as al-maisan [07:41] Project devel build #493: STILL FAILING in 4 hr 43 min: https://hudson.wedontsleep.org/job/devel/493/ [07:42] That's not good. [07:43] does IntColumn.__eq__(Enum) work ? [07:44] I don't know, but it would certainly make you a bad person. [07:44] Why? [07:49] wgrant: migrating Archive.getPublishedSources to storm so I can DRS it [07:52] who has permission to mark a branch as an official source package branch? [07:52] the tb [07:55] because they're the owner of the distro series it's going into? [08:04] lifeless: I think you just need ~ubuntu-branches membership. [08:04] Because they are in the special group created by jmls initial work [08:04] Celebrities, yay. [08:05] and that group is meant to only have the TB in it now [08:05] because it effectively controls upload rights [08:05] we should indeed move that to a right granted by owning the distro [08:05] ok [08:05] so allowing them to only push the button if they also own the branch would fix this? [08:05] We also need to delete bazaar-experts. [08:06] it would avoid them shooting themselves in the foot by accidentally trusting someone else's branch [08:07] poolie: uh [08:07] poolie: I am missing a lot of context here I presume [08:07] what button, what problem, what accidental trust ? [08:07] nm [08:07] i sent mail [08:07] bug 516709 [08:07] <_mup_> Bug #516709: revisit official package branch permissions < https://launchpad.net/bugs/516709 > [08:08] poolie: Owner == Maintainer [08:08] not urgent [08:08] And that is shown on Distribution:+index. [08:09] ok [08:09] there's still a bug as to how you're supposed to know owner==maintainer [08:09] Pillars don't have the "owner" term exposed in the UI. It's called "Maintainer" instead (and "Registered by" is the owner for both of those objects, but that is a bug) [08:10] Where have you seen a reference to a distribution owner? [08:10] Or a project owner? [08:10] robert's comment in the thread about that bug, and the api [08:12] poolie: its not clear what you are trying to optimise for in that discussion [08:13] > Lets aim for 'excellent' not 'least work' (while still aiming for 'done soon'). [08:13] that's kind of vague [08:13] who could disagree?L [08:13] well [08:13] but it's not very helpful in working out what is actually simplest [08:13] simplest to implement? use? describe? [08:14] any [08:14] I'm essentially getting confused [08:15] I'm not sure if you're trying to redesign the distro branches feature [which isn't fully implemented, and the design calls for the permissions to override - thats why there is a bug open for that] [08:15] or something else [08:16] i'm trying to clarify what if anything needs to be done for https://bugs.launchpad.net/udd/+bug/516709 [08:16] <_mup_> Bug #516709: revisit official package branch permissions < https://launchpad.net/bugs/516709 > [08:16] poolie: we need to ensure the identity statement that only uploaders to the distro can upload to the distro [08:16] sure [08:17] there are various possibilities to get there [08:17] what are your constraints [08:17] what are your requirements, and the requirements of your stakeholders? [08:18] i think the only hard requirement is, just as you said, that only package uploaders should be able to write to these branches [08:18] afaict that is in fact already met [08:18] well, its not. [08:18] then there are two soft requirements [08:18] 1- it should not be too easy to make mistakes that open up access- [08:18] because the permissions | together rather than excluding. [08:18] 2- it should not complicate lp's access control user model any more than it is at present [08:19] which permissions? [08:19] poolie: I'm going to put on my paranoid hat and say that 1 is not a soft requirement. [08:19] poolie: its a *hard* requirement that it be effectively impossible to open up access. [08:19] poolie: this has been well established over the years of UDS discussions. [08:19] what i mean is it is not a black and white easy [08:19] how is 'too easy'? [08:20] if its possible to have something look like its distro uploader only, it must be distro uploader only. [08:20] well, that's really my second point, that the user model should be understandable [08:20] I'd rather for 2) we say 'we want it to be easy to understand and use' [08:21] s//black and white issue [08:21] sure, i agree with that too, i'm just trying to be more specific [08:21] ok [08:23] poolie: so we have a trivial to code tolerable solution mooted in the bug [08:23] poolie: I assert that something needs to be done because we don't satisfy the identity requirement today. [08:23] it has the drawback that we will then have branches owned by say ~mbp that can't be written by ~mbp [08:23] right [08:24] i think this does poorly on transparency and understandability [08:24] if you want to incrementally improve things, and aim for shortest path to tolerable source uploads from bzr [08:24] then I think you should accept that we already have 'branches own by ~mbp that ~ubuntu-devel can write to' [08:24] this neither much worse, nor much better than the status quo. [08:25] i didn't know we could already have such branches [08:25] where? [08:25] but this is on the path to bzr source uploads [08:25] because having them be adequately secure is important [08:25] poolie: if we bless, for instance, ~mbp/bzr/packaging as lp:ubuntu/bzr [08:26] then ubuntu-devel get write access to ~mbp/bzr/packaging [08:26] right, and i keep it [08:26] that's what i ithkn 516709 is about [08:26] yes [08:26] is blessing existing branches actually useful? [08:26] I can't tell if we're violently agreeing or what [08:26] poolie: unless you rework the feature from groundup, its necessary. [08:27] I'd love to see it reworked. [08:27] it seems to me, from this thread, that we should get away from blessing existing branches [08:27] why? [08:27] i mean, why would it require redoing the feature? [08:27] because there is *no* lp:ubuntu/bzr branch - at all, and no space for it. [08:27] lp:ubuntu/bzr is *defined* as being an alias. [08:29] so, given the thread so far, it seemed like it would be cleanest to make it an alias to a branch owned by, say, ~techboard or something similar [08:29] that will still require lp changes [08:29] and will need care to accomodate linaro etc [08:29] who have a different governance structure [08:29] [but the distro permissions wouldn't need reworking, if we go with 'distro permissions are all that matter for blessed branches'] [08:30] hm [08:30] good point about linaro [08:31] however, i don't think that making the branches owned by the series owner should be a problem [08:31] (I'm pretty sure that source package branches need a complete rethink for when we want to support multiple distributions. More complete than you suggest.) [08:31] wgrant: Possibly. [08:31] that just effectively formalizes that the only way to them is through the distro permissions [08:31] poolie: this sets off all sorts of alarm bells [08:31] Yippie, build fixed! [08:31] Project windmill build #3: FIXED in 1 hr 9 min: https://hudson.wedontsleep.org/job/windmill/3/ [08:32] poolie: and I really don't want to spend a lot of time painting the bikeshed [08:32] sure [08:32] and it's late [08:32] poolie: the requirement we were given was 'match the distro permissions'. [08:32] I can see two routes to that: [08:32] - genuine *distro* branches that are not aliases. [08:32] - *completely* overriding the permissions when blessed. [08:33] those both have some problems [08:33] I've seen no proposals that meet the requirement other than those two. The former is a moderate amount of work. The latter is going to be confusing if the feature is misused but easy to do and secure. [08:33] the first is apparently a fair amount of work; the second complicates the already hellacious security model [08:34] the incremental complication is near-zero. [08:34] Because the elephant is inside the room already. [08:34] i'm confused because i think i'm proposing exactly what you proposed on 18/2 [08:34] its fine to argue we shouldn't arbitarily bless branches. But thats orthogonal to meeting the requirement. [08:34] > I'd make *that* owner for these branches the owner [08:34] of the distro series the branch is for, not a celebrity. [08:35] poolie: you've elided the context [08:35] 'We probably want an owner in the same sense that a team has an owner: [08:35] someone that has administrative privilege over the thing but no direct [08:35] access to the content of the thing' [08:36] sure [08:36] anyhow, it's late [08:37] that is, people that are members of distro.maintainer can say 'this is the branch' or 'thats the branch' but cannot write to the content of the branch unless they separately have upload permissions [08:37] this is part of doing genuine distro branches [08:37] i'm trying to work out a path to unblock the problems identified in with the existing branch [08:37] <_mup_> Bug #516709: revisit official package branch permissions < https://launchpad.net/bugs/516709 > [08:37] i don't want to bake into the user model the idea that branches owned by random users can be blessed [08:37] poolie: its already there [08:38] I appreciate the motivation, but *its already there* [08:38] poolie: conflating 'fix this previous design shortcut' and 'meet the stakeholder requirements' will push your critical path way back. [08:39] hm [08:39] ok, good night [08:39] heh [08:39] i'll think about it [08:39] poolie: I'm in town in the weekend [08:40] which reminds me, I need to mail folk and say 'hi, meetup?' [08:41] jam: Around? [08:43] lifeless, so essentially you're disagreeing with jml's assertion in #8 that the usability consequences need to be fixed at the same time? [08:43] you may be write that will be faster [08:43] s//right [08:43] yes [08:43] its a bit like your thing on reviews [08:44] that asking for more work at the same time often just stalls things. [08:44] true [08:44] designing a better solution than we collectively came up with the first time around is possible. [08:45] Buts its going to be more work than finishing our initial design. [08:45] yeah [08:46] i wonder what people do own official branches at the moment and how they use them [08:46] does anyone rely on having separate access through that, or does anyone have it but not need it [08:46] it may not actually cause a problem [08:48] lifeless, this feels like the kind of thing where lp puts in a poor user model, and then spends more time explaining and supporting it than it would have taken to just finish it properly in the first place [08:48] poolie: quite possibly. its a 4 year old design [08:49] poolie: the design was optimised for delivery not explainability or usability. [08:49] like many things at the time. [08:51] ok, perhaps you're right we should split them, then separately get away from blessing existing branches [08:52] is that in fact what you're saying? [08:53] yes [08:53] I'm not trying to defend blessing at all [08:53] its fugly [08:55] good morning [08:56] ok, i see [08:56] well, really good night then [09:07] morning all [09:07] Morning bigjools. [09:07] P-a-s makes me cry. [09:08] it does that to most people :-) [09:08] Hi [09:11] wgrant: I'm sure a boy of you calibre can fix it [09:11] your* [09:11] Hah. [09:27] lifeless: How do I push a config overlay onto the FS? [09:27] wgrant: see layers.py [09:27] Thanks. [09:35] wgrant: so, sparc builders [09:35] bigjools: We worked out the problem. [09:36] wgrant: yeah, I'm interested to hear details, steve said it was a corrupt bq record? [09:36] bigjools: https://wiki.canonical.com/IncidentReports/2011-03-02-LP-build-with-wrong-status [09:36] No BQ. [09:36] But the BFJ was BUILDING, when it should have been FAILEDTOBUILD. [09:37] Very similar to Bug #717969 [09:37] <_mup_> Bug #717969: storeBuildInfo is sometimes ineffective < https://launchpad.net/bugs/717969 > [09:37] Except the opposite. [09:37] Oh, yes, sorry, a BFJ record [09:38] wgrant: the report doesn't say *why* the build candidates were not being considered [09:38] bigjools: The 80% rule. [09:38] Due to the 80% PPA rule, this phantom build prevented further sparc builds in that PPA from being dispatched. [09:38] which PPA? [09:38] It's in the description! [09:39] https://launchpad.net/~ubuntu-mozilla-security/+archive/ppa [09:39] urgh [09:39] ok I need to teach you how to write incident reports [09:39] :) [09:39] Yes. [09:39] this was all written well afterwards, so it was sort of not very well done. [09:40] the other bug is that nonvirtual PPAs should not be doing the 80% check [09:40] Indeed. [09:40] so the sBI bug is serious [09:41] I have seen it only three times. But yes, it is rather serious. [09:41] I concur that there's a missing commit [09:41] NFI where, though. [09:41] but what's worse is that we have a problem with transaction boundaries [09:41] Oh? [09:41] in that the boundary is wrong, if we can get in an unrecoverable situation [09:41] the b-m should have seen the build and fixed it [09:42] Yeah. [09:42] the first step is a minimal test cast to re-create it [09:43] But the first step of that is working out what has gone wrong. [09:43] no, it's not [09:44] it's working out a test case that re-creates it :) [09:44] How do you propose to write a test case when we don't know how to reproduce it? [09:44] Well, yes. [09:44] "what has gone wrong" is not the same as "how to reproduce it [09:44] " [09:44] but we're on the same wavelength, so no worries [09:55] wgrant: need to talk to you at some point about folding the SoyuzTestPublisher into the LaunchpadObjectFactory [09:56] the stuff the LOF does is completely wrong in some cases [09:57] bigjools: james_w had a branch to do that at one point. [09:57] And I've wanted to get that landed for a long time. [09:57] I occasionally improve STP when writing tests... but some of it is just so wrong that it needs a big rethink. [09:57] yes [09:57] I want to deprecate STP [09:57] but the LOF needs fixing, it creates bad data [09:58] Yup. [09:58] Still, not data that is as bad as the sampledata :) [09:58] [10:07] stub, hi, I'd like to create a (constraint) trigger that stops a row from being removed if it's the last one for a foreign key [10:08] stub, I have https://pastebin.canonical.com/44140/ so far: I am using a subselect to be able to add a LIMIT 1, and COUNT() to ensure I get a value in the variable ("IF FOUND" didn't work for me) [10:08] henninge: Hi. [10:08] Hi wgrant! [10:08] a foreign key references a single row... [10:08] stub, also, one thing I am worried about is race conditions [10:08] henninge: Did you land that sharing information thing just now? [10:08] stub, right, but I might have multiple rows that reference that same single row in the other table [10:08] wgrant: yes [10:08] stub, basically 1 StructuralSubscription -> n BugSubscriptionFilters [10:08] wgrant: PQM was *very* quick. [10:09] henninge: How long did it take from when you submitted it? [10:09] henninge: It should have been less than a minute. [10:09] stub, BugSubscriptionFilter.structuralsubscription is what references StructuralSubscription [10:09] wgrant: seemed like it [10:09] henninge: Great, thanks. [10:09] danilos: So normally, we would just allow the removal and have a garbo job clear out the lost rows. [10:09] wgrant: did you do that? Great work! [10:09] thanks a lot [10:09] stub, oh, I want to stop removal of the final row, to avoid race conditions [10:10] stub, iow, I stop the removal in python, but it's potentially easy to cause a race condition and still remove it [10:11] And you want to trigger an OOPS instead? [10:11] henninge: Yeah, previously we ran buildout and then bin/test to run a test that ran one line of shell... now we just do that directly in PQM. [10:11] It's a little bit faster. [10:11] just a tiny bit ... ;) [10:11] stub, will that happen if delete removes 0 rows? [10:12] stub, (and even so, I'd prefer that) [10:12] stub, or, I could allow removal and then insert another row, would that be better? [10:12] I don't understand why you want to prevent removal. [10:13] wgrant, so python code can assume that there is always at least one BugSubscriptionFilter for every StructuralSubscription (that's what we have already agreed on as the model) [10:13] danilos: The DELETE trigger can allow or block the deletion. I don't think it can make it fail silently or remove a different row. That would be lying. [10:13] danilos: Right, but that doesn't really explain why :) [10:15] wgrant, well, to clean up the semantics, allow easier linking etc (i.e. use just bugsubscriptionfilter for linking instead of having to have complex logic for choosing structuralsubscription of bugsubscriptionfilter) [10:15] wgrant, today, you can have "no filtering" in two different ways: SS with no BSFs, or SS with BSF with no filters [10:15] wgrant, we don't like the duality [10:16] Ah, right, forgot that you wanted to link them to notifications. [10:16] I don't see why a structuralsubscription must have >= 1 bugsubscriptionfilter's myself. [10:17] stub, right, the trigger I have so far seems to give me the right behaviour, I am just wondering if that's the optimal or good way to do it? [10:17] stub, see above, though we can argue if they are strong enough reasons [10:18] danilos: It doesn't work if two transactions delete the last two referencing rows in separate transactions at the same time. [10:18] Both sessions think there is another reference, both succeed, both rows are removed. [10:19] stub, that's what I was wondering; I see that one can have a CONSTRAINT TRIGGER to happen at the end of transaction, which would work better, but that one can only be an AFTER trigger, meaning it can't stop deletion (naturally, since everything else in the transaction might be messed up) [10:19] stub, also, would it be possible to explicitely lock rows that I care about? [10:19] Your just shuffling the race condition around [10:19] Yes, SELECT FOR UPDATE will do that. [10:19] But this is a complex way of proceeding. Ideally we find a non-complex way of solving the problem. [10:20] stub, agreed re shuffling race condition around, but having more safeguards in place reduces the chances of it happening, and ideally, I'd be avoiding it completely [10:20] stub, any suggestions on a non-complex way? (not that I see this as overly complex) [10:21] We just keep outerjoining with the table and not fighting the model SQL gives us. [10:22] stub, that's something I'd have to (re)discuss with my team, since we are generally relying on this already in a few places; if it's not too many places, perhaps we can go back [10:25] stub, one of the reasons we are going this way is to allow easier merging of structuralsubscription and bugsubscriptionfilter tables in the future [10:26] bigjools: Do you want to review https://code.launchpad.net/~wgrant/launchpad/generalise-add-missing-builds/+merge/51873, or should I grab someone else? [10:26] fwiw, there is nothing in the data model preventing a bug from having no linked bugtasks, but that hasn't been a problem. [10:27] bigjools: Also, is there an existing wiki page on adding new archs, or should I create one? [10:27] wgrant: I can do it in a bit [10:27] I searched around earlier, but found nothing :/ [10:28] Which I guess is unsurprising, given the questionable results of the last two new ones :) [10:28] wgrant: feel free to start one [10:34] stub, right, so your suggestion is to just ignore the potential race condition? (in our particular case, it is very unlikely, but if we don't put any safeguards in, a malicious person will be able to do it, especially if we allow API access) [10:34] danilos: So to do this. In Python, you do 'SELECT * FROM BugSubscriptionFilter FOR UPDATE', then check there is at least one other link. If there is, proceed with the deletion. [10:35] If not, render an error message [10:35] If two transactions try that at the same time, one will block until the other commits, then a serialization exception is raised and Zope retries the request [10:36] danilos: Doing it in the Python level means you catch the case with a nice error [10:36] stub, right, the reason I wanted this as the trigger is because I ran into a place where someone was not using the appropriate Python API, but instead removed a row directly using store.find(...).remove() [10:36] stub, true, agreed [10:36] danilos: The trigger would need to duplicate this logic, but it will cause an OOPS. [10:37] stub, right, but if we guard the proper API in Python, this would indicate a bug, which is exactly what we want :) though, I can't make it work in the trigger [10:38] danilos: I'd ignore the race condition - do you really care if there is a structuralsubscription with no bugsubscription? Malicious isn't an issue - you can only shoot yourself in the food. [10:38] stub, do you know how to "discard results of a SELECT using PERFORM"? [10:38] stub, right, but I hate to see food flying around :) [10:38] stub, and you are definitely right, we can cause problems for ourselves in many more different ways, so I'll just guard against it in the python code [10:38] stub, thanks [10:39] danilos: Yes, but the fallout from all the extra complexity you are adding is likely worse than the problem. [10:40] PERFORM is a pl/pgSQL statement so only useful in a stored procedure. Syntax similar to SELECT but it throws the results away. [10:40] stub, I assume that last comment is re trigger implementation? [10:41] It depends on how contrived your race condition is. [10:43] If you have a form """( ) bugsubscription a (x) bugsubscription b""", it is reasonable to assume someone clicks 'delete', thinks 'oh shit', changes the check box and clicks submit again. So catching this in Python is reasonable. [10:43] If you need to contrive an example with two browser windows opened to the correct page and the user submitting both at once, you might not want to bother. [10:43] stub, atm it's the latter, but the UI is not settled yet [10:44] I certainly wouldn't bother catching the case of the form submission being processed at the same time as some background script running that does the same removal (esp. as I doubt we would have a background script performing that removal) [10:45] stub, I can imagine the UI becoming what you described first, and it seems it's not hard to guard against it with SELECT FOR UPDATE [10:47] stub, ok, and just for fun, the trigger works with PERFORM FOR UPDATE, so something I learned today :) [10:49] Right. Should be able to simplify it too. Once you have locked all the rows referencing the structural subscription, you only care if there is a result from SELECT TRUE FROM BugSubscriptionFilter WHERE structuralsubscription=OLD.structuralsubscription AND id <> OLD.id' [10:50] I think you can lie and silently not do the deletion, but I think it is better to raise an exception here. [11:00] stub, ah, SELECT TRUE trick is a very nice one [11:00] stub, is it worth doing a LIMIT on that query as well? (just for my education, it seems it is :) [11:00] danilos: That wouldn't lock all the rows then [11:01] Oh... the check if the result exists. Yes. [11:01] stub, ah, so I would put the FOR UPDATE in this query as well? [11:01] Thought you meant on the FOR UPDATE, which should not have a LIMIT :) [11:01] stub, right :) [11:27] wgrant: DistroSeries:+queue just hove into view on the oops reports [11:27] I hate that page [11:27] bigjools: Yes. :( [11:27] I remember when it used to have 3000+ queries on it. I was quite chuffed when I got it to ~600 [11:27] that looks big now :/ [11:28] A lot of it is the same problem as +copy-packages: createMissingBuilds is shit. [11:28] there's also a massive problem when there are many packages with custom files [11:28] Yeah. [11:28] I only optimised binary and source uploads, custom packages are still bad [11:29] It's fairly easy to optimise now that we're allowed to do caching on the model. [11:29] GETs, that is. [11:29] POSTs are still hard. [11:29] we can redirect [11:29] I think it does that already [11:29] We already do. Most of the work is skipped on POST. [11:30] *Most*. [11:30] :) [12:19] bigjools, I /just/ noticed bug #610491 is fixed... but I can't get it to work... getPublishedSources(package_signer=IPerson)... i would have thought would return a packageset? [12:19] <_mup_> Bug #610491: [API] Please expose getPublishedSources(package_creator,package_signer) < https://launchpad.net/bugs/610491 > [12:20] Daviey: why do you think it was fixed? [12:21] getPublishedSources returns publications, not packagesets BTW [12:22] bigjools, ah yeah... that is what i want [12:23] we can add the filtering as per the bug, but it's low priority [12:23] StevenK, hi, are you still awake? :) [12:23] I'm sure everyone wants a fast, non-oopsing Launchpad first :) === danilos changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | firefighting: - | On call reviewer: danilos | https://code.launchpad.net/launchpad-project/+activereviews [12:26] bigjools, i thought it was fixed because i confused it with bug #372704 [12:26] <_mup_> Bug #372704: expose Signed-by and Changed-by via API < https://launchpad.net/bugs/372704 > [12:26] heh [12:27] ETOOMANYSIMILARBUGS! :) [12:35] My laptop is busted. [12:36] danilos: Hai? [12:36] StevenK, you were listed as OCR, I've removed you thinking you might be done with that [12:37] danilos: It's 1130pm, I so am :-) [12:37] StevenK, ok, cool, enjoy the night then :) [12:39] jml: that sucks [12:41] jam: Hi. [12:42] hi wgrant [12:42] jam: We have another chance to roll out the forking service next week. Do you think we should try? [12:43] wgrant: I would like to land one more patch, but I think I can make it by next week [12:43] https://code.launchpad.net/~jameinel/launchpad/lp-serve-child-hangup/+merge/50055 [12:43] but poolie approved it, pending switching to signal.alarm() [12:43] jam: Right, I've seen that sitting around. [12:43] I think I would prefer an explicit handler. === henninge is now known as henninge-lunch [12:43] well, it seems approved from the conversation, I'll certainly put it up again for review [12:44] wgrant: something to handle SIGALRM? [12:44] Yeah. [12:44] But I guess it doesn't really matter. [12:45] wgrant: it makes the testing easier [12:45] since otherwise I have to worry about it when testing that specific code [12:45] True. [12:45] I'll think about it [12:45] I can certainly just install a handler for the test itself [12:48] Thanks. [12:49] wgrant: any idea why ^C in the middle of the test cancels the current test with OK, rather than generating a traceback and failure where it is blocked? [12:49] I think this changed with testtools [12:49] maybe jml ^^ is better to ask (though prob not in this timezone :) [12:50] jam: it's complex. depends on the test. [12:50] jml: I've seen it on just about all test runs I've tried [12:58] wgrant: do we have a way to tell that a process, which is not a direct child, has exited? [12:59] jam: "We" meaning victims of POSIX? I don't think so. [13:00] poll ps :P [13:00] jml: yeah, that's what I thought was required. which... I'm not doing. But thanks for confirming [13:01] jam: I guess you could do something like upstart [13:02] * danilos -> food [13:10] wgrant: https://code.launchpad.net/~jameinel/launchpad/lp-serve-child-hangup/+merge/51893 [13:10] if you want to give it a look ovre [13:10] the only other thing I can think of, is wanting to have the full TIMESTAMP and PID in log files, rather than mutter's default of time-since-started [13:10] but that doesn't seem like a blocker for rollout [13:12] jam: Thanks. I'll have a look tomorrow. === Ursinha-afk is now known as Ursinha [13:34] hey, benji, i'd like to talk about the way in which exceptions in launchpad are turned into http responses in lazr.restful [13:35] the more i look at this, the more it looks like two systems that do the same thing [13:35] for instance, you filed bug 631711 [13:35] <_mup_> Bug #631711: Ability to specify HTTP result code when using lazr.restful.error.expose < https://launchpad.net/bugs/631711 > [13:35] but we have that--it's the webservice_error() decorator [13:35] but the webservice_error() decorator on its own doesn't work [13:36] you have to call expose() on the exception object as well [13:36] and i can't change webservice_error() to call expose() on the exception class, because expose() is for objects [13:36] so you have to change it in two places--call webservice_error() on the class and expose() on the object [13:36] this has been bothering people for a while and i think i'll do something about it [13:37] but, i'd like to check in with you since it looks like you are responsible for the other system [13:38] leonardr: I'm on a call at the moment. I'll be off in about 5 minutes. [13:38] sure [13:43] Yippie, build fixed! [13:43] Project devel build #494: FIXED in 5 hr 11 min: https://hudson.wedontsleep.org/job/devel/494/ [13:46] gary_poster: thanks, we'll need it; I'm almost certain we're upside down, but there's always the option of a short sale [13:46] Project db-devel build #411: STILL FAILING in 6 hr 4 min: https://hudson.wedontsleep.org/job/db-devel/411/ [13:46] gary_poster: thanks for the pointer, it's hard to find a good agent [13:46] benji, ack, and following you around on the channels is exciting ;-) [13:46] LOL [13:47] ok, I really need to do something about that [13:47] * benji starts a local chapter of Channel Hoppers Anonymous [13:48] leonardr: the root difference is that one is about exposing exception classes and the other is about exposing exception instances (especially instances of built-in exceptions) [13:49] right [13:49] the problem is that on its own, exposing an exception class does absolutely nothing [13:49] you also have to expose the instance [13:49] oh, you do? I thought that if the framework caught an instance of an exposed exception class then it would handle it specially [13:50] no, it doesn't [13:50] i don't know if it used to and doesn't anymore, or what [13:50] hmm [13:50] ideally there would be one method that you could call within a class definition, or on a class, or on an instance [13:51] two courses of action suggest themselves: make exposing an exceptoin class actuall do something or do away with the class-based mechanism alltogether [13:53] the class-based mechanism is very useful because you can take care of the whole thing in one place [13:55] the reason you filed that bug is because expose() only works for exception classes that give a 400 error code [13:55] so... let me try to come up with a solution [14:02] henninge: standup? [14:02] abentley, adeuring: Can we do skype? [14:02] henninge: sure. [14:02] surwe [14:03] * adeuring needs to start a machine where skype is installed. 2 minutes, please [14:29] benji: argh, we actually have _three_ systems, only one of which works [14:30] leonardr: yow [14:30] but two of the systems are alternate spellings === abentley changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | firefighting: - | On call reviewer: danilos, abentley | https://code.launchpad.net/launchpad-project/+activereviews [14:42] benji: do you know of any reason why it shouldn't be allowable to call expose() on an exception __class__ (or do the equivalent--basically, force a class to implement a "error handled specially by the web service" interface [14:44] leonardr: that seems reasonable; I'd make it a class decorator (as well as a normal function) so you could use it either way [15:00] sinzui_: will your connection stay up long enough to answer a question? :) [15:01] doubtful. unity cannot decide to show menu in the menu bar or the window. My windows are shaking with anger [15:02] sinzui_: well let me try. rvba is having problems using create_initialized_view in a test with a principal of None [15:02] he is rendering content? [15:02] https://pastebin.canonical.com/44154/ causes https://pastebin.canonical.com/44155/ [15:02] yes [15:03] sinzui_: calling render on the view triggers a LocationError: 'name_link' [15:03] sinzui_: (Hi by the way ;-)) [15:03] He needs to pas the principal because menus links and login often needs to get the user form the interaction [15:04] what about an anonymous user? [15:05] Most Lp pages will not render with an anonymous user. The interaction is not setup right with anon [15:05] hah [15:05] sinzui_: ok [15:05] didn't know that [15:05] he can pass the arg like [15:05] I guess I was lucky to find a page that does when I've done this [15:05] principal=distro.owner to hack the case [15:05] sinzui_: login(someuser) won't do? [15:05] the login() will still work for permission [15:06] no [15:06] rvba: login() is for the zope interaction [15:06] rvba, We prefer login_person() so that we have the person to pass as the principal [15:07] sinzui_: got it [15:08] bigjools: I though every security check was "zope interaction" [15:08] rvba: yes, logging in sets up the user for the interaction [15:09] bigjools: what is the part that is *not* zope interaction? [15:09] benji: see if this makes sense to you: http://pastebin.ubuntu.com/574501/ [15:09] * benji looks [15:09] rvba: anything that's not using database objects, views and utilities generally [15:10] so not much :) [15:10] bigjools: ok, good to know [15:11] sinzui_: thanks for the clarification [15:13] leonardr: that sounds like it accurately describes the status quo [15:13] ok, now to figure out a better system [15:13] i'm kind of thinking of a system in which we do a named operation based on __lazr_webservice_error [15:14] er, a named adapter lookup [15:14] since the way we describe this class of exception is by saying what response code we want to send instead of 500 [15:14] so if you could do @error_status(400) on an exception class, or expose(exception, 400) on an instance [15:15] lazr.restful would do an named lookup adapting to IWebServiceErrorView, '400', and you would get a ClientErrorView === danilos changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | firefighting: - | On call reviewer: abentley | https://code.launchpad.net/launchpad-project/+activereviews [15:57] abentley, hi, do you mind taking a look at https://code.launchpad.net/~danilo/launchpad/devel-bug-720826-delete-race/+merge/51890? [15:58] danilos: sure. OTP. [15:58] abentley, great, thanks, I'll be around for discussion as well [16:14] benji: what do you think of this? http://pastebin.ubuntu.com/574528/ [16:14] (skip to "a better system") [16:15] danilos: back. [16:15] abentley, generally, the branch is quite simple; few things I have on the top of my head: perhaps I should split the delete_final test into a test-per-attribute, though I am not sure about that; second, I am not testing the race condition handling === salgado is now known as salgado-lunch [16:16] "I have on the top of my head" is an interesting way to put it, though :) [16:18] danilos: I haven't seen "FOR UPDATE" before. Is that why you didn't use Storm expressions? [16:18] abentley, yeah [16:18] abentley, that's how one locks rows in Postgres, I just learned that today from stub :) [16:19] danilos: cool way of verifying that an object is deleted. I've been grabbing the ID and then querying for it. [16:20] abentley, this should handle race conditions where someone tries to delete it from a different place, the check succeeds in both, and then you try the delete; this way, one of them gets locked out until the check and delete are done [16:21] danilos: I meant using self.assertIs(None, Store.of(bug_subscription_filter)) [16:21] abentley, ah, right :) [16:22] danilos: what is getDescriptionFor? [16:23] abentley, oh, it returns you things like Choice(blahblah) object from the interface; I've looked it up using help(Interface) [16:24] abentley, perhaps I could say it returns an attribute "descriptor" [16:24] danilos: Okay, cool. [16:26] danilos: if I were writing it, I'd probably look up the default on a separate line, just to avoid clutter. But that's a matter of personal taste. [16:26] danilos: r=me. [16:27] abentley, a thought did occur to me as well, so since it's not just me, I'll do it :) [16:30] abentley, and thanks! [16:30] danilos: np. === al-maisan is now known as almaisan-away === matsubara is now known as matsubara-lunch === salgado-lunch is now known as salgado === beuno is now known as beuno-lunch [17:29] Ursinha: matsubara-lunch: whats the landing process for qatagger ? [17:30] lifeless, it's all explained on the README file (I believe) [17:30] lifeless, wait, let me find the wiki page [17:31] Ursinha: its not :) [17:31] Ursinha: I have this branch diogo approved, I would like to land it [17:31] lifeless, first what I do is to merge it on trunk [17:31] after that, a make dist [17:32] I pick the package and copy over to devpad, untar it, make install [17:32] and create a qa link for that folder [17:32] change the cron entry and watch [17:32] ah, I copy the files last-revno-* from current to qa folder [17:32] benji: got some time for more brainstorming? [17:33] and create a link for logs and current-milestone [17:33] I'm sure that's explained somewhere, I guess a wiki page I wrote [17:33] just a moment [17:33] lifeless, did you manage to get access to lpqateam? [17:33] on devpad, that is [17:33] Ursinha: haven't moved that forward [17:33] Ursinha: thats *deploy* [17:34] Ursinha: how do I get my change into trunk ? [17:34] ah [17:34] for me that's land :) [17:34] Ursinha: I like it :) [17:34] lifeless, if that's approved, merge into devel branch and push [17:34] then you have to deploy [17:34] that's what I described [17:34] Ursinha: ah, so I can't run the test suite at the moment [17:34] some buildout error [17:35] perhaps you could check I haven't broken a test and push it to devel if its ok ? [17:35] ah, I know [17:35] I guess that's trying to find a launchpadlib version that isn't there anymore, just a moment [17:40] yes [17:40] Error: Couldn't find a distribution for 'launchpadlib==1.6.3'. [17:40] lifeless, it seems mars removed his pypi folder that contains the dependencies, I'll add one to ~ursula [17:40] you can fix the find-links to point to ~ursula, I'll fix that [17:41] Ursinha: can you merge this one into trunk - I need to go chase two in-progress timeout bugs [17:42] lifeless, of course :) [17:42] Ursinha: I appreciate you fixing the process blockers immediately - thats great stuff. [17:42] leonardr: sure, give me about 10 minutes [17:42] k [17:42] Ursinha: thanks [17:42] lifeless, np [17:46] bigjools: are you around for much longer ? [17:46] lifeless: 10 mins [17:46] I'm having trouble converting archive.getPublishedSources to storm [17:46] ho ho :) [17:47] I want to do that to change the prejoins to set based eager loaders [17:47] why are you doing that? [17:47] ok [17:47] https://bugs.launchpad.net/launchpad/+bug/727560 [17:47] <_mup_> Bug #727560: Archive:EntryResource:getPublishedSources < https://launchpad.net/bugs/727560 > [17:47] 15 second query atm [17:47] meep [17:47] you could say that [17:50] bigjools: are there unit tests for this, or just 'archive.txt' ? [17:51] lifeless: just the doctest :/ [17:51] thanks [17:51] feel free to rip it out [17:51] migrate to unit tests ? [17:51] however, that method is used *everywhere* - loads of other code and tests use it [17:51] that would make it a lot easier to debug [17:51] yes, migrate if you want [17:52] archive.txt is way too big [17:52] I'll have a look see [17:52] thanks [17:52] np [17:55] lifeless: did you want any more help? === matsubara-lunch is now known as matsubara [17:58] abentley, do you have time to review https://code.launchpad.net/~sinzui/launchpad/sane-definition-status-0/+merge/51932 [17:58] too late if you do, I'm off :) [17:58] good night all [18:15] benji: i think i've reduced my remaining problems do "why doesn't this component lookup succeed?" want to take a look? [18:15] leonardr: sure [18:15] ok, pushing [18:16] lp:~leonardr/lazr.restful/consistent-error-exposing === beuno-lunch is now known as beuno [18:16] if you run bin/test there's a breakpoint right at the bit where everything goes south [18:17] look at the SystemErrorView registration in configure.zcml. that's the one that's failing [18:17] and look in _resource.py, the except clause in ReadWriteResource.__call__ [18:17] that's where the lookup happens [18:19] sinzui_: happy to. [18:20] i've manually verified IException.providedBy(e), IWebServiceLayer.providedBy(self.request), and Interface.providedBy(IWebServiceErrorView) [18:22] hm, it succeeds if i copy simple.py and remove the thing i'm trying to adapt to [18:22] but i get the wrong status code [18:22] i get SystemErrorView [18:23] rather than WebServiceExceptionView [18:25] looking at it now [18:26] i think i got it, actually. i'm just not sure why i had to start adapting to Interface (presumably the default) instead of IWebServiceErrorView [18:27] let me get it working and then you can critique the working version [18:35] benji: pushed working version [18:35] basically, WebServiceExceptionView is now the default view for all exceptions unless another view is registered [18:35] looking [18:35] and if the exception happens to have been annotated (by any mechanism), WebServiceExceptionView will defer to the annotation when it comes to the status of the response [18:37] and the exception handler does a lookup for any exception, not just for a client error [18:39] sinzui_: Why are you addressing this issue by providing a new enum? Why not just check that the value is in a set of acceptable values? [18:43] leonardr: looks good very good to me [18:44] benji: ok, maybe you'll like this other change, which takes it one step further... [18:49] benji: check the new version. it *always* turns an exception into a view, and gets rid of IWebServiceError altogether [18:50] k [18:54] leonardr: wouldn't that mean that an "accidental" exception would result in a 400? It seems a 500 would be more appropriate. [18:54] benji: an accidental exception would turn into an un-annotated WebServiceExceptionView [18:54] and its status would be the default, 500 [18:55] leonardr: ok, sounds like a plan; simpler code and it behaves the way we want [18:56] all right, i'll fix the test failures and submit a formal mp [19:04] benji: this does mean that the publisher will not be handling any errors. is that ok? [19:05] leonardr: will we still get OOPSes on unannotated exceptions? [19:06] leonardr: hmm, does our custom publisher do retries on DB availability errors (like conflicts)? If so, it won't get the error to know that the error happened. [19:07] lifeless: i don't know. lazr.restful will set X-Lazr-OopsId if request.oopsid is present, but i imagine there's more to "getting an oops" than that [19:07] maybe I should install unity-2d. The constant shaking windows is making me ill. [19:07] leonardr: I mean the server side mechanism where raising is called() [19:08] leonardr: lifeless makes a good point; we may want to only catch the errors that were marked as "not exceptional exceptions" instead of intercepting potentially hair-on-fire exceptions (MemoryError comes to mind) [19:09] benji: you credit me with too much... that is also an excellent point [19:09] benji: what if we raised exceptions if their error code was 500, for whatever reason? either because no other code was specified or because the error was explicitly marked as a server error? [19:09] benji: my concern was more prosaic - I want our feedback loop for fixing crasher bugs to remain intack. [19:10] leonardr: benji: may I ask what problem you're trying to solve ? [19:10] s/trying to solve/solving/ [19:11] leonardr: I'm not sure I understood that part ("what if we raised exceptions if their error code was 500...") [19:11] lifeless: we have 2.5 systems for telling lazr.restful to treat a given exception/exception class as a certain http response code, and only one of them works [19:11] that's done; this is ancillary cleanup work, in which the question is "which exceptions can lazr.restful handle on its own, and which should it delegate to the publisher?" [19:12] benji: so, earlier we effectively re-raised exceptions if their status code was not 400 [19:12] oh. Have you considered 'if lazr.restful does not explicitly know, its not its business' ? [19:12] leonardr: I favor reverting that last bit (r181) [19:13] lifeless: yes. the question is what it means to 'explicitly know' this [19:13] leonardr: that would seem to be dependent on the system you've chosen for telling lazr.restful how to determine result codes for exceptions [19:14] benji: if i revert the last bit, then exceptions will be re-raised unless they have been annotated with a return value. i don't think that's necessary or sufficient [19:14] leonardr: that's not the way I read the code. It seems to me that we reraised if the exception wasn't marked as "an exception that the client caused". [19:14] an exception that the client caused -> 4xx response code [19:14] we can make that explicit [19:15] let me do some code and tests and show you waht i mean [19:15] I think we're talking about this bit, right? http://pastebin.ubuntu.com/574616/ [19:16] benji: yeah. the comment is wrong [19:16] leonardr: the comment before or after the change is wrong? [19:16] before [19:17] if an exception is annotated as 301 or 503 (or 200 for some reason) [19:17] lazr.restful would have handled it [19:17] even though its decoration does not indicate that it's the client's fault [19:17] ah, ok [19:17] i think it makes sense to only handle exceptions where the response code is 4xx [19:17] and let the publisher handle the rest [19:18] that way, if you explicitly decorate an exception as 5xx, it will become an oops [19:18] the exceptions-as-control-flow thing was throwing me [19:19] leonardr: +1 [19:20] the issue there would be that (if I'm understaning all the mechanisms correctly) that if you decorate an exception as a 50X but the code in question re-raises the exception, the publisher's error view may well retorn something other than 50X (say it may have 500 hard-coded). Does that jive with your understanding? [19:21] benji: yes, that might happen [19:21] i think that failure mode is minor compared the the failure modes that might happen if lazr.restful handled all the errors [19:24] It seems to me that since we're not crying out for more controll over 500s, we should designate this decoration mechanism as for status codes from 400 and down, that way we don't have to get our fingers in the "a *real* error happened reporting story" while also greatly improving the "client did something wrong" story. [19:25] i think just handing 4xx is good enough--that's effectively what we were doing before (except we only handled 400) [19:25] and the publisher may have some way of handling exceptions that are marked as redirects, or something weird like that [19:26] +1 [19:37] ok, i think we have a winner [19:45] Project db-devel build #412: STILL FAILING in 5 hr 17 min: https://hudson.wedontsleep.org/job/db-devel/412/ [19:45] oh yay, tz boobytraps in archive.getPublishedSources [19:51] benji: https://code.launchpad.net/~leonardr/lazr.restful/consistent-error-exposing/+merge/51946 [19:51] punt to abentley if you don't want to do this right now [19:53] leonardr: I'd love to do the review, but I have some pressing work I really need to put some time into [19:54] abentley, can you take a look? [19:55] Project windmill build #5: FAILURE in 1 hr 13 min: https://hudson.wedontsleep.org/job/windmill/5/ === Ursinha is now known as Ursinha-food [20:41] matsubara: ping [20:42] hi thumper [20:42] matsubara: sorry for not getting back to you earlier [20:42] matsubara: you were asking about some recipe testing [20:42] matsubara: what do you need from me? [20:43] thumper, can we have a quick call? [20:43] matsubara: sure... skype or mumble? [20:43] thumper, let's try mumble [20:44] matsubara: ok [20:44] I'm on the product channel [20:45] thumper, https://dev.launchpad.net/QA/ExploratoryTesting [20:59] benji, i'm not sure why we have both webservice_error and @error_status--they do the same thing with slightly different syntax. do you have any ideas? should we standardize? [21:01] leonardr: nope, no idea; I'm pretty sure both existed when I started looking at them [21:03] * leonardr will leave it alone for now [21:04] leonardr: mumble? [21:04] thumper, yes! === wallyworld___ is now known as wallyworld [21:26] thumper: https://code.launchpad.net/~leonardr/lazr.restful/consistent-error-exposing/+merge/51946 [21:39] leonardr: ok, I'm left a little confused with your merge proposal [21:40] leonardr: I think I was fine up until the last test [21:40] which left me questioning [21:40] thumper: an alternative is to prohibit the caller from marking an exception with a response code not in the 4xx or 5xx series [21:41] leonardr: in test_decorated_exception_instance_becomes_error_view when the resource is called, you get a result [21:41] leonardr: but in the last test, when you call the resource it raises [21:41] why? [21:42] thumper: because of line 27 in the diff [21:42] if the response code is 4xx we handle the response (since the error was the client's fault) [21:42] in all other cases we let the publisher handle it [21:42] ahh [21:42] ok [21:42] although the 200 case seems a bit of an edge case [21:43] I guess it is there for completeness [21:43] thumper: yeah, we could also prohibit it, like i said [21:43] but if the user wants to do that, we don't really know what they're trying to do, so better let the publisher handle it [21:43] approved [21:43] great [21:44] i'll land but not do a release, as integrating into launchpad may require other changes [21:45] ok === Ursinha-food is now known as Ursinha [21:56] matsubara: please triage new bugs :) - 728059 [21:57] lifeless, will do [21:57] I just filed them [21:59] i just got Error 324 (net::ERR_EMPTY_RESPONSE): Unknown error. from https://bugs.launchpad.net/~canonical-bazaar/+assignedbugs [21:59] good morning btw [22:01] wallyworld: BrowsesWithQueryLimits is your best option for that test [22:07] lifeless: yeah. i'll need to modify it to suit. [22:08] currently it takes a context object and i need to give it a url [22:09] i'm keen for the branch to land to improve +daily-builds performance even more [22:12] wallyworld: I would give it an object and a view selector [22:12] defaulting to +index [22:12] wallyworld: then you can use the Root object and +daily-builds [22:13] ok. haven't really thought too much about it yet. have to context switch back. sounds good what you're suggesting [22:14] wallyworld: or you could make a parallel one (perhaps sharing code) that takes a url [22:14] wallyworld: we need to talk separately about your bug task search patch [22:15] ok. i'll take another look and see if i can decorate the rs outside the search implementation [22:15] wallyworld: can you tell me what you need to do ? [22:15] basically the current search functionality only returns bugtask and i need to also access tables from the query eg BugBranch [22:15] [this is why I don't like itsy bitsy landings without context - they are harder to review] [22:15] wallyworld: /why/ [22:16] so i can, for each bugtask, create a data structure that allows be to lookup a bugtask based on branch id, so i can find all info i need with one query [22:17] s/be/me [22:17] otherwise i would have to do one query per branch [22:18] wallyworld: this is the privacy thing you're fixing ? [22:18] wallyworld: I can give you a one liner to do what you need a different way [22:18] and when using the search method and passing in linked_branches=any(), the BugBranch table is in the query but i don't get to access it currently [22:18] yes [22:20] visible_bugs = getUtility(IBugTaskSet).searchBugIds(BugTaskSearchParams(user=self.user, bug=any(*(bug.id for bug in bugs_you_have_today)) [22:21] this is one extra query, but should be fairly lean, and it will be constant. [22:22] the problem with passing a result decorator into the query is that you actually want a wider tuple out, which such a decorator won't give you [22:23] actually mine does. it works :-) [22:23] and doesn't cause any extra sql [22:24] but i'll take a look at your suggestion [22:24] pastebin it ? [22:25] lifeless: https://pastebin.canonical.com/44193/ [22:29] lifeless: with your suggestion, at first look, it still seems like i won't have access to the info i need with one query, which is bugtasks for each branch === matsubara is now known as matsubara-afk [22:30] i'll look in more detail in case i'm missing something [22:30] ok [22:30] uhm, I'm worried about the interface basically [22:30] there is a compiler in there that is fiendish [22:31] what you have will actually cause more queries [22:31] if you have *visible* private bugs [22:31] because their visibilty won't be cached [22:31] hmm. ok [22:31] if you tested with *invisible* private bugs, it will seem fine. [22:32] because you're fixing half the problem [22:32] the main problem is that the search() assumes you only want bugtasks and not other stuff from the query [22:32] sinzui: No critical issues with your CSS changes? [22:32] wallyworld: thats what its designed for [22:32] no critical [22:33] OK, we shall deploy then. [22:33] Thanks. [22:33] hmmm. don't like the design then :-) [22:33] wallyworld: there are some options - eager load (e.g. into a cache property) the branches; ask for related things in the result [22:33] wallyworld: Here is another problem with your fix [22:33] wallyworld: you'll get multiple rows back for the same bugtask. [22:35] wallyworld: (if branch A and B are both linked to bug C, then you'll get a row for (C, A) and for (C, B)) [22:35] so long as the related branch in each row is correct that won't matter [22:35] i building a dict of branch->related bug tasks [22:36] wallyworld: why won't it matter? It will still cost cpu: cache refreshing of duplicate rows in storm. [22:36] but the rows for (C,A) and (C,B) are valid and required [22:36] wallyworld: I would do a separate single query on bugbranch for (BugBranch.bugID.is_in(bug_ids), BugBranch.branchID.is_in(source_branch_ids)) [22:37] sounds reasonable. get the bug ids from the search() and then another query to load the required info [22:38] if visible private bugs were cached, my way would require less queries :-P [22:40] thanks for the input. i'll get back to it after food and after i resubmit the +daily-builds perf improvement stuff [22:41] Looking at the stable merge failure. [22:45] sinzui: Oh, yesterday I also killed Windmill and made PQM fast. [22:45] wgrant: oh just that, I'm surprised you remembered at all! [22:45] wgrant: you made 5 minute PQM (a merge in less than 5 minutes) actually work as promised! [22:46] I think we gave up on that 18 months ago [22:46] Sadly prod config landings are still slow. [22:47] But they are rare enough. [22:47] hi all [22:47] Oh no :( [22:47] Now PQM is going to spam us more often with stable->db-devel conflicts :( [22:48] wgrant: the 5 min for main launchpad stuff is amazing. i think we can all bare the configs being longer. :-) [22:55] Huh. [22:55] A test went missing when r12399 was automatically merged into db-devel. [22:55] orly? [22:56] lifeless: Compare the stable r12399 and db-devel r10214 diffs, lib/lp/registry/tests/test_person.py [22:57] wgrant: if it was already in db-devel [22:57] then it wouldn't show in the diff [22:57] It's not in db-devel now, and doesn't seem to have been removed since. [22:58] ok, thats more worrying then [22:58] But let's see the tree at that rev... [22:58] No, not there in 10214 either. [22:59] ok, thats whack [22:59] if you merge the two revs locally, does bzr do that? [22:59] Waiting for that to happen. [22:59] Warning: criss-cross merge encountered. See bzr help criss-cross. [22:59] Heh. [22:59] And it's missing. [23:00] So the criss-cross kills it without a conflict. [23:00] This is sort of really bad. [23:00] we can teach pqm to abort on criss cross [23:01] I'll revive the test now and look through the stable->db-devel diff to check that nothing else is missing. [23:01] lifeless: not just whack, wiggida wiggida wiggida wack! [23:01] hi daddy :) [23:02] lifeless: well it's a criss-cross problem :P [23:02] ...gonna make you jump, jump. [23:03] spiv: oh, I missed the ref. [23:03] spiv: 5:45 starts on thursday ;) [23:03] I think I may be too young for this. [23:05] wgrant: http://en.wikipedia.org/wiki/Kris_Kross [23:05] Ah, yes, 1992. [23:06] wgrant: http://www.youtube.com/watch?v=SmO4xdnf6Gk, especially around the 40 second mark [23:07] wgrant: That was the year you were born, right? [23:07] huwshimi: '91. [23:07] wgrant: oh, my apologies :D [23:08] I am at least a few months older than this... abomination. [23:08] no excuses then [23:09] wgrant: That would only make you a little younger than they where when the song came out [23:12] wow [23:12] I'm not an apple afficiondo [23:12] but this is sweet hardware: http://www.apple.com/ipad/features/ [23:13] Yes, even I have to admit that :( [23:14] * huwshimi is wondering if Ubuntu will work on his released-last-week MacBook Pro [23:15] * huwshimi is about to find out [23:19] bah [23:20] how is %foo% meant to be escaped for storm [23:20] File "/home/robertc/launchpad/lp-sourcedeps/eggs/storm-0.18-py2.6-linux-x86_64.egg/storm/database.py", line 366, in _check_disconnect [23:20] return function(*args, **kwargs) [23:20] File "/home/robertc/launchpad/lp-branches/working/lib/lp/testing/pgsql.py", line 115, in execute [23:20] return self.real_cursor.execute(*args, **kwargs) [23:20] ("SELECT COUNT(*) FROM SourcePackageName, SourcePackagePublishingHistory, SourcePackageRelease WHERE SourcePackagePublishingHistory.archive = %s AND SourcePackagePublishingHistory.sourcepackagerelease = SourcePackageRelease.id AND SourcePackageRelease.sourcepackagename = SourcePackageName.id AND (SourcePackageName.name LIKE '%' || 'cd' || '%')", (9,)) [23:20] IndexError: tuple index out of range === salgado is now known as salgado-afk [23:28] argh, I'm blind [23:28] sqlobject vs storm difference on % escaping. Baby saviour wept. [23:29] Are you blind for not seeing it, or blind because you saw it? [23:30] blind having fixed it [23:30] " '%%%%' || %s '%%%%'" % quote_like(name) [23:31] damned if I know why the change was needed [23:31] Yippie, build fixed! [23:31] Project windmill build #6: FIXED in 49 min: https://hudson.wedontsleep.org/job/windmill/6/ [23:36] oh fail [23:36] julian used bool(resultset) all over with getPublishedSources [23:38] wgrant: pop quiz [23:38] binary_copies) | clauses.append( [23:38] lib/lp/soyuz/scripts/packagecopier.py line 564 [23:38] would .one() or .first() be appropriate? [23:39] lifeless: Which line? [23:39] source_copy = source_in_destination[0] [23:39] ? [23:39] yes [23:39] first [23:39] one will cras [23:39] +h [23:40] is a failover server? [23:40] \o/ test passes [23:45] Yay, appservers updated. [23:46] hah, and a trivial query reduction in distroseriesdifference.py [23:46] Oh? [23:46] *sql is not python* [23:46] second hunk [23:46] http://pastebin.com/0ijzdQjD [23:48] Ah [23:51] also in findSourcePackageRelease [23:51] same bug [23:52] and getSourceAncestry [23:53] also know as LBYL is slow [23:56] actually, [23:56] - if pubs: [23:56] + try: [23:56] return pubs[0] [23:56] - else: [23:56] + except IndexError: [23:56] is cleaner [23:59] abentley: you still there?