[00:09] wgrant: Down to one failure in my no-more branch [00:11] Excellent. [00:21] How did this fail before? I don't get it [00:25] Hm? [00:26] wgrant: http://pastebin.ubuntu.com/674932/ [00:27] StevenK: It possibly was relying on the branch not existing on disk. [00:27] StevenK: Try checking the output on devel. [00:30] ERROR Job execution raised an exception. [00:30] -> http://localhost:33770/93/bLqgyciAI0wS9vOJ96yNSjGCqMV.txt (Not a branch: "lp-internal:///~person-name-870614/product-name-870620/branch-870616".) [00:32] Whereas that doesn't happen for my change [00:34] So I guess it wants a job for a branch that doesn't exist on disk? [00:35] Right. Use a RevisionAddedJob or so. [00:35] Since RevisionMailJob doesn't touch the disk any more. [00:44] wgrant: Fixed. [00:44] StevenK: Using RevisionAddedJob? [00:44] wgrant: Yes [00:45] Great. [00:45] Now to craft evil queries [00:52] Should be pretty trivial. [00:52] And there's only like 11000. [00:52] Possibly less, if some of them have preview diffs. [00:53] wgrant: what are you doing ? [00:53] lifeless: StevenK is abolishing review diffs. [00:54] SELECT count(distinct(review_diff)) from branchmergeproposal where merge_diff is null; => 5213 [00:55] did you talk to aaron about difftasticd ? [00:55] s/d // [00:56] Not about difftastic, that had utterly slipped my mind [00:56] well, review diffs and difftastic are pretty linked [00:56] They are? [00:56] I'd hate to see that deleted rather than fixed. [00:56] perhaps I misunderstand [00:56] I thought difftastic was used for incremental diffs? [00:57] lifeless: Those are incremental diffs. [00:57] I probably do misunderstand :) [00:57] Review diffs are what came before preview diffs. [00:57] Back in the days of lp:mad. [00:57] Named in DB as *static*diff [00:57] <= 2010/01 [00:57] ah [00:57] cool cool thanks! [00:58] wgrant: So, we have a query to clear review_diff where preview_diff is not null [00:59] launchpad_dogfood=> UPDATE branchmergeproposal set review_diff = null where review_diff is not null and merge_diff is not null; [00:59] UPDATE 6860 [00:59] launchpad_dogfood=> SELECT merge_diff IS NOT NULL, review_diff IS NOT NULL, COUNT(*) FROM branchmergeproposal GROUP BY merge_diff IS NOT NULL, review_diff IS NOT NULL; ?column? | ?column? | count [00:59] ----------+----------+------- f | f | 3061 f | t | 5251 t | f | 36774 t | t | 6860 [00:59] Fail. [00:59] Haha [00:59] But there are 6860 with both, 5251 with just a staticdiff. [01:00] Right, so the update for 6860 rows is fine [01:02] Now a query to create previewdiffs [01:02] Are preview diffs always used in preference to review diffs? [01:02] From my reading of the code, yes [01:06] There's only 746 MPs with a staticdiff, no previewdiffs and aren't merged [01:07] And 4505 where date_merged is not null [01:07] Even merged ones would be nice. [01:07] I'm just a little lost how to loop on branchmergeproposal, creating rows in previewdiff and updating [01:08] Project db-devel build #826: STILL FAILING in 5 hr 29 min: https://lpci.wedontsleep.org/job/db-devel/826/ [01:18] wgrant: StevenK: dumb question of the day - what's the difference between a review diff and a preview diff? [01:18] wgrant: reviewdiffs are static and ancient [01:18] RARGH [01:18] wallyworld_: ^ [01:19] StevenK: ah, so review diffs do not get updated if someone pushes a change to the branch after it is put up for review? [01:20] wallyworld_: They have not been generated in the database for over 18 months, it's probably not worth learning about, TBH [01:20] ok [01:20] delete away.... :-) [01:22] Bah. [01:23] No index on previewdiff(source_revision_id, target_revision_id) [01:24] wgrant: Do we need one? [01:26] StevenK: Probably. We should try on staging without it. [01:26] But DF definitely does. [01:26] http://paste.ubuntu.com/674955/ [01:26] Or you end up with thousands of seq scans of the table. [01:27] Right [01:27] wgrant: Is that in a transaction? [01:28] StevenK: Yes. [01:28] Right [01:28] Should also set review_diff = NULL, I guess. [01:28] But it's fast enough to do all this directly. [01:28] I have a query for that already [01:28] If we do that at the end, we can use it to clean up for us [01:29] We might as well do it in the one UPDATE. [01:29] UPDATE branchmergeproposal SET merge_diff = (SELECT id FROM previewdiff WHERE from_revision_id = source_revision_id AND to_revision_id = target_revision_id) FROM staticdiff WHERE staticdiff.id = review_diff AND merge_diff IS NULL; [01:29] UPDATE branchmergeproposal set review_diff = null where review_diff is not null and merge_diff is not null; [01:29] Are the two of them [01:30] I guess we need to unset all of them, whereas my UPDATE only needed to consider those without a preview_diff. [01:30] So indeed. [01:41] OOPS-2062F5 [01:41] hmm, no bot? I'll have to do this by hand [01:42] those numbers seem small [01:42] not really [01:42] mwhudson_: Which? [01:42] its 0142 [01:42] in the oops [01:42] ah right [01:42] 2 hours in, 60 appservers. [01:42] Date: Aug. 24, 2011, 2:01 a.m. [01:43] which makes it 0101 [01:43] if you use a real tz :P [01:43] :( [01:43] Dropping a foreign key requires a lock on both tables. [01:43] oh or its a couple days back :) [01:43] How stupid. [01:43] either way [01:43] yeah [01:43] wgrant: thats because fk's are triggers. [01:43] I was just making sure the oops had synced :-) [01:43] lifeless: Ah. [01:44] wgrant: (apparently :P) - they get presented differently of course, but are built on that mechanism. [01:44] it [01:44] it's doing the count(*) twice? [01:45] wallyworld_: uhm [01:45] lifeless: ? [01:45] wallyworld_: are you *trying* to make the public -stacked-on-private branch obfuscated? [01:46] no [01:46] why do you need a metaclass then ? [01:46] and __setattr__ [01:47] so that code in storm queries can still go "Branch.private == xxx" [01:47] (the metaclass does this bit) [01:47] yes, but this means that aBranch.private and Branch.private are now totally different htings. [01:47] this is horribly confusing [01:47] and so that exisitng code which creates a branch via Branch(name=xxx, private=True) still works [01:47] (the setattr does this) [01:48] wgrant: Ah, I forgot we need to drop the FK as well [01:48] And then after my evil branch lands we can drop the column and table [01:48] lifeless: the other option then is to s/Branch.private/Branch.explicitly_private [01:48] wallyworld_: please [01:48] wallyworld_: thats what we've done in the past: KISS [01:48] ok. it wasn't confusing to me :-) [01:49] i was trying to do it behind the scenes so to speak [01:49] wallyworld_: its not confusing to me either, cause I've seen the diff. But imagine someone looking at the rest of LP [01:49] I can't find a bug for this timeout, would someone take a look, or should I just file one? [01:49] who comes to this code, and sees no difference visually. [01:49] wallyworld_: unless you rename the column I suggest actually not renaming [01:49] wallyworld_: instead add a property 'effectively_private' or something, which would honour self.private + self.stacking etc. [01:50] wallyworld_: and reference that from security adapters, views etc. [01:50] wallyworld_: this will keep the class .private and the schema .private in sync with each other. [01:50] lifeless: that's a big change [01:50] why not just rename the private property to explicitly private [01:51] and use dbname="private" in the property [01:51] this will have the smallest footprint in terms of change to the code [01:51] and still be quite clear i think [01:52] I think that's better, so that you don't have to remember that the obvious "private" attribute isn't likely what you usually want [01:52] wallyworld_: I'm thinking about the mapping from object to schema [01:52] james_w: file a bug :) [01:52] james_w: agreed. [01:52] wallyworld_: but sure. [01:52] wallyworld_: my main concern is that what folk think they are reading is what they are reading. [01:53] wallyworld_: they should be able to guess at the behaviour of the code, with high accuracy rates, without having read the entire implementation. [01:53] wallyworld_: if private is a property and *not* the db column, then 'foo.private = True' will not do what they expect. [01:54] however, I see that that is not meant to happen here anyway [01:54] folk are meant to call setPrivate [01:54] that bit will do what they expect. ie make the branch private. but if they say foo.private = False it may not [01:54] so +1 [01:55] lifeless: ok. we can iterate later and change the column if needed [01:55] wallyworld_: well, it will only do that with either a property or a setattr [01:55] https://bugs.launchpad.net/launchpad/+bug/834293 [01:55] <_mup_> Bug #834293: Product:+code-index times out < https://launchpad.net/bugs/834293 > [01:55] wallyworld_: ok; so for clarity - you're doing:13:50 < wallyworld_> why not just rename the private property to explicitly private [01:55] 13:50 < wallyworld_> and use dbname="private" in the property [01:55] wallyworld_: dropping the metaclass and setattr [01:55] lifeless: yes [01:56] wallyworld_: to be clear, I don't object to metaclasses and setattr per se; just that they should really be enhancing things, not obfuscating :) [01:56] wallyworld_: thanks! [01:56] james_w: please don't use lp-oops urls [01:56] james_w: just the oops id [01:56] lifeless: np, input is appreciated. i can see how it's clear to me because i'm close to it but not to others who may be looking in from the outside [01:57] lifeless: will you have time to do that loggerhead review before you go and spend your evening changing shitty nappies? [01:57] lifeless, why? [01:58] james_w: a) we linkify anyway so theres no different in click-through, b) its easier to read and copy, c) the garbage collection code ignores url style oops references so they will get deleted. [01:59] ah, I didn't realise there was any integration [01:59] the latter seems like a bug though :-) [02:02] james_w: it is [02:02] james_w: but anyhow, its intended for users that have no access to the reporting UI [02:02] your having access to it is the exception :) [02:03] yeah [02:07] frell [02:07] another regression [02:07] lifeless: Where? [02:09] bug 834266 [02:09] <_mup_> Bug #834266: "831884 is not a valid bug number or nickname" < https://launchpad.net/bugs/834266 > [02:10] lifeless: I think that's the second time you've declared this bug to be a regression. [02:10] lifeless: The case that's handled in marking a master as a dupe. [02:10] s/in/is/ [02:11] Trying to find the dupe.. [02:11] There. [02:35] wgrant: thanks :) [02:36] grar. [02:36] Why do people insist on implementing non-auditable copying. [02:36] Seriously. [02:36] Stupid. [02:55] Hey, good morning everyone! I think I'll implement some non-auditable copying today. [02:59] jtv: No, your squad already did that last week. [02:59] You're too late :( [02:59] Oh, but there's always more to do. [02:59] I insist on implementing non-auditable copying. [03:01] Good, good. [03:01] What could go wrong. [03:06] hi wallyworld___, your collection of underscores is coming along nicely I see. [03:06] Any reviews for me to approve? [03:13] Project devel build #997: STILL FAILING in 6 hr 0 min: https://lpci.wedontsleep.org/job/devel/997/ [03:19] lifeless: O hai -- wgrant and I would like to remove all references to staticdiff -- the queries take approximately 12 seconds on DF and it will be a once-off. Or are you going to make me write a garbo job? [03:20] 12s include an index creation that can be done whenever. [03:20] 4-5s without. [03:24] Over a hundred broken tests in buildbot. That's impressive. [03:25] Yeah. [03:25] A revision was landed that depended on a revision that was reverted a few hours earlier. [03:25] the index creation is nonblocking [03:25] if its done CONCURRENTLY [03:25] Right. [03:26] so, that seems fine to me. [03:26] Hence can be done whenever, including in the patch, if required. [03:26] 4-5s is too long to do as one transaction though if its hitting 6K rows [03:26] We can split it up [03:26] yup [03:26] 2s per transaction or so [03:26] that works for me [03:26] lifeless: 6k merged proposals from before 2010? :) [03:27] Only 5k [03:27] wgrant: FK references. [03:27] wgrant: never underestimate the MVCC lock monster [03:51] lifeless: looks like the django ticket for passing the exception info up has progressed to "Design decision needed". Hopefully that isn't a black hole [03:52] jamesh: It normally is with Django... [03:53] jamesh: win. [03:53] course, we could patch it in Ubuntu :) [03:54] it seems one of the issues is that not all exceptions might make it to handle_uncaught_exception(), so capturing errors there might not be perfect [03:55] I've added a comment that all the errors in my view code generally pass through that code path though [03:55] cool [03:55] if they swallow exceptions and don't reach that code path, that seem like a separate bug :) [03:55] as opposed to a reason not to do this [04:54] wgrant: you asked why I didn't re-implement [BS]PPH.setDeleted in terms of PublishingSet.setMultipleDeleted. The answer is that the latter bypasses the ORM, which is more likely to be harmful in code that requests deletion of individual objects. [04:54] jtv: True. Although it doesn't have to bypass the ORM. [04:54] Do you know about ResultSet.set()? [04:55] It gets far too little use. [04:55] No, I don't! I knew there was something like that for delete, but it gave us endless trouble anyway. [04:55] (There were all sorts of queries it didn't support) [04:55] yeah, but for this sort of thing it's probably fine and a lot less ugly. [04:56] I'll file a bug to fix that. It'd save some test commits. [04:56] Right. [04:59] jtv: Do you like lp.soyuz.model.publishing's wonderful test coverage? [04:59] Truly exemplary stuff. [05:00] How sarcastic are you being? [05:00] wgrant: I'm picking up a hint of sarcasm there, but I'm told that I wouldn't understand because I'm not American. [05:02] Just a bit. [05:02] Thanks. It helps to be clear about these things. [05:02] wgrant: That's like calling a fish 'just a bit wet' [05:03] * wgrant deletes more of Distribution. [05:03] A-khah. This vhat British capitalist call, understatement—da? [05:04] wgrant: Excellent. [05:06] * StevenK fights with Jenkins [05:12] jtv: you managed to create two bugs [05:12] That explains why I got two confirmation boxes. [05:12] jtv: I wonder if we have a bug in our request retry logic or something [05:13] Or a very late failure triggering a retry after db commit maybe? [05:14] I don't suppose the browser would retry the POST unless it got an unambiguous application-level failure response? [05:14] Sure you didn't click submit twice? [05:14] Very. [05:15] POSTs are not permitted to be retried automatically. [05:15] At the browser-level. [05:15] This is chromium though. Full of optimizing cleverness. [05:15] Heh [05:15] BTW I see now that I expressed myself badly: [05:16] I know what you meant [05:16] I meant, I don't suppose the browser would retry the POST just because it did not get an unambiguous application-level failure response. [05:16] and yes I think post-commit retry is the most plausible epxlanation [05:28] jtv: How did you file those bugs? [05:29] wgrant: could you be more specific? [05:29] I only see one bug filing in the appserver logs, and that was 90 minutes ago. [05:29] Are you using edge or something? [05:29] Nope. [05:29] wgrant: appserver bug [05:29] wgrant: we retry requests in the appservers. [05:30] wgrant: on POST, on conflicts. [05:30] lifeless: The bug in question was filed only 30 minutes ago. [05:30] I do not see even one POST for it. [05:30] wgrant: that is how it was posted, no ? [05:30] oh, I see 90 vs 30 [05:30] wgrant: you've got all the appservers ? :> [05:31] actually. EOD. EOW. SOL. [05:31] lifeless: Good plan. See you eventually, I suppose. Good luck! [05:31] lifeless: enjoy your weekend. [05:31] jtv: Weekend? [05:32] wgrant: yes, it's that bit where the office goes nice and quiet so you can work in peace. === almaisan-away is now known as al-maisan [05:32] When that happens, the rest of us are having something called a week-end. [05:32] wgrant: I think jtv is being sensible about disclosure of private data [05:32] Bah. [05:32] which is nice [05:32] I wonder if puppet broke stuff. [05:32] Not all the logs are here. [05:33] Probably means we're missing OOPSes, too. [05:33] that vould be a vorry [05:33] But not surprising. [05:33] hangon, I was going. [05:33] See you. [05:34] wgrant: I'm pushing up my gpfixtures WIP for LP [05:35] lifeless: I may have a poke at it and get it finished. [05:36] lp:~lifeless/launchpad/usegpgfixtures and of course lp:python-gpgfixtures [05:37] it needs a ServerFixture to invoke the process etc, which would live in python-gpgfixtures for now. Its not a perfect demo of the soa test fake layout. [05:37] perhaps it should be, but one step at a time [05:37] Yeah. [05:37] Anyway, shoo. [05:38] both branches pushed. [05:38] enjoy. [05:38] Thanks. [05:39] heh, the keyserver in fixtures is shorter than the one in LP; including the main() stuff and the json API. [05:39] \o/ [05:39] Nice. [05:40] not a totally fair comparison, but shrug [05:51] Right, Jenkins is in money-suck mode [06:12] jtv, hi, can you perhaps give me another short review, mostly for a branch you've already reviewed: https://code.launchpad.net/~danilo/launchpad/bug-826692-take2/+merge/72996 (it was broken for private branches, and the fix is easy, and I add a test for it) [06:12] jtv, you can perhaps just look at the incremental diff in http://paste.ubuntu.com/675046/ [06:12] danilos: okay okay, you've sold me. Don't try to buy it back. :) [06:13] heh, thanks === al-maisan is now known as almaisan-away [06:18] danilos: Given its history, you might want to coerce a LOSA to merge that on staging so you can try it out for real. [06:19] wgrant, sure, though when you do anticipate a problem (or when there are tests for it), it makes a big difference :) [06:20] danilos: Bah, who needs tests. [06:21] not us the real men! our test suite are our users [06:21] wgrant, what was the commercial team name again? I need to find who can make me a few private branches on staging [06:22] danilos: “Merge proposals against private branches are visible to *the* branch owner.” [06:22] danilos: ~commercial-admins and LOSAs. [06:22] danilos: I have access to U1 branches, if that helps. [06:22] wgrant, are they on (qa)staging? I have access to landscape branches, but they ain't there [06:23] danilos: The content doesn't matter, does it? [06:23] jtv, thank you :) [06:23] danilos: is this the branch that caused (some of) those 93 failures and 17 errors we had in devel earlier? [06:23] The DB stuff is there, and that's all that matters. [06:23] wgrant, nope, I just need merge proposals against them [06:23] jtv, nope, the reversion of this branch and gary's revision using some of the stuff in this branch is what caused the failures [06:24] danilos: gary reverted that before you emailed me, btw. [06:24] wgrant, it is? oh right, logging in might help see them [06:24] danilos: oh, I heard about those but didn't think it was the same failure. There are far too many lately. :/ [06:24] jtv, this is just a regular distributed development fallacy, I don't think there's anything we could do about it except make the test suite run shorter thus reducing the chances of something like this happening [06:25] danilos: I agree that that's about all we can do. [06:25] wgrant, yeah, I've seen that as well, sorry if I bothered you :) [06:25] danilos: for a moment I thought you were saying I was completely wrong, but now I suspect "fallacy" isn't really the word you meant. :-) [06:25] If we had a one hour test suite again (which is practical with parallelisation), the occurrence and impact of this sort of thing would be far, far lower. [06:26] Yes. [06:26] Although the past few days, Q/A has played a large role as well. [06:27] jtv: That's made worse by the fact that QA can rarely happen until 10 hours after the branch is submitted. [06:27] Which means a lot more breakage can slip into devel before it's noticed, and it's less likely to be noticed because the wrong people are doing it. [06:28] Absolutely. Getting a branch from submission to Q/A readiness is still an overnight process. [06:28] Which reminds me of another suspect in our current problems: staging updates. [06:28] What staging updates? [06:29] It hasn't updated in a while :) [06:29] :) [06:29] Oh well that's alright then. :) [06:29] The ticket is only 85.. we should probably get it bumped. [06:31] danilos: I rejected the MP based on the missing "the" in the comment. Please try again Monday. [06:31] jtv, fair enough, thank you very much, I'll be back on Monday [06:32] (I'll go spend weekend fixing all the missing articles in the LP tree, however you take that ;) [06:32] np [06:32] danilos: *the* weekend. You did that on purpose, didn't you? [06:32] spend the weekend? [06:32] Yup. :) [06:33] heh, yeah, I can usually find them missing only on re-reading the sentence, and that never happens with comments for tests :P [06:35] Project devel build #998: STILL FAILING in 16 min: https://lpci.wedontsleep.org/job/devel/998/ [06:36] Uh [06:37] bzr: ERROR: http://bazaar.launchpad.net/~launchpad-pqm/launchpad/devel/.bzr/repository/packs/9e62132dbdc7acb42f1156c7a4128cf2.pack is redirected to https://launchpad.net [06:37] RARGH [06:38] Yay [06:38] wgrant: - 1. [[#william_grant|William Grant ]] ''(133 top-level landings)'' [06:38] + 1. [[#william_grant|William Grant ]] ''(134 top-level landings)'' [06:38] wgrant: lies [06:38] lifeless: I merged a rev from a branch from December :) [06:38] Haha [06:38] I thought I cherrypicked it. [06:38] But apparently not well enough. [06:38] Ah, it was the first rev in the branch. [06:43] lifeless: Are you here? [06:43] lifeless: If so, a prio bump on https://rt.admin.canonical.com/Ticket/Display.html?id=47551 would be nice, since staging hasn't restored for two weeks now and we'll likely be fastdowntime-capable next week... [06:45] * 31NAAXP81 hates doc tests. too hard to debug :-( === 31NAAXP81 is now known as wallyworld_ [06:45] Haha [06:45] shutup [06:45] Underscores, random characters... what next? [06:45] Nice nickname [06:45] i'm already way too grumpy === StevenK is now known as W4|_|_YW0R|_D [06:45] @%^@#%^!@^ [06:45] Project devel build #999: STILL FAILING in 3.5 sec: https://lpci.wedontsleep.org/job/devel/999/ === W4|_|_YW0R|_D is now known as StevenK [06:46] smart arse [06:46] Oh damn it! [06:46] JENKINS! [06:46] 3.5 is pretty short [06:46] The slave was already corrupted [06:52] wallyworld_: Hi! ;) [06:53] henninge: hi there [06:53] i assume you saw my comments :-) [06:54] wallyworld_: yes, thank you ;) [06:54] np. i hope they made sense [06:54] wallyworld_: I replied rather lengthy [06:54] very much so [06:54] ah right. i'll look at my email [07:00] henninge: right, i see the disabling of the form submit button now. and it will be re-enabled if a request build succeeds or partially succeeds? [07:01] wallyworld_: not currently, no. [07:01] i think the original request build form did this? [07:01] wallyworld_: AFAIUI the user is supposed to close the dialog and re-open. [07:01] no [07:01] it did not [07:02] or, let me check that ... ;) [07:02] ok. i thought it was either re-enabled or else the user could click an as yet unbuilt distro series and it would be enabled [07:02] without requiring the form to be closed and reopened [07:03] I just checked, the button is only enabled in the connect function. [07:03] s/connect/correct ? [07:03] wallyworld_: There are other things to fix in this dialog, and also in its code but that is outside of the scope of what I am doing here. [07:04] (which as alread widened a lot ;) [07:04] sure. i'm just a bit concerned that the button used to work a certain way but may not have kept the same behaviour] [07:04] wallyworld_: the connect_requestbuilds function that connects the link [07:05] actually, it is done in the click handler, when the dialog is opened [07:05] the enabling, I mean. [07:05] possible, but that change was before me ;-) [07:06] s/was/happened/ [07:06] perhaps [07:07] so in the case hwere the user clicks 2 out of 3 series, and the build for one of them is queued but the other has an "already building" error or something. they should then be able to click the 3rd series that was not part of the original selection and request a build [07:07] without having to close the form [07:09] wallyworld_: I agree completely! [07:09] i'm fairly sure that's how it worked before. perhaps it has changed in a branch previous to yours. if it works diffeently now, it would be cool if a bug were raised so that we didn't forget to fix it. or if the behaviour is different because of the changes in your mp, it would be cool to see it fixed in there [07:09] sorry for being a pain in the arse [07:10] wallyworld_: I just looked at the code previous to mine (bzr cat -r :submit) and the button is never re-enabled. [07:10] ah ok. seems like you are off the hook :-) [07:11] i'll update the mp with r=me and get jtv to mentor [07:11] oh, hi wallyworld_! [07:11] wallyworld_: I am about to start the fourth branch to fix this one bug about error reporting, so I will be very reluctant to add anything else to it ... ;-) [07:11] i thought that would wake yu up :-) [07:11] wallyworld_: Thank you! [07:12] henninge: sure. any fix to the submit button warrents it's own bug [07:12] yup, I'll file that [07:12] awesome. thanks. [07:17] jtv: over to you. it's beer o'clock for me but i'll pop back later to check on things https://code.launchpad.net/~henninge/launchpad/bug-824435-failure-reporting-3/+merge/72908 [07:17] wallyworld_: cheers === wallyworld_ changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | On call reviewer: - | Critical bugs: 237 - 0:[#######=]:256 [07:17] in more sense than one [07:17] no, thank you [07:17] :-) [07:18] * jtv gives wallyworld_ a puzzled frown [07:18] it's been along day. i've resorted to converting a doc test to a unit test so i can debug [07:18] jtv: sorry, the subtlety of my Two Ronnies reference was obviously lost [07:19] Evidently. [07:19] * jtv needs to catch up on his Two Ronnies. [07:19] * wallyworld_ opens a bottle and drinks :-) [07:19] ahhhh [07:19] And "resorted to" converting a doc test to a unit test? You deserve, if not a medal, then at least a cool beer! [07:19] yep :-) [07:19] Wow, that was timely [07:20] still only 3/4 of the way through [07:20] frustrating day today [07:24] wallyworld_: this'll cheer you up… 34 [07:27] jtv: ³¼? [07:28] wallyworld_: bug 834463 [07:28] <_mup_> Bug #834463: Re-enable submit button in request builds overlay. < https://launchpad.net/bugs/834463 > [07:28] henninge: that's cheating! You did ^314, didn't you? [07:29] jtv: well, I think "Alt Gr" is the compose key on the German keyboard and if I hold it down an press 34, I get ³¼ [07:29] * jtv fondly remembers the can't-get-used-to-metric-but-we'll-give-decimal-a-go ²²/₁₀ pricing at US petrol stations. [07:29] henninge: Alt Gr is not quite the same thing. [07:29] It's just a third set of characters, AIUI. [07:29] Compose is a little more powerful :) [07:30] But good to know there's yet another standard to confuse things. ☺ [07:30] * henninge hates to be lacking power … :( [07:30] henninge: I don't know of any layout that has a "Compose" key. In Ubuntu, at least on US keyboards, you have to configure it manually. I use RAlt. [07:31] ah, I see [07:31] It is displeasing that there is not one configured by default. [07:31] RAlt *is* Alt Gr [07:31] But perhaps there are people around who use RAlt or Menu for their traditional purposes. [07:31] For many European keyboard layouts, RAlt is already special. [07:31] We need it here for {[]}€~ [07:31] The menu key... Not so much. [07:33] oh, and for \ [07:33] one more reason to hate DOS paths [07:33] At least it's not a French keyboard. [07:33] rvba is a bad person. [07:34] German keyboards aren't actually too bad. [07:34] But French, aaaaaaa [07:34] wgrant: I think you did very well with a French keyboard. I shall bring you one next time we meet :) [07:34] I can live with it [07:34] :-D [07:40] What happens when you try to type qwertz on an azerty keyboard? [07:41] You probably turn Alsatian. [07:52] good morning [08:13] hi adeuring [08:15] hi jtv! [08:17] Mornin' [08:18] hi mrevell [08:18] henninge: 404 [08:19] wgrant: yes, that's the point ;) [08:19] Ah! === wgrant changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | On call reviewer: - | Critical bugs: 243 - 0:[#######=]:256 [08:20] Anybody know if Matt owns a baseball bat? [08:23] henninge, This Matt does. Is this the Matt you're looking for? [08:23] yup [08:24] mrevell: http://yuilibrary.com/something.html [08:24] heh [08:32] mrevell would need a cricket bat === adeuring changed the topic of #launchpad-dev to: #launchpad-dev ist: https://dev.launchpad.net/ | On call reviewer: adeuring | Critical bugs: 237 - 0:[#######=]:256 [08:33] The unmistakable sound of willow against bugs. [08:47] we really need to block MPs on vcs imported branches [08:47] Yeah. [08:49] * bigjools files a bug [08:54] mrevell. henninge: the lockerz.com website has a good oops page, it's a space man floating around on the page with a message "Houston, we've had a problem!" [08:55] bigjools: they seem to have different ones. Now I see a dog licking my screen .... ;-) [08:55] lol === almaisan-away is now known as al-maisan [09:20] Given we are going to have 5 minute outages a couple of times a week, if anyone is bored... [09:20] No reason we can't get them sub-minute :) [09:21] wgrant: Thanks for qa-ing :D [09:21] * nigelb was asleep [09:21] * wgrant needed to get a regression fix out. [09:21] stub: I now know exactly how you feel when your clock is messed up. [09:21] wgrant: please tell me you fixed that brittle bzr versioning test! [09:21] jtv: 'fraid not. [09:22] Feel free to do the 5-character fix, though. [09:22] int()? [09:22] * nigelb can do it right away [09:22] Yup [09:22] Oh thanks [09:22] I'd be happy to review it. [09:22] have the bug number handy? [09:22] no wait, I have it [09:22] * jtv digs [09:22] oh [09:22] bug 833743 [09:22] <_mup_> Bug #833743: test_versioninfo fails when system has bzr 2.4 < https://launchpad.net/bugs/833743 > [09:25] wgrant: do you know if the ec2 image got updated with the deps for the xz branch? [09:26] Yes [09:26] I did it [09:26] thanks [09:26] and thanks [09:27] I wish the Updating diff page had ajaxy magic not forcing me to refresh every few seconds. [09:28] nigelb: Wait [09:28] It was demo'd at the epic [09:28] oooooooh \o/ [09:28] I need to keep wishing for more :P [09:29] jtv: https://code.launchpad.net/~nigelbabu/launchpad/833743-revno-fail/+merge/73008 [09:29] nigelb: done [09:30] nigelb: need me to land it? [09:30] Yes, please :) [09:41] Well, that should fix that devel failure. Thanks nigelb! [09:48] adeuring: could I beg you for a review? https://code.launchpad.net/~jtv/launchpad/bug-832661/+merge/73009 [09:48] jtv: sure [09:48] thanks [09:59] bigjools: It landed via ec2, so I hope so :) [09:59] bigjools: Image version 519 [09:59] German has infiltrated the channel again :( [10:00] nigelb: my team will be finishing that work after we finish derived distros [10:01] wgrant: good. I got fed up with ec2 randomly kicking out 2 of my branches all week so I just lp-landed them. [10:01] bigjools: Odd, the image was fixed a day or so before it landed. [10:04] adeuring: argh! I just realized (because everyone keeps %$#@ distracting me) I forgot to set up the branch dependency. That diff should be a lot smaller. [10:04] jtv: ah, cool :) [10:05] Branches that were unable to land because buildbot has been near-permanently broken the past few days. [10:06] adeuring: re-submitted as https://code.launchpad.net/~jtv/launchpad/bug-832661/+merge/73012 [10:06] ack [10:09] jtv: \o/ [10:09] bigjools: Yay! [10:10] nigelb: the basic code is working but we need to make it production-worthy which will be tricky as we have to set up some stat counting for Rabbit [10:10] ooh, rabbitmq, neat [10:11] did you guys deal with the HA rabbit thing that was on the list a while back? [10:11] yeah, once this is done we'll be able to add more async updates to pages that wait for jobs [10:11] something like the sync pages you guys did? [10:11] that sort of thing [10:12] we'll do popular pages first though :) [10:12] can't remember where we got to with HA rabbit [10:12] bigjools: Rabbit metrics will be pretty easy with the plugin. [10:12] And I think it has an HA exemption. [10:12] The hardest bit is probably getting OOPSes into txlongpoll. [10:13] yes [10:14] But that won't be terrifyingly difficult. [10:14] which plugin is this? [10:14] With lifeless' work this week. [10:14] bigjools: rabbitmq-management [10:14] ppa:wgrant/rabbitmq [10:14] ah cool, didn't know about that [10:14] Doesn't LP already install rabbitmq by default? [10:14] Has packages. [10:14] nigelb: Yes, but it's not actually used. [10:14] * nigelb remembers having rabbit on his ystem [10:14] bigjools: It provides a web UI and JSON API with counters and rates and stuff. [10:14] nigelb: you can turn it off, the tests that need it will start it [10:15] wgrant: oh, awesome, just what we need [10:15] bigjools: Exactly. [10:15] just need to work with the losas to get that nagiosed [10:16] Works on Lucid->Natty, not sure about Oneiric. [10:17] Oneiric has 2.5, so we should probably upgrade to that everywhere. [10:17] Will investigate that next week. [10:18] bigjools: nah, I use it occasionally with other stuff [10:18] (like celery) [10:18] Oooh. I just realized. With that 5 char fix. I now have 5th non-canonical LP contributor spot. [10:19] LP generally need a very, very solid and easy story to communicate between components [10:19] or SOA will be awful [10:19] Heh, yeah. Fun. [10:20] This means writing tests for that stuff will be further fun. [10:21] bigjools: Well, most SOA stuff will not be through rabbitmq, but some hopefully will ebe. [10:21] wgrant: either way, it needs to be easy to communicate :) [10:32] bigjools: Indeed. [10:32] bigjools: At least OOPSes for WSGi apps are just about trivial now. [10:32] bigjools: Have you seen the zeca replacement? === jtv is now known as jtv-afk [10:39] wgrant: there's a zeca replacement? [10:42] jtv-afk: r=me [10:43] bigjools: lp:python-gpgfixtures [10:43] bigjools: lifeless wrote it, to be less huge and not depend on Twisted. [10:43] So lp:gpgverifyd doesn't need a Twisted dep. [10:43] cool [10:44] thanks adeuring! [10:48] And once gpgverifyd is up, poppy's time in the tree can come to an end. [10:49] That's it for me. Have a good weekend, people! [10:49] and there was much rejoicing [10:49] nn jtv-afk [10:49] Night jtv-afk. [10:49] Night jtv-afk! [10:49] nn [10:50] Waiting for MFBT :/ [10:59] nigelb: I see you are #5 now! [10:59] wgrant: I noticed! \o/ [10:59] 3 more branches to #4 [11:00] I landed 3 branches this week. If I keep this up, I just may have a chance to beat you :D [11:00] Yup. [11:01] activate the HIREBOT [11:01] lol [11:02] Heh. [11:02] Did anyone come up with stats for the LP team yet? [11:02] It would nice to see how outside contributors compare with the actual team :D [11:04] * bigjools wants to eradicate security.cfg [11:04] nigelb: I have a branch which gets it mostly working for ~launchpad too. [11:05] oooh :D === henninge is now known as henninge-lunch [11:09] nigelb: But for some idea, 'bzr stats' is helpful. [11:10] hmm [11:10] need to try it when I get to my computer. (at work) [11:13] wgrant: what is the number that stats is outputting? It doesn't say anywhere, not in the output nor the help. === al-maisan is now known as almaisan-away [11:14] bigjools: number of commits [11:15] jelmer: including all levels I guess? [11:15] bigjools: yep [11:15] jelmer: ok thanks. [11:15] so it shows who does local commits the most :) [11:15] so it's not a terribly good metric for e.g. Launchpad, given some people do a commit for each two lines they write and others only \ [11:15] Yup. [11:15] a couple of times per landing [11:16] the one exception is Celso [11:16] Celso? [11:16] he was basically a coding machine [11:16] heh [11:17] I like commiting multiple times [11:17] (because I keep making stupid mistakes) [11:17] Ste [11:17] StevenK is the worst offender :P [11:17] lol [11:17] One 1100 line commit per branch or so. [11:17] but he uses perl to remove code [11:17] True. [11:17] Makes it even worse. [11:17] That's cheating :D [11:20] it needs to be weighted by lines per commit [11:22] it'd be hard to distinguish between actual lines changed and lines moved [11:22] yeah it's not perfect but better than just commits I think [11:24] ok I need help because security.cfg is such a monstrosity I have no idea what tables I need to add to it so my process can close a bug [11:25] how do I ascertain what I need? [11:25] Hahaha [11:25] this is a leading question :) [11:25] Look at process-accepted [11:25] I am [11:25] but ... [11:26] you see where I am driving with this? [11:26] Yes, it's hard. [11:26] Partly because LP grew into a 700KLOC monstrosity. [11:27] security.cfg just needs to die [11:27] I'm not sure whether it's worth it. [11:27] But it would be a lot easier if it wasn't all one thing. [11:27] given that the webapp has carte blanche anyway, why are we restricting scripts? [11:28] I can only think of a couple of bugs that have been revealed by the restrictions. [11:28] ok how evil is it to inherit queued in sync_packages? [11:28] * jelmer remembers a few incidents with security.cfg being too strict [11:28] because I simply cannot be arsed to work this out [11:28] jelmer: indeed [11:28] it just seems to cause problems, not solve any [11:28] Only a few? :) [11:29] we could do something more intelligent in security.cfg and have groups that represent actions, not scripts [11:30] I've been considering that for a while. [11:30] there's a little of that already - well done to whoever made "translations_approval" [11:30] I don't think there need to be too many. [11:31] Since bug actions need like 20 tables, that probably covers a lot of the lcutter. [11:31] well since PCJ does all of what queued does, I am going to inherit it, screw it [11:32] Sounds like a plan. [11:32] A not very good plan, but an effective one :) [11:32] I'd rather it was too wide than too narrow [11:32] Indeed. [11:32] I'm really not sure what we are achieving with the permissions === mrevell is now known as mrevell-lunch [11:51] Project devel build #1,000: STILL FAILING in 4 hr 48 min: https://lpci.wedontsleep.org/job/devel/1000/ === henninge-lunch is now known as henning === henning is now known as henninge === bac changed the topic of #launchpad-dev to: #launchpad-dev ist: https://dev.launchpad.net/ | On call reviewer: adeuring, bac | Critical bugs: 237 - 0:[#######=]:256 [12:08] good morning adeuring [12:08] good morning bac! [12:21] Anybody got an idea why the LibrarianLayer might be failing locally like this? http://paste.ubuntu.com/675201/ [12:25] henninge: clear out /var/tmp stuff [12:25] including pid file [12:33] bigjools: I don't see anything testrunner or launchapd specific in /var/tmp [12:33] the only pid file was development-google-webservice.pid === mrevell-lunch is now known as mrevell [12:34] le [12:34] matsubara, hey mumble? [12:34] mrevell, hey, yes! [12:34] henninge: Chameleon does some very evil stuff; I need to talk to gary_poster about removing that or it. [12:34] henninge: dpkg -S setuptools.egg-info [12:34] Oh, gary_poster is here. [12:34] gary_poster: Hi. [12:35] wgrant, on call will ping [12:35] k [12:36] henninge: Which Ubuntu release has this machine been upgraded from? [12:36] henninge: I suspect you have a non-dpkg-managed /usr/lib/python2.6/setuptools.egg-info, as we found on loganberry and several other production machines. [12:37] wgrant: from maverick to natty [12:37] henninge: We deleted it, and everything worked. [12:37] henninge: Hmm. [12:37] So, /usr/lib/python2.6/dist-packages/setuptools.egg-info is meant to be a symlink. [12:37] I suspect it's a directory on your machine. [12:38] python-setuptools: /usr/lib/python2.6/dist-packages/setuptools.egg-info [12:38] python-setuptools: /usr/lib/python2.7/dist-packages/setuptools.egg-info [12:38] python-setuptools: /usr/share/pyshared/setuptools.egg-info [12:38] wgrant, I'm aware of one evilness--z3c.ptcompat is perhaps the hairiest Python I've ever seen. Perhaps. It sounds like you have another? [12:39] z3c.ptcompat can eventually go [12:39] which was why I was not overly worried about that one [12:39] gary_poster: I invite you to read the top of chameleon/template.py [12:40] heh, ok, going wgrant [12:40] there are problems with migrating dpkg-managed filesystem entries between symlinks and directories - it has to be done by manual maintainer script code (for some excellent reasons) [12:40] gary_poster: It hashes, in undefined order, the versions of every package it can find on the system. [12:40] unfortunately .egg-info entries became one or the other at the whim of complex build systems [12:40] (i.e. a simple rebuild could change without any obvious reason in the source package) [12:40] cjwatson: Yay [12:41] it's probably simplest to rm -rf the .egg-info file in question and then reinstall the package that owns it [12:41] cjwatson: Interestingly, on the production machines the package was no longer installed. [12:41] I presume it was removed after the upgrade. [12:41] But didn't remove the directory, because it was meant to be a symlink. [12:41] that's a bit harder to explain, but perhaps if dpkg expects it to be a directory it uses rmdir not unlink [12:42] rather than unlink => if EISDIR rmdir [12:42] Yeah. [12:44] Anyway, on production where it's been a problem we've just deleted the directory, and reinstalled the package if it was still installed. [12:44] All seems happy. [12:44] But that part of Chameleon really worries me. === almaisan-away is now known as al-maisan [12:47] wgrant, that's nothing compared to z3c.ptcompat, IMO. :-P [12:47] Anyway, the undefined order is probably barely alright practically for what the purpose appears to be, which is to enforce some kind of "rebuild when things change," given that the undefined order is stable in a Python release, and a new Python release ought to trigger this kind of thing anyway. [12:47] But we certainly can and should convince the person to order them. Beyond that, given that Chameleon is a compile-to-Python template system, the basic idea strikes me as reasonable, within the context of the kind of task Chameleon has set itself to. [12:50] gary_poster: It is revolting and causing cronspam and breaking the test suite. I guess we may need to clean up all production and dev machines. [12:52] wgrant, "revolting" is a wildly unnecessary description, IMO, not that I have anything to do with it. Just seems unproductive to talk that way. Causing cronspam I saw; it is because of broken packages in our distro's packaging system, which is a chronic and annoying problem for anyone who does work with system Pythons. I have not seen a broken test suite. [12:54] wgrant, practically, what machines have been taken care of? [12:54] somebody should file an Ubuntu bug about that python-setuptools problem if they haven't already; we should be able to take care of it in an update [12:54] if you can't rely on system packages, that's a pretty serious problem from our POV [12:56] cjwatson, when I've experience this in the past, I was unable to identify a cause. My suspicion has been distro upgrades not cleaning things up properly. [12:56] gary_poster: I've roughly outlined the cause above. [12:56] gary_poster: a bug about the fact of the problem is sufficient [12:56] I have no hard evidence to support this and have been unable to dupe reliably, so it's the kind of...ok OK! Great, cjwatson, I'll read the backlog and file it then. Thanks [12:57] (or at least, the very likely cause) [12:57] heh, good enough [12:57] we can go through the history and hunt for symlink<->directory chnges [12:57] *changes [12:57] I agree it's likely to be a pain to come up with a coherent reproducer for [12:57] but let's not burn too many people's time on that when we have a pretty good hypothesis [12:58] +1 [12:58] cjwatson: I tried a clean hardy->lucid upgrade with python-setuptools installed, and it worked fine. [12:58] But AFAICT on loganberry that's what broke it. [12:58] It's possible there was some old launchpad PPA package, though... I don't have logs to find out. [12:58] * cjwatson will have a quick hunt through publishing history [13:08] cjwatson, wgrant, https://bugs.launchpad.net/ubuntu/+source/python-setuptools/+bug/834698 [13:08] <_mup_> Bug #834698: setuptools.egg-info can end up as a directory when it is meant to be a symlink < https://launchpad.net/bugs/834698 > [13:09] It's not just setuptools. That's just the most common one. [13:09] Ah, you say that in the bug. [13:09] yeah === salgado_ is now known as salgado [13:42] bigjools: Oh, you're using changelog_entry to create the aggregate changelog? [13:42] bigjools: I was expecting to use the real librarian changelog, but I guess that's probably OK... [13:43] Except not. [13:43] It's bad. [13:43] It searches for all SPRs in any context, doesn't it? [13:43] yes, hmm [13:43] I thought that was ok but maybe not [13:44] It'll count all PPA uploads, Ubuntu uploads, Linaro uploads, etc. [13:44] yes [13:44] As well as the general badness and unreliability that changelog_entry entials. [13:44] entails. [13:44] what is wrong with changelog_entry? [13:45] It is a guess. [13:45] A historically error-prone guess. [13:45] can you be a bit more explicit? [13:45] Let me find some bugs. [13:45] I'm looking :) [13:46] Ah, it's change_summary that is the cause of bug #144620. But changelog_entry for the gina case is a guess, and for Ubuntu it is frequently wrong because people build packages badly. [13:46] <_mup_> Bug #144620: Some displayed sourcepackagerelease changes files don't have attribution < https://launchpad.net/bugs/144620 > [13:46] Worse still, if Linaro then copy from Ubuntu, they will miss everything. [13:46] Because Ubuntu only has the last SPR. [13:47] doesn't matter does it? [13:47] Why not? [13:47] it's only for acceptance emails [13:47] it's right that each SPR has only its changelog [13:47] what sync-source does is crack [13:47] Er, no :) [13:47] can you try to be less condescending? [13:48] An upload's changes file is meant to contain all the intermediate changelog entries. [13:48] Acceptance emails should too. [13:48] So they actually list the changes. [13:48] And not just the last change. [13:48] And isn't this to be used for bugs too? [13:49] If we restrict the SPR search to those published in the origin archive, Debian->Ubuntu copies will work. [13:49] But Debian->Ubuntu->Linaro copies will magically only have the last SPR. [13:49] So the acceptance emails will be truncated, and not enough bugs will be closed. [13:50] This is easily solved by just using the actual changelog, I think. [13:50] including previous version's changelogs in a particular SPR is weird [13:50] We shouldn't include them in the SPR. [13:51] We should consider them as part of the upload/copy. [13:52] That's the whole point of this change, isn't it? [13:52] To include more than just the actual SPR that is being copied. [13:52] correct [13:52] To do that only for the first copy seems less than ideal. [13:52] deryck, abentley: From the w3c spec: [13:52] 10.4.1 400 Bad Request [13:52] The request could not be understood by the server due to malformed syntax. The client SHOULD NOT repeat the request without modifications. [13:53] Particularly when changelog_entry is a reasonably awful thing in the first place, and we have the changelog file a couple of milliseconds away. [13:53] But I will still use 400 if that is how we do it around here ... ;-) [13:53] what does gina put in changelog? [13:53] henninge: right. The HTTP spec provides no way to signal that the request was unacceptable for reasons other than bad syntax. [13:53] bigjools: I think it puts everything between the last SPR published and the current one. [13:53] wgrant: Personally, I'd like to kill changelog_entry [13:54] henninge, I don't think it's a perfect match, but it's close enough. the spec makes me feel better about it actually. :) [13:54] StevenK: I very much would. [13:54] abentley: because it was not unacceptable. [13:54] StevenK: It is very wrong in the new world. [13:54] StevenK: Because it depends on the previous publication. [13:54] henninge: it was unacceptable. The request could not be performed. [13:54] It's really part of the upload, not the source package. [13:54] abentley: no, it was performed, at least in part [13:55] And even so, we don't even keep the .changes file around [13:55] StevenK: We do... [13:55] abentley: I would expect a 4xx to mean that nothing was changed on the server. [13:55] StevenK: We strip the sig, but we keep the changes file for every well-formed upload. [13:55] henninge: perhaps we differ on what "perfomed" means. [13:55] henninge: As would I. [13:56] but as I said, I am not going to go against our customs here ... [13:56] ... at least not in this branch. [13:57] wgrant: oh, so you are aware of our misuse of 400s? ;-) [13:57] bigjools: I think changelog_entry is a near-adequate compromise for displaying in the web UI sometimes, but we should avoid it where possible. And it's now possible to go direct to the changelog and get a correct result, so we should do that. [13:57] henninge: No :( [13:57] henninge: Where? :( [13:57] everywhere, so I am told. [13:57] wgrant: I am verifying what gina does, I don't trust it [13:58] henninge: Most 400s should abort the transaction, so the only server changes will be sessions and OAuth nonces... [13:59] henninge: Is that not the case? [13:59] bigjools: It is not trustworthy. [13:59] bigjools: But I think it looks at the last published SPR, and takes the entries since then. [13:59] and looking at gina code makes me want to scratch my eyeballs out [13:59] bigjools: To do anything else would make little sense. [13:59] wgrant: nope, at least not for the case I got here. [13:59] does that apache configuration for serving static files on launchpad do anything special for etags? [13:59] cr3: gary_poster changed all that to be much better recently, I believe. [13:59] henninge: Which is that? [13:59] wgrant: it returns a 400 if not all builds for a recipe could be requested. [13:59] Ahh. [13:59] wgrant: might that be available somewhere for me to see? [14:03] cr3: I'm hoping gary_poster will be able to tell you :) [14:03] henninge: That sounds bad to me. [14:03] wgrant: lets mention his nick, gary_poster, as often as possible to get his attention :) [14:03] henninge: I would assume that a 400 meant no changes had been made. [14:03] no need to convince me ... ;-) [14:07] I guess the really corect thing to do is use a 207... but aaaaaaaaa [14:10] this Gina code makes me want to go back to C++ [14:11] wgrant, cr3, hiya :-) [14:12] bigjools: gina does not age well. [14:12] deryck: I think abentley understood that situation. [14:12] it's hideous [14:12] bigjools: Although it wasn't ideal in the first place either, no. [14:12] cr3, yes, it does do something special for ETags [14:12] # XXX kiko 2005-11-05: this is very funny. [14:12] heh [14:12] gary_poster: awesome, could I have a looksee at the apache config? [14:12] henninge: Well, not completely, but I do feel that if it wasn't completely successful, the status code should reflect that. [14:12] wgrant: where is it checking to see what was previously published? [14:12] henninge: ideally, we'd have a separate status for each request. [14:12] bigjools: It's not. [14:12] bigjools: It just takes the first entry. [14:12] oh. yay [14:12] bigjools: This is, as we say, "wrong". [14:12] so my hunch was write [14:12] ffs [14:12] right [14:12] bigjools: changelog_entry is not to be trusted. It doesn't really belong on SPR at all. [14:12] ok so I will switch to changelog [14:12] bigjools: It's mainly there to permit fast access for views, and because we didn't store changelogs in the librarian. [14:12] We should remove it from everywhere we can. [14:12] but I think I still need to parse backwards through what's published [14:12] ... starting with this code that will DoS everyone if I upload 100 packages with similar versions to a PPA :) [14:12] bigjools: See the base version calculator. [14:12] abentley: that would either mean the 207 or making seperate xhr requests for each build. [14:12] bigjools: It walks through them until it sees the common ancestor. [14:12] I need to go and lie down [14:12] cr3, that's a question for the LOSAs, I don't have it. I can tell you that we generate ETags for launchpad API docs and WADL and JSON; it is tied to file size and mtime; we synchronize mtime for these files based on the last bzr revision so that it should be stable across machines that share the same revision [14:12] bigjools: python-debian makes it fairly trivial. [14:12] abentley: We can file a bug to fix that. ;-) [14:12] we store last common version [14:12] henninge: yeah, 207 seems the most reasonable. I didn't know about that RFC. [14:12] bigjools: Actually, IIRC we create a set of the common versions, and take the max. You need to do the same, but then walk back through the entry iterator until you hit that version. [14:13] cr3, I'm not sure if the Apache file will tell you more than that; and I'm only familiar with the api.[qastaging.|staging.|edge.]launchpad.net rules [14:13] It'll be pretty painless once you realise how awesome python-debian is. [14:13] what iterator? [14:14] bigjools: the changelog entry iterator from python-debian. [14:14] s/iterator/generator/ [14:14] gary_poster: thanks, that's definately something I can grep in launchpad but I'll still ask for the apache config in #launchpad-ops out of curiosity. I'm sure there's a trick or two I can learn [14:14] hang on [14:14] cool cr3 [14:15] cr3: Default Apache etags are generated from inode number, mtime and size. [14:15] yup [14:15] cr3: inode number obviously isn't portable between machines. [14:15] cr3: But if you reconfigure FileETag and make sure your mtimes are the same, the etags will match across machines. [14:18] bigjools: Check DSD._updateBaseVersion [14:18] And the getAncestry that it calls. [14:18] I need to step back from this [14:18] it's not straight to me exactly what needs to happen any more [14:19] adeuring, bac: hi there, are you available for a review? https://code.launchpad.net/~matsubara/python-oops-datedir-repo/improve-db-statement-matches/+merge/73059 [14:19] what I need is an example case [14:20] matsubara: sure, I'look [14:20] bigjools: Do you know how sync-source.py does this? [14:20] thanks adeuring [14:20] bigjools: It does it the same was as normal uploads: it calls dpkg-genchanges with -v$LATEST_VERSION_IN_UBUNTU. [14:21] This causes dpkg-genchanges to include in the changes file all changelog entries strictly later than $LATEST_VERSION_IN_UBUNTU. [14:21] cr3, now a question for you. :-) could you either make ~launchpad the admin of https://launchpad.net/launchpad-results , add ~launchpad to ~launchpad-results ? Since it is part of the Launchpad project, we are responsible for triaging bugs [14:21] Or remove launchpad-results from launchpad-project, possibly. [14:22] s/ add / or add/ [14:22] yeah that too [14:22] gary_poster: I think the problem with that is that I'm not a member of ~launchpad which might prevent me from working on the project, but please correct me if I might be wrong :) [14:22] Since it's not code managed by our team, it probably shouldn't be in launchpad-project. [14:23] cr3, yeah that leaves you two options [14:23] wgrant: I was following the process here though: https://dev.launchpad.net/CreatingNewProjects [14:24] wgrant: you're not talking about my aggregate changelog stuff any more are you? you're talking about gina? [14:24] cr3: That's for split-out parts of Launchpad. [14:24] cr3, I agree with wgrant that maybe it should not be in launchpad-project. Alternatively, if you add ~launchpad to ~launchpad-results, at least everyone responsible would be able to to what they need. [14:24] bigjools: They are intertwined, but I'm not talking about gina at all now. [14:25] why are you talking about parsing changelogs then? [14:25] bigjools: Because traditional syncs do that. [14:25] cr3, but unless there's a good reason to have it be part of launchpad-project, again, I agree with wgrant [14:25] bigjools: That's the baheviour we're trying to emulate. [14:25] gary_poster, wgrant: I have a call scheduled with lifeless later today, I'll add that to the agenda depending on where we want to go with the project [14:25] cool cr3 [14:25] wgrant: which requires fixing in gina AIUI [14:26] bigjools: Not if we make native syncs parse the changelogs too. [14:26] I'm easy either way, I did feel strange seeing the launchpad folks getting bug reports about the project so I feel your pain :) [14:26] euargh [14:26] for now then, cr3, perhaps you could simply triage https://bugs.launchpad.net/launchpad-results/+bug/833854 ? :-) [14:26] <_mup_> Bug #833854: Unable to submit results with results2launchpad on natty < https://launchpad.net/bugs/833854 > [14:26] bigjools: Classic/legacy/traditional/ohgodgoaway syncs don't use changelog_entry. They create a normal upload, which parses the changelog to work out which entries to include in the changes file. [14:26] wgrant: you said that gina doesn't import all of the changelog [14:26] bigjools: Right. [14:26] so we can't parse it [14:26] gary_poster: fair enough, done :) [14:26] bigjools: But that's not too relevant if we ignore gina. [14:27] thanks cr3 :-) [14:27] bigjools: No, it doesn't import all of the changes into *changelog_entry*. [14:27] bigjools: debian/changelog is imported into the librarian verbatim. [14:27] bigjools: There is no possible error there. [14:27] wgrant: ok *that* is what was confusing me [14:27] bigjools: changelog_entry is generated based on a dodgy heuristic. [14:27] let's ignore changelog_entry then [14:27] Yes :) [14:27] assume changelog is complete [14:28] If it's not, then it's still not a regression from the old method. [14:28] Because normal uploads and legacy syncs just parse the changelog. [14:29] I'm not convinced that changelog parsing is needed [14:30] if gina makes a composite one for a particular import then we never saw the oldver versions anyway [14:33] bigjools: If you fix gina to make a composite one, then maybe. [14:33] bigjools: But using the real changelog is definitely safer, and probably easier. [14:33] wgrant: hang on [14:35] right, I think I am finally tuned in. I need more sleep. [14:36] Perhaps I should change the docstring of changelog_entry to "Red herring." [14:37] bigjools: Anyway, you should probably revert that revision. [14:37] Or things could get *very* confusing and noisy. [14:37] hmmm [14:38] I mean, the current implementation is clearly practically flawed. [14:39] Because it will pick up PPA uploads etc. [14:39] it will yes [14:39] a bad oversight [14:39] Hmm. I wonder what happens if I delete a package and copy it back in. I guess it includes every SPR ever seen by Launchpad. [14:39] That could be amusing. [14:40] With the right SPN, that is. [14:44] I suppose I should sleep at some point. [14:44] bigjools: You're not around Monday, I suppose? [14:44] wgrant: I am actually [14:44] Oh. [14:44] I thought it was a holiday. [14:45] it is but I swapped it [14:45] Ahh. [14:47] matsubara: r=me [14:47] thanks adeuring [14:47] Night all. [14:48] night [14:48] adeuring, I have another one from oops-tools which is related to that MP: https://code.launchpad.net/~matsubara/oops-tools/829460-typeerror-qsmmx/+merge/73062 could you take a look as well, please? [14:48] matsubara: sure [15:15] gary_poster: you mentionned generating etags for api, wadl and json. however, I'm seeing an oddly short etag for .css files, for example, and I'm not seeing anything about FileETag in the apache configuration nor anything special in the launchpad code, any ideas where I should look? === beuno is now known as beuno-lunch [15:21] matsubara: kiko has a good point about your mp === al-maisan is now known as almaisan-away [15:50] adeuring, yep, I'll reply to kiko's comment. Apart from that, are there other things to change in the mp? [15:51] matsubara: no, otherwise, the MP looks good [15:51] adeuring, cool. I'll grab some lunch and sort this out afterwards. thanks for the review! === matsubara is now known as matsubara-lunch [15:51] matsubara-lunch: enjoy lunch! [15:52] matsubara-lunch: note that I'll have EOD in about an hour, and I'll leave home then [15:56] Project devel build #1,001: STILL FAILING in 4 hr 4 min: https://lpci.wedontsleep.org/job/devel/1001/ === beuno-lunch is now known as beuno [16:08] cr3, I don't know. I haven't touched ETags for CSS, at least intentionally [16:10] gary_poster: I think I found what's going on: seems to be a server context FileETag config I'm not seeing in the launchpad virtualhost. I'm double checking with chex just to make sure [16:10] that's all I could assume being from the outside looking in :) [16:10] hm, ok cool cr3 [16:16] can someone help me rationalize what goes under lp.services? for example, I would tend to think there shouldn't be anything that depends on a database layer there, but that probably means I don't get the purpose of that namespace === deryck is now known as deryck[lunch] [16:48] abentley, you looked at https://answers.launchpad.net/launchpad/+question/167859 last week. We have two almost identical SourceForge subversion branches, one of them importing and one of them not. I tried the failing one locally and it worked fine. I was thinking of asking LOSAs to clear out past attempts to see if that would help somehow. Does that sound reasonable, and/or do you have a better idea? [16:50] cr3, I think of it as "general-purpose sorts of things go here, like things that might go into a library, or things that are LP-specific but used as tools throught the code base." That said, I'll deny everything if you claim I told you anything about this. ;-) [17:13] abentley, I spoke too soon. The local checkout eventually failed after almost completing with "bzr: ERROR: A Subversion remote access command failed: REPORT of '/svnroot/ifolder/!svn/vcc/default': SSL handshake failed: Secure connection truncated (https://ifolder.svn.sourceforge.net)" [17:13] Perhaps I should assign this to Jelmer? [17:13] gary_poster: makes sense. [17:13] ok thanks === matsubara-lunch is now known as matsubara [17:19] sinzui: I think I found a bug in addMember: http://pastebin.ubuntu.com/675383/ [17:20] yes you have [17:20] oh, hold on [17:20] approved is a member, invited is not. how can that happen. I think approved would have to go to deactivated first [17:20] sinzui: or you could consider it a bug in the browser code. [17:22] abentley, is this a case where I add a team twice? Once I invite, the other party accept and the state goes to approve, but invite the team again. This should be a no op. there is nothing to do and no status should change [17:23] sinzui: yes, this is adding a team twice. [17:32] sinzui: how much of a no-op should it be? would we still reset the expiration date, etc? [17:33] I would not think so [17:35] I understand your point, but I do not think that is a way team admins force renewals. Lp sends them out and the other team accepts them. If the admin wanted to extend the expiration date, he can do that from +members [17:43] sinzui: Is there a legitimate way for a team to become PROPOSED? I can do it using force_team_add. If so, should the transition from PROPOSED to INVITED legitimate? [17:44] A team is proposed when the admin of the team is the user doing the add. [17:46] abentley, proposed -> invited should not happen. That action is an implicit acceptance by admins of both teams of the user, so I expect the membership to be approved [17:46] sinzui: ISTM that if the admin of the target team then attempts to add the team, the status should go to APPROVED? [17:46] we agree [17:51] Anyone else getting lots of errors in ec2 tests with complaints like this? [17:51] [] [18:00] what's the url that contains info regarding staging's database updates? === deryck[lunch] is now known as deryck [18:05] bdmurray, getting it for you, and it hasn't bee since the 15th IIRC. [18:06] gary_poster: okay, I thought I'd put the url in https://help.launchpad.net/StagingServer [18:06] explanation, btw, from stub: "the account running the update has lost its [18:06] sudo access, probably puppet fallout. I've opened RT #47499" [18:06] <_mup_> Bug #47499: Sources.list are set to Australia < https://launchpad.net/bugs/47499 > [18:07] bdmurray, is this what you had in mind? https://staging.launchpad.net/successful-updates.txt [18:07] gary_poster: yes firefox wasn't helping me remember it [18:07] cool [18:10] sinzui: with my changes, I think I can prove that TeamMemberAddView.add_action never triggers a TeamMembershipTransitionError. Should the same be true for Person.addMember itself? [18:12] abentley, that would be nice, but I think that is impossible given that users call that via poorly written scripts. [19:03] sinzui: ISTM that currently addMember can be used to demote someone from ADMIN to APPROVED or PROPOSED. Does that make sense? [19:04] NO [19:04] oh [19:05] I think there is a related bug. maybe you found the root cause [19:05] * sinzui looks [19:07] abentley, I wonder if you could help me for a moment. If you don't think you can do so easily, no problem; writing this has helped me gather good diagnostic information. I was going to put it on IRC, but it was too much, so here's what I have: http://pastebin.ubuntu.com/675475/ [19:08] mm, lemme try making that wrap... [19:08] abentley, try again: http://pastebin.ubuntu.com/675476/ [19:10] gary_poster: So when we say "garbage", we mean "stuff that the garbage collector should have collected". [19:11] gary_poster: I have poked at this before, and my impression was that the garbage collector is not perfect. [19:11] abentley, yes, I believe so. If I had to guess, I would say that the message I see locally is not being rendered for some reason [19:11] abentley, heh [19:13] abentley, bug 480157 which clearly states what you found [19:13] abentley, can I somehow force the file that the message references to unlock? [19:13] <_mup_> Bug #480157: IPerson.addMember() should not demote admins < https://launchpad.net/bugs/480157 > [19:13] gary_poster: So I can take a look at that test and see if the lock lifetimes are broken. [19:14] ok thanks abentley. Please let me know if I can help. Given that these are keeping my ec2 land from passing but they are occurring oin buildbot and not causing a stop-the-line, I guess I'll submit directly for now; I'll also shoot a quick note about the situation to the list. [19:17] gary_poster: the test looks correct. I think the puller really is failing to unlock the branch. [19:17] abentley, so it smells like a bzr error? [19:17] gary_poster: no. [19:17] oh? [19:17] gary_poster: it smells like a puller error. [19:18] * gary_poster thought puller was bzr, and is corrected [19:19] gary_poster: puller uses bzrlib. [19:19] gary_poster: but it is part of codehosting. [19:20] cr3: hi [19:20] ok, gotcha. abentley, do you want me to pull on that as a matter of my own edification, or should I leave you to it? [19:21] gary_poster: Have a look at it. There are supposed to be symmetrical Branch.lock(read|write)/Branch.unlock calls. It seems like they're not organized correctly. [19:22] cool abentley. I'll take a glance. May I treat you as a mentor for this one? [19:22] lifeless: hola senor [19:22] gary_poster: sure. [19:22] cool thanks [19:22] cr3: do you have skype ? [19:22] lifeless: I might, let me have a looksee [19:22] or I can do voip [19:22] gary_poster: The traditional pattern is that branch.lock_(read|write) is followed by a try/finally block that does branch.unlock. [19:23] makes sense [19:23] I think we have a decorator too - a with write_locked(branch): [19:23] lifeless: yep, marcatinterunion [19:23] IMBW [19:23] lifeless: indeed. [19:23] gary_poster: anyhow, you should be able to tell from the URL which branch isn't being unlocked appropriately. [19:27] gary_poster: Oh, it might be relevant that locks are tracked on a per-instance basis. So another instance of "default-stack-on" is probably being opened and locked elsewhere. [19:28] gary_poster: that would be why stack_on.unlock said "LockNotHeld" [19:29] abentley, hm, ok. per instance of what? is the puller in another process, or at least in another bzrlib instance? [19:30] gary_poster: per instance of the branch class. [19:30] gary_poster: class instance, not bzrlib instance or process. [19:31] ah ok abentley. [19:31] abentley, they shared directory names though. That's to be exoected? [19:31] expected [19:31] That is, the lock file when it said "LockNotHeld" was the same [19:32] path as the lock file that gc complained about [19:32] gary_poster: that's to be expected. [19:32] oh ok [19:33] (that seems mystifying on the surface of it, but I don't think digging into it will be necessary.) [19:33] gary_poster: if you open a branch twice, you get two instances of the branch class, referring to the same files. [19:33] abentley, ah so the instance does not hold the lock that exists? [19:34] (for another instance, and the whole multiple instance thing is presumably the whole point) [19:34] gary_poster: I don't understand the question. [19:34] yeah I think I just realized how obvious the answer to my confusion was [19:36] you can have multiple class instances, each pointing to the same file. In order to manage threads and processes and so on, bzr gives locks to its files. When I tried to unlock the instance, the error message was not "there is no lock" but "the lock over there isn't mine" [19:36] that was the obvious bit, yeah, abentley? [19:39] gary_poster: bzrlib isn't really built with threading in mind. The read-locking enables caching. The write-locking pysically prevents other bzrlib clients from making changes. [19:39] abentley, ack [19:40] gary_poster: The error message was "I'm not locked", without saying anything what other objects may or may not be locked. [19:40] ack [19:42] abentley, well...let me make sure I am on the right track here. if I have branch file F, and bzrlib branch instance A of F, and bzrlib branch instance B of F, and A has a lock, if I tried to unlock B, it would say "I'm not locked." That's what you mean here, yes? [19:42] gary_poster: right. [19:42] cool [20:03] abentley, the bug is within bzrlib, or we aren't using it correctly somehow. We never get the repo pack to unlock it ourselves, and bzrlib doesn't do it because the code path apparently expects "fallback.unlock()" (whatever that is) will do it, but that is not around for us. Here are the details: http://pastebin.ubuntu.com/675505/ [20:04] s/repo pack/repo back/ [20:04] cr3: bug 799271 [20:04] <_mup_> Bug #799271: cache entries get corrupted and lead to hard to diagnose failures < https://launchpad.net/bugs/799271 > [20:06] gary_poster: Okay, it looks like a bzrlib bug. [20:06] Yippie, build fixed! [20:06] Project devel build #1,002: FIXED in 4 hr 10 min: https://lpci.wedontsleep.org/job/devel/1002/ [20:07] gary_poster: I think the lock_read must not happen until we are sure that we'll add it as a fallback repository. [20:08] abentley, ok, that makes sense. Let me see if that works... [20:08] gary_poster: or else we need to manually unlock if we we're exiting without adding it. [20:08] right [20:10] cr3: https://dev.launchpad.net/ArchitectureGuide/ServicesAnalysis [20:14] abentley, I don't know how much of an indication of correctness this is, but ./bin/test -vvc lp.codehosting.puller.tests.test_worker passes (and without the locked file complaint) with the reordering you suggested. I'm inclined to branch bzr and make the change, but I'll probably need some serious help with a test. But you agree that I should make a bug, work up a full branch for an MP, and make an egg for LP, yes? [20:15] and I'll try for the test with help first :-) [20:15] though maybe hints on what tests to run would be good--I saw that the bzr test suite is 3.5 hours now [20:15] mm, I saw your faultline plugin; that might help [20:15] gary_poster: True. [20:17] gary_poster: Yes. For bzr itself, faulline adds an "--auto" flag that will automatically determine the tests to run. [20:17] gary_poster: IOW, "bzr selftest --auto" is your friend. [20:18] ok cool, abentley. So...in order to get faultline to trigger I just need to add a comment to where I intend to make the change? [20:18] then I can look at those tests, try to figure out what to write new, and go from there [20:19] gary_poster: You can do it that way, or you can just specify the file. [20:19] oh ok === bac changed the topic of #launchpad-dev to: #launchpad-dev ist: https://dev.launchpad.net/ | On call reviewer: - | Critical bugs: 237 - 0:[#######=]:256 [20:56] jcsackett, do you have a few minutes to mumble? === matsubara is now known as matsubara-afk [21:31] abentley, had trouble with faultline fwiw. the --auto hack uses FaultFinder.from_tree, while the trunk has FaultFinde.from_trees. Simply fixing the name is insufficient. [21:31] File "/home/gary/.bazaar/plugins/faultline/faultline.py", line 250, in run [21:32] finder = FaultFinder.from_trees(target.basis(), target, [21:32] AttributeError: 'WorkingTree6' object has no attribute 'basis' [21:32] bzr fault-line works fine [22:25] gary_poster: sorry 'bout that. [22:27] abentley, no apology needed, just telling you. you want me to file a bug? [22:38] i have an launchpad openid sso problem: [22:39] when lp:~gandelman-a logs into my site, he shows up with the local_id https://login.launchpad.net/+id/QAxGXJy [22:39] but his account page, https://launchpad.net/~gandelman-a [22:39] says his local_id should be https://login.launchpad.net/+id/KeeKF87 [22:39] and i can't work backwards from QAxGXJy to find out what account it's for. [22:40] why would it show up as something different, and how can i verify his identity?