[00:18] StevenK: howdy, how goes the convoy branch? [00:24] rick_h: I've not worked on it. If this critical regression branch gets finished off, I'll toss it at ec2 today. [00:25] StevenK: awesome, ok. wanted to check. I've got deryck starting to reivew my JS test branches so was curious [00:27] Hahahawefwefw [00:28] quick! someone get wgrant some water before he passes out! [00:29] I just discovered why Product:+index is so terrible. [00:30] ProductPackagesPortletView always calculates linkage suggestions, even if it's not going to show them. [00:30] And that's an expensive FTI query. [00:31] Is there a way to merge pipes, or push changes back into the parent? I accidentally created a pipe when I had uncommitted changes and now want these changes to be in the parent pipe. [00:32] huwshimi: They're just normal branches with handy aliases. [00:32] You should be able to say 'bzr merge :next' [00:32] That sort of thing. [00:33] wgrant: Ah awesome, that makes sense [00:36] * wallyworld punches wireless keyboard and mouse that just died [00:37] wallyworld: wired > * [00:37] i think it's the receiver. bollocks [00:38] * StevenK loves his wireless keyboard and mouse [00:38] mine is a logitech which should be better than a no-name brand you would think [00:39] I think they're due for replacement. I've played enough first person shooters that the labels for WASD are either completly or just about worn off [00:39] hah [00:39] That's why real keyboards don't have labels :) [00:39] Well, that and Dvorak. [00:39] i've never tried a Dvorak keyboard [00:40] vim + dvorak scares me too much [00:40] though I do love my buckling spring unicomp :) [00:41] wallyworld: I do hate that the two major choices for keyboard/mouse are Microsoft and Logitech [00:41] bah [00:41] http://elitekeyboards.com/ http://www.pckeyboard.com/ [00:41] yeah. i've have to buy another one today. can't not have one :-( [00:42] there's your real keyboards [00:42] rick_h: How much do you want to bet they don't ship to .au? [00:42] wow. the pckeyboard one even has a clitoris [00:43] wallyworld: yea, but no middle scroll mouse button :( guess it's a trademark thingy they can't do [00:43] * wgrant hates non-mechanical switches now. [00:43] StevenK: ah, ok got me there. [00:43] wgrant: you tried a topre yet? I can't get my wallet out to go that route [00:43] rick_h: No, but they intrigue me. [00:44] * StevenK prefers keyboards that are quiet [00:44] But they feel terrible :/ [00:44] * wallyworld prefers keyboard that work [00:44] wallyworld: You seem to be typing okay. [00:44] wgrant: do they? it's one of the few switches I've used [00:44] StevenK: check out the cherry brown [00:44] StevenK: on my laptop keyboard [00:44] those are pretty quiet switches [00:45] i'll have to undock for today [00:45] Googling for Cherry Brown isn't so helpful [00:45] and whenever i undock, it kernel panics [00:45] rick_h: I was replying to StevenK's quiet keyboards thing. [00:45] wallyworld: Handy. [00:45] yeah [00:45] well here goes..... [00:46] I believe mine has Cherry Browns. [00:46] They're certainly not quiet. [00:46] wgrant: oic [00:46] But they're a bit quieter than Blues :) [00:46] Bwaha [00:46] That sounds like a panic [00:46] wgrant: yea, I've got two of the leopolds, one in brown and one in blue [00:46] wgrant: So you prefer keyboards that sound like a typewriter factory? [00:46] from the elite keyboards place [00:46] StevenK: Yes, because they are so nice to type on. [00:47] wgrant: ++ [00:47] I used to prefer my ThinkPad keyboard to every other keyboard I'd tried. [00:47] But now it feels terrible compared to this :/ [00:48] rick_h: $54USD to ship a keyboard from elitekeyboards [00:48] StevenK: ouch [00:49] pccasegear has some reasonable keyboards. [00:49] They're Australian. [00:50] http://www.pccasegear.com/index.php?main_page=index&cPath=113_1277 [00:50] All four of them are corded :-) [00:50] cool, the leopolds are two that I have used. wgrant you used cherry reds? another one I'm interseted in [00:51] Why would you want a wireless keyboard :/ [00:51] You have to have wires to your monitor, and your keyboard is right in front of your monitor... [00:51] rick_h: I just have a Das Keyboard Ultimate S [00:51] Which has Browns AFAIK [00:51] wgrant: ah, gotcha [00:52] I've never tried reds. [00:52] heh, tonight at the local coders meet up is keyboard night. Brought out the leopold blue, brown, happy hacker, and unicomp [00:52] I need help... [00:52] wgrant: My keyboard isn't [00:52] I have a keyboard drawer [00:53] I used an actual Model M for a while. [00:53] * StevenK wonders why calling this view is returning "" [00:54] wgrant: my unicorn is one of those split model M's. Like a model M and a MS Natural had babies, but they for for like $1k on ebay :/ [00:54] is bug 732510 f-r really? [00:54] <_mup_> Bug #732510: poppy-sftp should connect to the database with a unique user ID < https://launchpad.net/bugs/732510 > [00:58] wallyworld_: That was a nice panic? [00:58] * StevenK puts together a deployment [00:59] StevenK: i went and got a drink before i rebooted [01:11] lifeless: No [01:11] lifeless: Needs downtime. [01:21] wgrant: paramses?! [01:23] lifeless: Well, it's a sequence of sets of parameters :( [01:23] Yes, it's terrible, but params is already used for the singular. [01:35] rules, searches, clauses, conditions, param_sets, ... [01:35] I dunno, but accepting it as terrible seems undesirable :) [01:36] lifeless: It's a sequence of BugTaskSearchParams objects. [01:36] yes [01:37] which all get unioned together [01:37] not anded [01:37] this is worth calling out perhaps [01:37] Yes [01:37] -> alternatives [01:37] -> alternate_parameters [01:40] alternatives it is [01:46] wgrant: bug 929241 [01:46] <_mup_> Bug #929241: ProductPackagesPortletView calculates suggestions when it knows it won't show them < https://launchpad.net/bugs/929241 > [01:46] wgrant: why can't that just be the first line of setUpFields ? [01:54] lifeless: Because then setUpWidgets complains that there aren't any fields. [01:55] ah. thanks [01:55] If I avoid setUpFields, I have to also avoid the rest of the form code. [01:57] yea [02:02] WidgetInputError('name', u'Name', LaunchpadValidationError(u'unique-from-test-team-py-line529-100007 is already in use by another person or team.')) [02:02] I call shenanigans [02:12] maybe i was dreaming but i recall hearing about a C API for launchpad. [02:12] if it exists, where can i download it? [02:12] I'm not aware of one. [03:09] lifeless: the docs say any alter table patch cannot be applied live. is this true even for "alter table xxxx drop constraint...." patches? [03:10] Doesn't that take a full write lock on the table? [03:10] no idea [03:10] wallyworld_: It should still be fine to apply during FDT [03:10] but i would have thought a drop would be virtually instant [03:10] It still needs a lock [03:11] sure, but a very short one [03:11] which hopefully we could tolerate? [03:11] wallyworld_: Applied live refers to "Do not needs us to go down, can be done live" [03:11] I think we should tolerate it. [03:11] Although it will cause timeouts. [03:11] Ah, except slony. [03:11] So no, it requires a full lock across the cluster. [03:12] Which is FDT only [03:12] If we weren't using slony, I think it would be acceptable, as it would only cause things to time out for a few seconds. [03:12] Yes. [03:12] ok, will redo patch nr [03:12] nr? [03:13] wallyworld_: yes, it is true. [03:13] nr is an (i thought) common abbreviation for number [03:13] wallyworld_: Right, but this doesn't involve a number change. [03:13] The docs in allocated.txt are out of date. [03:13] wgrant: my patch nr ends in -1 [03:13] wallyworld_: the reason it says cannot be applied live is because it cannot be applied live. [03:13] 'tis irrelevant nowadays. [03:14] so do i make the patch nr end in -0? [03:14] wallyworld_: in a busy cluster with e.g. backups, we might wait up to 24 hours to get a lock. [03:14] wallyworld_: It no longer matters. [03:14] wallyworld_: so even without slony, it would be unsafe to apply such a patch live. [03:14] lifeless: It's safe if you are careful. [03:14] ok, will mark the mp as needs review then [03:14] wgrant: FSVO [03:14] wgrant: its not trivially repeatedly safe [03:14] * StevenK stops looking at this visibility branch for a bit, since it is making him very very frustrated. [03:15] lifeless: It involves less downtime than fastdowntime (up to ~9s of timeouts, rather than 90s of outage) [03:15] * wallyworld_ taps fingers waiting for diff [03:15] wgrant: the goal of the process is to make it trivially and repeatedly safe to evolve the schema; having complex rules that only db gurus can get right is not a good way to achieve this. [03:15] But because of slony we can't do it. [03:15] Mmm [03:15] True [03:15] wgrant: more than 9 seconds, even without slony. not unless and until we have sorted a bunch of other stuff out. [03:16] wgrant: (other stuff being e.g. backups - anywhere in the cluster. Cronscripts with locks. Etc. [03:16] lifeless: Assuming that you are smart and don't apply when there are long transactions. [03:16] (read locks) [03:16] Which is what I meant by "careful" [03:16] wgrant: hard to automate -well- because you cannot tell what /will/ be long when you start, only what *is* long already. [03:16] lifeless: Lies. [03:17] crontab disablement is handy. [03:17] wgrant: e.g. the fdt kick everyone off logic would still be needed. [03:17] Nope. [03:17] hurry up diff!! [03:17] You set a global flag which tells longrunning jobs to GTFO [03:17] s/GTFO/not start/ [03:17] wgrant: I agree that in an ideal world it could be made moderately reliable, but I won't go further than that :0 [03:17] Wait for any stragglers to go away. [03:17] Apply patch, which waits for contending requests to go away [03:17] Done [03:18] wgrant: modulo bugs, modulo stale connections, modulo another sysadmin doing something ... [03:18] Yes. [03:18] wgrant: the patch during that period breaks the world, of course. [03:19] wgrant: taking everything out as fdt does, ignoring cluster sync, we can get less than that 9 second downtime window, and do it reliably (ignoring cluster sync because this is a post-slony discussion) [03:19] lifeless: It breaks things that use that table for up to $REQUEST_TIMEOUT. [03:19] Rather than breaking the entire application for $SLONY_TIME [03:19] wgrant: and it also pauses all other incoming queries for that same window [03:19] Isn't that [03:19] "breaks things"? [03:20] That was my definition of "breaks things". [03:20] Oh, you mean it will hang threads and therefore cause queueing? [03:20] True. [03:20] yes [03:20] knock on effects [03:20] rather than 'LP is down, come back soon' folk see 'LP is slow with no notice' [03:20] this is undesirable [03:21] Mmm [03:21] Slow for 5 seconds is better than down ever. [03:21] s/5/10/ [03:21] debatable; there is no reason FDT can't be that fast *anyhow*, so ... [03:22] given a less risky and highly repeatable process, with similar limits on performance, why would you want a more risky process? [03:23] wgrant: are you working on that new loggerhead bug right now? should I delay releasing? [03:24] lifeless: I'm 30s from pushing [03:24] Just found another couple [03:24] cool [03:28] * wgrant stomps on the branch scanner's face [03:28] Hard [03:28] That's okay, it has another one [03:29] Unfortunately. [03:29] wallyworld_: this ajax submit thing is very slow [03:29] Or is it broken... [03:29] wasn't slow for me before [03:29] I'm leaning towards broken at this point. [03:30] Oh [03:30] It's the longpoll bug [03:30] Had too many MPs open. [03:31] rotfl [03:31] cat just fell off the desk [03:31] meow [03:31] I have pillows behind my LCD [03:31] wgrant: what's the longpoll bug? [03:31] nice big one stacked two deep; both cats can lie up there [03:31] wallyworld_: Web browsers suck [03:31] * wallyworld_ hates cats [03:31] wallyworld_: They try to avoid DoSing, by limiting the number of connections per hostname [03:32] the white one stretched out, arched back, and slid off the edge of the pillows, and the desk, and fell onto my minitower case :) [03:32] ah :-) [03:32] lifeless: https://code.launchpad.net/~wgrant/loggerhead/bug-929275/+merge/92194 [03:32] Finally [03:32] is that a request for a code review ? [03:33] s/request/fishing attempt/ [03:33] 31 from paste import httpexceptions [03:33] 32 +from paste.httpexceptions import HTTPNotFound [03:33] Fail [03:34] bah [03:34] wgrant: so, I wonder, would a wsgi middleware to map NoSuchId and NoSuchFile -> 404 be a good idea, or hide to many legit issues [03:34] lifeless: I don't think that's a good idea. [03:34] right now, for instance, the cod eyou've added can mask some classes of file system error [03:35] because they just trigger NoSuchFile [03:35] I used NoSuchId and NoSuchRevision [03:35] Aren't those pretty specific? [03:35] oh hmm, [03:35] I misread slightly [03:35] so yes, that should be pretty narrow [03:36] anyhow, fine at first glance, I'll let StevenK who seems to be in a reviewing mood do it ;) [03:36] StevenK: FIxed [03:36] lifeless: I think that could mask legitimate bugs. [03:36] And encouraging people to write terrible code is not what Loggerhead needs more of :) [03:41] wgrant: r=me [03:42] StevenK: Thanks. [03:42] * wgrant lands. [03:47] lifeless: Should I upgrade Launchpad's? [03:47] Or do you have stuff planned? [03:52] wgrant: just release stuff from toshio, so please go ahead [03:52] k [03:54] WidgetInputError('name', u'Name', LaunchpadValidationError(u'unique-from-test-team-py-line529-100007 is already in use by another person or team.')) [03:54] LIES! [04:05] lifeless: the GIL contention between xmlrpc-private and app seems to be growing :( [04:05] we see lots of https://lp-oops.canonical.com/oops/?oopsid=OOPS-1e8d134f2de3a8101bf09a55c20c00ae now [04:07] 4 seconds getting feature flags? [04:07] Precisely. [04:07] ugh. [04:07] ugh ugh ugh ugh ugh guh [04:07] It's not unexpected. [04:08] doesn't make it /nice/ :) [04:08] so, we need to figure an appropriate ratio and then manually split the cluster, I guess [04:08] Yeah [04:08] I haven't seen anything suggesting haproxy knows how to count across two clusters yet [04:08] I don't think so, no. [04:08] that, or we teach lp how to server both from the same port [04:09] But we can do the rebalancing entirely in haproxy [04:09] which is nice [04:09] I considered that, but that is scary. [04:10] it could be a pgbouncer limit being hit [04:10] Unlikely [04:10] first query [04:10] Hm, really? [04:10] https://lp-oops.canonical.com/oops/?oopsid=OOPS-1e8d134f2de3a8101bf09a55c20c00ae#statementlog [04:10] Oh, xmlrpc, so no auth [04:11] I've never seen that on a webapp request, though. [04:11] in main servers we get the ff early too, to determine query timeout [04:11] should be the first query everywhere [04:11] I don't think we get it before auth. [04:11] * wgrant checks. [04:11] webapp threads are probably never idle [04:12] fsvo idle [04:12] SQL-main-slave SELECT getlocalnodeid() [04:12] is the first for webapp [04:12] or more importantly, the number of webapp errors probably dwarfs the number that would match this... [04:12] Right. [04:12] which is on -slave [04:12] Mmm, true. [04:12] so, two theories; whats the cheapest thing to do [04:13] Graph pgbouncer connections, which UI has been trying to do this week? :) [04:13] At least that's what #webops has looked like. [04:13] excellent [04:13] I've only glanced, though. [04:13] as a starting point that makes sense [04:13] I wonder if pgbouncer logs enough that we can short circuit that [04:13] I'll check when these started [04:13] Because we hit the FD limit last week [04:14] when we moved session to go through pgbouncer. [04:14] e.g. we've just added X appservers (have we?) -> more connections used [04:14] No, they've not been added yet [04:14] Also someone needs to add the new prefixes to oops-tools once they are hactive [04:14] At least I hope not [04:14] Since there's still queue depth issues [04:17] Last I heard we were still waiting for more RAM and the extra CPU in one appserver [04:17] HMmm, you may be right. [04:17] On the pgbouncer thing [04:17] Unfortunately I started pruning properly again yesterday, so we don't have full records. [04:17] But I can't see too many similar OOPSes before the 31st. [04:26] I think we should investigate pgbouncer [04:26] 4 seconds in the GIL is pretty extreme, we were seeing that when we had 20 threads running on private-xmlrpc full-tilt, or something crazy like it [04:27] its more plausible to me that this is a queued connection in pgbouncer [04:27] at least on the data we have [04:32] grrr, wtf, quoting *everything*. sob. [04:35] What's quoting everything? [04:35] handlebars [04:35] shouldCompileTo("{{awesome}}", {awesome: "&\"'`\\<>"}, '&"'`\\<>' [04:35] its quoting " and ' ` [04:36] some might say this is overkill [04:36] "' need quoting [04:36] ` not really [04:36] why does ' need quoting in a template expansion ? [04:45] lifeless: Attributes [04:45] Some misguided folk use ' to delimit XML attributes. [04:45] It's perfectly valid, if a little discouraged and quite rare. [04:52] nah [04:52] ah [05:01] hi [05:01] is it a regression, or a known problem, that attaching a patch to a bug no longer shows up in the comment timeline? [05:03] oh well, https://bugs.launchpad.net/launchpad/+bug/929313 [05:04] <_mup_> Bug #929313: patches aren't mentioned in the bug comment where they were attached < https://launchpad.net/bugs/929313 > [05:05] I'm not sure it is either [05:05] o/ [05:07] wgrant: Can haz help? With http://pastebin.ubuntu.com/834814/ and running the new feature flag test I get WidgetInputError('name', u'Name', LaunchpadValidationError(u'unique-from-test-team-py-line529-100007 is already in use by another person or team.')) [05:09] StevenK: Looking [05:14] StevenK: I'm not sure that's a problem. [05:14] StevenK: (Pdb) view.request.response._headers [05:14] {'location': ['http://launchpad.dev/~unique-from-test-team-py-line529-100007']} [05:14] It created it anyway, and redirected. [05:14] it's possible that you're not meant to render the form in that case. [05:18] So I'm not supposed to call view()? [05:18] IIRC redirects have no body -> no need to render ever [05:19] I'm just a bit unclear what I should do in that case. [05:19] Just call c_i_v() and then check the team exists and is private? [05:21] StevenK: I believe so. [05:22] AssertionError: !=: [05:22] reference = [05:22] actual = [05:22] :-( [05:25] StevenK: Also, lint. [05:25] Trailing whitespace everywhere! [05:26] If everywhere is 3 places [05:26] Anyway, fixed. [05:26] It's bright red. [05:26] So yes, everywhere. [05:27] * StevenK isn't sure why visibility isn't set [06:45] ImportError: No module named pqm.pqm_submit [06:49] Anyone know where that is supposed to be pulled in from? I don't see this bzr plugin in sourcecode. [06:49] stub: The bzr-pqm package [06:50] Ta [06:50] Are you on precise? [06:51] Which is installed [06:51] Oneiric [06:52] Hmm [06:52] Possibly 2.6 vs 2.7 [06:53] My system bzr is happy with the pqm plugin. Not sure how Launchpad's bzrlib is supposed to load the system plugin. [06:54] Could be right about 2.7 - system bzr uses 2.7 and lp uses 2.6 [07:05] Nah [07:05] So how is the pqm plugin from the package supposed to get loaded when bzrlib.plugins is an egg? [07:06] The system path is shadowed [07:06] bzrlib.plugins is magical. [07:06] It can pull stuff in from lots of places. [07:06] I don't know how, though. [07:15] stub: Will postgres plan through a view as if it wasn't there? [07:15] Yes [07:15] It gets expanded inline and the whole thing planned [07:15] Right. Thanks. [07:21] stub: You're not really concerned with say, planner changes between 8.4 and 9.1? [07:23] StevenK: Usually things get better with a few regressions. [07:36] webops ping: Need the pqm pre-commit hook tuned [07:37] Or maybe just removed is better no [07:37] w [07:37] Need 'Makefile' added to the list of exceptions along with *.sql and fti.py [07:38] stub: Why? [07:40] stub: We've never had problems with database/schema/Makefile changes accidentally landing on devel, so I don't see any point in restricting them. [07:50] sorry... on the phoe [07:50] Had to change Makefile rules for the newly labotimized fti.py invocation [07:51] lobotomised even [07:51] stub: *-0.sql and fti.py are the only things that are *forbidden* [07:51] I think I should give up typing today [07:51] Everything else is allowed. [07:51] Ahh... so my fti.py changes are the trigger [07:51] Yep [07:52] webops ping: Please remove the precommit hook then [07:52] Heh [08:00] stub: where would I find this? [08:00] ah, think I've found it [08:02] stub: is this for /home/pqm/archives/rocketfuel/launchpad/devel or /home/pqm/archives/rocketfuel/launchpad/db-devel or both? [08:05] precommit_hook=[ -z "$(bzr status -S database/schema/ | grep -e '\(-0\.sql\|/fti.py\)$')" ] [08:05] this one? [08:10] hloeung: that is the one [08:11] hloeung: It is no longer helpful, and a hindrance for my branch [08:14] lifeless: Do you really think it is worth keeping memcache around for just the blog info? [08:22] stub: ok, commented out [08:22] Ta [08:26] stub: well its also used as a cheap temp store for some migrations [08:27] stub: I nearly wrote 'and move the blog to something else' before I remember that [08:27] StevenK: I expect us to get fucked over in short order by pg 9.1, but the really traumatic stuff I think was in 8.4 [08:28] StevenK: when a bunch of old implicit optimisations got derailed by CTE AFAICT [08:51] wgrant: I'm not sure I see the logic of "we've never had a problem, so let's remove the prevention of a problem" [08:53] mthaddon: For some things, sure. [08:54] can we change it rather than removing it altogether? [08:54] mthaddon: The particular case I was arguing against was a suggestion to *add* a new restriction that would not have stopped any previous problems. [08:54] But in general I think that restriction does more harm than good. [09:04] good morning === almaisan-away is now known as al-maisan [10:19] rick_h: JS on qastaging seems to be pretty broken. [10:19] LPJS is not defined [10:20] rick_h: At least on +register-merge === matsubara-afk is now known as matsubara [11:47] wgrant: looking [11:52] wgrant: @#$#@ see it. Putting up a mp right now. Very stupid [11:53] rick_h: I got annoyed with the regression branch so much I pushed convoy to the PPA and put up a small MP for convoy. [11:54] rick_h: I also tossed combo-url at ec2, which failed with codehosting failures. [11:54] StevenK: or wgrant can you review: https://code.launchpad.net/~rharding/launchpad/graph_lpjs_928500/+merge/92257 [11:54] very small stupid me change that's got qastaging broken JS [11:55] StevenK: ok, thanks for the heads up. Want to shoot me the failed tests and I can peek at them today? [12:05] rick_h: Forwarded. Not sure if it's my fault. === rick_h changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | On call reviewer: rick_h* | Firefighting: - | Critical bugtasks: 4*10^2 [12:23] adeuring: ping, can you peek at the MP please? https://code.launchpad.net/~rharding/launchpad/graph_lpjs_928500/+merge/92257 [12:23] rick_h: sure [12:41] rick_h: r=me [12:42] adeuring: thanks === Ursinha` is now known as Ursinha [13:48] bah, whats that thing where you can't sleep? [13:51] day? [13:55] hah yes [13:55] hey, so I've got the db hardware quote moving again [13:56] should have updated options soon and will rope flacoste into it when he is back on deck [13:56] stub: herb is proposing SSD, perhaps with the same RAM we have today, or even a reduction. Your thoughts? [13:57] SSDs are great if you pick the right hardware (and a disaster waiting to happen if you pick the wrong ones, such as the ones with volatile write caches on board). But not sure if they help our performance since we see little disk activity overall. === jcsackett changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | On call reviewer: rick_h*, jcsackett | Firefighting: - | Critical bugtasks: 4*10^2 [14:00] Not sure what RAID setup we would need - need to scan the section in the PG performance tomb. [14:00] Maybe a single RAID5 would be fine offsetting cost, or maybe we would still need two channels or an expensive RAID0+1 setup. [14:00] stub: we do have a number of cold-cache bugs open [14:01] stub: I know we see little absolute disk IO, but when we do its all concentrated AFAICT :) [14:04] gmb: I'm taking on branchscanner-timeout-bug-808930. Can we chat? [14:06] abentley: Sure, can you give me a little while to get to a point where I can switch? Say 14:30 UTC? [14:07] gmb: That's stand-up time. 15:00? [14:07] abentley: Sure, 15:00 would be fine. [14:07] gmb: great. [14:07] good night lifeless ;-) [14:10] flacoste: heh I wish. [14:11] flacoste: we finally got cynthia to sleep properly, fingers crossed. And now *I* have insomnia. [14:11] lifeless: that's so not unusual! [14:11] stub: whats your feeling about RAM usage with SSD's? Could we keep it at todays levels, even with more appservers; or could we even reduce it? [14:12] stub: and yeah, please do scan the PG performance tomb and get back to me [14:12] flacoste: \i/ [14:12] lifeless: I'd need to do research or benchmarking. Spinning plates of rust could end up faster if we can afford more RAM [14:12] so, herb is going to get options on SSD w/128GB of RAM [14:13] lifeless: similarly, you are right that we might be able to get away with less RAM with SSDs (although SSD access is still slower than cache) [14:13] I think we need to advise on the raid setup for those options? [14:13] my intention would be raid 10 [14:13] Yes. There have been several threads on the mailing lists. [14:13] herb: oh hai! [14:13] even with SSD. :) [14:14] herb: over hardware raid 5? For paranoia or throughput reasons? [14:14] * lifeless guesses at less rewrites-of-sectors [14:14] (+ performance) [14:14] stub: both [14:15] herb: ok, since thats about as good as it gets, I don't think we need to ask you for a specific config :) [14:15] herb: And one or two channels? We have two partitions, but that might be unnecessary since we are not queuing up writes for the right bit of rust to be in the right point in space/time. [14:16] my intention was a single channel SAS. even with SSD RAID 10 we won't be able to saturate the channel (nor could with with 16 or more spindles) [14:16] stub: the controller would still have a limited tagging queue, which can still saturate; but I doubt we would have an issue given our current write load [14:17] but I'm happy to take input on that. [14:17] elmo has the same book I havem but I'll trawl the mailing lists for the more resent threads. Only major thing I recall is many drives cheat by having write caches that fail for disaster recovery (including 'enterprise' devices) [14:17] herb: we could add another channel later, right ? [14:18] in any case, let's get some order of magnitude pricing. it might show that SSD is a non-starter and RAM is the way to go. [14:18] I suspect it is unnecessary [14:18] herb: if we can add another channel later, I think its totaly fine to start with one; SSD is -very- nice these days :) [14:18] lifeless: indeed we could. [14:19] so, we all seem agreed - lets defer the question [14:19] I think we should save money on RAM and buy me a coffee machine. Much better performance improvement. [14:19] use your frequent flyer points :) [14:19] I find your claim to be suspect. [14:20] herb: lol :) Also, I can point you at some hair raising queries/schema bits, which no amount of hardware can compensate for :) [14:20] KLM, Air France, Emirates.... don't think I've earned a real FF point in ages... [14:22] Anyway, off for dinner at 9:20pm because some of us have this non-sleeping thing from the other end. [14:24] ciao [14:26] herb: Consider a combination of SSD and spinning metal. No point wasting SSD on logs and dumps. [14:27] stub: good call. will do. [14:30] abentley, adeuring, rick_h -- https://plus.google.com/hangouts/extras/talk.google.com/orange-standup [14:31] stub: herb: May I suggest the san rather than spinning metal for those offcuts ? [14:31] probably faster than spinning metal ... [15:01] gmb: mum-ble or han-gout? [15:01] abentley: Hangout works for me - besides, I can't guarantee that Mumble will. Bear with me a sec... [15:03] * gmb waits for firefox to get its act together [15:13] adeuring, deryck: are we done with bug 829074? or is there some other branches that need to land? [15:13] <_mup_> Bug #829074: Show bugs that are not known to affect "official" upstream < https://launchpad.net/bugs/829074 > [15:15] flacoste, on call with adeuring now. explain shortly. [15:15] deryck: ok [15:21] flacoste: there is a minimal fix in place: you can filter by upstream target, if a packaging link exists [15:22] flacoste: i am working on a more comprehensive fix: [15:22] under the label "bug 2325" [15:22] <_mup_> Bug #2325: Distro CVE report: permission denied! < https://launchpad.net/bugs/2325 > [15:22] the idea is to porvide an option to select arbitrary upstream targets [15:23] well, not completely arbitrary, but any product or other source pachkage [15:23] erm, i meant bug 232545 [15:24] <_mup_> Bug #232545: resolved_upstream list does not do product / source package matching < https://launchpad.net/bugs/232545 > [15:24] adeuring: is the comprehensive fix requested by the stakeholders also? [15:24] anyone else unable to use ec2 commands b/c of pqm issue? did i miss an email about something i need to change? [15:24] the bug report was a bit vague... [15:24] jcsackett: I am, abentley is/has fixed it. [15:24] lifeless: ah fantastic. thanks. [15:25] jcsackett: I haven't tried again since, so i don't know where teh fix has gotten up to. [15:25] lifeless: mur? [15:25] abentley: the pqm-submit config stacks patch [15:25] abentley: (IIRC the various bits correctly) - a few days ago. [15:26] lifeless: Yes, I've fixed the lp-land command. Does that also fix ec2 land? [15:26] abentley: oh, I thought it did/would :) [15:26] abentley: IIRC the error in ec2 land happens when it invokes bzr lp-land to get the signed email for later delivery [15:27] jcsackett: can you pastebin the transcript showing hte error? [15:27] lifeless: sure, one moment. [15:28] adeuring: might be worth talking to brian and bryce about it [15:28] lifeless: Oh, I didn't think it shelled out to lp-land. [15:28] it might be the thing rick has fixed [15:28] adeuring: just to make sure we actually implement what they need [15:28] lifeless: http://pastebin.ubuntu.com/835337/ [15:28] in which case pulling trunk will fix it [15:28] flacoste: ok [15:29] jcsackett: ah, thats not what I saw a few days back [15:29] jcsackett: That looks unrelated to the issue I fixed. [15:29] lifeless, abentley: well, darn. [15:29] jcsackett: that looks like what stub experienced [15:29] lots of different issues cropping up here, huh? :-P [15:29] jcsackett: system plugin path isn't on your bzr plugins path (is all I know) [15:29] jcsackett: Possibly pqm-submit isn't installed on the box in question. [15:29] jcsackett: stub filed a bug [15:30] adeuring: from the launchpadlib pseudo-code that Bryce shows in the bug report, i think it was fine to make this work for linked_product only [15:30] adeuring: best to validate with them directly [15:30] jcsackett: look in latest-bugs for LP [15:30] pqm-submit is installed, abentley. i was hoping that was the issue too, since it was easily resolved. :-p [15:30] flacoste: ok [15:30] lifeless: ok, thanks. [15:33] lifeless: yup, i'm experiencing the same thing. thanks for pointing me to the bug report. [15:41] danhg, I have 3 draft posts on blog.launchpad.net. Do you want to read them? [15:49] adeuring, abentley, rick_h -- I'm starting on the open questions now. for real. :) [15:49] deryck: excellent [16:04] allenap or gmb -- if an external tracker is a Trac bug tracker, and we're syncing comments, shouldn't the status also update based on upstream bug? [16:04] Yes [16:04] deryck: Yes, it ought to be. [16:04] there's no mapping required between trac and us, right? === al-maisan is now known as almaisan-away [16:08] ah, not updated in awhile. resetting watch it is. [16:13] bigjools or jelmer -- I've got a question about failed recipe build I don't know how to answer. Can I assign to one of you, or you point me to who? :) [16:14] deryck: link? jelmer was just helping someone in #lp and curious if it's the same [16:14] rick_h, https://answers.launchpad.net/launchpad/+question/186193 [16:15] deryck: ok, different one, nvm [16:15] rick_h, thanks anyway :) [16:19] deryck: I really have NFI what's going on there. [16:19] other than something wants more memory than it can get [16:20] deryck: however, it's Java, so all bets are off, we've had this before with java builds [16:22] deryck: that just sounds like the issue with java using too much memory on the buildds [16:22] adeuring: that launchpadlib bug is a dupe of a server side bug, i forget the # though. [16:23] adeuring: you may want to mark it as such to avoid duplicate analysis [16:23] lifeless: right [16:23] bigjools, jelmer -- ok thanks guys. I'll find the bug and point the user at it. [16:23] adeuring: its almost certainly fallout from the materialised INCOMPLETE enums [16:23] adeuring: (but you know that :P) [16:23] bigjools: bug 693524 might be related [16:23] <_mup_> Bug #693524: Daily builds of Java packages fail: "Could not reserve enough space for object heap" < https://launchpad.net/bugs/693524 > [16:23] yeah ;) === fjlacoste is now known as flacoste [16:42] abentley, adeuring, rick_h -- questions are all caught up. answered or assigned them all. [16:42] deryck: win! [16:43] \o/ [16:45] adeuring, have you done interrupts today? ;) [16:45] deryck: working on roject review [16:45] adeuring, awesome! [16:45] adeuring, I shall quit nagging you now. :) [16:45] ;) [16:46] adeuring, was about to ratchet it up to public internet shaming, but all is well now. ;) [16:46] I had alreading registered adeuringsucksatinterruptduties.com [16:46] :) === deryck is now known as deryck[lunch] [17:33] rick_h, thanks for reviewing https://code.launchpad.net/~linaro-infrastructure/launchpad/workitems-model-classes/+merge/92174 . could you have a look at my update and let me know if I'm on the right track? [17:34] mabac: will do, sec [17:38] rick_h, absolutely no rush. :) thanks! [18:00] lifeless: What's the best way to examine an OOPS that occurred on a dev instance (launchpad.dev) ? [18:00] abentley: my preferred way is to glue oops-tools into the rabbit instance and have them come up in the normal web UI [18:01] any oopses before you set that up will have been tossed by rabbit (as the exchange would have had no queue attached to it) [18:01] on https://dev.launchpad.net/QA/OopsToolsSetup there is a 'deploying locally' section [18:01] lifeless: I have an oops in /var/tmp/codehosting.test/2012-02-09 [18:02] abentley: ah, interesting - must have been running without rabbit active [18:02] in which csae the simplest thing is dump-bson [18:02] sorry, bsondump [18:02] which is in utilities/bsondump, or if you have a buildout of oops-amqp in bin/bsondump of it [18:02] e.g. bsondump $path [18:03] lifeless: cool. [18:04] lifeless: utilities/bsondump says "No module named bson", but "bin/py utilities/bsondump" says "'module' object has no attribute 'decode_all'" [18:04] you can load that into the web UI if you want using datedir2amqp from python-oops-datedir2amqp [18:05] abentley: argh, that blows. [18:05] abentley: I know the one in oops-amqp works well [18:06] erm, wrong project :< [18:06] abentley: quick start: [18:06] branch python-oops-datedir-repo [18:06] link in the eggs and download-cache from your LP work area [18:07] lifeless: I installed python-bson with synaptic, and it works with that. [18:07] ./bootstrap ... [18:07] abentley: ok cool === deryck[lunch] is now known as deryck [18:17] mabac: hey, sorry, was going to ping you in irc but you stepped out [18:17] mabac: replied to your MP, the model definition is different with stormbase [18:17] rick_h, sorry. I'm going in and out of sessions at Linaro Connect [18:17] rick_h, thanks. I'll check in a bit [18:17] mabac: I hooked you up with an example in the blueprints directory there with you [18:18] let me know if that doesn't help [18:22] rick_h, awesome. thank you! [18:23] mabac: thank you for working on it [18:23] my pleasure :) [19:09] gmb: The transaction killer is not related to bug 808930 AFAICT. The jobs are killed by the job-running infrastructure, because they run too long. It's not length-of-transaction; it's length-of-job, which is tougher to optimize, because you'll be lucky if you only have O(n) complexity. [19:09] <_mup_> Bug #808930: Timeout running branch scanner job < https://launchpad.net/bugs/808930 > [19:10] And with O(n) complexity, it's always possible to time out by increasing n. [19:27] hi, abentley, i've been looking around and cannot figure out the story for nested branches with bzr. is it in 2.5? can you point me somewhere to find out about them? [19:28] bac: Nested trees? Not implemented yet AFAIK. [19:28] abentley: ok. i'd heard they were [19:28] bac: You don't mean colocated branches, do you? [19:29] no [19:29] nested, as in the equivalent of svn externs [19:29] bac: jelmer is planning to implement them, but he was nowhere near finished at the thunderdome. [19:30] abentley: ok, thanks === matsubara is now known as matsubara-afk === almaisan-away is now known as al-maisan [22:04] wallyworld_, wgrant, jcsackett, StevenK, I think this summarises my day reading the voucer/commercial subscription code: http://people.canonical.com/~curtis/wtfpm.jpg [22:07] lol [22:08] I shall have to show that picture to my colleagues tomorrow :-) [22:14] sinzui: http://pastebin.ubuntu.com/835839/ [22:17] sinzui, wallyworld: bug 929352 is what has hit me. [22:17] <_mup_> Bug #929352: ec2 land unable to find pqm plugin < https://launchpad.net/bugs/929352 > [22:19] jcsackett, set the env before calling the script BZR_PLUGINS_AT=gtk@/path/to/your/plugins/pqm c [22:19] sinzui: thanks! [22:19] ^ that is how I test plugin changes in the branch I am hacking on [22:20] oops [22:20] jcsackett, set the env before calling the script BZR_PLUGINS_AT=pqm@/path/to/your/plugins/pqm c [22:21] abentley: Around? [22:21] wgrant: yes. [22:22] abentley: I think your bzr plugin loading changes break ec2 land [22:22] It can't load the pqm plugin [22:23] wgrant: You're talking about the site-customize changes? === al-maisan is now known as almaisan-away [22:25] abentley: Yes. Just finding exactly what it is now. [22:26] wgrant: wouldn't have thought that ec2* ran Launpad in-process. [22:26] abentley: bin/ec2 is a buildout script [22:26] So it uses lp_sitecustomize [22:33] Hmm [22:33] So loading lp.codehosting breaks everything. [22:33] abentley: Can we avoid that, or should I revert the whole rev? [22:34] wgrant: I don't think we should avoid that. If we do, we get weirdness where LoomBranch isn't treated as a subclass of Branch. [22:35] abentley: I suspect we want to rip that out of lp.codehosting, then. [22:35] lp_sitecustomize is imported extremely early, and pulls in no other big bit of lP. [22:35] Just a few isolated bits from lp.services. [22:36] wgrant: I don't know what you're saying would be ripped out of lp.codehosting. [22:36] wgrant: I'm sure the fact that it's loading bzr plugins is related to the fact that pqm-submit isn't loading. [22:37] Yeah, but separately I don't think lp_sitecustomize should be importing codehosting. [22:38] wgrant: Okay, but the main efffect of importing codehosting is initializing the plugins. [22:40] abentley: True [22:40] Anyway, I think we have little choice but to revert this. [22:40] As it breaks the dev toolchain. [22:41] wgrant: Doesn't break *my* dev toolchain, but I guess you should. [22:41] abentley: Have you rerun bin/buildout? [22:41] wgrant: I don't use ec2. [22:41] Hah [22:59] wallyworld_, http://www.youtube.com/watch?v=YWMVVpOsjko [22:59] Its from 2005 I think [23:00] sinzui: thanks, looking now :-) [23:02] omg buildout is annoying [23:03] "no you can't have that" [23:03] Error: Picked: distribute = 0.6.24 [23:03] ok, add to to versions.cfg [23:03] Error: There is a version conflict. [23:03] We already have: distribute 0.6.16dev-r0 [23:04] ok, so why didn't you pick that one? [23:04] change versions.cfg [23:04] Error: Couldn't find a distribution for 'distribute==0.6.16dev-r0'. [23:04] but you said you already had it! [23:08] james_w: That's what you get for daring to use a packaged Python. [23:08] Upstream doesn't like distributions much. [23:09] clearly [23:09] we're considering migrating to buildout though [23:10] because it's possible that other alternative suck more [23:11] I think buildout sucks less than virtualenv(+pip) [23:11] I find that combination easier to use [23:12] In some respects. [23:12] but I don't like the "oh, you have that version already installed? well I'm going to try installing it anyway? oh look, it's already in the download cache! well, I'll try downloading it anyway just to make sure" [23:18] https://code.launchpad.net/~james-w/python-oops-amqp/bson-compat/+merge/92389 is what I was trying to do === wallyworld changed the topic of #launchpad-dev to: https://dev.launchpad.net/ | On call reviewer: wallyworld | Firefighting: - | Critical bugtasks: 4*10^2 [23:47] james_w, hi. looks to me that you need to have a little cider break ;) [23:48] rick_h, thanks for being patient. I have pushed another change to lp:~linaro-infrastructure/launchpad/workitems-model-classes/ and hope that will work better [23:50] mabac, heh, yeah, heading out curling now :-) [23:51] james_w, sounds dangerous ;) take care! [23:52] I will!