[00:04] And another one, http://buildbot.twistedmatrix.com/builders/lucid32-py2.6-select/builds/1091/steps/bzr/logs/stdio [00:26] poolie: hi, exarkun found some bugs [00:28] hi nigelb [00:28] hey lifeless [00:29] hmm, I thought this was #launchpad-dev for some reason. [00:29] nigelb: nup :P [00:30] lifeless: you're everywhere, which confused me :P [02:38] glyph, nigelb, lifeless, hi === krow_ is now known as krow [07:17] hello poolie [07:22] hey poolie, I'm going to try to be back in time for the stand-up, but we're getting a late start at the house today. So don't wait on it for me. [07:37] hi poolie, hi nigelb [07:37] np [07:38] hi jam [07:43] lifeless, hi [07:43] re bug 819604 [07:43] Launchpad bug 819604 in Bazaar "when an idle ssh transport is interrupted, bzrlib errors" [Critical,Confirmed] https://launchpad.net/bugs/819604 [07:45] hi [07:47] your last comment there is correct as far as it goes but it's not really where i thought we ended up [07:48] hm [07:48] maybe i'll just reply [07:51] actually my question is really: why are you unable to deploy to codehosting again? [07:51] how is this any worse than before/ [07:53] lifeless: ^^? [07:56] poolie: we've stopped scheduling monthly downtime [07:57] poolie: and in terms of being worse than before, we had to kill the service every deploy, before. [07:57] so, if there is a disruption, it will disrupt them more frequently? [07:57] if we want to be able to experiment effectively we need a graceful option, otherwise each test is downtime. [07:58] but, for other reasons, every deploy is downtime at the moment [07:58] eg the xmlrpc error that comes back during a deploy [07:58] unless that's recently fixed [07:59] that ticket is in progress with hloeung at the moment [07:59] should be fixed in a day or two I hope [07:59] that's great [08:00] what will happen to connections that are busy for a long time and not idle? [08:00] yeah, it should be done within the next couple of days. [08:00] poolie, I believe the hard timeout is 2mins [08:01] which hard timeout? [08:03] poolie: appserver connections? or bzr connections? [08:03] bzr [08:04] so, I would like to interrupt them at the next verb [08:04] and if we have that, we could set a decent (say 1 hour) period after which we would interrupt anyway [08:04] ok, but what will happen today if we fix this particular bug? [08:04] we'd reset the timeouts [08:05] and have a fallback limit of 30minutes after which we would interrupt anyway [08:06] the lower backend timeouts would cause a timeout after (N, to be determined - say 3) minutes of inactivity, which would likely be low enough to get the CI servers and PQM and the like to migrate over to the new service instance. [08:07] we will need to address both bugs before we can deploy reliably with minimal interruption to users. [08:10] vila, do you have a pointer to the testtools-related failure on babune? [08:11] lifeless: so, you'll leave the old back ends running for up to an hour? [08:12] jelmer: http://babune.ladeuil.net:24842/view/%20High/job/selftest-freebsd/lastFailedBuild/#showFailuresLink [08:12] poolie: something-like, once we've got all the friction sorted [08:13] poolie: at the moment no timeout is high enough - we've still got a back end running from when we cut over to the haproxy setup last week, I believe. [08:21] vila, poolie, jam: http://bugs.python.org/issue12544 [08:29] for which date is 2.4 scheduled? [08:30] gour: we're talking about it right now, it depends on https://bugs.launchpad.net/bzr/+bugs?search=Search&field.importance=Critical&field.status=New&field.status=Incomplete&field.status=Confirmed&field.status=Triaged&field.status=In+Progress&field.status=Fix+Committed [08:30] gour: sometime between this thursday and two weeks after that [08:30] gour: mostly bug #771184 and bug #809048 [08:30] Launchpad bug 771184 in Bazaar "option to disable/enable fetching of all tags" [Critical,Confirmed] https://launchpad.net/bugs/771184 [08:30] Launchpad bug 809048 in Bazaar "bzr crashed with AttributeError in stopTest(): '_TypeEqualityDict' object has no attribute 'clear' - breaks bzr selftest on oneiric" [Critical,Confirmed] https://launchpad.net/bugs/809048 [08:30] cool...it's on time [08:32] what's with colo branches in 2.4? they are in core? [08:34] not in core; very good in bzr-colo [08:36] thank you [08:37] https://bugs.launchpad.net/bzr/+bug/806348 [08:37] Ubuntu bug 806348 in Ubuntu Distributed Development "BzrCheckError: Internal check failed: Cannot add revision(s) to repository: missing chk node(s) for id_to_entry maps" [High,In progress] [08:37] jam, vila: ^ [08:37] thanks to jelmer & {hg,git} plugins, bzr is the only dvcs we use now :-) [08:37] gour: \o/ [08:38] (after spending many years with darcs, played with mtn, fossil, hg...) === med_out is now known as medberry [08:49] i've asked web2py to upgrade repo format to 2a..in order to provide correct info, since when is bzr using 2a as default? [08:49] since 1.19 or something - its in NEWS [08:50] poolie: did you want to talk about these codehosting bugs? [08:50] on the phone at the moment [08:50] which ones? [08:50] lifeless: ta [08:51] Think it only because default in 2.0. It existed in 1.18... [08:52] fullermd: ahh...that's what we need. thanks [08:53] Sounds like 1.17 has support for it too... [09:11] vila: http://babune.ladeuil.net:24842/job/selftest-chroot-oneiric/lastBuild/testReport/junit/bzrlib.tests.test_selftest/TestTestResult/test_verbose_report_known_failure/ [09:12] can't we just do a getattr() to handle different testtools/python versions? [09:15] jam: instead of checking the version you mean? [09:15] vila: is babune perhaps running some other pre-0.9.12 snapshot of testtools? [09:15] jelmer: I think checking versions is going to be hard to get right [09:16] jam: 0.9.12 introduces the fixes, 0.9.11 doesn't have them [09:16] jelmer: that specific bug is in bzrlib because of attribute issues with python's runner, right? [09:16] jam: ah, sorry [09:16] jam: There are multiple issues here [09:17] that one is unrelated to testtools, I've just put up a proposal for it that uses getattr [09:17] jam: https://code.launchpad.net/~jelmer/bzr/2.4-809048-workaround/+merge/70834 [09:33] poolie: the two: gracefully handle tcp disconnects during idle periods of bzr+ssh connetions, and secondly a way to tell the smart server 'disconnect after serving the next verb or now if you are already idle' [09:36] i agree they'd be useful [09:37] i see you mentioned them in bug 795025 but they seem a bit distinct from the main bug in the bzr-sftp front end, so i might file two separate bugs [09:37] Launchpad bug 795025 in Launchpad itself "Codehosting service didn't stop in a timely manner" [Critical,Triaged] https://launchpad.net/bugs/795025 [09:37] i'm going to downgrade bug 819604 because it doesn't meet our definition of critical [09:37] Launchpad bug 819604 in Bazaar "when an idle ssh transport is interrupted, bzrlib errors" [Critical,Confirmed] https://launchpad.net/bugs/819604 [09:37] ie it's not a release blocker [09:37] and create series tasks [09:38] it should be fixed though [09:42] ok [09:42] poolie: so bug 795025 is about not shutting down; the *cause* for that is active sessions having no way to interupt [09:42] Launchpad bug 795025 in Launchpad itself "Codehosting service didn't stop in a timely manner" [Critical,Triaged] https://launchpad.net/bugs/795025 [09:43] poolie: I don't particularly care if you want to split out a bug asking for a way to interrupt, but it seems unnecessary [09:44] poolie: these are critical from LP's side, in the sense of queue jumping bugs; and (as I think we've established) we cannot sensibly move forward on the forking-daemon until we have 795025 addressed in some fashion [09:45] so for 795025 the only change needed to fix it is for bzr to close the connections, either on request or after a 5m timeout? [09:45] (or, whatever idle timeout?) [09:46] on request [09:46] i guess idleness can be done sufficiently well from haproxy,because the socket will just be idle [09:46] well, as long as the timeout is long enough we're sure it's not just that one side is busy [09:47] a timeout in the server would be a nice bonus, but unless the work just fell out while looking at on-request, I would do it separately. [09:47] sure [09:47] i only ask because you suggested a timeout [09:47] so as far as deployments [09:48] ah, I was describing the overall situation before, was probably unclear [09:49] yes, I think haproxy is reasonable for timeouts at least in the medium term. We will have to fiddle the timeout counter in it to find a sweet spot eventually. === gour1 is now known as gour [09:50] the issue is basically that if you upgrade/downgrade one of the back ends, you will need to choose between leaving the back end live indefinitely, or perhaps making long-lived clients error out [09:50] is it intolerable to make them error? [09:50] doesn't tarmac cope with this already? [09:50] no, tarmac errors out, pqm errors out, bzr-eclipse errors, buildbot errors. [09:51] and doesn't recover? [09:51] how has it possibly coped with our monthly roll outs so far? [09:51] well [09:52] I'm sure it does when it opens a new transport [09:52] users seem to find hour long test runs being aborted a bit of a pain [09:54] ok [09:54] i think i understand the situation [09:56] perhaps we can touch base tomorrow; its getting late here :) [09:56] here too [09:56] i don't want to have a long discussion about whether to fix it or not [09:56] we should fix it [09:56] i just want to have clear meanings for bug metadata [09:56] as do you [09:57] yup [09:57] unfortunately the meanings are a bit different :) [09:57] thats a beer topic I think [09:57] is there something one would miss for not using colo-branches when simply using shared repo and each branch in its own dir, iow. should i bother with colo-branches? [09:59] the main difference is that it's more biased towards having one working tree [10:00] you can also also use colo-sync-from and colo-sync-to [10:00] yeah, but i do not get if having one working tree is so important [10:00] it's good if the tree is large relative to your disk (or other resources) or if it takes a long time to compile [10:00] etc [10:01] or if your tools like staying in one location [10:01] or if your working methods do [10:01] i believe that using feature-branches is then good-enough for me [10:02] we'll stay with python for all development (qt & web2py), so compile is not really issue here (except some cython) [10:14] jam, could you maybe give francesco a bit more of a tip where to add the test or where to look for inspiration? [12:36] vila, your recent mp: https://code.launchpad.net/~vila/bzr/migrate-config-options/+merge/70767 [12:37] jam: yes ? [12:37] You mention taking the register key from the option object [12:37] but I don't see that in the patch [12:37] jam: different mp [12:37] vila: read what you wrote on that one [12:37] bullet point 2 [12:38] I guess you were saying these are things that aren't part of this but would be done next? [12:38] I thought it was the list of things in this patch [12:38] jam: read what I wrote in introduction: 'follow-up' [12:38] jam: yup [12:38] vila: I tend to skim and focus on bullet points [12:38] no, the introduction was just one line [12:39] putting things that aren't part of the patch into the MP... can be confusing [12:39] anyway, looks good [12:39] just confused [13:10] vila: I have a big concern about your config stack work [13:10] booleans don't come back as booleans... [13:10] http://paste.ubuntu.com/661920/ [13:11] I'm looking at how Martin is using "repository.fdatasync" and I'm pretty sure it is wrong [13:11] he is doing: self.write_stream.close( [13:11] want_fdatasync=self._pack_collection.config_stack.get('repository.fdatasync')) [13:11] which is sort of what I would expect. But you can't set it to "False" or "F" or ... all of those evaluate to True in python [13:11] (because they are strings) [13:12] I don't see any tests from poolie that the config item can actually disable fdatasync [13:14] jam: I'm working on the boolean config options just now [13:14] jam: you're right that there is a bug right now, my next submission will fix it though [13:16] vila: I'm looking to do something with it for 2.4. Should I just use "get_user_option_as_bool" for now? [13:16] I was also a bit disturbed that just adding that suddenly made an HPSS ratchet test fail because it went from 14 HPSS calls to *34* [13:16] jam: yup [13:16] but that could have been my fault. [13:16] (somehow) [13:16] I haven't worked through all the failing tests yet [13:17] jam: what means "something" ? You're backporting the fdatasync stuff ? [13:18] vila: I was using fdatasync to figure out how to do "no fetch tags" [13:18] only to find out it didn't work [13:18] ha right, adding a new boolean option then ? [13:18] (repository.fdatasync as an example to crib from) [13:18] right [13:18] k [13:18] also, oddly the HPSS verbs are different [13:19] branch.get_config() calls 'Branch.get_config_file' [13:19] BranchStack() calls 'get' [13:19] not even 'get_bytes' from what I can tell [13:21] Store calls get_bytes, Stack doesn't know about which files are involved [13:23] but fdatasync shouldn't care about remote repositories (unless there is a verb for that) or it's a bug no ? [13:23] s/care/apply/ [13:23] s/care about/apply to/ [13:24] jam: be aware that remote branches doesn't obey bazaar.conf nor locations.conf, they need a specific stack (TBD) [13:25] vila: Remote cares about fetch_tags [13:25] I didn't test anything for fdatasync in that area [13:25] jam: hehe, indeed :) [13:27] someone here involved in qbzr or explorer? [13:28] i'm wondering if there is indeed any code to get the window positions and window dimensions from qbzr.conf? [13:31] blackarchon: I haven't poked at that code recently, but ISTR that there are window positions that are remembered [13:31] I think it involves deriving from a particular class [13:32] like QBZRWindow or somethign [13:33] well I'm asking because this doesn't seem to be working at all [13:34] ok, I will try to find the exact place for this restoreSize function which is called [14:23] blackarchon: AFAICS, there are options for *size* but not for *positions* [14:42] hello [14:44] I would like some help: I ve created a branch of a project and I want to commit to it [14:44] (a launchpad branch) [14:45] when I use the command: bzr commit -m "blah blah" [14:45] I get the following error: [14:45] bzr: ERROR: Cannot lock LockDir(lp-86944400:///%2Bbranch/stellarium/.bzr/branchlock): Transport operation not possible: readonly transport [14:46] although i am logged in in launchpad [14:46] do you have any idea on what I am doing wrong? [14:52] hikiko: you probably don't have *write* access on this branch (actually, you're probably using a checkout which is bound to the remote branch instead of a local branch that you then push somewhere else) [14:54] how can I change this? I mean what I did was: checkout the project code [14:54] then create my launchpad user and branch [14:54] and then attempt to commit [14:54] what step is missing? [14:55] oh [14:55] I see [14:55] you mean i have to do sth like: [14:56] bzr branch lp:~ [14:56] and then change that code [14:56] and bzr push lp: ... [14:56] ? [14:57] yes, this would work. [14:57] If you checked out _before_ creating your LP user, the checkout is connected over a _non-logged-in_ transport. [14:57] yes I checked before [15:01] and if I want to update [15:01] bzr up [15:01] will give me the new code from the branch or from the project? [15:04] ok I found it :) [15:04] thank you very very much for your help all! [15:11] um... where can I find the default ignore file, which is installed with bzr? [15:12] ~/.bazaar/ignore perhaps? [15:12] no, I edited it - and left no backup of it :( [15:13] 'm pretty sure bzr writes it if it doesn't exist... [15:13] oh yes? let me try... [15:13] mv it away and start again? [15:15] Semirelatedly, it's probably rather discouraged to be editing it, as a rule. Since it would no longer match what anybody else has, you can wind up with surprises in branches multiple people touch. [15:15] well no, it doesn't get magically recreated :( [15:16] If I mv mine out of the way and run a 'bzr stat', it writes out a new one... [15:17] ah yes, but only if this bzr command succeeds [15:18] my bzr stat wasn't inside a bzr directory, so it hasn't created the file - thx! :) [15:29] I want to fix this bug: https://bugs.launchpad.net/qbzr/+bug/578935 [15:29] Ubuntu bug 578935 in QBzr "qbrowse crashes when run in repository: 'NoneType' object has no attribute 'last_revision_info'" [High,Confirmed] [15:30] so I added a check if the argument isn't a branch or a wt [15:30] https://code.launchpad.net/~andrebachmann-dd/qbzr/fix_qbrowse [15:31] but I still need an idea of how to proceed, I want to disable all objects in qbrowse [15:32] but I don't have the right hought on how to do that [15:35] any hints? === beuno is now known as beuno-lunch [16:44] more than 3 hours to land on pqm, is it me or it's getting worse ? [16:48] vila, it's not just you [16:49] jelmer: :) === deryck is now known as deryck[lunch] [17:08] vila: can you submit my 2.4 branch? I don't seem to have permission to submit to release branches [17:09] jelmer: 2.4-809048-workaround ? [17:10] yep [17:10] done [17:12] vila: thanks [17:13] jelmer: nothing showing up on pqm for now :-/ [17:16] jelmer: ok, here it is [17:17] jelmer: thanks for that ! Merge it into trunk when you can ! [17:18] will do :) [17:21] ok, I'm off, see you online on Thursday ;) === beuno-lunch is now known as beuno === deryck[lunch] is now known as deryck [19:51] jelmer: dots don't work on bzr 2.1, only none works [19:52] jelmer: and I think that for deployment we can't wait for launchpad so we will just tar everything up to the clients === r0bby is now known as robbyoconnor [23:02] morning poolie [23:28] hi thar === yofel_ is now known as yofel