[00:00] So yeah, compatibility would suggest checking whether the class supports that parameter. [00:00] like supports_reverse_cherrypick. [00:01] This is weird. I'm trying to bzr rm a symlink to a directory, but it is complaining about the files in the directory the symlink links to [00:01] An no, I'm not appending a slash to the end of the symlink name [00:02] pickscrape: I believe we have a bug :( [00:02] beuno: wb [00:03] lifeless: passing the --force flag does the right thing (the target directory is not affected) [00:03] Not the correct user experience though. I'll raise it in launchpad. [00:06] Loggerhead! I'm going to see if I can even get CLOSE to setting it up..... ;) [00:07] hrmm, does bzr version symlinks? [00:07] jam: why did you change it from using calltree? [00:07] NfNitLoop: yes [00:07] Huh. cool. [00:07] pickscrape: thanks [00:07] lifeless: you mean callgrind?, because I don't have KCacheGrind on win32, you can change it back if you like [00:07] I noticed a bug the other day re: operation on a case-insensitive filesystem. [00:08] is that known? [00:08] I tried adding file x, when it had already been added as X. [00:08] save('filename.callgrind') should give the same result, btw [00:08] and it committed it a second time. [00:08] and then complained that it was missing. :p [00:10] jam: parameterised it [00:10] NfNitLoop: oh, foo. File a bug ? [00:11] NfNitLoop: thats the sort of thing that makes usage for windows users (or FAT under linux) particularly painful. [00:13] yes. I'm all too familiar with file case issues and still was a bit confused as to what was going on. [00:14] I'll try to create a sample and submit that tonight. [00:17] lifeless: latest version of PyBloom has a bloom.resize() function, so we can poke at it if we want. [00:17] I'm done for the night, though. Off to dance class [00:17] lp:~jameinel/+junk/pybloom [00:18] jam: tchau [00:18] jam: what dance style? [00:18] ballroom, aka latin, waltz, swing, etc [00:18] nice [00:18] I keep meaning to get back into that [00:20] so [00:20] parse_lines and string split make iter_random_one cry [00:21] a C parser would be substantially more efficient [00:25] (I'd like to drive the cost of random access way down, because it its a good total-cost proxy) [00:28] Hm. I can't seem to install Paste 1.1.7 [00:29] Moin. [00:29] Er, 1.71. [00:29] Make that Paste 1.7.1. ImportError: No module named setuptools [00:30] I wonder why all the other "python setup.py install"s I did? [00:30] do easy_install setuptools [00:30] somebody wanna raise their voice on the whole GNOME+DVCS stuff? [00:30] tolstoy: Or, better, use your package management system if you have one. [00:30] hey pygi. I liked your post on the matter. [00:30] which gnome+dvcs stuff? [00:31] Odd_Bloke: This is on Solaris, all inside a user account. [00:31] I don't think I have "easy_install" on there. [00:31] james_w, thanks, I just wrote a big comment addressing concerns Jason raised [00:31] Yes, yes, it feels like I've gone back in time decades by not having root on this box. [00:31] name, the discussion of GNOME migration to DVCS [00:32] tolstoy: http://pypi.python.org/pypi/setuptools/ [00:32] Are setuptools not normally part of the Python? And not required for other versions of setup.py? [00:33] (I'm so far away from Python culture these days, alas. Close as I can get is Groovy.) [00:34] tolstoy: I believe that most setup.pys use just distutils. [00:34] But I'm not sure. [00:34] Okay. Cool. That makes sense. [00:35] Odd_Bloke: yes they do [00:35] pygi: thanks, just read it. Jc2k has raised some similar points to you, and I think it's very important, and not something that's being discussed at the moment. I don't have any voice in GNOME to try and steer the conversation, but I'd be happy to comment on a migration plan for bzr. [00:36] pygi: are you a git fan, or is gitorious just very good for what you want? Would you prefer bzr, or is the abstraction of everything what you would like? [00:36] james_w, as I've written in the post, I'm not a fan of either git or bzr [00:36] I use both where I see fit [00:36] (heck, I've even mentored two bzr students in 2006) [00:37] james_w, currently, something like gitorious doesn't exist for bzr [00:37] and no, you can't count LP, because it is not free [00:37] pygi: ah yeah, I forgot that sorry. [00:37] pygi: why is LP not free? [00:37] james_w, the problem is people keep going in circles, rather then discussing important points [00:38] pygi: yeah, that's what lp is trying to do, but it's obviously not going to work for GNOME. [00:38] that's why I hope my post will bring at least put bug in peoples ear :) [00:38] name, because its not Free by FSF definition :) [00:39] james_w, if you wanna work with me, we could surely write a plan for bzr [00:39] I'm all for it [00:39] james_w, it the end, it all comes down to communication and collaboration, as I've mentioned [00:39] these folks forgot what that is lately [00:40] pygi: are you looking for somewhere to put bzr gnome branches? [00:40] yeah, these tools are supposed to help with collaboration, not cause fights. [00:40] pygi: there's a #gnome-bzr channel on gimpnet that may be more appropriate if we want to have longer discussions on this topic, there's a bunch of people there willing to work to make this happen. [00:41] hey thumper [00:41] james_w: hey [00:41] james_w: getting kinda late where you are? [00:41] pygi: where is your post ? [00:41] lifeless, moment :) [00:41] lifeless, http://pygi.wordpress.com/2008/07/01/broken-by-birth/ [00:41] thumper, it [00:42] thumper, it's a little more complicated then that [00:42] james_w, ok, will come there (You have pm too, btw) [00:42] thumper: aye, just finished a meeting. [00:43] james_w: work style meeting? [00:43] thumper: yup, platform team weekly. [00:43] pygi: are you aware that savannah, and alioth support bzr ? [00:44] pygi: (by which I mean, what is 'like gitorious' mean) ? [00:45] lifeless, yes, I am aware that they support bzr [00:45] ok, cool [00:49] lifeless, it's not only about supporting bzr per-se [00:49] pygi: if you're wanting the ability for people to use whatever DVCS they want, then wouldn't svn as the central repo work best? [00:49] it's about making it easier for everyone involved [00:49] from what I understand the svn integration in bzr, git and hg is better than the integration between the 3 [00:50] night all [00:50] Pilky, that's what people are currently doing mostly [00:50] night james_w [00:50] but that means overhead [00:51] true [00:51] Pilky, anyway, that was just a suggestion on how things *could* be handled [00:51] what is important is for people to finally realize they're not going anywhere with useless arguments sometimes even on personal level [00:51] yeah [00:52] the unfortunately reality about we programmers is that when we start liking a tool we'll fight to the death over it ;) [00:52] I know, but sometimes you have to think above tools [00:53] If I was a new folk in the FOSS community, and read planet GNOME, I'd run away from the project as faster as possible [00:53] because of the recent events [00:53] pygi: I think you make good points [00:53] lifeless, any questions or critics perhaps? Those always help flesh out the idea :) [00:54] pygi: there's a reason I don't do a huge amount of open source stuff (or at least large stuff), it's far easier to get your own way when you're the sole dev ;) [00:54] pygi: well, one thing to note is that bzr itself has an abstraction system; it can natively talk about hg and svn and a number of bzr formats [00:54] pygi: (in a way that none of git/hg/svn try to do) [00:54] lifeless, yep, am aware of that =) [00:55] thanks for pointing it out ;) [00:55] for instance, pqm had a multi-dvcs abstraction [00:55] and now we're ripping it out to replace with bzr plugins [00:56] Pilky, I prefer to stay out, and maintain a lot of stuff outside such projects (the only exception is xfburn (xfce project), but it was initially started there, so can't change it for the time being) [00:56] lifeless, I should really look further into pqm branch then (I guess it's on LP as usual) [00:57] pygi: the only "major" open source project I've worked/am working on is BazaarX, everything else is my shareware apps [00:57] Pilky, ah, no Mac here, sorry :) [00:58] well, pqm is not the cleanest code base. Yes its on lp. But the point there was just that /one/ approach would be to write admin toolchains in bzr, as a lingua franca [00:58] much as is being done with svn today, but being D aware [00:58] pygi: heh [00:59] lifeless, understood [00:59] * pygi is thankful for all the comments ;) [01:00] lets hope I can put them to good use :) [01:01] I'd argue that gnome has a gitorious itself /already/ [01:01] just called 'gnome-svn-orius' [01:02] ehm, well, if you can call the current platform they use that way, yes [01:02] well [01:02] what is it about gitorius as a system that could save admin time [01:03] is it outsourcing - but gnome could have gotten a variety of sites to host the svn migration [01:03] lifeless, ehm, no, its not about that [01:03] lifeless, gnome would have its own instance ofcourse, and not *plain* Gitorious [01:03] right [01:03] I've outlined some of the things that would save admins time in my latest comment [01:03] so why would it be easier for the admins :) [01:04] but there are a lot of stuff we could change to make it even easier for them [01:04] thats why I want the discussion [01:04] which I managed to initiate at KDE (sadly I can't attend Akademy, but oh well) [01:05] pygi: right, so my point is 'what would make the sysadmins life easier' is a good discussion and quite orthogonal to changing vcs -except- insofar as different vcs's have different admin overheads [01:05] lifeless, I definitely agree [01:05] lemme c/p excerpt: [01:05] "Maintainers of specific module could approve users for commit access to “mainline” or main repository, giving administrators less work. New modules/projects could be added by existing GNOME developers after their inclusion has been discussed. New translators and documentation writers could get their commit bits much easier" [01:06] as an example ofcourse, I can't count all the use-cases [01:06] I think that thouse things are actually highly automated at gnome already? [01:07] I was thnking of things like bkor having to script some massive dump+load for all 520 svn modules to handle 1.5 [01:07] some things are, some things aren't [01:07] lifeless, that's the part where I say whats needed prior to migration :) [01:07] K [01:08] lifeless, we could have an interface that would allow upgrading all repos to say new bzr format [01:08] pygi: right; not to mention that bzr can do that it self remotely [01:08] indeed [01:09] I just gave that as an example [01:09] :) [01:10] lifeless, I definitely agree that everything you mentioned should be discussed [01:10] lifeless, but before that, we need to make people collaborate the way they can once again [01:11] we can't make them, we can only try to collaborate with them [01:11] well, yes, and by collaborating with them change the way they think about collaboration =) [01:11] which is a little hard when you get told you're stupid or insane for making an informed design decision :) [01:12] lifeless, eh, even though I doubt anybody will listen to whatever I wrote in the post, they'd listen even less if I actually argued technical reasons behind whatever DVCS they choose [01:12] or if I accidentally mentioned that I prefer one over the others [01:12] pygi: sure, not suggesting you should get technical; I was noting that unpleasantness doesn't help collaboration - and there has been plenty in some of the previous months/years about DVCS [01:13] lifeless, and not only DVCS ... [01:14] * pygi should probably do some bzr hacking this summer if he gets the time [01:15] pygi: that would be cool [01:16] pygi: will you be at GUADEC? I'd be happy to point at internals face-to-face if you are [01:17] lifeless, eh, sadly no :( [01:17] ah well [01:18] but I'd be happy to learn more about internals over irc/mail/jabber/whatever :) [01:33] we've done a lot of scalability work since 2007 [01:33] many more things that only look at small subsets of history [01:34] still need to fix inventories to not be full-tree-objects [01:34] lifeless: Looks like someone beat me to it (by a year or so) reporting case insensitive fs issues. https://bugs.launchpad.net/bzr/+bug/77744 [01:34] Launchpad bug 77744 in bzr "win32 bzr is confused by filename case changes" [High,Confirmed] [01:34] I pasted my test as well. [01:34] NfNitLoop: thanks [01:37] *cough* bzr log *cough* [01:37] mwhudson: eh? [01:38] just carping at lifeless's "many more things that only look at small subsets of history" [01:38] aaah. [01:45] mwhudson: *many* not all [01:45] mwhudson: if you compare what we _used_ to do [01:46] yeah, it's much better, i'm just being annoying [01:46] go merge the loggerhead search integration branch or something [01:56] Boy, that's sad. missing took 26 seconds. pull took 9. [01:56] heh. [02:03] Hm. Are these location-based aliases documented somewhere I can't find? [02:06] fullermd: which version of bzr ? [02:06] i know missing got some love recently [02:07] bzr.dev local, 1.5 remote. [02:08] hm [02:08] should be new enough :/ [02:19] bkor: ping [02:20] spiv: ping [02:20] lifeless: pong [02:21] did you end up having time to look at the gnome hooks? [02:21] lifeless: (I suspect Olav will be asleep) [02:21] AfC: so do I but it never hurts to try :) [02:22] gnome hooks? [02:23] Sure. You don't want 'em just wandering around the garden unsupervised... [02:24] spiv: yes - the ones they run on svn, which we'd need to support [02:27] Ah, no, I hadn't looked. [02:28] could I entice you to look? You've done some server-hook stuff before.. [02:28] morning [02:30] hi igc [02:31] * spiv looks [02:31] lifeless: Apologies if I just spammed you with LP merge requests, I was playing around with the interface and didn't realise it generated emails. [02:32] staging.launchpad.net is a handy sandbox for experimenting with. [02:32] I am trying to misuse bzr as a versioned backup program, I have created the remote and local branches, and am running a server for the remote. Is there a tutorial for this type of misuse or do I have to RT(Entire)FM? ps using bzr send I get the error 'bzr: ERROR: No mail-to address specified.' how do I make it go away? [02:33] spiv: Well, the merge requests are valid, so it doesn't matter if they're in LP proper. I just didn't intend to send lifeless a number of emails (which largely mirror on-list discussion anyway). [02:33] Odd_Bloke: :) [02:34] JamesB192: well bzr send is used to create emails, you probably want bzr push [02:35] Ah, OK. [02:37] lifeless: Did you get a merge request (not via LP) that I sent a little while ago. I CC'd the ML but haven't seen any sign of it and am wondering if it got through. [02:37] yesterday? [02:37] Hold on, I'll find a timestamp. [02:37] or earlier today, cause I saw one yesterday and one earlier today [02:38] lifeless: This one was sent, according to GMail, on the last hour (so around 40 minutes ago). [02:38] But I'm using GMail via IMAP, which can occasionally be a bit funky. [02:39] Subject was "[PQM/MERGE] Remove Arch-style naming from tests". [02:40] yes [02:40] uhm, I think its fine [02:40] OK, good. [02:40] The reason I asked was because I was going to reply to it to mention that it depends on my 'fix-tests' branch, but haven't had the chance to do so yet. [02:41] Then I started playing with LP. :p [02:41] :) [02:43] lifeless: well, I've looked. The post_change_branch_tip hook should be able to do the post-commit actions fairly straightforwardly I think. [02:43] lifeless: (and thanks to my push work it turns out that that hook *does* get run server-side now!) [02:44] lifeless: but we probably need to add a pre_change_branch_tip hook as well. [02:45] lifeless: because there's a fair bit of checking of the commit before it is accepted (there must be a log message, PO files should be syntactically correct, etc). [02:45] spiv: I'm really sorry that I hadn't drawn it more visibly to your attention earlier; I think you were in protocol-3 space [02:45] Yeah, I probably was at that time. [02:46] I don't think adding infrastructure for a pre_change_branch_tip hook should be too hard. [02:48] Also, pre-1.6 clients won't trigger that hook, because they will be using VFS RPCs to update the last revision info. I'm not sure how much that will matter. [02:48] spiv: any chance for GUADEC? :P [02:48] spiv: well, we could add a version check policy on the server? [02:48] If it does matter, we could probably hack up a plugin for the server that traps that VFS call and invokes the right hook(s). [02:50] Hmm, we could do that, I suppose, by e.g. writing a plugin so that the server will reject non-protocol-3. [02:50] spiv: don't client give their verison now ? [02:51] Probably no easier than hooking VFS calls that change */.bzr/branch/last-revision, though. [02:51] They do, but as a string. [02:51] That header was intended for logging rather than greater-than/less-than version comparisons. [02:51] But protocol-3 happens to be new in 1.6, as is that verb. [02:53] I'd rather not get into comparing strings like "1.6b3" and "1.7" and "1.10". [02:53] right [02:54] but isn't the string taken from the string made from the tuple ? [02:54] It is bzrlib.__version__, yeah. [02:54] I'd rather introduce a new header if we do that, though. [02:55] So we keep the concepts of user-agent-label and protocol-level-I-speak separate. [02:56] The current header is "Software version", and is intended to be the former. [02:56] Maybe I should just rename it to "User agent" :) [02:56] right [02:57] but - [02:57] there are two levels of protocol [02:57] there is 'how I encode' [02:57] and there is 'what verbs I will try' [02:57] We can add another header pretty painlessly, but we should think about how we want it to work and what use cases we want it to satisfy. [02:57] anyhow [02:58] I would like to be able to say to bkor: 'bzr X on server, bzr X or > on client, plugin Y on server' [02:58] Ideally, we shouldn't need to rely on the client to indicate what verbs it will try, we should just DTRT based on the verbs they do try :) [02:58] and have : ported checks from svn run reasonably fast and appropriately, and reliably. [02:59] Porting the checks from SVN will take a bit of effort. Not gargantuan, but not trivial either. [02:59] yah [03:00] I'm sure bkor would be happy to do that bit [03:00] Although "poke the viewcvs so that it knows about the new revision" bit will hopefully be unnecessary with loggerhead :) [03:00] (though perhaps not as happy as if we did it :P) [03:01] now for the argh! bit. the DVCS talk is on monday :) [03:01] Well, I'd be happy to help port them. Realistically it'll probably be easier for us to work with them than for either side to try port them alone. [03:01] Ok, so you'd like the pre-commit capability done ASAP? [03:01] if its doable this week, then yes. otherwise no - we can say we know what it will take and its coming [03:02] Well, I don't think it'll be hard. I'll spend an hour or so on it to see if it really is feasible that quickly. [03:03] sweet [03:04] I wouldn't be totally shocked if there's a couple of unexpected bumps between here and having it done, but I wouldn't be shocked if it's plain sailing either. We'll see :) [04:05] lifeless: any updates on btree? I haven't seen any commits or posts since I left :) [04:16] jam: just doing C parsing for _leaf_nodes [04:16] jam: 60% of that 110 seconds is claimed to be in the parsing [04:16] lifeless: sure [04:17] If you consider... it takes 1s to read all nodes (iter_all) [04:17] and we get maybe 100-200 nodes per leaf [04:17] and we have a 50% cache rate [04:17] 200 * .5 * 1 = 100s [04:17] lifeless: pyrex or raw C? [04:18] pyrex [04:18] adapting _knit_load_data [04:19] :) [04:19] I'm not sure its wrong [04:19] but profilers - meh [04:19] anyhow, finding out is relatively cheap [04:20] what dance did you do tonight? [04:21] its disconcerting that the three-level test kicks in before the progress bar debounce [04:36] we did rumba and waltz tonight, focusing on underarm turn and breakaway 5-something [04:36] 5position whatever, where your foot is crossed behind the other [04:37] lifeless: yeah, the "nothing is happening" is why I generally run with -v [04:37] lifeless: So I'm poking around at our bloom stuff, and I was noticing that our bottom level rows have more nodes than I think you give them credit for [04:38] I'm seeing bloom filters with 9,000 entries [04:38] I think the bottom level is < 100, but you have at least that many in the internal node [04:38] So to be useful, we would need bloom filters of ... 9000 bytes [04:39] Now, this is for the biggest pack in all packs [04:39] wheee thats quite some compression [04:39] oh right [04:39] yes, I was saying that only the bottom level of filter is really useful [04:39] lifeless: that *is* the bottom [04:40] leaf nodes + 1 internal = 9000 records per internal node [04:40] Otherwise you only have leaf nodes [04:40] oh [04:40] and if you've already read them ... [04:40] so the *only* time these are useful is when you have 1 root node, because you didn't fill it all the way === lamont` is now known as lamont [04:40] eh, this doesn't make sense to me [04:41] lifeless: each internal node can map to 100 other nodes [04:41] right? [04:41] how many keys are in each leaf?, and how many leaves per internal node [04:41] And each leaf node can hold 90 [04:41] so, 9K yes [04:41] but - we *are* seeing IO avoidance already [04:41] I don't know what our FPR is in practice [04:41] I get approximately the same values by just using "Num nodes / row_length" manually looking at the headers [04:42] lifeless: we aren't getting any hits on the big packs [04:42] it is only packs with < 9k records total [04:42] so you have a single root node [04:42] jam: ah, interesting theory [04:42] jam: making indexbench tell us this would be good [04:42] lifeless: -Dindex will print out the bloom statistics for every finalize [04:42] also, trying say a 512byte filter - which will push out some nodes from the internal [04:42] Finished node 1 with bloom filter . [04:43] lifeless: the only problem is the finish_node() has no idea what level it is at, so I had to hack in another debug statement before it gets called [04:43] jam: yes, I added that :P. What I mean is at the external level, to tell us what the aggregate usefulness is of the bottom internal row [04:44] so, there is one other case that it slightly helps, which is the 'last added' node. [04:44] because that one is also not always full [04:46] still, minimal [04:46] ok, there is someting for you to play with [04:47] I shelved the pyx code contents pushed the rest [04:49] jam: ^ [04:49] oh, and iix fits more than tix does into each leaf node [04:49] yeah, I saw the commit, [04:50] I bumped up the bloom to 1K, and we still get 7K entries, and 98.8% full [04:50] Wrote: 4710400 with 1K blooms instead of 4706304 with 256 byte blooms [04:51] so only added 1 page in all of that output :) [04:51] I guess we have some room to spare [04:53] still, 98% is not saving much [04:53] 2K? [04:53] (meep) [04:53] well, we have a few other ones that start hitting 45% full, which is "optimal" [04:53] I'll try 2K next [04:53] but I'm running the benchmarks first [04:53] meep? [04:53] lifeless: note the 'gray level' isn't == FPR [04:55] jam: its the number of bits on; 2% is the number that can possibly miss, and we get 5 chances at that per value [04:55] right [04:55] jam: so 10% on average, or 'not saving much' :) [04:55] sure, but 10% >> 2% [04:55] true [04:56] So, I wasn't translating all the way up to the FPR because we both know the maths :) [04:56] I'm going to give a shot at 2K when this test run finishes, I forgot to supply --no-graph [04:56] anyhow, meep was simply expressing dismay at 2K/4K being bloom [04:57] may also find the chunk code is starting to generate more slack space at that fraction too [04:58] yay gcc [04:58] _parse_btree_c.c:91: warning: â defined but not used [04:58] _so_ not helpful [04:58] well, I just hit AssertionError: Somehow we ended up with too much compressed data, 4221 > 4096 [04:58] with 2K bloom filters :) [04:59] jam: I think that my reserved hack is not quite solid enough [04:59] I'm not quite sure why, as I'm using "2*capacity" [04:59] I guess maybe we don't get our 2:1 compression with < 2K bytes to work with [05:00] dropping to 1.5 [05:02] ultimately, I think we need to make blooms too big to really be useful, which was something I felt before [05:02] It might be a bit better with the md5 bloom [05:02] hard to say at these levels ... [05:04] lifeless: I can see that 2k blooms work "better" than 1k blooms for miss torture [05:04] 309k bloom misses, versus 282k bloom misses [05:04] and it saves ... 13.2s => 12.6s [05:05] search_from_revid is *slower* 1.597 => 1.639 [05:05] get_components is faster 15.143 => 14.394 [05:05] with 24.4k hits versus 19.8k hits [05:07] jam: well the bloom gets compressed [05:07] jam: but a grey one won't compress *that* well [05:07] well, a 100% grey compresses very well [05:07] all 1s :) [05:07] but is useless :) [05:08] also, 100% grey == black! [05:08] yeah, unfortunately optimal blooms don't compress well [05:08] lifeless: black ... or white depending on your terminal color [05:08] and whether you consider 1 whiter or blacker than 0 [05:08] I'm seeing 90% grey at the level 1 nodes for the big packs, and we are down 13k => 8k entries for iix, and 9k => 4.6k for tix [05:09] with 75% for tix [05:09] and a corresponding large increase in l1 nodes [05:09] so broadly [05:09] we can - not use a bloom [05:09] - use a bloom for the entire table, and page it in size [05:10] I would say if we want to use a bloom, it needs to be on a page of its own *next to* the node mapping page [05:10] use blooms for the bottom row only (giving use better fan out high up the tree [05:11] 4096 bytes for 9k records would give us ~4 bits / entry [05:11] (3.64) [05:11] which is in the ~20% FPR range [05:12] lifeless: the one advantage of a whole-pack bloom is that you know what pages to read in, as soon as you know how big the bloom is [05:12] for as many keys as you want to check [05:12] so while you are right [05:13] that it would be 3-round-trips to get down to the almost leaf node [05:13] it is only 1 round-trip, but more total data [05:13] jam: actually, I was comparing against no-blooms :) [05:13] and for what we want, doesn't scale terribly well with number of keys [05:14] so aiming for very low FPR is one approach [05:14] another approach to analysing this is to say: [05:15] how much IO do we add (for each scenario) by having the bloom, whether it is a global, or bottom-internal-only, or whatever. And how much do we save? [05:15] 10% unset is around 50% FPR (more or less) [05:16] for halve the pointer count in the internal node [05:16] if we strip the bloom from higher nodes [05:16] we've doubled our bottom internal node row count [05:16] lifeless: but we don't have many intermediate nodes anyway [05:17] if we figure a bloom-less node is 1:100 fan out [05:17] jam: right, b+Tree structure for the win [05:17] and a bloom one is 1:50 [05:17] its still waaay faster than bisection to locate an individual key [05:17] and the tips are... ~100 [05:17] jam: anyhow, my point is to compare IO load, not to compare a specific FPR [05:18] you have to have 500,000 nodes before it starts getting interesting [05:18] lifeless: so... what is saves us is searching for missing keys in the bottom most layer [05:18] which helps 'miss torture' but not much else [05:20] hmm... you might consider using them for small packs [05:20] as you could even aggregate them if you wanted to [05:20] so miss_torture is trying to simulate something I think we see in real world [05:20] which the other tests don't yet show [05:21] anything with CompositeGraphIndex in there will start to show it [05:21] lifeless: I think 'search_from_revid' is a reasonable approximation [05:21] and it benefits more from having fewer internal nodes [05:22] and removing the CPU overhead of computing sha1s to compare against the bloom [05:23] I'm a bit curious about start+end keys at this point [05:23] It would be the same 50% overhead [05:23] but would we get better filtering than bloom [05:23] (10%) [05:24] I don't think so [05:24] also bloom will be getting nearly 50% filtering [05:24] (at 10% unset bits) [05:24] erm, well, 1-0.9^5 or something [05:25] 43% [05:27] lifeless: Well, removing the blooms entirely (page_size = 16 bytes), I get 10.550s for miss torture [05:28] lifeless: can you help me work the glidepath for stacking? [05:28] which is down from 13s at 1k, and 12s at 2k [05:28] afaics there is nothing no more needing review [05:28] so i'd like to do some tweaks/merges [05:29] poolie: oh dang, I fgort to do the annotate performance test [05:29] jam: 16 bytes? [05:29] lifeless: just something to make them ignored [05:30] We don't have a way to just turn them off right now [05:30] the point is that they a) aren't being used during read, and b) aren't taking up much space [05:30] lifeless: i don't mind doing that test [05:30] but i'd like to know the overall map [05:31] if i just work through from the bottom up and merge them will we be done? [05:31] poolie: yes, my shallow branch loom has them all except aarons policy work [05:31] poolie: working from bottom up will bring in the features [05:32] ok [05:32] any tips, or suggestions of anything better to do? [05:32] poolie: and the bug with 'bzr branch not-stacking-format --stacked localpath' is still something I need to fix (by changing the format on the fly' [05:32] poolie: for annotate, if its dog slow, undo the delete and do if(len(knit._fallback_vfs)) else: old_code [05:33] right [05:33] poolie: other than that, no surprises that I know of [05:35] jam: heh, your bzr-service c client looks pretty similar to mine in some ways [05:35] mwhudson: yeah, but i wrote that 2.5 years ago :) [05:35] just never caught on, I guess [05:35] we must have similar levels of c (in)experience [05:35] probably win32 lacking os.fork() didn't help [05:39] i guess doing something fancy with ptys would make some degree of sense [05:42] how light could you make the client in python? [05:43] never mind, I just noticed client.py :-) [05:43] mlh: pretty light :) [05:43] mwhudson: I remember the big off-put [05:43] gpg signing [05:44] It would prompt you for the signature in the original shell [05:44] not the one you were in [05:44] and fixing *that* wasn't worth my effort [05:44] I guess all state has to be passed every request .. and the bze service would have to be changed likewise [05:45] jam: i think doing the stdout/stderr overriding in a more unixy way might help there [05:45] jam: but yeah, effort [05:46] mwhudson: gpg talks to the tty directly, just like ssh [05:47] I *think* you might be able to set GPG_TTY [05:47] but then you have to start passing env vars to the back end, etc. [05:47] jam: forkpty, maybe? [05:48] you can make another file descriptor the controlling terminal for your process *somehow*, i'm sure [05:48] mwhudson: you must close the existing one then the next tty you open will become yours by default iirc [05:49] or something like that [05:49] you might need to change session group? [05:49] i do have a book that describes this if you really care [05:49] i have apue somewhere [05:49] but i'm very unlikely to care enough to work on this [05:49] 'linux application development' also has several bags of this crack [05:49] or whatever crack comes in. tubes? [05:50] i semi-enjoy unix-hackery on this level, but not really enough to put actual work in [05:50] poolie: have you seen pyrepl? [05:51] right, poolie, play innocent [05:51] Anyway, => sleepytime [05:51] have a good evening [05:51] gnight [05:51] night john [05:51] mwhudson: no, i haven't, also python.net seems to be down [05:52] yeah, it's some weird-ass virtual machine slice these days and doesn't work very well [05:52] i guess i could put the bzrlib apidocs on people.ubuntu.com [06:03] ok [06:03] 7 seconds saved [06:03] by parsing just the key in C [06:03] clearly faster, I'll finish the conversion [06:30] Hi all [06:30] My last chance to get Bazaar working the way I need it before I give up and go back to SVN. [06:30] huh [06:31] Does anyone know a way to get Loggerhead working though the fastcgi wrapper for wcgi? [06:32] Or whatever TurboGears uses? [06:32] you tried and it didn't work? [06:32] I've got no idea how to do it. The Loggerhead instructions assume you've got it running as a daemon [06:33] There's no mention of any other way of doing it. [06:33] some coding will be required [06:34] from loggerhead.apps.filesystem import BranchesFromFileServer [06:34] BranchesFromFileServer('/path') <- there's your wsgi app [06:36] morning [06:41] hello pygi [06:47] Whee, I've got automated bisection working. [06:47] hello Odd_Bloke [06:47] in pqm? [06:48] Nope, in the bisect plugin. [06:49] I've been tidying up my development directory and stumbled on some stuff I started at the London sprint. [06:49] Odd_Bloke: can it bisect a graph? [06:50] Odd_Bloke: cool [06:50] jamesh: I dunno, I'll test. [06:51] It's just hooking into existing bisect stuff. [06:51] The majority of this patch is a while loop. :p [06:53] It's not quick on bzr.dev. [06:54] Oh, and it's bombed with an encoding issue. [06:54] Probably Lukas' fault. :p [06:58] jamesh: It'll find a dotted revision, if that's what you meant by 'bisect a graph'. [06:59] Odd_Bloke: I guess what I meant was to use the graph when working out what revisions to test [07:00] jamesh: I don't know. Jeff Licquia wrote most of it, I've just plugged in the necessary bits to get automation. [07:00] Odd_Bloke: if you keep the assumption that "one revision causes the breakage", each successful test indicates that the revision's entire ancestry is clean, and every failed test indicates that every other revision that includes this one in its revision history is broken [07:01] rather than treating it as a simple linear sequence of revisions you can cut in the middle [07:01] Right, I suspect it just cuts it, but I don't actually know either way. [07:01] I think cutting in the middle is fine for left-hand [07:01] hi pygi. Thanks for your blog encouraging more interop between the DVCSs. For large projects, dynamic interop will be slow but I think there's promise in using fastimport/export for keeping high-performing mirrors around and synched [07:02] so bisect (mainline), then bisect(new in branch) [07:02] how does git-bisect handle it? [07:03] not entirely sure [07:04] ok, saved 30/120 ish seconds [07:04] spiv, ping? [07:11] spiv: so, are the hooks boom or bust? [07:12] pygi: my experience is that fastimport fails with darcs... [07:13] lifeless: boom [07:14] cool [07:14] I have iter_random_one down to 78.722, [07:14] just by parsing faster [07:18] It looks like bisect does restart on the branch when it bisects to a merge revision. [08:15] What word can I use to refer to { trees, branches, repositories }? Elements? [08:15] thingies? [08:15] :) === Toksyuryel` is now known as Toksyuryel [08:19] Odd_Bloke: objects? components? [08:25] that's it from me today. Night all [08:42] hmmm [08:42] night igc [08:43] I wonder what the gc does with swapped out pages [08:45] howdy lifeless [08:47] hi beuno [08:48] lh search merged yet? () [08:48] heh, I promise I'll review the current code today, and send it off to the list so mwhudson feels more pressured :) [08:49] I've showered and slept, so today should be more productive [08:49] lol [08:49] where are you? [08:49] Canonical [08:50] London? [08:50] 16 hour on a plane, went straight to the meeting [08:50] yeap [08:50] cool, didn't even know there was a meeting ;) [08:50] you going to GUADEC? [08:50] bzr-unrelated stuff [08:51] sounds fun :) [08:51] I don't think so, I'm here for a few days (may end up staying longer though) [08:51] it is! [08:52] Guadec is in Spain, isn't it? [08:52] turkey [08:52] next week [08:52] monday ! [08:53] ah, then I'm sure I'm not going [08:53] I do still have to make some things pretty for Jc2k [08:53] :) [08:53] I hope I can get to that this weekend [08:57] lol [08:57] BTreeIndex: iter_random_one in 199.008, [08:58] I don't have a backlog, how much is it without your BTreeIndex magic? [08:58] dunno yet [08:58] its new [08:58] its actually iter_random_one_reopen [09:02] and - much longer - still going [09:04] oh, cool. mwhudson go the new version of LH back up again in LP [09:04] and it's fast! [09:06] still using mega ****wads of ram [09:07] but at least that isn't affecting the rest of the codehosting any more [09:07] mwhudson, more or less then before? [09:07] still going [09:10] GraphIndex: iter_random_one in 794.916, [09:10] beuno: well, different [09:10] beuno: it's probably growing more slowly [09:10] beuno: if it was using this much on the old machine it would have been killed by now [09:11] ah, not very encouraging [09:11] mwhudson: kudos [09:11] beuno: still, the sysadmins are much happier :) [09:11] mwhudson, is there some way you can get more information on what's eating up most of the ram? if not, is there anyone in Canonical's office I can stalk to do so? [09:12] beuno: the sysadmins during london's day [09:12] beuno: elmo is the boss sysadmin [09:12] right, but you threw hardware at the problem, which isn't ideal as we can't send free hardware with each LH download :p [09:12] beuno: probably not without adding some code [09:12] beuno: no one else is running a site remotely like LP yet, fortunately [09:13] beuno: being *able* to throw hardware at it is a feature [09:13] mwhudson, well, the guy that opened up the 100% CPU bug maybe [09:13] hmmm pretty things :) [09:14] beuno: the obvious suspect for ram-chewage is all the whole history data we store [09:15] if it's that, and i guess one could add some code to estimate it, it probably wouldn't be hard to store it much more compactly [09:15] Jc2k, :) how's LH running on your server? is it ram-friendly enough? [09:15] indeed, its been fine [09:16] that's good news! also considering that you're using search too [09:16] yes, but search is good :) [09:16] (/me stops wanking) [09:16] lol [09:17] :) [09:18] I tried search with BTree [09:18] massively bigger index [09:18] but faster still [09:18] so there is room I think for having the last page not round to 4K [09:18] jam: ^ [09:19] (search has _many_ very small indices [09:19] igc, ;) [09:19] gour, well, file a bug with them? :) [09:21] pygi: https://bugs.launchpad.net/bzr-fastimport/+bug/232177 [09:21] Launchpad bug 232177 in bzr-fastimport "Better darcs support needed" [Undecided,New] [09:22] lifeless: hooks patch sent to list [09:23] spiv: rocking! [09:23] gour, isn't that a darcs problem with fastexport rather? [09:24] pygi: darcs-to-git and darcs2git, yes [09:26] spiv: I've approved; one thought though [09:26] spiv: perhaps the hook catpure logic could be pulled up [09:26] pygi: the point is darcs --> bzr conversion [09:26] poolie: ping [09:26] lifeless: in the tests? Yeah, I guess so. [09:27] Some of the test methods are also identical. [09:27] spiv: I don't think a common base class is _that_ ugly [09:27] gour, well, I understand. What I'm asking is: Is darcs fast-export implementation correct? [09:29] If the common base class inherits from TestCase, and has tests methods, but is not meant to be run, it's a bit ugly. [09:30] pygi: dunno. tailor is atm the only tool capable of darcs{1,2} --> bzr === brilliantout is now known as brilliantnut [09:31] * spiv goes for a swim [09:32] lifeless: pong, off the phone now [09:33] igc, we will see what happens, I'd certainly like if people would be able to collaborate even if they disagree on some stuff [09:33] poolie: feel like another call ? [09:33] :-} [09:34] let me get a cuppa and i'll call you [09:34] K [09:34] pygi: let'em choose whatever dvcs as long as it's called bzr :-) [09:37] gour, hehe [09:37] spiv: is there a simple recipe for 'give me a 300ms local server' [09:37] spiv: ? [09:46] spiv: what we do there is not give it test methods, but helpers :P [09:46] spiv: add an intermediate class :) [10:03] <\sh> luks: ping qbzr ppa rebuild hardy ;) [10:09] When merging branches, I have some duplicate conflicts where the same file was added to both branches individually, however the files are identical. Is there a merge algorithm that will take into account that the files are the same and not detect a conflict? [10:11] semslie: unfortunately not, as file joins are not implemented yet. [10:11] james_w: what are file joins? [10:11] they wouldn't need to be to not have a conflict, but bzr currently errs on the side of safety. [10:12] semslie: bzr uses file-ids to track file identity. If a file is added independently in two different places it will get two different file ids. When they are merged this causes the conflict. [10:13] a file join would mark the file as having two file-ids in its history, so there would be no need to conflict. [10:13] james_w: thanks === enobrev is now known as [enobrev] [10:14] resolving this conflict currently is just picking one file-id. You could have a merge algorithm that did that, and most users wouldn't care which was picked, so it's probably a safe thing to do. [10:14] however, I'm not sure that there aren't issues with symmetry that could trip that up. [10:14] semslie: there is a bug report open requesting this if you would like to subscribe. [10:15] I'll take a look [10:37] james_w: do you know where I could find a description of the different merge algorithms available? [10:41] semslie: I don't, sorry. [10:47] semslie: jam is working on a plugin that accounts for this [10:47] thanks lifeless [10:47] semslie: it doesn't address the root cause james_w mentions [10:48] semslie: http://bzr.arbash-meinel.com/branches/bzr/1.6-dev/merge3_per_file [10:50] (by plugin, I clearly mean branch) [10:50] http://bzr.arbash-meinel.com/plugins/per_file_remerge may be useful too [10:51] dunno how to make it jump etc, just saw it on the list [11:08] mtaylor: hey [11:08] hi lifeless [11:09] lifeless: I was in the middle of testing something for you, wasn't I... [11:09] mtaylor: dunno, were you ? ;) [11:09] lifeless: ok. good! then no! [11:09] mtaylor: bzr-search has come a long way since your last attempt i think [11:09] :) [11:09] sweet. I'll pull again. [11:10] also there is now a new index layer under development [11:10] should help people with very large indices:). e.g. mysql. [11:10] thanks for the tips. after playing around a bit I seem to be finding that bazaar detects conflicts in places where subversion does not, which is a bit puzzling. I dont think subversion is wrong here, but it would be nice to see a good descriptions of the different merge algorithms and why they might be detecting conflicts [11:11] semslie: well, the code is authoritative, but please do file a bug saying you couldn't get the info you needed [11:11] semslie: also, if you have a particular false-conflict, if you can describe it perhaps we can help you understand why you get it [11:12] mtaylor: lp:~lifeless/+junk/bzr-index2 [11:12] lifeless: should I pull that as a separate plugin then? [11:12] or is that a branch of search? [11:12] mtaylor: adds a btree based format; if you wanted to give it a spin (in a new repository, obviously), and tell me how it goes I'd love the feedback [11:12] mtaylor: index2 is a new repository format [11:13] mtaylor: not related directly to bzr-search at all [11:13] oh. so but will it go in in plugins? or is it a branch of bzr then? [11:13] plugins [11:13] as 'index2' [11:14] lifeless: have you tried index2 on any bzr-svn repos? [11:14] cool [11:14] mwhudson: haven't created a rich root variation yet [11:14] lifeless: oh right [11:14] mwhudson: currently have a to-london test running though [11:14] just thinking that bzr-svn indices tend to be 'rather large' [11:15] ImportError: No module named pybloom.pybloom [11:15] gah [11:15] mtaylor: oh, also pull [11:15] :) [11:15] http://bzr.arbash-meinel.com/plugins/pybloom/ to plugins/pybloom [11:16] lifeless: cool [11:17] mwhudson: its about 4 times faster at getting a single key out from 'go' to 'woah' [11:20] mtaylor: so concretely, in case I was unclear - I'm talking about doing 'bzr init-repo --btree-plain foo'; bzr branch mysql-6.0 foo/6.0; or similar [11:20] then do log etc [11:21] not about bzr-search on the btree format - that will be basically unchanged [11:21] raise KeyError('Key %r already registered' % key) [11:21] KeyError: "Key 'btree-plain' already registered" [11:21] mtaylor: oh, one other thing - optional C extension [11:21] right [11:21] in index2, do ./setup build_ext -i [11:21] mtaylor: did you pull the index2 plugin twice? [11:21] nope. ImportError: cannot import name _KnitGraphIndex [11:22] I get "foo" already registered sometimes when a plugin crashes spectacularly [11:22] mtaylor: ah! this will want bzr.dev [11:22] :) [11:22] do we have running new-package-on-push .debs for bzr.dev yet? [11:22] because we need them [11:23] mtaylor: I don't think we do at the moment, I agree. [11:23] kiko!!! [11:23] not kiko's problem :) [11:25] ah... but how cool would it be to have a PPA auto-generate from a bzr branch, no? [11:25] mtaylor: indeed [11:26] i think this idea has been mentioned once or twice over the years... [11:26] lifeless: cache-hot, on my smallish project, that gives me a ca. 25% speedup on 'bzr log >/dev/null' [11:26] lifeless: nice. [11:26] PPA-from-bzr-branch++ [11:27] Kinnison: without the C extensions? [11:27] lifeless: I built the C extensions [11:27] Kinnison: cool [11:27] * Kinnison tries on something bigger [11:27] Kinnison: should still be faster with [11:27] *without* [11:30] lifeless: testing against my local copy of bzr.dev yields a 25% improvement in bzr log too [11:30] lifeless: really cool work [11:30] does anyone know what generates file ids like "x_Chris_Halls__Fri_Nov__4_10:34:47_2005_22857.16" ? [11:30] Kinnison: thanks [11:30] james_w: tla [11:31] lifeless: ah, thanks. [11:32] Kinnison: is 25% good ? :) [11:33] lifeless: It's a matter of a second on my smallish dev tree [11:33] so 4 to 3 ? [11:34] lifeless: aye [11:34] lifeless: and 8 to 6 on bzr.dev [11:34] Kinnison: try a cold cache comparison [11:34] erm [11:34] by which I mean [11:34] drop caches [11:34] run bzr info [11:34] then time bzr log [11:34] how do I drop the caches? [11:34] $ cat bin/drop-caches [11:34] #!/bin/sh [11:34] # get written data to disk (not that its guaranteed) [11:34] sync [11:34] # (drop the unmodified pages) [11:35] echo 3 | sudo dd of=/proc/sys/vm/drop_caches 2>/dev/null [11:36] man, this network test is up to hours now [11:36] gunna be a long benchmark [11:37] branching launchpad into an index2 repo is taking a good long while too [11:38] lifeless: Irritatingly, the cold-cache time on the index2 tree is stable where it is wildly variable on the non index2 tree [11:38] mwhudson: index writing is a little slower [11:38] Kinnison: likely the non index2 tree is split more on disk [11:38] lifeless: placing index2 on cold-cache anywhere from 25% slower to 50% faster [11:38] Kinnison: new repository single pull -> packed [11:39] Kinnison: strange [11:39] * Kinnison times cold-cache on a new copy of my test branch [11:40] on a fresh pull it's slightly more stable, and puts them on a par [11:40] well within experimental error [11:49] mwhudson: did it finish? [12:00] lifeless: yeah [12:02] mwhudson: is it faster? [12:11] $ bzr log --short -rtag:0.0.1-release.. [12:11] bzr: ERROR: Selected log formatter only supports mainline revisions. [12:12] what does that mean? [12:12] -rtag:0.0.1-release.. was in same branch [12:12] other branches have been merged and such to it of course but why doesn't it work then === emgent_ is now known as emgent [12:14] lifeless: yep, 20s vs 30s for log > /dev/null [12:14] the btree one is better packed though ofc [12:18] mwhudson: 'bzr pack' in both :) [12:19] lifeless: yeah, on that [12:19] lifeless: they're about the same speed after the pack [12:21] mwhudson: hmm :P [12:21] mwhudson: the new index should degrade better, or thats the theory [12:21] mwhudson: you could try upping the cache too [12:22] lifeless: i think i might try going to bed :-p [12:22] mwhudson: ciao! [12:53] mtaylor: did that work for you? [12:53] lifeless: was at lunch... haven't tested yet [12:54] mtaylor: no worries [13:00] lifeless: lp:bzr should get me bzr.dev, right? [13:00] uhm probably :P [13:00] (thats yes, yes it will( [13:00] ok. good [13:02] testing now [13:02] log isn't actually a great test, I think IO to get revisions rapidly outweighs indexing, except where the index blows :P [13:05] well, what is a good test? [13:05] 'doing stuff' I think :) [13:05] log will show up obvious problems :) [13:06] :) [13:11] log -r -400 or something [13:11] that should be faster [13:16] lifeless: still branching in to the new repo [13:16] lifeless: have I mentioned how much I enjoy copying 60k revs around ? [13:16] mtaylor: you enjoy it a lot [13:16] yes. [13:17] I'd really like to do it more [13:17] please sir [13:17] may I have another [13:18] yksi iso olut, olkaa hyvä. [13:18] LarstiQ: *blink* [13:18] lifeless: about the same in Finnish :) [13:19] * LarstiQ practices a bit for his extended stay in Finland from the 11th onwards [13:19] lifeless: that translates "May I have one beer please" [13:19] LarstiQ: cool [13:19] big beer even [13:20] the only sort :P [13:20] but hang on, why are you practicing asking for beer? [13:20] lifeless: ah well, this was the phrase as listed in my dictionary ;) [13:21] would have to add the orangejuice lookup [13:37] lifeless, james_w: turns out bazaar was doing the right thing by correctly finding the last common ancestor, while we were messing it up and checking against the wrong ancestor. problem solved, bazaar ftw :) [13:38] cool, good to know. [13:38] one thing I found was that I preferred the results of a weave merge to the default [13:38] I'm not sure what the implications of that is [13:43] gnigt all [14:10] hey hey folks =) [14:17] Hi all [14:18] I was wondering if anyone knows a good way to "push" a repository to a dev server? At the moment, I have a repository setup but would like to update a live version for testing without manually uploading the files [14:18] I read that using another working copy isn't the best solution, so wondered if there are any other solutions? [14:19] vxnick, I can think od 2 ways [14:19] there's a push-and-update plugin which will do that [14:19] oh, excellent [14:19] and there's the bzr-upload plugin which will *just* upload the working tree [14:20] no other revision information except the last revision uploaded, so next time you only uploaded what has changed [14:21] beuno: thanks a lot, I'll take a look at those :) [14:21] vxnick, you're welcome [14:26] jelmer: I'd be interested to talk about the bzr-gtk Bundle Buggy some time. [14:32] beuno: bzr-upload works great, thanks again! [14:34] abentley: Hi [14:34] abentley: Sure, what about it? [14:34] I've moved trunk to use postgres, so you may be surprised if you just update blindly. OTOH, BB is much more stable on postgres. [14:35] So I would recommend switching at some point. [14:36] abentley: I already updated :-) Apart from the fact that I had to point the configuration at sqlite again it seems to still work fine [14:36] Cool. Sorry about that. The sqlite-ness has always been assumed as part of the configuration. [14:36] Frankly, I'm not sure what the right way to handle this is. [14:37] I could do a local.conf, but a local-prod.conf and a local-dev.conf? [14:38] abentley: No worries, it was easy to deal with [14:39] abentley: Any particular reason pgsql is better than sqlite or just performance? [14:39] jelmer: Stability. TG interacts poorly with the way SQLite does locking. PG and TG play better together. [14:40] abentley: ah, ok [14:40] Also, schema migration is nicer. [14:42] So far locking hasn't been a problem for us, but I'll see if I can migrate it at some point anyway. [14:43] jelmer: BB seems to crash for you occasionally. I assumed it was that. [14:43] locking granularity leading to improved concurrency is why we moved our trac from sqlite to postgres. [14:45] abentley: No, the main problem at the moment is DHCP leases expiring and DNS updating failing for some reason [14:45] Oh, weird. [14:46] jelmer: Okay. So I'll try not to make any changes that require Postgres, but if I goof, please let me know. [14:46] abentley: Will do [14:53] Hi, I am trying to merge some launchpad branches [14:53] https://code.launchpad.net/~vcs-imports/subdownloader/trunk shall go into https://code.launchpad.net/~subdownloader-developers/subdownloader/trunk [14:53] Here is what I did. Check out the second branch and cd into it [14:54] Then "bzr merge lp:~vcs-imports/subdownloader/trunk" which resulted in http://rafb.net/p/rCkYQS78.html [14:56] hi Laibsch [14:56] it sounds like https://bugs.edge.launchpad.net/bzr/+bug/177874 [14:56] Launchpad bug 177874 in bzr "upgrading to rich-root-pack fails" [Critical,Fix released] [15:00] Great [15:00] james_w: thank you for the info [15:04] hello, how can I print the revision number in my sourcecode ? [15:04] tacone: bzr revno? [15:06] not. what shuold I write in my code, to make the current revision number appear inside that file ? [15:07] tacone: that's not possible yet, sorry [15:08] 1.6 might have it, if not then 1.7 I expect [15:09] not a bit problem :-). thank you for your answer. [15:15] james_w: isn't there another command/program to do this? I could've sworn I saw something about this in the manual for printing revision info to source code [15:15] I was looking for it myself last night - it'd be nice to output the revision number to a PHP file [15:16] vxnick: sed ? :-) [15:16] tacone: yeah I've done it with that, but it'd be nice to have a post-commit hook to do it, but I'm not familiar with Python [15:17] bzr version-info [15:17] you can hook that in to your build system to do it. [15:17] Actually, I used sed for subversion revision numbers, with a post-commit hook [15:17] james_w: that's the one :-) [15:17] any ideas for that though? [15:17] i.e. integrating into a build system [15:18] I've never done it, sorry [15:19] fair enough [15:19] thanks anyways [15:21] found a reference: http://doc.bazaar-vcs.org/bzr.0.90/en/user-guide/version_info.html [15:34] I didn't get an answer before [15:35] $ bzr log --short -rtag:0.0.1-release.. [15:35] bzr: ERROR: Selected log formatter only supports mainline revisions. [15:35] what does that mean? [15:35] does anyone around now know? [15:35] the tag was in the same branch [15:35] AnMaster: what does "bzr log -rtag:0.0.1-release" show? [15:36] (no --short, no "..") [15:36] a sec [15:37] http://rafb.net/p/RQruyr60.html [15:38] interesting [15:38] and "bzr log --short" works fine? [15:38] james_w, no it gives the error I gave above [15:38] AnMaster: without the .. [15:38] when done for that tag [15:38] even with no -r [15:39] ? [15:39] oh ok a sec [15:39] * LarstiQ is suspecting the .. includes non-mainline revisions which --short doesn't like. [15:39] without any -r short format works [15:39] sounds like a bug to me [15:39] one more to try "bzr log --short -rtag:0.0.1-release" [15:39] I suspect corrupted repo on some way because the revision number from "bzr log --short -rtag:0.0.1-release" is different [15:40] 449.2.20 Arvid Norlander 2007-11-13 [15:40] Relase of envbot 0.0.1! [15:40] why does revision number differ [15:40] oh wow, that is odd. [15:40] corrupted repo? [15:40] is this a public branch? [15:40] AnMaster: I doubt that. [15:41] james_w, well both branches are public, just sshed in to the server and there it works, server runs FreeBSD 6.3 with bzr 1.5 [15:41] python 2.5 [15:41] AnMaster: what do you use locally? [15:41] my desktop where it gives odd results runs Linux (arch linux currently), with python 2.4 [15:41] bzr 1.5 [15:42] however it is mounted over nfs from a gentoo box [15:42] does that matter? [15:42] shouldn't. [15:43] well let me check on the gentoo box (also bzr 1.5 and python 2.4) [15:44] ok this is probably not a bug in bzr I guess... [15:44] Reason.... the gentoo box now has two keyboard leds blinking (= kernel oops).... could be hardware issues on it I guess [15:44] :/ [15:44] sigh [15:44] ah. [15:45] well got to debug that, when I find out the cause of that I will get back to you (if it wasn't memory corruption simply) [15:45] (or harddrive issues or whatever) === james_w_ is now known as james_w [15:46] AnMaster: good luck [15:46] LarstiQ, I *do* have backups [15:46] daily to tape [15:47] still, faulty hardware sucks [15:47] yes it does [15:47] LarstiQ, anyway it could as well have been something else than hardware that crashed it, it runs X with binary nvidia driver (bleh I know, but I need 3D for various reasons) [15:48] * AnMaster books into memcheck [15:48] AnMaster: we'll still be here when you've sorted it all out :) [15:48] heh [15:57] did someone remember the url announcing mysql switch to bzr ? [15:58] vila, http://elliotmurphy.com/2008/06/19/mysql-converts-to-bazaar-and-why-it-matters/ [15:59] oops... link's on that post, though (got that from planer bazaar) [15:59] enobrev: thks [15:59] http://blogs.mysql.com/kaj/2008/06/19/version-control-thanks-bitkeeper-welcome-bazaar/ === thekorn_ is now known as thekorn [16:20] Hello is somebody using the Bazaar plugin for eclipse? [16:22] kwk, i am [16:22] on win32 [16:22] erm, xp [16:24] enobrev. Ok, did you experience problems when trying to set the bazaar preferences? I'am using the plugin on ubuntu with Eclipse Ganymede and it doesn't work when trying to specify the bzr executable. A bug report is already filed (https://bugs.launchpad.net/bzr-eclipse/+bug/245136). [16:24] Launchpad bug 245136 in bzr-java-lib "Cannot use Preferences page to configure executable if existing settings are in error" [Undecided,New] [16:25] kwk, nope worked ok for me on both granymede and europa. haven't tried it on my ubuntu system yet [16:26] ubottu: exactly. enobrev, can you please send me you configuration file from YOURWORKSPACE/.metadata/.plugins/org.vcs.bazaar.eclipse.core(if such a file exists)? The bug report says that there has to be an initial configuration file that is working. Therefore I ask. [16:26] kwk: Error: I am only a bot, please don't think I'm intelligent :) [16:27] bug 236162 [16:27] Launchpad bug 236162 in bzr-eclipse "branch action is broken for absolute path " [Medium,Fix committed] https://launchpad.net/bugs/236162 [16:27] nice [16:28] kwk, dir's empty [16:28] enobrev, hmm, do have any clue where the settings are located? [16:29] (looking now) [16:33] kwk, not sure... not seeing it anywhere [16:33] enobrev, OK thank you very much [16:34] LarstiQ, still there? bad ram. I guess that caused it [16:36] AnMaster: that is, if you try what you did before now, the problem doesn't surface? [16:36] AnMaster: it is of course possible that the problems were unrelated, and there is a real bug inbzr. [16:37] indeed that is the case [16:37] anyway I shut it down now, still got warranty on this ram at least === brilliantnut is now known as brilliantout [17:45] will "bzr get svn+ssh://rolf@svn.gnucash.org/repo/gnucash" check out all available branches? [17:46] Laibsch: no, you'd need to use "bzr svn-import" for that [17:46] OK [17:46] Thanks [17:48] abentley, is there info about the merge proposal email support somewhere? [17:49] jelmer: Should I even check out all branches? I want trunk plus 2 others. I will likely want to try and merge and later rebase them. [17:49] Laibsch: In that case, I would recommend just retrieving those branches [17:49] I was wondering if that would be impossible if I checked out the branches individually [17:49] OK [17:49] Thanks [17:49] e.g. "bzr get svn+ssh://rolf@svn.gnucash.org/repo/gnucash/trunk" ( I think) [17:50] Yes, correct [17:50] * jelmer wonders if the svn+ssh url means Laibsch is a gnucash developer [17:50] jelmer: http://news.launchpad.net/cool-new-stuff/email-interface-to-code-review [17:50] abentley, thanks [17:50] jelmer: partially [17:50] I have partial rw access [17:51] abentley: What about submitting merge requests? [17:51] jelmer: That's done through the web right now. [17:52] abentley: ok - should I file a wishlist bug for being able to do that over email or is it already pleanned ? [17:52] We plan to support merge directives. Is that what you have in mind? [17:53] abentley: Yeah, being able to "bzr send" to some lp address [17:56] jam: hi, are you around? I have a question about the merge-into plugin. [17:58] jam: I'm just trying it on a dummy branch, and "merge-into" set a merge revision, but didn't seem to import the text changes. [18:04] I am having trouble setting up a dedicated smart-server, ... [18:04] I set it up like this: bzr serve --allow-writes --directory=/home/vgeddes/src/nis ... [18:05] but `bzr log bzr://localhost//home/vgeddes/src/nis/main' [18:05] returns `bzr: ERROR: Not a branch: "bzr://localhost/". [18:05] Maybe // is a problem? [18:05] Also, you do know --allow-writes means there's no auth whatsoever, and anyone can write to your branches? [18:05] Wait. [18:06] vgeddes: try bzr log bzr://localhost/main with that --directory [18:06] yeah, thats not the problem, I tried it without the // [18:06] Since you passed a --directory like that, bzr ser -- yeah, what he said. [18:06] ha, thanks ! [18:08] http://rafb.net/p/iVSTU253.html Is that bug 177874? How can I perform the merge? The branches had completely separate directories, there should be no conflicts [18:09] james_w: Are you testing with a branch that has commits? [18:09] abentley: yeah, one commit in each [18:11] Laibsch, not sure, this looks like a bug [18:11] Laibsch: Can you try merging from a local copy of that branch? [18:12] jelmer: How will that work? [18:12] I'll check this out into $dir [18:12] and then bzr merge $dir? [18:17] Well, in any case, those steps lead to apparently the same failure [18:18] Bazaar (bzr) 1.3.1 from ahrdy [18:18] hardy === blueyed__ is now known as blueyed [19:21] bkor, I think you have your answer now :) [19:29] Laibsch, please file a bug [19:32] Verterok: Aha [19:33] awilkins: hey [19:34] I'm having trouble ; the plugin isn't initializing properly somewhere ; to get use it with a project you have to disconnect it and re-share it [19:36] awilkins: are you using a build of the latest code? [19:37] Verterok: I was just going to pull ; I'm using a slightly patched build [19:37] Verterok: With the xmloutput fix I just mailed in [19:37] awilkins: yes, I meaned that :) [19:39] Verterok: Hmm, might not have revisions 163/162 [19:40] * Verterok checks what changes in 162-163 [19:41] There's also (undecided) [19:41] #245136 [19:41] awilkins: 163 is the BIG integration merge [19:41] :) [19:41] Still the same trouble [19:42] awilkins: revno 163 merge your plugin-dev branch and other improvements (allow selection of xmlrpc service port, etc) [19:43] I think I was on the integration branch ; I just merged it [19:43] Verterok: Alas, still same problem. [19:43] awilkins: steps to reproduce? a eclipse restart? [19:44] I was going to see if being a standalone over a repo branch makes a difference [19:45] Verterok: This is the plugin running in debug from another session which is how I typically run it until I think it's stable enough for normal use [19:45] I have no idea if it's linked to https://bugs.launchpad.net/bzr-eclipse/+bug/245136 at all [19:45] Launchpad bug 245136 in bzr-java-lib "Cannot use Preferences page to configure executable if existing settings are in error" [Undecided,New] [19:46] I just added a chunk of code that sets a reasonable default for Windows (well, reasonable for my personal config....) [19:46] awilkins: maybe it's related [19:48] awilkins: while in debug I encountered some problems when using and oldd workspace, all gone away once I reconfigured usign the new preference page [19:48] It springs the prefs page every time you use the share wizard ; would that be normal? [19:51] not at all [19:52] I'll trash the settings for the workspace [19:52] awilkins: I'll check the preference initializer [19:53] Hm, there are no settings [19:53] Should there be a file in .metadata/.plugins/org...core ? [19:55] awilkins: I don't think so, but let me double check [19:55] Where does it store it then? [19:57] awilkins: ./org.eclipse.core.runtime/.settings/org.vcs.bazaar.eclipse.ui.prefs and ./org.eclipse.core.runtime/.settings/org.vcs.bazaar.eclipse.core.prefs [19:59] Ok, I've trashed those two files. Darn, should've kept them [19:59] I'll do the same here and see if I can reproduce the preference reset issue [20:00] I'm seeing it right now ; I changed the default preference for the UI, not the actual setting. [20:01] So it's displaying the .bat file and saying "cannot run program "bzr" cannot find file specified" [20:01] Or that last merge erased my changes [20:02] awilkins: you chanhed the preference initializer in the UI to point to the bzr.bat? [20:02] and keeps telling about bzr file [20:02] did I understood right? [20:02] Hi. I want to produce a patch for a changeset that removes a binary file and can be applied with the patch program. With no VCS, `diff -Naur' would do this. What do I use with bzr? [20:03] Yes, the preference says the bat file, the error message is as if it was just trying to run "bzr" like you would on posix [20:03] Aha, but if you summon the dialog AGAIN, it doesn't complain [20:04] awilkins: the core plugin also haave a preference initializer :P [20:04] Stills says "no client factory found" though [20:04] fbond: bzr diff ? [20:04] Verterok: I assume you haven't tried it. [20:05] fbond: I used bzr diff but I don't know if it's the same as 'diff -Naur' [20:05] fbond: and I used the generated diff with patch [20:06] Verterok: Then your binary file isn't being removed by patch. [20:06] fbond: oh, I missed the binary file part :P I never used it with binary data [20:07] awilkins: that means the bzr-java-lib can create a IBazaarClient [20:07] awilkins: commandline or xmlrpc [20:08] Verterok: Ok, I patched the core initializer with the same default value ; now the projects are connected but no overlays [20:08] I've tried bzr diff -r X..Y --using diff --diff-options '-Nau' but that produces funny output that I can't blame patch for not accepting. [20:08] Verterok: They were previously in the "Connected/Error" state where only the disconnect operation was available [20:08] This is XMLRPC [20:08] awilkins: if you trashed all the preferences, you should enbale the decorators again [20:09] Verterok: The status decorator is enabled in the defaults [20:09] awilkins: jusst disable/enbale it [20:09] Verterok: But only shows after you visit the property page, it seems.... must be in the UI defaults [20:10] * awilkins restarts workspace again [20:11] Ok, workspace restarted... now the projects are in the same state [20:11] (Connected/Error) [20:11] No stack traces in the console of the calling workspace [20:12] (host workspace) [20:12] Verterok: Suddenly starts working after visiting the main preferences page (and not touching anything) [20:12] awilkins: ohh, I see... [20:12] Verterok: Let me see if that is also true when you cancel it [20:13] awilkins: I think it's only doing the setup of the client when you enter the pref. page [20:14] Verterok: The prefs being a singleton isn't going to work either ; I think that holds on both posix and windows, you just haven't run into it on posix because the default config is valid for posix [20:15] Verterok: If you visit the prefs and cancel, it doesn't show decorators [20:15] Verterok: Not for "OK" either [20:15] Verterok: Or "Apply" ... [20:16] Aha, [20:16] To get decorators, you have to visit the decorators prefs and "OK" [20:17] awilkins: yes [20:17] To get functional team menu I think you have to visit main prefs and OK [20:17] awilkins: I'll try to reproduce that problem [20:17] You don't get decorators if you visit/ok decorators WITHOUT having functional Team menu [20:18] awilkins: actually I don't fully agree with the non-singleton preferences :) but this problem have more priority [20:19] Verterok: How does the prefs dialog check to see whether it holds valid prefs if the prefs are a singleton, and not set until the prefs page is valid? [20:20] awilkins: the bzr-java-lib preferences are setup when the core plugin is started (valid or invalid) [20:20] awilkins: in the prefs. dialog when apply/Ok the prefs are stored with the eclipse preferencestore [20:21] awilkins: and applied to the BazaarClientPreferences singleton [20:21] Verterok: Yes, but the validity check in the prefs page uses the prefs to start the client - the old prefs, not the prefs in the page [20:22] awilkins: ups, I missed that :P [20:22] So if the old prefs are invalid, it's impossible to set new ones [20:23] Verterok: So by all means have a default static prefs instance but you need to be able to create a new one in the prefs page for validation and use it [20:23] awilkins: heh, I forgot about the preference listener [20:23] awilkins: PreferenceStoreChangeListener [20:29] awilkins: that updates the preference store of the core [20:31] Verterok: Is that the bit which loads them in the first place? [20:31] Verterok: Or is it that the UI prefs are never being stored in the core? [20:32] a bit of both, I think I missed the update of the client when the core/ui are updated [20:32] but the core prefs are used in the core start to update the client preferences [20:32] Verterok: The prefs are not being saved either (but they haven't changed from the defaults) [20:33] * awilkins changes them [20:33] If I can iron this out, I'll do some more testing and deploy it to my users (minus the SaveableEditorInput). [20:34] awilkins: basically I missed a call to EclipseBazaarCore.updateClientPreferences() in PreferenceStoreChangeListener [20:34] I think that adding a call to that should make the trick :) [20:34] did anyone ever already suggest bzr diff --color? [20:35] mgedmin: I think there is a plugin for it [20:36] okay, then, did anyone ever suggest bundling it with bzr so that users could get pleasurable experience right out of the box? [20:37] awilkins: I can add a prefs. change listener to the core and fire the client prefs. update when the core preferences changes [20:38] Verterok: I added the core-from-UI-prefs update line [20:39] Verterok: It fixes the prefs page but not the (Connected/Error) state on init [20:39] awilkins: ok, let me know if that fixes the issue [20:40] awilkins: I don't fully understand what "(Connected/Error) state on init" means [20:40] :P [20:41] It means that when you start, the project thinks it's connected to a bzr repo, but is in an error state so only the disconnect menu item is available [20:42] This goes away after visiting the Team/Bazaar p[refs page [20:42] The decorators remain absent until you visit the decorators pref page [20:42] By the way, there is a difference between defaults, and your settings happening to BE the defaults [20:43] awilkins: ok, the problem can be that a client can't be created, let me debug it a bit [20:43] If you explicitly duplicate the default settings for decorators, the prefs file vanishes [20:43] It should not ; an upgrade may change the defaults and you may not want that [20:43] (IMHO) [20:45] awilkins: I'm thinking in remove the executable related prefs from UI, and store it only in the core [20:45] (to avoid this weird errors, and ease the manteinance) [20:45] Verterok: I would agree ; my UI prefs file is empty/absent anyway [20:46] awilkins: I can confirm the error state at init === mw is now known as mw|food [20:47] Verterok: You have a windows box ATM? [20:47] awilkins: nope, testinng on Mac OS X 10.4 [20:50] awilkins: but I borrowed vnc access to a win XP box from beuno office (/me waves beuno) [20:50] awilkins: I still need to configure bzr, Java, eclipse, but having an box with XP is a start :) [20:51] Heh, it's good that not everyone is using Ubuntu for bzr (although I like it, and it's the primary OS of the wife and her sister) [20:52] I'm using Vista ; I ran into a lovely issue with shelve ; apparently, Vista thinks that gnu patch needs admin rights because it's called "patch.exe" [20:52] If you try and run it it either crashes or spams a UAC box [20:53] Until you hand-edit a manifest resource into the file that tells Vista it doesn't really need Admin rights.... [20:55] awilkins: useful trick to know, did you added it on the wiki, your blog? [20:55] awilkins: Indeed, Ubuntu is quite nice, but I primary use OS X 10.4 (my Gentoo box died a few weeks ago) [20:56] Verterok: I added it to the sourceforge bug which already described it but the solution didn't work immediately (I think it caches manifests, I had to rename the file a few times) [20:58] awilkins: great :) [20:58] If nothing else, putting solutions to things online is always a good idea because sooner or later I run into the problem again and forget how I fixed it [20:59] hehe, sure [21:03] awilkins: I think error state at init can be solved by calling EclipseBazaarCore.checkClient() in EclipseBazaarCore.start [21:04] Righto, back in 1 hr [21:04] awilkins: seeya [21:07] So, `bzr replay -r 821.. foo' should replay commits 822, 823, etc., but not revision 821, right? [21:11] If i've moved a directory containing a repository and some checkouts to a different path, what can I do to fix the link from the checkouts to the repository? [21:13] guess i have to manually frob .bzr/branch/location [21:14] it's a little annoying that a trailing newline in that file breaks stuff === mw|food is now known as mw [21:37] indigo: bzr switch NEWPATH === jaypipes is now known as jaypipes-away [21:38] i already mangled location manually; is that a problem? [21:38] it seems to be working.. [21:38] hi bkor [21:38] hey lifeless [21:38] indigo: no problem, just letting you know the easier way :) [21:38] Moin. [21:38] ah, thanks [21:38] too bad there isn't a bzr fix-bug-in-this-old-code :( [21:39] indigo: hmm, I wonder :) [21:39] or even bzr get-this-old-code-to-compile [21:40] jam: poing [21:40] indigo: $ moap code --help [21:40] commands: [21:40] develop develop code [21:46] lifeless: boing boing [21:48] bkor, moap? [21:48] jelmer: http://thomas.apestaart.org/moap/trac [21:48] I use it for ChangeLog generation [21:48] jam: what do you think of running with the new index format? [21:49] jam: I have a network stress test still running overnight :X [21:50] bkor, ah [21:50] does bzr have what Hg calls "named branches"? [21:50] schmichael: all of our branches are named [21:50] hm... i don't think i'm using the right terms [21:50] schmichael: You can't have multiple branches in one directory. [21:51] Peng: ah, thanks [21:51] schmichael: hg has a single directory with multiple heads [21:51] schmichael: and the ability to name some revisions with movable labels which they then call named branches [21:51] schmichael: such things being useful because they reuse the working tree and repository [21:52] schmichael: we allow working tree reuse and repository sharing, but we do it by doing: [21:52] bzr init-repo --no-trees REPO [21:52] bzr branch THING REPO/NAME [21:52] bzr checkout --lightweight REPO/NAME WORKINGTREE [21:53] cd WORKINGTREE; bzr switch NAME2; bzr switch NAME3 etc [21:53] bzr switch? [21:53] i'm not familiar with what that does... [21:53] new this-week: http://jam-bazaar.blogspot.com/2008/07/this-week-in-bazaar.html [21:53] schmichael: it switches a checkout from one branch to another [21:54] lifeless: great! thanks! [21:54] schmichael: no problem :) [21:55] hg's named branch aren't really moving labels.. They're tied into the revision's meta data. [21:58] Peng, lifeless: thanks! [21:59] branches* [21:59] Peng: you can commit to a named branch right? [21:59] (in hg) [21:59] I dunno. I think it might just be that when you commit, if the parent is a named branch, the new revision inherits the name. [22:00] Peng: I thought it was a list of names that pointed at revision hashes [22:00] lifeless: yes [22:00] well actually i'm trying to learn the details of named branches in hg [22:00] Peng: because otherwise you can't delete them [22:00] they're new and kind of awkward [22:01] lifeless: Correct, you can't delete them. [22:01] Peng: oh, I didn't realise that. ouchies. [22:01] I think git's named branches are basically automatically-moving tags. [22:01] way to make the cost of using them permanent [22:01] Peng: they are [22:02] Peng: they are reasonably sensible, except for the all-in-one-place aspect [22:02] (I have a bzr repo with 13K branches; don't really want that in a plain text file thanks( [22:02] Heh. [22:03] Does bzr perform well if you have thousands of tags? [22:03] lifeless: well, at one point hg's named branches was a simple 'tag' that said "and follow decendents until you have no more". And it was actually committed into the repository, I don't know if they have moved away from that yet or not. [22:04] Peng: not terribly well; but tags are per-branch, not per-repository, so they are cappable and trimmable without losing history [22:04] I think our scaling for N branches in a shared repository is quite good. I don't know that we have tuned the tag support as much [22:05] lifeless: as for running with the current btree stuff.... we certainly can [22:05] I was hoping to have one more poke at a whole-pack bloom filter [22:05] Since I worked out resizing, etc. [22:06] It would be a single global one, so the code gets to be a bit simpler as well [22:09] jam: sure, I think we can spend our respective fridays tweaking [22:09] jam: I want to validate that networking doesn't blow [22:09] lifeless: yeah, though we have a pretty good idea that it won't [22:09] did you see iter_random_one_reload ? [22:09] the only thing I might add is speculative reading of extra pages over a network link [22:10] lifeless: not specifically [22:10] I think I saw you performance testing it [22:10] does iter_random_one [22:10] with what 800s => 200s ? [22:10] but reloads the index on each key [22:11] so its testing raw 'key a get' speed [22:11] (e.g. no cache for either index) [22:11] sure [22:11] its been running for 12 hours now over the network :P [22:11] lifeless: but if you really wanted to be mean, you should run "drop_caches()" inbetween [22:12] lifeless: ouch... [22:12] jam: network - what cache :P [22:12] jam: (and I do) [22:12] lifeless: of course, that makes your machine pretty useless while it is running :) [22:12] for key in order: |from bzrlib.repository import format_registry as repo_registry [22:12] index.iter_entries([key]).next() |repo_registry.register_lazy( [22:12] if reload_index: | 'Bazaar development format - btree-rich-root (needs bzr.dev from 1.6)\n', [22:12] index = factory(index._transport, index._name, index._size) | 'bzrlib.plugins.index2.repofmt', [22:12] drop_cache() | 'RepositoryFormatPackBTreeRichRoot', [22:12] um, some weird side-by-side pasting there [22:13] yes, wrong vim copy command [22:13] :) [22:15] hmm, we need 'bzr unpack' to test btrees better :) [22:15] lifeless: so... whose to blame for 12 hours worth of pain? [22:15] well [22:15] 99704 keys [22:15] say 3 IO's per key on btree for the larger index [22:15] Is btree better/worse than graph at that point (I fully expect it to be better) [22:16] 0.3 seconds per IO -> 1 sec per key [22:16] and is this what, you to Chinstrap? [22:16] yes [22:16] that's about 1 day [22:16] yes [22:16] 1.15 days [22:16] I didn't think it out in advance you see :) [22:16] so I'm going to stop it :) [22:17] lifeless: hence why we should be doing speculative reading of extra pages [22:17] and do 20 keys [22:17] a round-trip should never be < 64k, or something like that [22:17] jam: so there are two things I'd like to see [22:17] I'd like the last page to be allowed to be truncated [22:17] so that a small index is, well, small [22:17] sure [22:18] I saw that in the backlog [22:18] what you need is tail recursion, or whatever it was called :) [22:18] and I'd like to read more internal nodes [22:18] Otherwise if they are separate, you're going to get the same disk space regardless [22:18] they are laid out as they are to allow the read-more-than-one optimisation [22:18] lifeless: you mean internal nodes up front ? yeah [22:19] jam: for instance, reading 64K on a > 64K index in the first read [22:19] and then any read covering internal nodes do the same [22:19] but only read 4K for leaf node requests [22:20] I would just try to spread it out, such that if we request 16 4k disjoint pages, we don't actually try to get 16-64k pages [22:20] e.g. do range(node, node+16) [22:20] more of a "if there is some more room, grab these as well" [22:20] jam: sure [22:20] in fact [22:20] the single key workflow is probably best kept at 'minimise IO' [22:21] but when you have two or three keys being asked for you can see graph walking happening, so start being aggressive [22:21] lifeless: interesting thought [22:21] and probably quite reasonable [22:21] jam: next thing I'm going to do though, is to write a in-place upgrader for these repository formats [22:23] lifeless: not very hard, right? Just write the indexes to upload, rename into place? [22:23] jam: roughly yes [22:24] Have you ever considered packing indexes without packing the data? [22:24] jam: create a new indices dir with the right contents, save the old format, remove the format marker, pivot the indices dirs, write the final format marker, remove the saved format marker [22:25] jam: well, we didn't have indices that could be anything other than optimal. or do you mean one index N packs ? [22:25] lifeless: right [22:25] The idea is to get the benefit of 1 index [22:26] without the cost of moving the data around [22:26] Just an idea that came up [22:26] I haven't thought about it in-depth [22:26] one index N packs would make ensuring integrity and so on quite a bit harder [22:27] is citeseer working for you? [22:28] lifeless: http://citeseer.ist.psu.edu/ is down [22:28] dang, was going to point you at LSM trees [22:28] and re-read it myself [22:28] Well, there was also something about CSB+ trees that were supposed to be cache sensitive [22:29] but that was more for in-memory DB [22:29] lifeless: http://www.google.com/search?q=lsm+trees&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-US:official&client=firefox-a [22:29] First link is a postscript file on www.cs.umb.edu [22:30] lets run some arbitrary postscript over the web [22:31] yah thats it [22:31] a bunch of similarities with what we do [22:36] well, we run arbitrary pdf all the time, right? [22:37] :) [22:39] lifeless: what key is best for Andrew Bennetts? [22:39] spiv [22:39] he seems to have 3 registered with subkeys.pgp.net [22:39] oh, sorry, that is michael hudson who has multiples [22:39] mwhudson: ^^^?? [22:40] And it seems he doesn't have any of them in *my* web of trust [22:41] jam: ah right, i kept losing hard drives with private keys on them :( [22:41] jam: the one on lp.net is right [22:46] I was thinking of http://www.freenet.org.nz/python/embeddingpyrex/ [22:47] for startup speed [22:47] lifeless: except you still have to "import bzrlib" which is the whole cost [22:48] now if we could get pyrex to compile bzrlib.... [22:48] that would be interesting [22:48] jam: right, thats the point [22:48] I'm pretty sure a single .so can provide > 1 modules [22:48] with a little love [22:49] lifeless: well, you could have 'bzrlibmodule.so' [22:49] and that would clearly be able to have "bzrlib.foo" [22:49] I'm not 100% sure about "from bzrlib.foo.bar.baz import bing" [22:50] loading is left to right [22:50] so it should work [22:51] well, standard attributes would confuse the __import__ stuff, but if you tricked it into thinking they were modules [22:54] jam: types.ModuleType [22:54] jam: by 'trick' I hope you mean 'make them modules' :) [22:55] the fact that 'from os.path import join' works means this isn't too hard [22:55] (os not being a package, and os.path being around from before there _were_ packages) [22:58] mwhudson: it sure does make it hard to track down where to find the *code* for os.path.foo though :) [22:59] jam: not as hard as pyrexing it all up :) [22:59] Especially when the function is "nt.lstat()" but the code for it is found in 'posixmodule.c' [22:59] oh yes, that is a good trick [23:03] jam: design thought [23:04] jam: how does this sound: 'to convert repository instance Y into a Z, we use an InterRepositoryRepositoryFormat instance' [23:04] to simplify all cut&paste you do each time? [23:04] well currently we do a full fetch [23:04] so this is new code [23:05] I'm proposing that as the design/lookup [23:07] I thought we already had upgrade/downgrade infrastructure [23:07] we do [23:07] MetaToMeta does [23:07] if repo._format != desired_format: [23:07] init_new [23:07] new.fetch(repo) [23:07] ouch [23:07] pivot() [23:08] which is fine, it works and is very generic [23:08] and until now we haven't had a meta format which would benefit from special casing [23:08] (except perhaps plain -> richroot). anyhow: [23:08] what do you think of the approach. [23:09] lifeless: well, interestingly enough, you already have: [23:09] # TODO: conversions of Branch and Tree should be done by [23:09] # InterXFormat lookups [23:09] But not a comment for Repository [23:10] so... I'm okay with it, though it is a layer of abstraction, which I don't think will see a lot of benefit [23:10] as we won't have tonnes of Repo<=>Repo converters that benefit [23:10] its pluggable though, which is a win I think [23:10] versus the work that has to be done anyway to make 'fetch()' work well. [23:11] sure [23:11] so, I'm fine with the idea, though I wouldn't work terribly hard to write the code just yet myself [23:11] I like being able to have plugins provide special tools for their customizations [23:12] it is just code to support, and maintain, and etc. [23:12] For 1 plugin which is going to be core RSN [23:12] jam: well, I've written several before that would have benefited [23:13] and I'm expecting we'll keep hacking on index2 for a while; its just going to be a code dump to get the first generation in to core [23:13] lifeless: reasonable enough, if you've seen a need for it, it is reasonable to bring in [23:13] <== is a big fan of plugins [23:13] having written the original import code :) [23:14] and a slew of plugins myself [23:14] :) [23:15] lifeless: so, would it be reasonable for me to strip out the per-internal-node bloom code [23:15] I don't think we'll see much benefit there [23:15] though I can leave it if you want [23:16] jam: I'd kind of like to see network impact of 50% blooms, that sort of thing [23:16] ok [23:16] I'll leave it if you are still poking at it [23:16] jam: so unless its in the way, I'd rather leave it - but o course make it not cost [much] when not using [23:17] I was going to change how blooms get generated, but I can do so compatibly [23:17] don't worry about disk layout compatibility, anyone using this today can convert back :P