/srv/irclogs.ubuntu.com/2012/06/03/#bzr.txt

=== yofel_ is now known as yofel
jimiswhich algorithm does bzr use to compress .pack files?21:52
jimisis it gzip/bzip2 or something?21:57
jimisdoes it spawn an external process to decompress or it uses a C module from within the same process?21:58
jimisit bogs down my CPU and I must find out why...21:59
wgzit's zlib, in process22:04
wgzcompression just takes work, there aren't magic fixes22:04
jimishmmm so I guess it could be related22:05
jimiswgz: there are fixes, not magic ofcourse22:05
jimiswgz: the .pack files I assume the contain many small compressed chunks. About how large is each?22:06
jimisI'm asking because I notice too much seeking too22:06
jimisI've already filed a bug report, I'm just looking a bit more into it22:07
wgzright, most of the obvious issues are much higher level than the comprssion step22:07
jimisprobably true, but if it reads through the *whole* pack files then it would take some time to decompress .5GB22:08
bob2?22:09
bob2there's an index file next to it22:09
jimisbob2: saw it, but still when stracing it I see too many reads, I think it's seeking through most of the file...22:09
jimisI should measure exactly how much it reads from disk22:10
wgzdecompressing an entire pack when only part of it needed may be a bug, that decompressing a big chunk of data takes cpu isn't :)22:10
jimiswgz: that's why I was asking about the comression22:10
jimisfirst, if it was spawning separate gzip processes then it would use a second core22:11
bob2except that if the decompression is the bottleneck like oyu say, that's of no use22:11
jimissecond, there are faster algorithms from gzip, with other tradeoffs of cource22:11
bob2have you read the pack format docs?22:11
jimistrue have to find out how much it reads :-p22:11
wgzI'd suggest that's the wrong end to tackle things from22:12
bob2+122:12
jimisbob2: only a generic doc is what I found22:12
bob2no idea what that means22:12
jimispointer to the pack docs?22:12
wgzfor something like bug 1006194 I'd start by seeing if it's opening the repo twice22:13
ubot5Launchpad bug 1006194 in Bazaar "bzr diff too slow (cpu intensive) on large projects" [Undecided,New] https://launchpad.net/bugs/100619422:13
wgzrather than assuming zlib can be improved on or trying to launch multiple workers22:13
jimiswgz: how can I see it?22:13
jimisand why would it be opening it twice?22:14
jimisbob2: http://doc.bazaar.canonical.com/latest/developers/packrepo.html is the doc I found22:14
wgzbecause the bug states that it's an issue when there are two different branches are involved22:15
jimisbut didn't mention anything about compression algorithm22:15
jimiswgz: even with same branch but -r-2 overhead is the same22:16
wgzalso see the mailing list archives about log performance22:16
jimislatest quarter?22:17
wgzjimis: eg. <https://lists.ubuntu.com/archives/bazaar/2011q4/073729.html>22:20
jimisthanks wgz, seems relevant22:22
jimisEven though these points may apply, bzr log -r-2 is instantaneous in my case.22:27

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!