=== niko is now known as evilniko === evilniko is now known as niko [13:48] Who manages http://irclogs.ubuntu.com? [13:48] fahadsadah: are you going to ask retroactive permission to wget it? :P [13:49] No. [13:49] fahadsadah: /whois ubuntulog [13:49] Actually, was going to ask for monolithic files [13:49] Or at least a tarbal [13:49] *tarball [13:49] rt [13:49] That's something [13:50] fahadsadah: i doubt you will have much luck with that, tbh [13:50] Can't find any rts in here [13:50] fahadsadah: it's not a single person [13:51] A team? [13:51] something like that [13:51] basically it's canonical [13:51] what I wonder is why you need such thing? [13:52] Statistics === DJones_ is now known as DJones [13:52] hmm, what kind of statistics? [13:52] pisg [13:52] Similar to http://digit.cluenet.org/clueirc.html [13:52] for some particular channel or all? [13:52] fahadsadah: it doesn't really take too long to wget the whole thing, anyway. i've done it, it's manageable [13:53] #ubuntu [13:53] fahadsadah: BTW pisg is known to be heavy, really heavy [13:53] compared to many others that is [13:53] also, no fun with a channel like #ubuntu :P [13:53] It's also shiny [13:53] indeed [13:53] really shiny [13:53] =p [13:54] fahadsadah: not shinier than any others in my experience [13:55] Please can you suggest one? [13:55] actually, /me goes to download the latest to see what the karmic spike was like [13:56] LjL: You say the wget was manageable? [13:56] fahadsadah: fisg, irssistats, ircstats ... there's others [13:56] It's been going for two hours, and is still in 2006 [13:56] Tm_T: Thanks [13:56] fahadsadah: uh. i don't really remember just how long it took for me, but that seems way too long. [13:56] fahadsadah: you're very sure it's downloading only the .txt, and only for #u? [13:56] (also, what's your connection like?) [13:57] 100Mbps [13:57] well mine is 10... [13:57] And it's downloading everything, then discarding everything that isn't index.html or #ubuntu.txt [13:57] oh. ouch. [13:57] I know [13:57] Stupid wget [13:57] :-P [13:57] I wouldn't blame wget [13:58] OK, stupid options passed to wget [13:58] Tantamount to stupid user [13:58] eh, i'm pretty sure wget can be made to not download the rest in the first place... also, you could just tell it which files to download in advance [13:58] wget -rA "#ubuntu.txt,index.html" http://irclogs.ubuntu.com [14:00] I Ctrl+Ced it [14:00] Seeing as I know all the filenames I want, I'll make a file containing them, with ruby, then use wget -i [14:00] Thanks for your help [14:01] fahadsadah: yeah, i've done the same thing with php [14:01] for($Date=mktime(12, 0, 0, 2, 16, 2008); $Date That's only since 2008? [14:02] fahadsadah: because i already had the ones before that [14:03] Great, thanks [14:03] * fahadsadah rubyfies [14:03] if i actually had the logs i'd just give you a tarball, but my php script processes them and then discards them, so i don't have them [14:09] LjL: Fast. [14:10] Hasn't been half a minute, and I'm in 2005 [14:10] fahadsadah: well, 2004 has very few things [14:10] but indeed, most of the time spent will be requesting files, rather than actually downloading them... and when you requested them all instead of just the #ubuntu one, that's death [14:10] I wonder how much disk space five years of #ubuntu takes up? [14:11] i don't remember. too much for me to keep on my drive :P but that doesn't mean much, i hardly have have one gb free [14:11] I'm on a linode 360 [14:11] So 16GB disk [14:12] fahadsadah: i made a quick calculation, it should take about 1.5gb, perhaps less [14:14] In 2006 [14:15] I'll probably do #ubuntu-offtopic too [14:18] I'll make a cron job [14:19] Every day, it will download the previous day's [14:19] And do a pisg regeneration [14:20] fahadsadah: -offtopic is not logged [14:22] Oh. [14:22] =( [14:31] Wow [14:31] It's done [14:31] =D [22:01] Enter text here...test [22:03] oO