/srv/irclogs.ubuntu.com/2009/11/21/#ubuntu-irc.txt

=== niko is now known as evilniko
=== evilniko is now known as niko
fahadsadahWho manages http://irclogs.ubuntu.com?13:48
LjLfahadsadah: are you going to ask retroactive permission to wget it? :P13:48
fahadsadahNo.13:49
LjLfahadsadah: /whois ubuntulog13:49
fahadsadahActually, was going to ask for monolithic files13:49
fahadsadahOr at least a tarbal13:49
fahadsadah*tarball13:49
fahadsadahrt13:49
fahadsadahThat's something13:49
LjLfahadsadah: i doubt you will have much luck with that, tbh13:50
fahadsadahCan't find any rts in here13:50
LjLfahadsadah: it's not a single person13:50
fahadsadahA team?13:51
LjLsomething like that13:51
LjLbasically it's canonical13:51
Tm_Twhat I wonder is why you need such thing?13:51
fahadsadahStatistics13:52
=== DJones_ is now known as DJones
Tm_Thmm, what kind of statistics?13:52
fahadsadahpisg13:52
fahadsadahSimilar to http://digit.cluenet.org/clueirc.html13:52
Tm_Tfor some particular channel or all?13:52
LjLfahadsadah: it doesn't really take too long to wget the whole thing, anyway. i've done it, it's manageable13:52
fahadsadah#ubuntu13:53
Tm_Tfahadsadah: BTW pisg is known to be heavy, really heavy13:53
Tm_Tcompared to many others that is13:53
LjLalso, no fun with a channel like #ubuntu :P13:53
fahadsadahIt's also shiny13:53
Tm_Tindeed13:53
fahadsadahreally shiny13:53
fahadsadah=p13:53
Tm_Tfahadsadah: not shinier than any others in my experience13:54
fahadsadahPlease can you suggest one?13:55
LjLactually, /me goes to download the latest to see what the karmic spike was like13:55
fahadsadahLjL: You say the wget was manageable?13:56
Tm_Tfahadsadah: fisg, irssistats, ircstats ... there's others13:56
fahadsadahIt's been going for two hours, and is still in 200613:56
fahadsadahTm_T: Thanks13:56
LjLfahadsadah: uh. i don't really remember just how long it took for me, but that seems way too long.13:56
LjLfahadsadah: you're very sure it's downloading only the .txt, and only for #u?13:56
LjL(also, what's your connection like?)13:56
fahadsadah100Mbps13:57
LjLwell mine is 10...13:57
fahadsadahAnd it's downloading everything, then discarding everything that isn't index.html or #ubuntu.txt13:57
LjLoh. ouch.13:57
fahadsadahI know13:57
fahadsadahStupid wget13:57
Tm_T:-P13:57
Tm_TI wouldn't blame wget13:57
fahadsadahOK, stupid options passed to wget13:58
fahadsadahTantamount to stupid user13:58
LjLeh, i'm pretty sure wget can be made to not download the rest in the first place... also, you could just tell it which files to download in advance13:58
fahadsadahwget -rA "#ubuntu.txt,index.html" http://irclogs.ubuntu.com13:58
fahadsadahI Ctrl+Ced it14:00
fahadsadahSeeing as I know all the filenames I want, I'll make a file containing them, with ruby, then use wget -i14:00
fahadsadahThanks for your help14:00
LjLfahadsadah: yeah, i've done the same thing with php14:01
LjLfor($Date=mktime(12, 0, 0, 2, 16, 2008); $Date<time()-3600*48; $Date+=3600*24) {          $Filename="http://irclogs.ubuntu.com/".date("Y", $Date)."/".date("m", $Date)."/".date("d", $Date)."/%23ubuntu.txt";14:01
fahadsadahThat's only since 2008?14:02
LjLfahadsadah: because i already had the ones before that14:02
fahadsadahGreat, thanks14:03
* fahadsadah rubyfies14:03
LjLif i actually had the logs i'd just give you a tarball, but my php script processes them and then discards them, so i don't have them14:03
fahadsadahLjL: Fast.14:09
fahadsadahHasn't been half a minute, and I'm in 200514:10
LjLfahadsadah: well, 2004 has very few things14:10
LjLbut indeed, most of the time spent will be requesting files, rather than actually downloading them... and when you requested them all instead of just the #ubuntu one, that's death14:10
fahadsadahI wonder how much disk space five years of #ubuntu takes up?14:10
LjLi don't remember. too much for me to keep on my drive :P but that doesn't mean much, i hardly have have one gb free14:11
fahadsadahI'm on a linode 36014:11
fahadsadahSo 16GB disk14:11
LjLfahadsadah: i made a quick calculation, it should take about 1.5gb, perhaps less14:12
fahadsadahIn 200614:14
fahadsadahI'll probably do #ubuntu-offtopic too14:15
fahadsadahI'll make a cron job14:18
fahadsadahEvery day, it will download the previous day's14:19
fahadsadahAnd do a pisg regeneration14:19
LjLfahadsadah: -offtopic is not logged14:20
fahadsadahOh.14:22
fahadsadah=(14:22
fahadsadahWow14:31
fahadsadahIt's done14:31
fahadsadah=D14:31
neodirtchiefEnter text here...test22:01
McPeteroO22:03

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!