/srv/irclogs.ubuntu.com/2012/04/29/#ubuntu-nz.txt

=== ojwb is now known as Guest55901
=== Guest55901 is now known as ojwb
=== ojwb is now known as Guest74151
=== Guest74151 is now known as ojwb
snailmōrena19:50
ojwbmorning20:01
ibeardsleemorning20:02
chiltsmorning20:58
ajmitchmorning21:09
snailgot bitten badly by google last week. turns out that their RSS web crawler ignores robots.txt files. asking for an RSS feed of 5000 epubs every 3 minutes without caching makes my server sad when someone leaves their google homepage open for a couple of days21:10
ojwbeep21:10
* ojwb can see an argument for ignoring robots.txt there, but hitting every 3 minutes isn't sane21:11
snailthe answer, naturally, is caching21:17
snail:)21:17
mwhudsonmorning21:39
hadsIt's interesting to find weaknesses due to third parties activities. I discovered that I wasn't caching nicegear's rather heavy 404 page and just about brought the server down when some PCI-DSS company ran a stupid vulnerability scan.22:45
hadsAnd, morning.22:45
lifelesssnail: well, maybe make epub cheaper to deliver ?22:46
snaillifeless: the issue was that the RSS feed was based on a solr search. the ePubs are already stored as flat files on the disk22:58
lifelessah23:05
Atamiramorning23:13
kcjMorning.23:22

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!