=== ojwb is now known as Guest55901 | ||
=== Guest55901 is now known as ojwb | ||
=== ojwb is now known as Guest74151 | ||
=== Guest74151 is now known as ojwb | ||
snail | mÅrena | 19:50 |
---|---|---|
ojwb | morning | 20:01 |
ibeardslee | morning | 20:02 |
chilts | morning | 20:58 |
ajmitch | morning | 21:09 |
snail | got bitten badly by google last week. turns out that their RSS web crawler ignores robots.txt files. asking for an RSS feed of 5000 epubs every 3 minutes without caching makes my server sad when someone leaves their google homepage open for a couple of days | 21:10 |
ojwb | eep | 21:10 |
* ojwb can see an argument for ignoring robots.txt there, but hitting every 3 minutes isn't sane | 21:11 | |
snail | the answer, naturally, is caching | 21:17 |
snail | :) | 21:17 |
mwhudson | morning | 21:39 |
hads | It's interesting to find weaknesses due to third parties activities. I discovered that I wasn't caching nicegear's rather heavy 404 page and just about brought the server down when some PCI-DSS company ran a stupid vulnerability scan. | 22:45 |
hads | And, morning. | 22:45 |
lifeless | snail: well, maybe make epub cheaper to deliver ? | 22:46 |
snail | lifeless: the issue was that the RSS feed was based on a solr search. the ePubs are already stored as flat files on the disk | 22:58 |
lifeless | ah | 23:05 |
Atamira | morning | 23:13 |
kcj | Morning. | 23:22 |
Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!