Archive for September, 2008

What not to GET. Limiting what robots will request.

Monday, September 8th, 2008

I tested the spider at spider.my a few times recently. It was previously restricted to just a few sites that their respective admins had kindly volunteered. One of the immediate problems I noticed with releasing the spider in the wild was the number of pages I was mangling between storage ...