[OGDev] URGENT Re: [OpenGuides-Dev] OG performance

Christopher Schmidt crschmidt at crschmidt.net
Thu Jul 26 02:03:46 BST 2007


On Thu, Jul 26, 2007 at 01:24:50AM +0100, Rev Simon Rumble wrote:
> Indeed, this should most certainly be done for all revisions of a 
> page except the current, so that if someone reverts spam without the 
> admin password, it's not indexed by the crawlers.
> 
> PS: I strongly doubt it's Google causing problems.  Google is a very 
> well behaved bot.  Others like the MSN one are much less well behaved.

I'm not convinced of that.

Google routinely and regularly fetches *large* pages on the Open Guide
to Boston that almost never change. Think Category Restaurant page --
1MB page, changes maybe once a week, Google fetches it daily.

Granted, OG Boston is particularly poorly optimized for this because we
use index_list in our category pages. The actual index_value, etc. mode
in wiki.cgi is significantly more lightweight. (Bad decision on my
part.) But you don't have to have someone fetching much data to hurt a
site, and even if Google is only requesting things slowly, they can
still exceed the return rate of the server.

Regards,
-- 
Christopher Schmidt
Web Developer



More information about the OpenGuides-Dev mailing list