Robot Identification AlgorithmCurrent Identification AlgorithmThe following algorithm is used to determine if a user is a indexing (or otherwise) robot. If the remote machine identification (usually in the user agent field of the HTML request) includes any of the following, it is considered a robot:
In addition, several individual IP addresses have been blocked due to past behavior. While this is hardly a secure method, it does enable robot-friendly pages to be served, and other behavior to be blocked. Finally, for diagnostic and demonstration purposes, by appending "?robot=yes" to the end of the URL a page can be viewed as if it was requested by a search engine robot. This only works if you are currently logged out of the system however. |