ACAP - Automated Content Access Protocol
- http://www.the-acap.org/
- Standard being developed on behalf of content publishers to communicate permissions information more extensively than is the case with robots.txt. Project documents, implementation and background information.
Robotstxt.org
- http://www.robotstxt.org/
- Information on the robots.txt Robots Exclusion Standard and other articles about writing well-behaved Web robots.
User Agent String
- http://user-agent-string.info/
- Tool from ASAP Consulting s.r.o. for detailed user agent string analysis using an online form. Includes databases of browsers and robots.
User-Agents.org
- http://www.user-agents.org/
- Large list of search engine spiders, similar web robots, and Web browsers: their web-log identification and links to their originators.