# Based roughly on http://en.wikipedia.org/robots.txt, but more lenient # since I don't have any real issues with traffic yet. # If you want a complete copy of the site, please don't use a webcrawler - # instead, e-mail me at webmaster@moonflare.com and ask me to produce an # offline copy for you. There are no official drops yet. # advertising-related bots: User-agent: Mediapartners-Google* Disallow: / User-agent: sitecheck.internetseer.com Disallow: / User-agent: grub-client Disallow: / User-agent: Gigabot Disallow: / # # Hits many times per second, not acceptable # http://www.nameprotect.com/botinfo.html User-agent: NPBot Disallow: / User-agent: WebReaper Disallow: / User-agent: * # Disallowed because they don't serve the same content with each fetch Disallow: /Special:Randompage Disallow: /Special%3ARandompage Disallow: /Special:Random Disallow: /Special%3ARandom Disallow: /Special:Search Disallow: /Special%3ASearch # Disallowed because they contain lots of irrelevant search keywords Disallow: /LiteratePrograms:Articles_found_using_search_engines Disallow: /LiteratePrograms:Articles_people_Google_for Disallow: /LP:Articles_found_using_search_engines Disallow: /LP:Articles_people_Google_for # Disallowed because people looking for "How to write an article" # for a literature class too often end up here. Disallow: /LiteratePrograms:How_to_write_an_article Disallow: /LP:How_to_write_an_article Crawl-delay: 1