Home PageFacebook pageAbout Our ChurchSunday ServicesActivitiesInfo
|
Pm Wiki versions 0.6.0 and later already have built-in handling for most robots. This information is provided for earlier releases of PmWiki.
One good day Google Bot will visit your newly created PmWiki-web and will index the whole set-up from a-z, literally. "That's fine", I hear you say, "let Google Bot index the whole lot and make my site the number one site in the Universe." Well, think again. What you do want is for Google Bot to index the 'regular' pages, like However, what you don't want is a complete archive of every There are two ways to prevent this scenario from happening. The first one is easy and consists of creating a robots.txt file in the root of your website (i.e. The robots.txt approachIn 1993 and 1994 there have been occasions where robots have visited WWW servers where they weren't welcome for various reasons. Sometimes these reasons were robot specific, e.g. certain robots swamped servers with rapid-fire requests, or retrieved the same files repeatedly. In other situations robots traversed parts of WWW servers that weren't suitable, e.g. very deep virtual trees, duplicated information, temporary information, or cgi-scripts with side-effects (such as voting). The method used to exclude robots from a server is to create a file on the server which specifies an access policy for robots. This file must be accessible via HTTP on the local URL "/robots.txt".To prevent Google Bot accessing some of the unwanted pagelinks, you would put the following statements in the User-agent: Googlebot
Disallow: */main/allrecentchanges$
Disallow: */pmwiki*
Disallow: */search*
Disallow: *recentchanges*
Disallow: *action=*
However, now PmWiki will by default include special meta-information in the pages that it returns when edit and diff actions are performed that instructs search engines to neither index the page, nor follow any links in it. This removes the need for some of the lines in the << QA Markup | PmWiki.Documentation Index | Trouble Shooting >> |
|
Page last modified on March 19, 2004, at 07:13 AM
|
|