Wednesday, November 7, 2012

Tell robots not to index a page, part 1

Robots are little pieces of technology that find links for the search engines. This makes it so the pages discovered by the robots, (also called "spiders.") appear in the search engine's results. This is part one of how to keep Google, Yahoo, Bing, Gigablast, Fireburst and other search engines from indexing a single page or pages of a site. This is done by meta tags inserted between the <head> and </head> tags.
Keep a bot from indexing a page
To keep a "good" bot from adding a page to its index insert this meta tag in the head:

<meta name="robots" content="noindex" />

Keep bots from following links on a page
<meta name="robots" content="nofollow" />


Keep a bot from following a specific link

<a href="URL" rel="nofollow">link text</a>


Other bot restriction Tags

Keep bots from indexing images on your page:

<meta name="robots" content="noimageindex" />


Keep bots from archiving a copy of your page:

<meta name="robots" content="noarchive" />

Keep bots from translating the page in search result
<meta name="robots" content="notranslate" />

No comments:

Post a Comment

Feel free to comment. Keep it nice, polite and on topic. You may link to a site if it contains content relates to the content of whatever post you are commenting on. You may link to a site that only uses, for example, HTML5 video. Remember, on topic!