HTML5 Canvas
Written by: Simon Sarris
Featured Refcardz: Top Refcardz:
  1. Apache Hadoop
  2. Web Driver
  3. MVVM
  4. REST
  5. ADO.NET
  1. HTML5
  2. Ajax
  3. jQuery Selectors
  4. CSS Part 1
  5. Git

Link Details

Link 15650 thumbnail
User 71517 avatar

By Volume4
via robotstxt.org
Published: Mar 11 2007 / 22:12

The Robots Exclusion Protocol is very straightforward. In a nutshell it works like this: When a compliant Web Robot vists a site, it first checks for a "/robots.txt" URL on the site. If this URL exists, the Robot parses its contents for directives that instruct the robot not to visit certain parts of the site. As a Web Server Administrator you can create directives that make sense for your site. This page tells you how.
  • 11
  • 0
  • 1627
  • 0

Comments

Add your comment
User 217760 avatar

entrodus replied ago:

0 votes Vote down Vote up Reply

I dont use a robots.txt file in my site.
I dont know how bad would this be in SEO ways.
But really... i cant find i way to set a valid robots.txt

In my example, i only want robots.txt to have access to 4 specific files.
Nothing else in my whole server.
How can i achieve that, since i dont have an allow method in robots.txt specification?

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (11)



Voters Against This Link (0)