Search
Recommended Products
Related Links


 

 

Informative Articles

Criteria for Choosing an Ideal Web Hosting Company
Before we talk about what it takes to be a cheap and good quality web host, let’s get to the fundamental first. What is a Web Host? A web host is a service provider that places your web site on a computer which is connected to the internet. The...

How To Track Your Online Ad Response
It is crucial for every marketer to accurately track his advertising results. In the direct marketing industry, marketers track the responses to their direct mail pieces so that they know which ads work, which headlines are winners and which...

What Is A Traffic Exchange?
If you have surfed the internet for very long, you have probably ran into what is referred to as a "traffic exchange". These programs can be designed in many ways, with many different functions. In this article we are going to cover what they are,...

What is the Robot Text File?
What is the Robot Text File? The robot text file is used to disallow specific or all search engine spider’s access to folders or pages that you don't want indexed. Why would you want to do this? You may have created a personnel page for...

Where To Find Free Pictures, Artwork And Animation For Your Website
You've decided to have a website for yourself or your business. Great! You know what you want to say, how you want it laid out, what pages you need, and all the other good stuff that goes into a great website. But what about graphics? Where do you...

 
Google
What is the Robot Text File?

What is the Robot Text File?

The robot text file is used to disallow specific or all search engine spider’s access to folders or pages that you don't want indexed.

Why would you want to do this?

You may have created a personnel page for company employees that you don't want listed. Some webmasters use it to exclude their guest book pages so to avoid people spamming. There are many different reasons to use the robots text file.

How do I use it?

You need to upload it to the root of your web site or it will not work - if you don't have access to the root then you will need to use a Meta tag to disallow access. You need to include both the user agent and a file or folder to disallow.

What does it look like?

It's really nothing more than a "Notepad" type .txt file named "robots.txt"

The basic syntax is

User-agent: spiders name here
Disallow:/ filename here

If you use

User-agent: *

The * acts as a wildcard and disallows all spiders. You may want to use this to stop search engines listing unfinished pages.

To disallow an entire directory use

Disallow:/mydirectory/

To disallow an


individual file use

Disallow:/file.htm

You have to use a separate line for each disallow. You cannot you for example use

Disallow:/file1.htm,file2.html

You should use

Use-agent/*
Disallow:/file1.htm
Disallow:/file2.htm

For a list of spider names visit http://www.robotstxt.org/wc/active/html/

Make sure you use the right syntax if you don't it will not work. You can check you syntax here http://www.searchengineworld.com/cgi-bin/robotcheck.cgi

For help on creating robot text files there is a program call robogen.

There is a free version and an advanced version, which costs $12.99 http://www.rietta.com/robogen/

About The Author

Alan Murray is a Certified Internet Webmaster Professional and Provides SEO Services and Website design.
http://www.designprofessional.co.uk/SEO-Services.htm