|
|
| Search Engines & SEO General search engines and search engine optimization discussion. |

03-19-2009, 10:18 PM
|
|
Senior Member
|
|
Join Date: Jan 2009
Location: Newark, Delaware, USA
Posts: 369
|
|
What is the use of Robot.txt file?
What is robot.txt? how it works and what is the exact use of it?
|

03-20-2009, 11:37 AM
|
|
Member
|
|
Join Date: Jul 2008
Posts: 67
|
|
|
Use it you can choose which search engine could crawled your site.
|

03-20-2009, 12:56 PM
|
|
Junior Member
|
|
Join Date: Mar 2009
Posts: 14
|
|
its like a record of your site that was indexed by google..  im not really sure... 
|

03-20-2009, 09:58 PM
|
|
Senior Member
|
|
Join Date: Jan 2009
Location: Newark, Delaware, USA
Posts: 369
|
|
|
Thanks guys.... hoping for some more clear answers.
|

03-21-2009, 01:00 PM
|
|
Junior Member
|
|
Join Date: Mar 2009
Posts: 4
|
|
All search engines, or at least all the important ones, now look for a robots.txt file as soon their spiders or bots arrive on your site. So, even if you currently do not need to exclude the spiders from any part of your site, having a robots.txt file is still a good idea, it can act as a sort of invitation into your site.

|

03-21-2009, 10:11 PM
|
|
Senior Member
|
|
Join Date: Jan 2009
Location: Newark, Delaware, USA
Posts: 369
|
|
|
Thanks sam... that was something different. Thanks for your reply.
|

03-23-2009, 09:29 AM
|
|
Senior Member
|
|
Join Date: Dec 2008
Posts: 1,666
|
|
|
As I understand this file should be written in the sourse code of the page?
|

03-23-2009, 10:07 PM
|
|
Senior Member
|
|
Join Date: Jan 2009
Location: Newark, Delaware, USA
Posts: 369
|
|
Quote:
Originally Posted by tanja
As I understand this file should be written in the sourse code of the page?
|
What do you mean by that? Please elaborate your sentence.
|

03-24-2009, 08:32 AM
|
|
Junior Member
|
|
Join Date: Mar 2009
Posts: 6
|
|
|
I think I am sure That, Its Used for crowler crawled your site.
|

03-24-2009, 02:25 PM
|
|
Junior Member
|
|
Join Date: Feb 2009
Posts: 16
|
|
|
Robots.txt file is a text file with a couple of instructions to Search Engines that respect the file as to which links of the site they can crawl and index and which they should not.
An example is: My Wordpress Robots.txt file below:
------------------------------------------
# This is your robots.txt file. Visit Options->Robots.txt to change this text.
User-agent: *
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content/plugins
Disallow: /wp-content/cache
Disallow: /wp-content/themes
Disallow: /category
Disallow: /tag
Disallow: /author
Disallow: /trackback
Disallow: /*trackback
Disallow: /*trackback*
Disallow: /*/trackback
Disallow: /*?*
Disallow: /*.html/$
Disallow: /*feed*
# Google Image
User-agent: Googlebot-Image
Disallow:
Allow: /*
# Google AdSense
User-agent: Mediapartners-Google*
Disallow:
Allow: /*
------------------------------------------------
By using robots.txt, you can prevent any personal pages on your site from getting indexed.
|

03-26-2009, 06:38 PM
|
|
Senior Member
|
|
Join Date: Jan 2009
Location: Newark, Delaware, USA
Posts: 369
|
|
|
I have noticed that many of the major sites disallow's everything in robot.txt except the content. Why?
|

04-09-2009, 04:20 PM
|
 |
Member
|
|
Join Date: Feb 2009
Posts: 95
|
|
|
Robot text are most used by large websites to avoid unnecessary pages to get crawled by bots and getting indexed.
|

04-18-2009, 06:12 AM
|
|
Member
|
|
Join Date: Apr 2009
Posts: 71
|
|
|
It provides restriction to search engine robots that crawl the web. You can get a lot of information on sitemaps.blogspot.com/2006/02/using-robotstxt-file.html
|

04-23-2009, 12:58 AM
|
|
Junior Member
|
|
Join Date: Apr 2009
Posts: 25
|
|
|
Robots.txt is used for giving some instruction to search engine.
It’s basically having the list of links of the site they can crawl and index and which they should not. Our site is crawl quickly if it contains sitemap and robots.txt
|

05-07-2009, 10:18 AM
|
|
Junior Member
|
|
Join Date: Sep 2008
Posts: 26
|
|
|
Robots.txt is a file which let you to restrict or let you to decide which search engine you want to crawl your website, or the crawler of search engine which you don't want to crawl your website you can restrict that crawler by disallowing that search engine in search engine.
Also you can also decide which page of your website is to be crawled or not...
|

05-28-2009, 06:34 AM
|
|
Senior Member
|
|
Join Date: May 2009
Posts: 352
|
|
|
Robots.txt is a text file which teaches the search engines about pages or directories to keep out from its database
|

06-01-2009, 10:49 AM
|
|
Member
|
|
Join Date: May 2009
Posts: 51
|
|
|
Robot.txt is very important in terms of stop any robot to crawl any data or page of websites.
|
| Thread Tools |
|
|
| Display Modes |
Linear Mode
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT +1. The time now is 05:04 AM.
|