![]() | | ||||||||
| | #2 |
| HyperActive Warrior Join Date: Oct 2011
Posts: 180
Thanks: 19
Thanked 30 Times in 13 Posts
|
Robot.txt is a file placed in root folder of your domain. It has instructions for search engines bots which content on your website it should check and index. Using it is very helpful when you have more complex site structure and you don't want show all your content to the outside world. Here is more about it: The Web Robots Pages |
| Looking for Joint Venture Partner! If you want to have multiple niche or authority websites - team up with me! I can: do keyword research, build websites, create backlinking strategy and more. We can work together and: split profits, swap services, etc. Just send me a message. | |
| | |
| | #3 |
| Peaceful Warrior War Room Member Join Date: May 2011 Location: Europe
Posts: 178
Thanks: 67
Thanked 48 Times in 32 Posts
|
You can get detailed explanation of robot.txt at The Web Robots Pages |
| | |
| | #4 |
| Active Warrior Join Date: Feb 2012
Posts: 41
Thanks: 0
Thanked 4 Times in 4 Posts
|
You can go to robotstxt.org or can get help of Wikipedia to get exact definition.
|
| | |
| | |
| | #5 |
| Warrior Member Join Date: Feb 2012
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
|
Basically Robot.txt is a File which is located on your web server, the main objective of robot.txt is to allow Bots/Crawlers to Indexed your web pages. You can also restrict pages from indexing through Robot.txt File!
|
| | |
| | |
| | #6 |
| HyperActive Warrior Join Date: May 2011
Posts: 231
Thanks: 0
Thanked 6 Times in 6 Posts
|
HI Robots.txt is the automated software which tell search engine what to crawl what to not. |
| | |
| | #7 |
| Warrior Member Join Date: Aug 2011
Posts: 6
Thanks: 0
Thanked 0 Times in 0 Posts
|
You can also find a very similar robots.txt discussion in this thread: warriorforum.com/adsense-ppc-seo-discussion-forum/534078-robots-txt.html
|
|
Are you getting the most out of your website? Conduct an SEO Audit, and know for sure!
| |
| | |
| | #8 |
| HyperActive Warrior Join Date: Jan 2012
Posts: 226
Thanks: 0
Thanked 8 Times in 8 Posts
|
If you don’t want to be crawled for a specific page you can use robots.txt . in fact if you don’t want to be wished be any specific search engine you direct them not to visit you.
|
| | |
| | |
| | #9 |
| HyperActive Warrior Join Date: Feb 2011 Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
|
If I restrict a page for crawling Google wouldn't crawl it then can I use black hat technique onto that page?
|
| Make your blog a MMM - Money Making Machine!!!!! | |
| | |
| | #10 |
| Active Warrior Join Date: Dec 2011
Posts: 87
Thanks: 1
Thanked 4 Times in 4 Posts
|
Robots.txt is a .txt file, which tell search engines that what to index or not on a particular websites. Here is one example: User-agent: * Disallow: / |
| | |
| | |
| | #11 |
| HyperActive Warrior Join Date: Feb 2011 Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
|
So how can I create this robot.txt. Should I open a new txt document and just rename it to robot?
|
| Make your blog a MMM - Money Making Machine!!!!! | |
| | |
| | #12 |
| Active Warrior Join Date: Feb 2012
Posts: 32
Thanks: 0
Thanked 2 Times in 2 Posts
|
Open up a notepad and save the file as 'robots.txt'
|
| | |
| | #13 |
| Warrior Fitness/Diet Guru Join Date: Nov 2008
Posts: 421
Thanks: 9
Thanked 11 Times in 9 Posts
|
How effective or helpful is Robots.txt for seo? There's a few warriors selling it in their seo packages saying it will improve rankings?
|
| | |
| | |
| | #15 |
| Warrior Member Join Date: Mar 2011
Posts: 6
Thanks: 0
Thanked 3 Times in 2 Posts
|
Go to Google's WebMaster Tools site (do a Google search) and add your site. Then click on "site configuration" on the left hand side, then "crawler access" and it will allow you to create/edit and test a robots.text file that you can then save and upload to your site all for free! Don't pay for this file, it's very simple to do and can be very good for controlling what the search engines do with your site and what they show in search results. |
| | |
| | #16 | |
| HyperActive Warrior Join Date: Feb 2011 Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
| Quote:
| |
| Make your blog a MMM - Money Making Machine!!!!! | ||
| | |
| | #17 |
| www.MonkSEO.com War Room Member Join Date: Jan 2012 Location: The Windy City
Posts: 41
Thanks: 14
Thanked 8 Times in 6 Posts
|
Robots.txt is a de-facto standard, which means it is used by people following web SEO standards, but not mandatory by any search engine or web standards authority. In other words, it is not necessary, but good to have. For more info visit: The Web Robots Pages |
| | |
| | |
| | #18 |
| Active Warrior Join Date: Mar 2012
Posts: 66
Thanks: 0
Thanked 1 Time in 1 Post
|
I have my robots.txt placed in the root of subdomain i.e: 'subdomain.mywebsite.com/robots.txt'. I also have the condition as 'Disallow: / and it is not helping me at all. However, a member on another forum has informed the following: Instead of: User-agent: * Disallow: / I must have User-agent: * Disallow: /http://subdomain.mywebsite.com |
| | |
| | |
| | #19 |
| Active Warrior Join Date: Feb 2012
Posts: 97
Thanks: 1
Thanked 2 Times in 2 Posts
|
Robots.txt is a text file and it is used to keep out content from the crawling method of search engine spiders. Here i define use to robot.txt file. User agent: this factor describes, for which spider the next factors will be valid. * is a wildcard which mean all spiders or Googlebot for Google. Disallow: describes which folders will be prohibited. Nothing means not anything will be excluded, means the whole thing will be excluded or , folder name can be used to identify the values to prohibited. |
| | |
| | |
| | #20 |
| Active Warrior Join Date: Feb 2012
Posts: 54
Thanks: 0
Thanked 0 Times in 0 Posts
|
Robots.txt placed in root folder of the domain..it allows or disallows some search engines to crawl particular part of your site..
|
| | |
| | |
| | #21 |
| Active Warrior Join Date: Jul 2011 Location: CA, USA
Posts: 87
Thanks: 0
Thanked 4 Times in 4 Posts
|
Robot.txt is the command to search engine bot whether you would like to index or not index through bot tags. If you don't want to index then you can add dis allow: page path or else if you want to index any page then allow: page path |
| Cheap SSL | Cheap WildCard SSL | Thawte SSL123 | GeoTrust QuickSSL Premium | RapidSSL EV SSL, Code Signing Certificates, & SAN Certificate Now! Available @ RapidSSLonline.com | |
| | |
| | #22 |
| Warrior Member Join Date: Feb 2010
Posts: 3
Thanks: 0
Thanked 0 Times in 0 Posts
|
allow only google adsense bot using robots.txt add the following two-line code User-agent: Mediapartners-Google Disallow: |
| | |
| | |
| | #23 |
| Active Warrior Join Date: Feb 2012
Posts: 61
Thanks: 5
Thanked 1 Time in 1 Post
|
Does blogspot have it?
|
| | |
| | |
| | #24 |
| Warrior Member Join Date: Mar 2012
Posts: 9
Thanks: 0
Thanked 1 Time in 1 Post
|
robot txt are metatags you use to avoid search engines from crawing your site
|
| | |
| | #26 |
| Warrior Member Join Date: Apr 2012
Posts: 27
Thanks: 0
Thanked 0 Times in 0 Posts
|
Some companies doesn't allow anybody to view their personal page or accounts pages so they put txt file on that page.you can't do that on free blogging sites.
|
| | |
| | |
| | #27 |
| Digital Marketing Agency Join Date: Apr 2012 Location: Mumbai, India
Posts: 23
Thanks: 0
Thanked 0 Times in 0 Posts
|
robots.txt is use for inform to search engine spiders that which pages are use for crawling and which are not. Suppose you use some irrelevant pages in your website which are necessary for your website. when Google spiders will come and they get irrelevant pages on your website then it will harmful for your website. The best solution to avoid it, is use of robots.txt.
|
| | |
| | |
![]() |
|
| Tags |
| robotstxt |
| Thread Tools | |
| |
![]() |