Go Back   WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 02-15-2012, 01:55 AM   #1
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Robots.txt

What is robot.txt? And how this is used?

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 02-15-2012, 02:00 AM   #2
HyperActive Warrior
 
Kris79's Avatar
 
Join Date: Oct 2011
Posts: 180
Thanks: 19
Thanked 30 Times in 13 Posts
Contact Info
Send a message via Skype™ to Kris79
Default Re: Robots.txt

Robot.txt is a file placed in root folder of your domain.
It has instructions for search engines bots which content on your website it should check and index.

Using it is very helpful when you have more complex site structure and you don't want show all your content to the outside world.

Here is more about it:
The Web Robots Pages

Looking for Joint Venture Partner!
If you want to have multiple niche or authority websites - team up with me!
I can: do keyword research, build websites, create backlinking strategy and more.
We can work together and: split profits, swap services, etc. Just send me a message.
Kris79 is offline   Reply With Quote
Old 02-15-2012, 02:06 AM   #3
Peaceful Warrior
War Room Member
 
Ferenc Makar's Avatar
 
Join Date: May 2011
Location: Europe
Posts: 178
Thanks: 67
Thanked 48 Times in 32 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile 
Default Re: Robots.txt

You can get detailed explanation of robot.txt at The Web Robots Pages
Ferenc Makar is online now   Reply With Quote
Old 02-15-2012, 02:49 AM   #4
Active Warrior
 
Join Date: Feb 2012
Posts: 41
Thanks: 0
Thanked 4 Times in 4 Posts
Default Re: Robots.txt

You can go to robotstxt.org or can get help of Wikipedia to get exact definition.

richardfranklin is offline   Reply With Quote
Old 02-15-2012, 02:56 AM   #5
Warrior Member
 
Join Date: Feb 2012
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt

Basically Robot.txt is a File which is located on your web server, the main objective of robot.txt is to allow Bots/Crawlers to Indexed your web pages. You can also restrict pages from indexing through Robot.txt File!

stevejhon is offline   Reply With Quote
Old 02-15-2012, 03:17 AM   #6
HyperActive Warrior
 
ashleysmith12's Avatar
 
Join Date: May 2011
Posts: 231
Thanks: 0
Thanked 6 Times in 6 Posts
Default Re: Robots.txt

HI
Robots.txt is the automated software which tell search engine what to crawl what to not.
ashleysmith12 is offline   Reply With Quote
Old 02-16-2012, 11:09 PM   #7
Warrior Member
 
Join Date: Aug 2011
Posts: 6
Thanks: 0
Thanked 0 Times in 0 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile  View Member's YouTube Profile
Default Re: Robots.txt

You can also find a very similar robots.txt discussion in this thread: warriorforum.com/adsense-ppc-seo-discussion-forum/534078-robots-txt.html

Are you getting the most out of your website? Conduct an SEO Audit, and know for sure!
webgnomes is offline   Reply With Quote
Old 02-17-2012, 02:34 AM   #8
HyperActive Warrior
 
Join Date: Jan 2012
Posts: 226
Thanks: 0
Thanked 8 Times in 8 Posts
Default Re: Robots.txt

If you don’t want to be crawled for a specific page you can use robots.txt . in fact if you don’t want to be wished be any specific search engine you direct them not to visit you.

warriorsaroj is offline   Reply With Quote
Old 02-23-2012, 08:41 AM   #9
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Re: Robots.txt

If I restrict a page for crawling Google wouldn't crawl it then can I use black hat technique onto that page?

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 02-23-2012, 10:43 AM   #10
Active Warrior
 
Join Date: Dec 2011
Posts: 87
Thanks: 1
Thanked 4 Times in 4 Posts
Default Re: Robots.txt

Robots.txt is a .txt file, which tell search engines that what to index or not on a particular websites.

Here is one example:

User-agent: *
Disallow: /

John Conner is offline   Reply With Quote
Old 02-25-2012, 09:26 AM   #11
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Re: Robots.txt

So how can I create this robot.txt. Should I open a new txt document and just rename it to robot?

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 02-25-2012, 09:47 AM   #12
Active Warrior
 
ktonline's Avatar
 
Join Date: Feb 2012
Posts: 32
Thanks: 0
Thanked 2 Times in 2 Posts
Default Re: Robots.txt

Open up a notepad and save the file as 'robots.txt'
ktonline is offline   Reply With Quote
Old 02-25-2012, 12:38 PM   #13
Warrior Fitness/Diet Guru
 
Join Date: Nov 2008
Posts: 421
Thanks: 9
Thanked 11 Times in 9 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile  View Member's YouTube Profile
Default Re: Robots.txt

How effective or helpful is Robots.txt for seo? There's a few warriors selling it in their seo packages saying it will improve rankings?

jsmith2482 is offline   Reply With Quote
Old 02-29-2012, 04:30 AM   #14
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Re: Robots.txt

Quote:
Originally Posted by jsmith2482 View Post
How effective or helpful is Robots.txt for seo? There's a few warriors selling it in their seo packages saying it will improve rankings?
It can be created all by ourself then why buying?

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 02-29-2012, 05:12 AM   #15
Warrior Member
 
Join Date: Mar 2011
Posts: 6
Thanks: 0
Thanked 3 Times in 2 Posts
Default Re: Robots.txt

Go to Google's WebMaster Tools site (do a Google search) and add your site.

Then click on "site configuration" on the left hand side, then "crawler access" and it will allow you to create/edit and test a robots.text file that you can then save and upload to your site all for free!

Don't pay for this file, it's very simple to do and can be very good for controlling what the search engines do with your site and what they show in search results.
Rob Ainge is offline   Reply With Quote
Old 03-08-2012, 09:26 PM   #16
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Re: Robots.txt

Quote:
Originally Posted by Rob Ainge View Post
Go to Google's WebMaster Tools site (do a Google search) and add your site.

Then click on "site configuration" on the left hand side, then "crawler access" and it will allow you to create/edit and test a robots.text file that you can then save and upload to your site all for free!

Don't pay for this file, it's very simple to do and can be very good for controlling what the search engines do with your site and what they show in search results.
You are so right cause paying for this type of easy work will be called rather foolishness.

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 03-08-2012, 10:22 PM   #17
www.MonkSEO.com
War Room Member
 
monkseo's Avatar
 
Join Date: Jan 2012
Location: The Windy City
Posts: 41
Thanks: 14
Thanked 8 Times in 6 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile 
Contact Info
Send a message via Skype™ to monkseo
Default Re: Robots.txt

Robots.txt is a de-facto standard, which means it is used by people following web SEO standards, but not mandatory by any search engine or web standards authority.

In other words, it is not necessary, but good to have.

For more info visit: The Web Robots Pages

monkseo is online now   Reply With Quote
Old 03-08-2012, 11:01 PM   #18
Active Warrior
 
Join Date: Mar 2012
Posts: 66
Thanks: 0
Thanked 1 Time in 1 Post
Default Re: Robots.txt

I have my robots.txt placed in the root of subdomain i.e: 'subdomain.mywebsite.com/robots.txt'. I also have the condition as 'Disallow: / and it is not helping me at all. However, a member on another forum has informed the following:

Instead of:
User-agent: *
Disallow: /

I must have
User-agent: *
Disallow: /http://subdomain.mywebsite.com

simona86 is offline   Reply With Quote
Old 03-09-2012, 12:07 AM   #19
Active Warrior
 
Join Date: Feb 2012
Posts: 97
Thanks: 1
Thanked 2 Times in 2 Posts
Default Re: Robots.txt

Robots.txt is a text file and it is used to keep out content from the crawling method of search engine spiders. Here i define use to robot.txt file.

User agent: this factor describes, for which spider the next factors will be valid. * is a wildcard which mean all spiders or Googlebot for Google.

Disallow: describes which folders will be prohibited. Nothing means not anything will be excluded, means the whole thing will be excluded or , folder name can be used to identify the values to prohibited.

shophia is offline   Reply With Quote
Old 03-09-2012, 12:11 AM   #20
Active Warrior
 
Join Date: Feb 2012
Posts: 54
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt

Robots.txt placed in root folder of the domain..it allows or disallows some search engines to crawl particular part of your site..

northtrans is offline   Reply With Quote
Old 03-09-2012, 03:23 AM   #21
Active Warrior
 
Join Date: Jul 2011
Location: CA, USA
Posts: 87
Thanks: 0
Thanked 4 Times in 4 Posts
Default Re: Robots.txt

Robot.txt is the command to search engine bot whether you would like to index or not index through bot tags.

If you don't want to index then you can add dis allow: page path or else if you want to index any page then allow: page path

johnasthlon is offline   Reply With Quote
Old 03-09-2012, 08:20 AM   #22
Warrior Member
 
Join Date: Feb 2010
Posts: 3
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt

allow only google adsense bot using robots.txt

add the following two-line code

User-agent: Mediapartners-Google
Disallow:

haiwasnm is offline   Reply With Quote
Old 03-09-2012, 08:23 AM   #23
Active Warrior
 
Join Date: Feb 2012
Posts: 61
Thanks: 5
Thanked 1 Time in 1 Post
Default Re: Robots.txt

Does blogspot have it?

TheeBook is offline   Reply With Quote
Old 03-09-2012, 08:41 AM   #24
Warrior Member
 
Join Date: Mar 2012
Posts: 9
Thanks: 0
Thanked 1 Time in 1 Post
Default Re: Robots.txt

robot txt are metatags you use to avoid search engines from crawing your site
chriscyan is offline   Reply With Quote
Old 04-10-2012, 09:26 PM   #25
HyperActive Warrior
 
Join Date: Feb 2011
Location: WarriorForum
Posts: 222
Thanks: 16
Thanked 15 Times in 15 Posts
Default Re: Robots.txt

Quote:
Originally Posted by TheeBook View Post
Does blogspot have it?
You can't have robot.txt file on free blogging platform such as blogger or wordpress. These free blogging platform doesn't provide you cPanel. They give you sub domains.

Make your blog a MMM - Money Making Machine!!!!!
hilarious89 is offline   Reply With Quote
Old 04-10-2012, 11:46 PM   #26
Warrior Member
 
Join Date: Apr 2012
Posts: 27
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt

Some companies doesn't allow anybody to view their personal page or accounts pages so they put txt file on that page.you can't do that on free blogging sites.

onlinemegastore is offline   Reply With Quote
Old 04-11-2012, 05:40 AM   #27
Digital Marketing Agency
 
Join Date: Apr 2012
Location: Mumbai, India
Posts: 23
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt

robots.txt is use for inform to search engine spiders that which pages are use for crawling and which are not. Suppose you use some irrelevant pages in your website which are necessary for your website. when Google spiders will come and they get irrelevant pages on your website then it will harmful for your website. The best solution to avoid it, is use of robots.txt.

heenakapoor is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum

Tags
robotstxt

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 06:48 PM.