![]() | | ||||||||
| | #1 |
| Active Warrior Join Date: Nov 2011
Posts: 38
Thanks: 0
Thanked 2 Times in 2 Posts
|
what is Robots.txt??? how it is used???
|
| | |
| | |
| | #2 |
| Warrior Member Join Date: Dec 2011
Posts: 10
Thanks: 0
Thanked 1 Time in 1 Post
|
robot.txt is a file that allow and disallow your website pages from search engines crawling.
|
| | |
| | |
| | #3 |
| Active Warrior Join Date: Nov 2011 Location: Sussex, UK
Posts: 64
Thanks: 10
Thanked 9 Times in 9 Posts
|
You can use a special robots.txt to tell search engine spiders not to index the content of a page, and/or not scan it for links to follow. You dont have to use it, but some people like parts of their pages ignored by the spiders if they are not relevant to what the site is about
|
| | |
| | |
| | #4 |
| Warrior Member Join Date: Sep 2011
Posts: 18
Thanks: 0
Thanked 1 Time in 1 Post
|
Robots.txt file is used to request Search Engine Spiders about the pages you want them to crawl or index or not. It is an important factor for the webpages to be indexed properly. |
|
Build your careers from home based business opportunity. | Wealthy Affiliate Review | |
| | |
| | #5 |
| HyperActive Warrior Join Date: Jan 2012
Posts: 146
Thanks: 0
Thanked 4 Times in 4 Posts
|
Robots.txt is a text file which can be uploaded in the root folder of your site. Sometimes you don't want the robots just roaming anywhere they like on your site.You can use robots.txt to block some pages on your site not be crawled by crawlers. Have a visit at this link to know about the effective use of robots.txt How to Write a robots.txt File |
| | |
| | |
| | #6 |
| Warrior Member Join Date: Jan 2012
Posts: 9
Thanks: 0
Thanked 0 Times in 0 Posts
| Robots.txt file is a set of instructions that tell search engine robots which pages of your site to be crawled and indexed. In most cases, your site is consist of many files or folders i.e. admin folders, cgi-bin, image folder, which are not relevant to the search engines. Robots.txt helps tell spiders what is useful and public for sharing in the search engine indexes and what is not. |
| | |
| | |
| | #7 |
| Active Warrior Join Date: Oct 2011
Posts: 48
Thanks: 0
Thanked 2 Times in 2 Posts
|
rebotex is file use to restrict google crawler to index your website pages
|
| | |
| | |
| | #8 |
| Active Warrior Join Date: Dec 2011
Posts: 87
Thanks: 1
Thanked 4 Times in 4 Posts
|
Robots.txt is a text files that used to tells engines that what to index and what not to index on a certain websites. It is written as: User-agent: * Disallow: / |
| | |
| | |
| | #9 |
| Active Warrior Join Date: Mar 2011
Posts: 65
Thanks: 0
Thanked 2 Times in 2 Posts
|
A site that hides part that user doesn't want to be crawled by Google.
|
| | |
| | |
| | #10 |
| Warrior Member Join Date: Jan 2012 Location: Mumbai
Posts: 18
Thanks: 0
Thanked 1 Time in 1 Post
|
hi robot.txt is a file that allow and disallow your website pages from search engines crawling. So if ur page is in process disallow it.. |
| | |
| | #11 |
| Active Warrior Join Date: Nov 2011
Posts: 38
Thanks: 0
Thanked 2 Times in 2 Posts
|
Thanx for giving info..but how i will create it??
|
| | |
| | |
| | #12 |
| HyperActive Warrior Join Date: Dec 2011
Posts: 109
Thanks: 3
Thanked 12 Times in 12 Posts
|
I think you need to read very detailed manual : The Web Robots Pages |
|
Yulia Borova Affiliate Manager Cigarettes Affiliate Program -GET 50$ CPA! Email: Yulia@DirectNetPartners.com | Skype: DNPYulia | |
| | |
| | #13 |
| Active Warrior Join Date: Dec 2011
Posts: 87
Thanks: 1
Thanked 4 Times in 4 Posts
| |
| | |
| | |
| | #14 |
| Active Warrior Join Date: Aug 2011 Location: Donegal, Ireland
Posts: 59
Thanks: 8
Thanked 5 Times in 5 Posts
|
here's a handy robots.txt generator.....Robots.txt Generator - McAnerin International Inc. |
|
What the world needs is more geniuses with humility, there are so few of us left! SEO Ireland | Google Places | |
| | |
| | #15 |
| Active Warrior Join Date: Nov 2011
Posts: 38
Thanks: 0
Thanked 2 Times in 2 Posts
|
Thanx all of you in making me understand one new thing..
|
| | |
| | |
| | #16 |
| Warrior Member Join Date: Dec 2011
Posts: 16
Thanks: 0
Thanked 0 Times in 0 Posts
|
A robots.txt is a permissions file that can be used to control which webpages of a website a search engine indexes. The file must be located in the root directory of the website for a search engine website-indexing program (spider) to reference,
|
| | |
| | #17 |
| Active Warrior Join Date: Nov 2011 Location: uk
Posts: 38
Thanks: 4
Thanked 1 Time in 1 Post
|
The robots.txt file is a set of instructions for visiting robots (spiders) that index the content of your web site pages. For those spiders that obey the file, it provides a map for what they can, and cannot index. The file must reside in the root directory of your web. The URL path (web address) of your robots.txt file should look like this... /robots.txt |
| | |
| | |
| | #18 |
| Active Warrior Join Date: Mar 2011
Posts: 65
Thanks: 0
Thanked 2 Times in 2 Posts
|
What is the kind of information that site owners does not want to be crawled?
|
| | |
| | |
![]() |
|
| Tags |
| robotstxt |
| Thread Tools | |
| |
![]() |