![]() | | ||||||||
| | #1 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
I really need some help here. I have a wordpress site, which has been ranked # 1 in google for the last year or so. Just yesterday it dropped to nowhere to be found. I checked google webmaster tools and its says robots.txt unreachable. I have no clue, how to fix this. My other sites, all wordpress all hosted with hostgator, do not have issues. Help??? |
| | |
| | |
| | #2 | |
| Warrior Member Join Date: Apr 2011
Posts: 3
Thanks: 0
Thanked 0 Times in 0 Posts
| Quote:
| |
| | |
| | #3 | |
| HyperActive Warrior Join Date: Apr 2010
Posts: 129
Thanks: 33
Thanked 26 Times in 22 Posts
| Quote:
There are tons of robots.txt tutorials online that should help you figure out what to put in your file. | |
| | |
| | #4 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
Ok, checked everything I can. Site is still indexed in google, just nowhere to be found. robots.txt file looks ok. if i type in mysite.com/robots.txt it shows User-agent: * Disallow: Sitemap: http://mysite.com/sitemap.xml.gz Still google bot can't find the robots.txt file? Am I missing something? |
| | |
| | |
| | #5 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
ok, I resubmitted the site map to google webmaster account, checked and backchecked everything. Google bot still can't find the robots.txt. Any ideas? |
| | |
| | |
| | #6 |
| HyperActive Warrior Join Date: Apr 2010
Posts: 129
Thanks: 33
Thanked 26 Times in 22 Posts
|
It could be that your web host's firewall is blocking IPs that make "too many" requests within a short period of time. Maybe you should contact your host and inquire about this issue. More info: http://www.homewithandrew.com/index.php/debugging-the-network-unreachable-robots-txt-unreachable-error/ |
| | |
| | #7 |
| Plundering the Web War Room Member Join Date: Feb 2007 Location: , , .
Posts: 4,849
Thanks: 804
Thanked 1,199 Times in 886 Posts
|
robots.txt has nothing to do with SERPs. If it's not found, no big deal. Googlebot will try and find it first. I have never had one. No need. It will show a not found error. But again, unless it just creeps you out to have a not found error, it's no big deal. Well, I guess it may if you had some stuff that you did not want indexed. But then that would be a different matter. Paul |
| How to Make Money off Facebook: Login to your account. Deactivate your account. Get your butt to work.
| |
| | |
| | #8 |
| SEO Strategist War Room Member Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
|
When you have a GWT robots.txt error, don't use the resubmit option. 1) Double check your url is good by searching the exact robots.txt url 2) If the robots.txt url is all good, delete the old robots.txt link inside GWT 3) Paste the good robots.txt url into GWT again (submit (not resubmit)), refresh your browser a couple of times. I'm not sure why this happens but I've found that the GWT resubmit option doesn't work very often. When I've just deleted & started over (submit) it works just fine. |
| | |
| | |
| | #9 |
| Active Warrior Join Date: Apr 2011
Posts: 35
Thanks: 4
Thanked 2 Times in 2 Posts
|
i dont understand how to use this
|
| | |
| | #10 |
| Active Warrior Join Date: Dec 2010
Posts: 40
Thanks: 0
Thanked 0 Times in 0 Posts
|
The Robot Exclusion Standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is otherwise publicly viewable. Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. The standard is unrelated to, but can be used in conjunction with, Sitemaps, a robot inclusion standard for websites. A robots.txt file on a website will function as a request that specified robots ignore specified files or directories in their search. This might be, for example, out of a preference for privacy from search engine results, or the belief that the content of the selected directories might be misleading or irrelevant to the categorization of the site as a whole, or out of a desire that an application only operate on certain data. For websites with multiple subdomains, each subdomain must have its own robots.txt file. If example.com had a robots.txt file but a.example.com did not, the rules that would apply for example.com would not apply to a.example.com. |
| | |
| | |
| | #11 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
@ Remington Steel, thank you, I contacted my Host. I could not find any errors on my site.
|
| | |
| | |
| | #12 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
@ Paul. Are you sure it has nothing to do with my site dropping in the Serps. Since it started dropping shortly after google bot could not access my robots.txt anymore
|
| | |
| | |
| | #13 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
@ yukon, I have tried this, no luck, but thanks
|
| | |
| | |
| | #14 | |
| HyperActive Warrior Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
| Quote:
If you can't solve the issue with the robots.txt file, you're better off to delete it. Then... it becomes "no big deal". Otherwise, your site will simply no longer be crawled... and that's a pretty big deal. Source: robots.txt unreachable - Webmaster Central Help | |
| | |
| | #15 | |
| Senior Warrior Member War Room Member Join Date: Jul 2008 Location: Vancouver, WA, USA.
Posts: 3,500
Thanks: 327
Thanked 584 Times in 408 Posts
| Quote:
| |
|
Tim Pears | ||
| | |
| | #16 |
| Warrior Member Join Date: Apr 2011
Posts: 17
Thanks: 0
Thanked 0 Times in 0 Posts
|
hello everyone, I know nothing about how to recover such an position but after reading the quires and their response now i understand some basic of it. So well job done by all. |
| | |
| | |
| | #17 | |
| HyperActive Warrior Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
| Quote:
User-agent: * Disallow: Sitemap: http://mysite.com/sitemap.xml.gz | |
| | |
| | #18 |
| Active Warrior Join Date: Apr 2011
Posts: 52
Thanks: 0
Thanked 2 Times in 2 Posts
|
first of all you need to check that robots file is working or not just type domainname/robots.txt if it is working then there is no problem and if not working then check your file where it is this file will be on root.
|
| | |
| | |
| | #19 | |
| Senior Warrior Member War Room Member Join Date: Jul 2008 Location: Vancouver, WA, USA.
Posts: 3,500
Thanks: 327
Thanked 584 Times in 408 Posts
| Quote:
| |
|
Tim Pears | ||
| | |
| | #20 | |
| HyperActive Warrior Join Date: Dec 2010
Posts: 181
Thanks: 9
Thanked 14 Times in 13 Posts
| Quote:
| |
| GET PAID FOR YOUR HARD WORK, BE CAREFUL IN CHOOSING Lists of DoFollow forums that allows to add signatures | ||
| | |
| | #21 |
| Active Warrior War Room Member Join Date: Mar 2009 Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
|
Ok, I checked with Hostgator, they were blocking google bot, I meanwhile entered all of my sites in Webmaster account twice. Once as mysite.com and once as www.mysite.com. According to google, they treat those 2 as different websites. I set the www. as preferred, also according to google, otherwise I might get punished for duplicate content. Here comes the fun part now: For some of my sites, google can find the robots.txt now for both the www.mysite.com and the mysite.com. For other sites they can only find the www.mysite.com/robots.txt but not the other one. I do however still have one site, where google bot can't find the robots.txt on neither. Does not make sense at all, I know. @guitarjosh - I would like to try to remove the robots.txt for the one site, I mentioned, but I have wordpress installed and they somehow automatically add it. Searched through Wordpress Forums, but no luck. I need to know, how to remove the robots.txt from my wordpress site. |
| | |
| | |
| | #22 | |
| HyperActive Warrior Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
| Quote:
How did you set the www as preferred in HostGator? I haven't checked in to seeing how mine's set but thought maybe I should. | |
| | |
![]() |
|
| Tags |
| issue, robotstxt |
| Thread Tools | |
| |
![]() |