Go Back   WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 04-11-2011, 09:49 AM   #1
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Robots.txt issue

I really need some help here. I have a wordpress site, which has been ranked # 1 in google for the last year or so.

Just yesterday it dropped to nowhere to be found.

I checked google webmaster tools and its says robots.txt unreachable.

I have no clue, how to fix this.

My other sites, all wordpress all hosted with hostgator, do not have issues.

Help???

BigG95 is offline   Reply With Quote
Old 04-11-2011, 10:22 AM   #2
Warrior Member
 
Join Date: Apr 2011
Posts: 3
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by BigG95 View Post
I really need some help here. I have a wordpress site, which has been ranked # 1 in google for the last year or so.

Just yesterday it dropped to nowhere to be found.

I checked google webmaster tools and its says robots.txt unreachable.

I have no clue, how to fix this.

My other sites, all wordpress all hosted with hostgator, do not have issues.

Help???
check your hosting provider,if your hosting provider has website builder sometime the problem could be in the Meta Data section , see i used joomla conected with M2host to me m2host is the best host on the net,just because you pay 12dollars a year for the hosting ,an you can shows different website builder like , joomla ,mambo,an much more but anyway check your hosting
tom johnson is offline   Reply With Quote
Old 04-11-2011, 10:58 AM   #3
HyperActive Warrior
 
RemingtonSteele's Avatar
 
Join Date: Apr 2010
Posts: 129
Thanks: 33
Thanked 26 Times in 22 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by BigG95 View Post
I checked google webmaster tools and its says robots.txt unreachable.

I have no clue, how to fix this.
If Google is saying that the file is unreachable, then you likely don't have one, or it's in the wrong place. The robots.txt file is supposed to go in the root of your site (e.g., http://www.yoursite.com/robots.txt). What happens when you go to the URL above (edited for your own domain, of course)? If you get a 404 error, then your robots.txt file is missing. To add it, just create one and upload it. Simple as that.

There are tons of robots.txt tutorials online that should help you figure out what to put in your file.
RemingtonSteele is offline   Reply With Quote
Old 04-11-2011, 11:41 AM   #4
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

Ok, checked everything I can. Site is still indexed in google, just nowhere to be found. robots.txt file looks ok.

if i type in mysite.com/robots.txt it shows
User-agent: * Disallow: Sitemap: http://mysite.com/sitemap.xml.gz

Still google bot can't find the robots.txt file?

Am I missing something?

BigG95 is offline   Reply With Quote
Old 04-11-2011, 02:18 PM   #5
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

ok, I resubmitted the site map to google webmaster account, checked and backchecked everything.

Google bot still can't find the robots.txt.

Any ideas?

BigG95 is offline   Reply With Quote
Old 04-12-2011, 12:15 PM   #6
HyperActive Warrior
 
RemingtonSteele's Avatar
 
Join Date: Apr 2010
Posts: 129
Thanks: 33
Thanked 26 Times in 22 Posts
Default Re: Robots.txt issue

It could be that your web host's firewall is blocking IPs that make "too many" requests within a short period of time. Maybe you should contact your host and inquire about this issue.

More info: http://www.homewithandrew.com/index.php/debugging-the-network-unreachable-robots-txt-unreachable-error/
RemingtonSteele is offline   Reply With Quote
Old 04-12-2011, 01:02 PM   #7
Plundering the Web
War Room Member
 
paulgl's Avatar
 
Join Date: Feb 2007
Location: , , .
Posts: 4,849
Thanks: 804
Thanked 1,199 Times in 886 Posts
Default Re: Robots.txt issue

robots.txt has nothing to do with SERPs.
If it's not found, no big deal. Googlebot will try
and find it first. I have never had one. No need.
It will show a not found error. But again, unless
it just creeps you out to have a not found error,
it's no big deal.

Well, I guess it may if you had some stuff that you
did not want indexed. But then that would be a different
matter.

Paul

How to Make Money off Facebook: Login to your account. Deactivate your account. Get your butt to work.
paulgl is offline   Reply With Quote
Old 04-12-2011, 04:02 PM   #8
SEO Strategist
War Room Member
 
yukon's Avatar
 
Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
Default Re: Robots.txt issue

When you have a GWT robots.txt error, don't use the resubmit option.

1) Double check your url is good by searching the exact robots.txt url

2) If the robots.txt url is all good, delete the old robots.txt link inside GWT

3) Paste the good robots.txt url into GWT again (submit (not resubmit)), refresh your browser a couple of times.

I'm not sure why this happens but I've found that the GWT resubmit option doesn't work very often. When I've just deleted & started over (submit) it works just fine.

yukon is offline   Reply With Quote
Old 04-12-2011, 06:55 PM   #9
Active Warrior
 
Join Date: Apr 2011
Posts: 35
Thanks: 4
Thanked 2 Times in 2 Posts
Default Re: Robots.txt issue

i dont understand how to use this
syahbiz is offline   Reply With Quote
Old 04-14-2011, 07:54 AM   #10
Active Warrior
 
Join Date: Dec 2010
Posts: 40
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt issue

The Robot Exclusion Standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is otherwise publicly viewable. Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. The standard is unrelated to, but can be used in conjunction with, Sitemaps, a robot inclusion standard for websites.

A robots.txt file on a website will function as a request that specified robots ignore specified files or directories in their search. This might be, for example, out of a preference for privacy from search engine results, or the belief that the content of the selected directories might be misleading or irrelevant to the categorization of the site as a whole, or out of a desire that an application only operate on certain data.

For websites with multiple subdomains, each subdomain must have its own robots.txt file. If example.com had a robots.txt file but a.example.com did not, the rules that would apply for example.com would not apply to a.example.com.

Ankit Sharma online invoicing and time tracking
ankitsharma is offline   Reply With Quote
Old 04-15-2011, 06:59 PM   #11
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

@ Remington Steel, thank you, I contacted my Host. I could not find any errors on my site.

BigG95 is offline   Reply With Quote
Old 04-15-2011, 07:00 PM   #12
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

@ Paul. Are you sure it has nothing to do with my site dropping in the Serps. Since it started dropping shortly after google bot could not access my robots.txt anymore

BigG95 is offline   Reply With Quote
Old 04-15-2011, 07:11 PM   #13
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

@ yukon, I have tried this, no luck, but thanks

BigG95 is offline   Reply With Quote
Old 04-15-2011, 10:18 PM   #14
HyperActive Warrior
 
Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by paulgl View Post
robots.txt has nothing to do with SERPs.
If it's not found, no big deal. Googlebot will try
and find it first. I have never had one. No need.
It will show a not found error. But again, unless
it just creeps you out to have a not found error,
it's no big deal.

Well, I guess it may if you had some stuff that you
did not want indexed. But then that would be a different
matter.

Paul
Kind of right, but in this case.. completely wrong. If you have a robots.txt file and it returns an "ambiguous" error as it's seemingly doing in this case, this is the problem. If Google sees there's a robots.txt but can't read it, they wisely assume your content is blocked (it wouldn't be a good practice to say "we can't read it, so let's index the whole thing").

If you can't solve the issue with the robots.txt file, you're better off to delete it. Then... it becomes "no big deal". Otherwise, your site will simply no longer be crawled... and that's a pretty big deal.

Source: robots.txt unreachable - Webmaster Central Help
guitarjosh is offline   Reply With Quote
Old 04-15-2011, 10:50 PM   #15
Senior Warrior Member
War Room Member
 
timpears's Avatar
 
Join Date: Jul 2008
Location: Vancouver, WA, USA.
Posts: 3,500
Thanks: 327
Thanked 584 Times in 408 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by BigG95 View Post
if i type in mysite.com/robots.txt it shows
User-agent: * Disallow: Sitemap: http://mysite.com/sitemap.xml.gz
Why would you want to disallow the site map? You want Google to read your site map.

Tim Pears

timpears is offline   Reply With Quote
Old 04-15-2011, 11:40 PM   #16
Warrior Member
 
Join Date: Apr 2011
Posts: 17
Thanks: 0
Thanked 0 Times in 0 Posts
Default Re: Robots.txt issue

hello everyone,

I know nothing about how to recover such an position but after reading the quires and their response now i understand some basic of it. So well job done by all.

rvitgroup is offline   Reply With Quote
Old 04-16-2011, 12:22 AM   #17
HyperActive Warrior
 
Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by timpears View Post
Why would you want to disallow the site map? You want Google to read your site map.
He's not disallowing it. His robots file is correct. It reads (as it should):

User-agent: *
Disallow:
Sitemap: http://mysite.com/sitemap.xml.gz
guitarjosh is offline   Reply With Quote
Old 04-16-2011, 12:57 AM   #18
Active Warrior
 
Join Date: Apr 2011
Posts: 52
Thanks: 0
Thanked 2 Times in 2 Posts
Default Re: Robots.txt issue

first of all you need to check that robots file is working or not just type domainname/robots.txt if it is working then there is no problem and if not working then check your file where it is this file will be on root.

ianspencer is offline   Reply With Quote
Old 04-16-2011, 01:23 AM   #19
Senior Warrior Member
War Room Member
 
timpears's Avatar
 
Join Date: Jul 2008
Location: Vancouver, WA, USA.
Posts: 3,500
Thanks: 327
Thanked 584 Times in 408 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by guitarjosh View Post
He's not disallowing it. His robots file is correct. It reads (as it should):

User-agent: *
Disallow:
Sitemap: http://mysite.com/sitemap.xml.gz
I guess I misread it. I don't really understand robots.text that well.

Tim Pears

timpears is offline   Reply With Quote
Old 04-16-2011, 03:00 AM   #20
HyperActive Warrior
 
rising_sun's Avatar
 
Join Date: Dec 2010
Posts: 181
Thanks: 9
Thanked 14 Times in 13 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by BigG95 View Post
I really need some help here. I have a wordpress site, which has been ranked # 1 in google for the last year or so.

Just yesterday it dropped to nowhere to be found.

I checked google webmaster tools and its says robots.txt unreachable.

I have no clue, how to fix this.

My other sites, all wordpress all hosted with hostgator, do not have issues.

Help???
Of course me too say to you the same things check the index from your hosting provider. I think they will know what is happening inside there. Please knock them quickly. Hopefully you will get index very first.

GET PAID FOR YOUR HARD WORK, BE CAREFUL IN CHOOSING
Lists of DoFollow forums that allows to add signatures

rising_sun is offline   Reply With Quote
Old 04-16-2011, 09:31 AM   #21
Active Warrior
War Room Member
 
BigG95's Avatar
 
Join Date: Mar 2009
Location: Tampa Bay, FL
Posts: 87
Thanks: 32
Thanked 5 Times in 5 Posts
Default Re: Robots.txt issue

Ok, I checked with Hostgator, they were blocking google bot, I meanwhile entered all of my sites in Webmaster account twice.

Once as mysite.com and once as www.mysite.com. According to google, they treat those 2 as different websites.

I set the www. as preferred, also according to google, otherwise I might get punished for duplicate content.

Here comes the fun part now:

For some of my sites, google can find the robots.txt now for both the www.mysite.com and the mysite.com.

For other sites they can only find the www.mysite.com/robots.txt but not the other one.

I do however still have one site, where google bot can't find the robots.txt on neither.

Does not make sense at all, I know.

@guitarjosh - I would like to try to remove the robots.txt for the one site, I mentioned, but I have wordpress installed and they somehow automatically add it.
Searched through Wordpress Forums, but no luck. I need to know, how to remove the robots.txt from my wordpress site.

BigG95 is offline   Reply With Quote
Old 04-23-2011, 02:23 PM   #22
HyperActive Warrior
 
Join Date: Dec 2010
Posts: 475
Thanks: 42
Thanked 68 Times in 47 Posts
Default Re: Robots.txt issue

Quote:
Originally Posted by BigG95 View Post
Ok, I checked with Hostgator, they were blocking google bot, I meanwhile entered all of my sites in Webmaster account twice.

Once as mysite.com and once as www.mysite.com. According to google, they treat those 2 as different websites.

I set the www. as preferred, also according to google, otherwise I might get punished for duplicate content.

Here comes the fun part now:

For some of my sites, google can find the robots.txt now for both the www.mysite.com and the mysite.com.

For other sites they can only find the www.mysite.com/robots.txt but not the other one.

I do however still have one site, where google bot can't find the robots.txt on neither.

Does not make sense at all, I know.

@guitarjosh - I would like to try to remove the robots.txt for the one site, I mentioned, but I have wordpress installed and they somehow automatically add it.
Searched through Wordpress Forums, but no luck. I need to know, how to remove the robots.txt from my wordpress site.
Wordpress's "virtual" robots.txt file is a pain in the arse. Just manually create a robots.txt file and put it in the root of your site. That should override the virtual.

How did you set the www as preferred in HostGator? I haven't checked in to seeing how mine's set but thought maybe I should.
guitarjosh is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum

Tags
issue, robotstxt

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 04:30 AM.