Google-bot Can't Access my website?

by aadi14
14 replies
  • SEO
  • |
There is a travel website having about 13,000 destination pages like from A to B and B to A, C to B and C to A and continues. But, the sitemap can't index all the website pages and its also showing that Google-bot can't access your site. What can I do to resolve the problem? Not even half of the pages are indexed. Also, for such a big website Should I add different sitemaps ? I already added 10. As in a sitemap only 8,000 are allowed. How can I fix this error ? I am getting good traffic though but I am scared my website will get banned ? Is that so? The website has only 6 pages and the journey will only show up once the destinations are entered.
#access #googlebot #website
  • Profile picture of the author promo87
    Banned
    Originally Posted by aadi14 View Post

    There is a travel website having about 13,000 destination pages like from A to B and B to A, C to B and C to A and continues. But, the sitemap can't index all the website pages and its also showing that Google-bot can't access your site. What can I do to resolve the problem? Not even half of the pages are indexed. Also, for such a big website Should I add different sitemaps ? I already added 10. As in a sitemap only 8,000 are allowed. How can I fix this error ? I am getting good traffic though but I am scared my website will get banned ? Is that so? The website has only 6 pages and the journey will only show up once the destinations are entered.
    Well, I think there is some problem with the Robots.txt file try checking the robots.txt file for errors once the robots.txt file is fixed try creating a sitemap again and submit over to the webmaster tool, wait a while to see if Google bot can now access your site ??
    {{ DiscussionBoard.errors[9382242].message }}
    • Profile picture of the author aadi14
      Originally Posted by promo87 View Post

      Well, I think there is some problem with the Robots.txt file try checking the robots.txt file for errors once the robots.txt file is fixed try creating a sitemap again and submit over to the webmaster tool, wait a while to see if Google bot can now access your site ??
      Checked the robots.txt its like this :
      #robots.txt for <http://www.abc.com>
      User-agent:*
      Disallow: /backoffice/

      Sitemap: <http://www.abc.com/sitemap1.xml>
      Sitemap: <http://www.abc.com/sitemap2.xml>

      Just like this there are a total of 228 entries in sitemap.(excluding <> signs)

      is this the right way ? as there are more than 13,000 journey the website provides so have to submit so many sitemaps.
      {{ DiscussionBoard.errors[9382274].message }}
  • Profile picture of the author sanusense
    Checking up Robots.txt page will help you and use Internal Linking, bots can't crawl all your pages if there is something programming mistake, so just take precautions and do Internal linking.

    A to B, B-C, C-A.
    So, if search engine bot will come to index your page A, it will see another link B. CRAWLING DONE.

    Then again when it will reach Link B it will again see Link C, DONE Problem Solved!!!
    {{ DiscussionBoard.errors[9382249].message }}
  • Profile picture of the author Scott016
    It has to be the robot.txt file. Make sure that file is properly made . After you make the robot.txt file , resubmit sitemap and also use " Fetch as Google " option in webmasters to fetch your website and certain sub domains. I hope that works for you.
    {{ DiscussionBoard.errors[9382250].message }}
  • Profile picture of the author hieuvu899
    problems of your are in the file robots.txt, you use edit tập tin of this to google can read information from sitemap
    {{ DiscussionBoard.errors[9382285].message }}
  • Profile picture of the author alexjames212
    when did you submit the sitemap, date?
    {{ DiscussionBoard.errors[9382308].message }}
  • Profile picture of the author SEO Power
    Extensive internal linking and authority are very important for deeper indexing. Internal linking helps Googlebot find new pages faster and authority (determined by the number and PR of backlinks) help with deeper crawling and frequent crawling.

    Make sure the pages that haven't been indexed yet aren't blocked by your robots.txt file, then build some backlinks to them and give Googlebot some time to visit and crawl them.

    Another potential culprit could be the reliability of your host. If your website is frequently down whenever Google tries to index your pages, most of your pages won't be indexed.
    {{ DiscussionBoard.errors[9382938].message }}
  • Profile picture of the author mkgg
    Its not his robots.txt, jeez do you guys even read ?.

    Fetch as googlebot in WMT and notice the error code that you receive, is it 404/200/403 which one ? What response does googlebot receive. If googlebot is fetching fine then its your server being crappy and probably because of downtimes.

    You can also check your robots.txt in WMT, google will tell you if its good or not. But robots.txt is not the issue. The only thing your robots.txt is blocking right now is backoffice folder which i suppose is what you want right ?.

    Also if robots.txt was the problem, google your site and it should show a message that google can't show webpage because it is not allowed access
    {{ DiscussionBoard.errors[9395697].message }}
    • Profile picture of the author aadi14
      Originally Posted by mkgg View Post

      Its not his robots.txt, jeez do you guys even read ?.

      Fetch as googlebot in WMT and notice the error code that you receive, is it 404/200/403 which one ? What response does googlebot receive. If googlebot is fetching fine then its your server being crappy and probably because of downtimes.

      You can also check your robots.txt in WMT, google will tell you if its good or not. But robots.txt is not the issue. The only thing your robots.txt is blocking right now is backoffice folder which i suppose is what you want right ?.

      Also if robots.txt was the problem, google your site and it should show a message that google can't show webpage because it is not allowed access
      Will try the fetch instruction in WMT first. and I want to block the Backoffice though as its the admin panel link.
      {{ DiscussionBoard.errors[9396566].message }}
  • Profile picture of the author hirithk
    Hi
    Add robot.txt file, and follow link, it will crawl the websites. In Home page you can add and upload using file zilla
    {{ DiscussionBoard.errors[9395720].message }}
  • Profile picture of the author Masondavis
    Add robot.txt file, and follow link, it will crawl the websites.
    Signature

    ----------------
    Digitallyy

    {{ DiscussionBoard.errors[9396579].message }}

Trending Topics