Erasing everything from Google index

9 replies
  • SEO
  • |
Hi,

During the first run, Google indexed some stuff that it should not have - plugins directories, etc.

How can one get rid of all this information in a batch-like mode? I know one can remove (or try to remove) individual links from Google Webmaster - but - how can you do a lot at one time?

TIA
#erasing #google #index
  • Profile picture of the author nik0
    Banned
    Originally Posted by dxmufasa View Post

    Hi,

    During the first run, Google indexed some stuff that it should not have - plugins directories, etc.

    How can one get rid of all this information in a batch-like mode? I know one can remove (or try to remove) individual links from Google Webmaster - but - how can you do a lot at one time?

    TIA
    Block it in robots.txt indeed to prevent it getting reindexed after you use the URL removal tool in Google webmasters.

    The URL removal tool suppors the option to deindex (sub) folders, there's an option for that.
    {{ DiscussionBoard.errors[9262257].message }}
  • Profile picture of the author dxmufasa
    Thanks - out of curiosity, would it be best to just remove the site and start from scratch again? I just saw an option that would remove everything.

    I am only asking this because the page titles being used in the SERP are wrong. I have made changes and checked the source HTML code to see that the <title> </title> tags are set correctly.

    Google reports that the page has been indexed. And - the site title is ~still wrong~. For example, I have an old site title of : Login To The SiteCompany Name A

    This was because I had 2 Wordpress plugins activated at one time: SEO Yoast and another (can't remember the name) - I had forgotten I had added it.

    I deactivated one and then the titles started showing up correctly - i.e.
    Login To The Site - Company Name A

    But again, Google does not recognize this change. I have been submitting sitemaps and Google reports the pages have been indexed.

    That was the reason for perhaps considering the "scorched earth" policy was to erase everything and try again - since Google refuses to recognize the changes.
    {{ DiscussionBoard.errors[9262286].message }}
  • Profile picture of the author dxmufasa
    Thanks but my understanding is that this is more of a preventative measure than a corrective one. It will work for any crawling in the future but does not address the wrong pages being indexed by Google in the present.

    Originally Posted by SEO rockzz View Post

    You should use robots.txt file on your public.html folder to indicate google crawler on which page should index and which not .
    Thanks
    {{ DiscussionBoard.errors[9262292].message }}
    • Profile picture of the author mkg
      Banned
      [DELETED]
      {{ DiscussionBoard.errors[9262352].message }}
      • Profile picture of the author dxmufasa
        Originally Posted by mkg View Post

        User-agent: *
        Disallow: /

        in your robots.txt and everything about the site is gone on google. You can also use the meta robots tag to stop google from showing the "A description for this result is not available because of this site's robots.txt - learn more." message when you search for the exact url.

        I don't recommend doing this, Google will update the site/page's title the next time it caches your site so why not wait a few days and let google do its thing automatically.
        Is there a link that would support how to do this?

        Thanks for the input I guess I was freaking out because I was submitting maps, they were getting indexed but the title was not changing - so - this is due to Google caching the page(?)

        Google also indexed a directory full of plugins as well I entered http://mydomain.com/wp-content/plugins/
        into the Webmaster removal tool - it is pending. This should remove the literally ~tons~ of pages (OK, I'm exaggerating a little) that have been indexed under /wp-content/plugins/ - right?

        I was getting ready to do another site as well. Is there a way to fix it so that Google (and others) ~do no caching or crawling at all~ until the site is completed? Something like an on/off button of sorts.
        {{ DiscussionBoard.errors[9262393].message }}
      • Profile picture of the author dxmufasa
        Originally Posted by mkg View Post

        User-agent: *
        Disallow: /

        in your robots.txt and everything about the site is gone on google.

        Thanks again for your responses. So - this will do the erasing in Google?
        I found some info here:
        The Web Robots Pages

        I thought that this would affect future crawling and not really the crawling that has already taken place. In other if Google had indexes "cached", (before adding robots.txt) then they would stay there.

        I though I read this somewhere - although I can't remember the link ...
        {{ DiscussionBoard.errors[9262408].message }}
        • Profile picture of the author paulgl
          There is no reason to do anything like you are thinking about.

          Complete nonsense.

          Google indexes tons of crap. Doesn't mean it will ever need to be shown
          in SERPs. And if it is, so what?

          Makes no sense to worry about this.

          Why are there so many threads like this? Are you people that new to net?
          Do you not know how the web works? Do you know anything about html,
          php, javascript, uploading files, changing files, etc?

          I can't believe I'm the only one that ever gives a voice of reason.

          Dump the whole site? Man you people are just crazy with a capital C.

          Paul
          Signature

          If you were disappointed in your results today, lower your standards tomorrow.

          {{ DiscussionBoard.errors[9262432].message }}
  • Profile picture of the author Earl Gray
    Yes, robots.txt should affect on google index in mater of days.
    {{ DiscussionBoard.errors[9262391].message }}
  • Profile picture of the author molliefr
    One of the way is to stop indexing is to have this meta tag -
    <meta name="robots" content="noindex">

    but you need it on every page.

    Better go for webmaster tool and remove all url and also edit the robot.txt

    more information can be found here
    https://support.google.com/webmaster...r/156449?hl=en
    {{ DiscussionBoard.errors[9262581].message }}
  • Profile picture of the author dxmufasa
    Thanks for all of the hints and help!
    {{ DiscussionBoard.errors[9263131].message }}

Trending Topics