![]() | | ||||||||
| | #1 |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
|
If you do NOT want Google to index a certain URL, or if Google already indexed a certain URL of your site and you do not want it to be listed in the search results, you simply include the following piece of code on the page: <meta name="robots" content="noindex,nofollow"/> Okay, sounds simple enough... But what if the file that is in Google's index that you want removed just happens to be a .SWF file?! You can't just "put" that code into a file like that. Catch my drift? |
| | |
| | |
| | #2 |
| Active Warrior Join Date: Dec 2010 Location: Wales
Posts: 69
Thanks: 0
Thanked 6 Times in 6 Posts
|
If it has already been indexed you will have to contact Google directly. Ask them to remove the file as it breaches your copyright.
|
| | |
| | #3 |
| HyperActive Warrior Join Date: Jan 2010 Location: United Kingdom
Posts: 176
Thanks: 54
Thanked 48 Times in 36 Posts
|
For files that have not been indexed could you not put them in a password protected directory?
|
|
Have a problem with migraine? Find help here: Natural Migraine Relief.
| |
| | |
| | #4 |
| HyperActive Warrior Join Date: May 2009 Location: Midwest
Posts: 234
Thanks: 27
Thanked 37 Times in 28 Posts
|
This one has always helped me get great position in the SERPs: <meta name="ROBOTS" content="position,#1"> I told myself I wasn't going to show anyone that black hat trick, but I wanted to give something back. LastWarrior ![]() Crowd roars and gives thanks! |
| | |
| | #5 | |
| Jordan K War Room Member Join Date: Aug 2010 Location: Canada
Posts: 833
Thanks: 18
Thanked 92 Times in 80 Posts
| Quote:
Thanks for that. It will save me a ton of time, and help my clients. | |
| | ||
| | |
| | #6 |
| Senior Warrior Member Join Date: Jun 2009 Location: Scotland, SD
Posts: 1,251
Thanks: 42
Thanked 290 Times in 191 Posts
|
I think you could use robots.txt to do this. Google it and see. That's what I'd try.
|
| | |
| | |
| | #7 |
| Bill Platt War Room Member Join Date: Jul 2004 Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4 Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
|
tell the robots.txt to ignore it, and on all links pointing to it, use the noindex, nofollow.
|
| Bill Platt --> Writing Puzzle --> Redneck Marketers --> Biz Magi --> How I Became a Redneck Marketer <-- Keep Ken Strong | |
| | |
| | #8 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Hmmm...based on my extensive research that command can only be used within a META tag on a webpage, but it would be great if I could achieve the same thing (deindexing an already indexed page) by simply controlling it from the robots.txt - but here's the HUGE problem with doing that...even if it was possible: Everyone else can view your robots.txt file - it's always recommended to never have any type of private information on your robots.txt file, so if what you want DEindexed from Google is something you don't want anyone else to have access to, by putting it in your robots.txt file you have good intentions, but then the directory/URL will be able to be viewed by the public with a simple web browser by simply seeing it listed in your robots.txt file. Hmmm... | |
| | ||
| | |
| | #9 |
| Super HyperActive Warrior War Room Member Join Date: Sep 2010 Location: Quebec, Canada
Posts: 473
Thanks: 21
Thanked 74 Times in 55 Posts
|
You can tell Google to not index your file by using the file robots.txt in your root folder. You put that inside: PHP Code: |
|
Benoit Tremblay
| |
| | |
| | #10 |
| Senior Warrior Member War Room Member Join Date: Jan 2008 Location: Wisconsin, USA.
Posts: 4,140
Blog Entries: 2 Thanks: 2,820
Thanked 3,878 Times in 1,891 Posts
|
I haven't been in my Google Webmaster Tools Account for a while, but I think there's a way to request delisting in there on a per page basis. I'd look at that first. If that's not there, perhaps the easiest thing to do is move the content you don't want Google to index and put up a different page in it's place that you do want Google to index. |
| | |
| | |
| | #11 |
| Bill Platt War Room Member Join Date: Jul 2004 Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4 Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
|
I honestly think you can go into Google Webmaster Tools and tell them to de-index that page.
|
| Bill Platt --> Writing Puzzle --> Redneck Marketers --> Biz Magi --> How I Became a Redneck Marketer <-- Keep Ken Strong | |
| | |
| | #13 |
| Bill Platt War Room Member Join Date: Jul 2004 Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4 Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
| |
| Bill Platt --> Writing Puzzle --> Redneck Marketers --> Biz Magi --> How I Became a Redneck Marketer <-- Keep Ken Strong | |
| | |
| | #14 |
| SEO Strategist War Room Member Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
|
Forget robots.txt, try this code in your .htaccess file. Code: RewriteRule .*.(swf|mov|avi|flv)$ hxxp://redirect_here.com/ [R,NC] I use this code on my sites to block google from adding certain image formats (jpg, gif) to Google Images. It works perfect for images, so it should also work for video (swf). BTW, this thread would be better in the SEO forum. |
| | |
| | |
| | #15 |
| SEO Strategist War Room Member Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
|
Also If your host is Hostgator, go to your Cpanel -> Hotlink Protection, & use that page to block your specific file formats (swf, zip, jpg, etc...) If you do this (Cpanel) it will update your .htaccess file on it's own, so you wouldn't need to do the manual update code in my last comment above. |
| | |
| | |
| | #16 | |
| Senior Warrior Member War Room Member Join Date: Jan 2008 Location: Wisconsin, USA.
Posts: 4,140
Blog Entries: 2 Thanks: 2,820
Thanked 3,878 Times in 1,891 Posts
| Quote:
![]() For the second part, I've got that puppy on my lap, making it hard to type. I'd call our obstacles about even. | |
| | ||
| | |
| | #17 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
that code! It happened to me! And if you do some research you'll find that to be true. Really! That command does not guarantee the link will not be indexed! It merely states to not allow the spider to crawl the page and gather information about the link (like the description tag) when displaying it in the SERPs. If other pages are linking to the URL it can still be indexed (it happened to me so I found out firsthand from experience on this one) & it will just appear in the SERP as a filename only with no description below it (like what happened to me). If you really want to prevent a URL from being indexed, or if you want an already-indexed URL to be removed from the search index, you must insert the following code within the heading of your page: <meta name="robots" content="noindex,nofollow" /> ...and then of course wait until the spider comes around to visit the page again to see the updated META tag & remove the page from it's index appropriately... The only problem about my situation here is that the file I want DEindexed is actually a .SWF file and not an HTML file that I can add that code to. (So I added it to the INDEX page of the directory pathway that the SWF file is contained in to see, as a test, if Google will DEindex only the INDEX page or the INDEX page as well as all the associated files within that directory path - time will tell) | |
| | ||
| | |
| | #18 | |||
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
Quote:
If you own the site Verify your ownership of the site in Webmaster Tools. On the Webmaster Tools home page, click the site you want. On the Dashboard, click Site configuration in the left-hand navigation. Click Crawler access, and then click Remove URL. Click New removal request. Type the URL of the page you want removed from search results (not the Google search results URL or cached page URL), and then click Continue. How to find the right URL. Note that the URL is case-sensitive—you will need to submit the URL using exactly the same characters and the same capitalization that the site uses. Click Remove page from search results and cache. Select the checkbox to confirm that you have completed the requirements listed in this article, and then click Submit Request. Make sure you choose "Remove page and cache" in the dropdown. Quote:
Regarding the public nature of the robots.txt: You could create a folder for only those files that you don't want indexed, then disallow that folder in the robots.txt, so the file names are not shown in the robots.txt file, although, I don't know whether the filenames could be viewed by typing in the domainname.com/yourdisallowedfoldername. You could also create a subdomain, and put all files under that you don't want indexed. Google handles subdomains separately from the main domain. You would put a robots.txt file under the subdomain that disallows indexing the whole subdomain. That way you don't have to state any folder names or file names in the robots.txt, so even though the robots.txt is public, nobody will know the folder names or file names under the subdomain. The big thing about the noindex meta tag, the robots.txt, and restricting access or indexing through the .htaccess file is that they are advisory--not obligatory. Search engines are NOT obligated or required to obey the instructions in either of those files or the meta tag. Google respects, and obeys those instructions, and Yahoo does too. But there are other, and many smaller search engines that don't. So, even if you have your file removed from Google search results, it doesn't mean that it's not in the search results of other search engines. If you want a file protected from indexing, why not put it in a different folder in your cpanel that's not under public_html | |||
| | |
| | #19 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Right... And it's here: https://www.google.com/webmasters/tools/removals But the only problem with that is after putting in the URL and clicking Continue... ![]() ...it doesn't seem like any of those options seem to fit. It seems as if those options are all geared towards the non-website-owner (I accessed this tools after already being logged in from my Webmaster Tools account). | |
| | ||
| | |
| | #20 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
I did this a week ago, and Google removed the page within hours. I posted instructions right before your last post. See there. You should be able to get the page removed. | |
| | |
| | #21 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Exactly. And this is another thing I learned today. Those files actually CAN be viewed IF you do not have an INDEX page for that directory AND in your cPanel (in HostGator anyway) in the "Index Manager" by default the setting is to display all the files! You must manually change this for the domain and select "No Indexing" which will THEN generate the "forbidden no access" page anytime someone accesses the directory with no index page. | |
| | ||
| | |
| | #22 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
| |
| | |
| | #23 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Thanks. I started to do that but then the checkbox below it... ![]() ...it sounds like I need to either place a block on robots.txt (which I'm NOT going to do since I don't want it listed there) or place a NOINDEX META tag in the file (which I mentioned earlier...this is a SWF file and I can not place a META tag in a SWF file!)... So since I can't do either one of the two, I can't check the box, thus can't move forward with this type of DEindexing option. Thankyou for the explanation though! It's very useful to know about in the future if I ever need to DEindex an actual HTML page, but as for a SWF page...perhaps I'll wait it out and see if the NOINDEX META tag I placed in the INDEX page of the directory the SWF is contained in will work. It'd be good to know that anyway. | |
| | ||
| | |
| | #24 |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| |
| | |
| | |
| | #25 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
I have to say, I am not familiar with SWF files.. Didn't Google give another option to block it? I thought there were three (you mentioned two), but I might not remember right. | |
| | |
| | #26 |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| |
| | |
| | #27 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Yes, there is a third option (see the checkbox image above) and that is that it returns a "404 error", which doesn't apply either since it does NOT return a 404 error (it loads fine). | |
| | ||
| | |
| | #28 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
| |
| | |
| | #29 | ||
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Yes, I did, and thankyou for the suggestion, however, as mentioned earlier in this thread, the DISALLOW command in robots.txt does NOT prevent files from becoming indexed (only crawled). I'll paste it again: Quote:
| ||
| | |||
| | |
| | #30 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Yes, that will work, but what if I needed to keep the same location of the file? I'm curious as to how else one would get a SWF URL removed from Google's index without having to change the filename (which is why I'm going to wait this out a few days to see if the META tag I put in the INDEX page of the directory the SWF file is contained in works to remove the URL of the SWF - if it doesn't then I'll just end up using the 404 option). Thanks for the input! | |
| | ||
| | |
| | #31 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
But you are correct, none of these will CERTAINLY prevent indexing because, as I said, these instruction are not obligatory for the search engines. | |
| | |
| | #32 | ||
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
Then you put the file back after it gets deindexed. (But as long as it's under public_html, it might be indexed again even if you "restrict" it with robots.txt or any of the other methods because search engines are NOT obligated to obey it (as stated before). You might also want to check if Yahoo indexed it. I don't know how you would remove it from there. You could put the file in a frame. Search engines can't read (or index) frames (as far as I know. I don't know the specifics, but you might want to do a search about it). Again, I don't know anything about SWF files, so I don't know if the frame would be appropriate. Quote:
| ||
| | |
| | #33 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
...a page does not have to be crawled in order to be indexed! If Google sees a URL being referenced by anchor text, it can still include the URL in the index. It will not be crawled (the page will not be fetched) so it will not have the description tag with it, and will appear as a URL only (as it did in my case). | |
| | ||
| | |
| | #34 | ||
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
of what other option I have for a SWF file. Quote:
It's primarily because I'd like to figure out a way to remove a SWF URL from Google's index without having to delete it's original location...just to know how it's done in case it ever gets asked in the future or if I'm ever in a similar situation in the future. | ||
| | |||
| | |
| | #35 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
| |
| | |
| | #36 | |
| HyperActive Warrior War Room Member Join Date: Nov 2010 Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
| Quote:
User-agent: * Disallow: /directory 1/ I can't think of any other way. Hope you will succeed. Interestingly, Google just deindexed a page on my site that I didn't want deindexed. I have the follow tag in the file, and the robots.txt doesn't restrict it either. I would like to know why they did that. | |
| | |
| | #37 |
| HyperActive Warrior Join Date: May 2010
Posts: 123
Thanks: 3
Thanked 5 Times in 5 Posts
|
Sometimes the robot disobeys the rule and as a result page gets indexed.For better results contact Google directly.
|
| | |
| | #38 | |
| Senior Warrior Member War Room Member Join Date: Oct 2006 Location: San Francisco, CA
Posts: 3,446
Blog Entries: 11 Thanks: 749
Thanked 835 Times in 520 Posts
| Quote:
Sounds like common sense now but I hadn't really thought about it until Emmanuel let me know. So if you have links like your download page or thank you page that you don't want crawled there are thieves who will look at your robots.txt file so be careful. | |
| | ||
| | |
| | #39 |
| Hear me CLear Join Date: Sep 2010 Location: Washington
Posts: 124
Thanks: 12
Thanked 3 Times in 3 Posts
|
I have learned alot and just working on some of my website portion not to be indexed by google and other search engines.
|
| | |
| | |
| | #40 | |
| SEO Strategist War Room Member Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
| Quote:
| |
| | ||
| | |
| | #41 | |
| --> SEOHelpVideos.com <-- War Room Member Join Date: Apr 2008 Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
| Quote:
Yukon, thankyou for your willingness to help, and yes, I did see the post you made a while back about using the .htaccess file, but it seems as if that code you're suggesting prevents Google from indexing certain files...not removes already- indexed URLs. | |
| | ||
| | |
![]() |
|
| Tags |
| indexed, meta, nofollow, noindex, remove |
| Thread Tools | |
| |
![]() |