Go Back   WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 03-09-2011, 04:22 PM   #1
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default I Said I DON'T Want To Be Indexed, OKAY?!

If you do NOT want Google to index a certain URL, or if Google
already indexed a certain URL of your site and you do not want
it to be listed in the search results, you simply
include the following piece of code on the page:

<meta name="robots" content="noindex,nofollow"/>



Okay, sounds simple enough...

But what if the file that is in Google's index that you
want removed just happens to be a .SWF file?!

You can't just "put" that code into a file like that.

Catch my drift?


Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 04:30 PM   #2
Active Warrior
 
Join Date: Dec 2010
Location: Wales
Posts: 69
Thanks: 0
Thanked 6 Times in 6 Posts
Contact Info
Send a message via Skype™ to tfos4941
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

If it has already been indexed you will have to contact Google directly. Ask them to remove the file as it breaches your copyright.
tfos4941 is offline   Reply With Quote
Old 03-09-2011, 04:48 PM   #3
HyperActive Warrior
 
Join Date: Jan 2010
Location: United Kingdom
Posts: 176
Thanks: 54
Thanked 48 Times in 36 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

For files that have not been indexed could you not put them in a password protected directory?

Have a problem with migraine? Find help here: Natural Migraine Relief.
wtatlas is offline   Reply With Quote
Old 03-09-2011, 05:09 PM   #4
HyperActive Warrior
 
Join Date: May 2009
Location: Midwest
Posts: 234
Thanks: 27
Thanked 37 Times in 28 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

This one has always helped me get great position in the SERPs:

<meta name="ROBOTS" content="position,#1">

I told myself I wasn't going to show anyone that black hat trick, but I wanted to give something back.

LastWarrior



Crowd roars and gives thanks!
LastWarrior is offline   Reply With Quote
Old 03-09-2011, 05:11 PM   #5
Jordan K
War Room Member
 
theseoguys's Avatar
 
Join Date: Aug 2010
Location: Canada
Posts: 833
Thanks: 18
Thanked 92 Times in 80 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via Skype™ to theseoguys
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by LastWarrior View Post
This one has always helped me get great position in the SERPs:

<meta name="ROBOTS" content="position,#1">

I told myself I wasn't going to show anyone that black hat trick, but I wanted to give something back.

LastWarrior

Thanks for that. It will save me a ton of time, and help my clients.

theseoguys is offline   Reply With Quote
Old 03-09-2011, 07:27 PM   #6
Senior Warrior Member
 
Join Date: Jun 2009
Location: Scotland, SD
Posts: 1,251
Thanks: 42
Thanked 290 Times in 191 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

I think you could use robots.txt to do this. Google it and see. That's what I'd try.

Rich Struck is online now   Reply With Quote
Old 03-09-2011, 07:33 PM   #7
tpw
Bill Platt
War Room Member
 
tpw's Avatar
 
Join Date: Jul 2004
Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4
Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
Social Networking View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via ICQ to tpw
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

tell the robots.txt to ignore it, and on all links pointing to it, use the noindex, nofollow.

tpw is online now   Reply With Quote
Old 03-09-2011, 07:46 PM   #8
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by tpw View Post
tell the robots.txt to ignore it, and on all links pointing to it, use the noindex, nofollow.

Hmmm...based on my extensive research that command can only be
used within a META tag on a webpage, but it would be great if I could
achieve the same thing (deindexing an already indexed page) by simply
controlling it from the robots.txt - but here's the HUGE problem with
doing that...even if it was possible:

Everyone else can view your robots.txt file - it's always recommended
to never have any type of private information on your robots.txt file,
so if what you want DEindexed from Google is something you don't
want anyone else to have access to, by putting it in your robots.txt
file you have good intentions, but then the directory/URL will be
able to be viewed by the public with a simple web browser by simply
seeing it listed in your robots.txt file.

Hmmm...

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 07:54 PM   #9
Super HyperActive Warrior
War Room Member
 
BenoitT's Avatar
 
Join Date: Sep 2010
Location: Quebec, Canada
Posts: 473
Thanks: 21
Thanked 74 Times in 55 Posts
Social Networking View Member's Twitter Profile 
Contact Info
Send a message via Skype™ to BenoitT
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

You can tell Google to not index your file by using the file robots.txt in your root folder. You put that inside:

PHP Code:
User-Agent: * 
Disallow: /folder/file.php
Disallow
: /folder2/file2.swf 
And yes, it is safe to do it this way. Obviously if every crawlers can see it, everybody can see it. If you want something private for real, you need to put a password otherwise someone can find your page somewhere or just randomly. If you only want crawlers to not index it, it's the perfect way.

Benoit Tremblay
BenoitT is offline   Reply With Quote
Old 03-09-2011, 07:58 PM   #10
Senior Warrior Member
War Room Member
 
Dennis Gaskill's Avatar
 
Join Date: Jan 2008
Location: Wisconsin, USA.
Posts: 4,140
Blog Entries: 2
Thanks: 2,820
Thanked 3,878 Times in 1,891 Posts
Social Networking View Member's Twitter Profile 
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

I haven't been in my Google Webmaster Tools Account for a while, but I think there's a way to request delisting in there on a per page basis. I'd look at that first.

If that's not there, perhaps the easiest thing to do is move the content you don't want Google to index and put up a different page in it's place that you do want Google to index.

Dennis Gaskill is online now   Reply With Quote
Old 03-09-2011, 07:58 PM   #11
tpw
Bill Platt
War Room Member
 
tpw's Avatar
 
Join Date: Jul 2004
Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4
Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
Social Networking View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via ICQ to tpw
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

I honestly think you can go into Google Webmaster Tools and tell them to de-index that page.

tpw is online now   Reply With Quote
Old 03-09-2011, 07:59 PM   #12
Senior Warrior Member
War Room Member
 
Dennis Gaskill's Avatar
 
Join Date: Jan 2008
Location: Wisconsin, USA.
Posts: 4,140
Blog Entries: 2
Thanks: 2,820
Thanked 3,878 Times in 1,891 Posts
Social Networking View Member's Twitter Profile 
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by tpw View Post
I honestly think you can go into Google Webmaster Tools and tell them to de-index that page.
Now why didn't I think of that?

Dennis Gaskill is online now   Reply With Quote
Old 03-09-2011, 08:03 PM   #13
tpw
Bill Platt
War Room Member
 
tpw's Avatar
 
Join Date: Jul 2004
Location: Stillwater, Oklahoma, USA.
Posts: 7,990
Blog Entries: 4
Thanks: 6,529
Thanked 6,059 Times in 2,729 Posts
Social Networking View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via ICQ to tpw
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Dennis Gaskill View Post
Now why didn't I think of that?
We were posting same at same time. The time stamp is the same.

I bet if the 3yo were not wallering on me, I could have beat you... LOL

tpw is online now   Reply With Quote
Old 03-09-2011, 08:06 PM   #14
SEO Strategist
War Room Member
 
yukon's Avatar
 
Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Forget robots.txt, try this code in your .htaccess file.


Code:
RewriteRule .*.(swf|mov|avi|flv)$ hxxp://redirect_here.com/ [R,NC]
You need to change the url (hxxp://redirect_here.com) to the location on the net that you want to send the person/bot that tries to hotlink your image.

I use this code on my sites to block google from adding certain image formats (jpg, gif) to Google Images. It works perfect for images, so it should also work for video (swf).

BTW, this thread would be better in the SEO forum.

yukon is offline   Reply With Quote
Old 03-09-2011, 08:10 PM   #15
SEO Strategist
War Room Member
 
yukon's Avatar
 
Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Also If your host is Hostgator, go to your Cpanel -> Hotlink Protection, & use that page to block your specific file formats (swf, zip, jpg, etc...)

If you do this (Cpanel) it will update your .htaccess file on it's own, so you wouldn't need to do the manual update code in my last comment above.

yukon is offline   Reply With Quote
Old 03-09-2011, 08:13 PM   #16
Senior Warrior Member
War Room Member
 
Dennis Gaskill's Avatar
 
Join Date: Jan 2008
Location: Wisconsin, USA.
Posts: 4,140
Blog Entries: 2
Thanks: 2,820
Thanked 3,878 Times in 1,891 Posts
Social Networking View Member's Twitter Profile 
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by tpw View Post
We were posting same at same time. The time stamp is the same.

I bet if the 3yo were not wallering on me, I could have beat you... LOL
For the first part of your comment...I know, but I couldn't help myself.

For the second part, I've got that puppy on my lap, making it hard to type. I'd call our obstacles about even.

Dennis Gaskill is online now   Reply With Quote
Old 03-09-2011, 09:42 PM   #17
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Lightbulb Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by BenoitT View Post
You can tell Google to not index your file by using the file robots.txt in your root folder. You put that inside:

PHP Code:
User-Agent: * 
Disallow: /folder/file.php
Disallow
: /folder2/file2.swf 
And yes, it is safe to do it this way. Obviously if every crawlers can see it, everybody can see it. If you want something private for real, you need to put a password otherwise someone can find your page somewhere or just randomly. If you only want crawlers to not index it, it's the perfect way.
Actually, your URL can still become indexed by using
that code!
It happened to me! And if you do some
research you'll find that to be true.

Really! That command does not guarantee the link
will not be indexed! It merely states to not allow the
spider to crawl the page and gather information about
the link (like the description tag) when displaying it
in the SERPs. If other pages are linking to the URL
it can still be indexed (it happened to me so I found out
firsthand from experience on this one) & it will just
appear in the SERP as a filename only with no
description below it (like what happened to me).

If you really want to prevent a URL from being indexed,
or if you want an already-indexed URL to be removed from
the search index, you must insert the following code within
the heading of your page:

<meta name="robots" content="noindex,nofollow" />

...and then of course wait until the spider comes around to
visit the page again to see the updated META tag & remove
the page from it's index appropriately...

The only problem about my situation here is that
the file I want DEindexed is actually a .SWF file
and not an HTML file that I can add that code to.

(So I added it to the INDEX page of the directory pathway that
the SWF file is contained in to see, as a test, if Google will
DEindex only the INDEX page or the INDEX page as well as
all the associated files within that directory path - time will
tell)

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 09:43 PM   #18
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
If you do NOT want Google to index a certain URL, or if Google
already indexed a certain URL of your site and you do not want
it to be listed in the search results, you simply
include the following piece of code on the page:

<meta name="robots" content="noindex,nofollow"/>



Okay, sounds simple enough...

But what if the file that is in Google's index that you
want removed just happens to be a .SWF file?!

You can't just "put" that code into a file like that.

Catch my drift?


Quote:
Originally Posted by Dennis Gaskill View Post
I haven't been in my Google Webmaster Tools Account for a while, but I think there's a way to request delisting in there on a per page basis. I'd look at that first.
Yes, you can request the page removal in Google Webmaster Tools. I just did that a week ago, and Google removed the page within a few hours. Here are the instructions:

If you own the site

Verify your ownership of the site in Webmaster Tools.
On the Webmaster Tools home page, click the site you want.
On the Dashboard, click Site configuration in the left-hand navigation.
Click Crawler access, and then click Remove URL.
Click New removal request.
Type the URL of the page you want removed from search results (not the Google search results URL or cached page URL), and then click Continue. How to find the right URL. Note that the URL is case-sensitive—you will need to submit the URL using exactly the same characters and the same capitalization that the site uses.
Click Remove page from search results and cache.
Select the checkbox to confirm that you have completed the requirements listed in this article, and then click Submit Request.

Make sure you choose "Remove page and cache" in the dropdown.


Quote:
Originally Posted by Emmanuel Betinis View Post
Hmmm...based on my extensive research that command can only be
used within a META tag on a webpage, but it would be great if I could
achieve the same thing (deindexing an already indexed page) by simply
controlling it from the robots.txt - but here's the HUGE problem with
doing that...even if it was possible:

Everyone else can view your robots.txt file - it's always recommended
to never have any type of private information on your robots.txt file,
so if what you want DEindexed from Google is something you don't
want anyone else to have access to, by putting it in your robots.txt
file you have good intentions, but then the directory/URL will be
able to be viewed by the public with a simple web browser by simply
seeing it listed in your robots.txt file.

Hmmm...
Trying to de-index it with a robots.txt might be a challenge. Google might or might not do that.

Regarding the public nature of the robots.txt: You could create a folder for only those files that you don't want indexed, then disallow that folder in the robots.txt, so the file names are not shown in the robots.txt file, although, I don't know whether the filenames could be viewed by typing in the domainname.com/yourdisallowedfoldername.

You could also create a subdomain, and put all files under that you don't want indexed. Google handles subdomains separately from the main domain. You would put a robots.txt file under the subdomain that disallows indexing the whole subdomain. That way you don't have to state any folder names or file names in the robots.txt, so even though the robots.txt is public, nobody will know the folder names or file names under the subdomain.

The big thing about the noindex meta tag, the robots.txt, and restricting access or indexing through the .htaccess file is that they are advisory--not obligatory. Search engines are NOT obligated or required to obey the instructions in either of those files or the meta tag. Google respects, and obeys those instructions, and Yahoo does too. But there are other, and many smaller search engines that don't.

So, even if you have your file removed from Google search results, it doesn't mean that it's not in the search results of other search engines.

If you want a file protected from indexing, why not put it in a different folder in your cpanel that's not under public_html
halmo is offline   Reply With Quote
Old 03-09-2011, 09:51 PM   #19
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by tpw View Post
I honestly think you can go into Google Webmaster Tools and tell them to de-index that page.

Right...

And it's here: https://www.google.com/webmasters/tools/removals

But the only problem with that is after putting in the URL and
clicking Continue...




...it doesn't seem like any of those options seem to fit. It seems as if
those options are all geared towards the non-website-owner (I accessed
this tools after already being logged in from my Webmaster Tools account).

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 09:57 PM   #20
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Right...

And it's here: https://www.google.com/webmasters/tools/removals

But the only problem with that is after putting in the URL and
clicking Continue...




...it doesn't seem like any of those options seem to fit. It seems as if
those options are all geared towards the non-website-owner (I accessed
this tools after already being logged in from my Webmaster Tools account).
Emmanual, did you verify your website in Google Webmaster Tools? If not, Google might think that you are not the owner, and that's the reason you are getting that dropdown you posted here.

I did this a week ago, and Google removed the page within hours. I posted instructions right before your last post. See there. You should be able to get the page removed.
halmo is offline   Reply With Quote
Old 03-09-2011, 09:57 PM   #21
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Regarding the public nature of the robots.txt: You could create a folder for only those files that you don't want indexed, then disallow that folder in the robots.txt, so the file names are not shown in the robots.txt file, although, I don't know whether the filenames could be viewed by typing in the domainname.com/yourdisallowedfoldername.

Exactly. And this is another thing I learned today. Those files actually
CAN be viewed IF you do not have an INDEX page for that directory
AND in your cPanel (in HostGator anyway) in the "Index Manager"
by default the setting is to display all the files! You must manually
change this for the domain and select "No Indexing" which will
THEN generate the "forbidden no access" page anytime someone
accesses the directory with no index page.




Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 10:02 PM   #22
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Exactly. And this is another thing I learned today. Those files actually
CAN be viewed IF you do not have an INDEX page for that directory
AND in your cPanel (in HostGator anyway) in the "Index Manager"
by default the setting is to display all the files! You must manually
change this for the domain and select "No Indexing" which will
THEN generate the "forbidden no access" page anytime someone
accesses the directory with no index page.



Didn't know you could do this. Thanks. You could also put a blank index file under the directory--I have done that.
halmo is offline   Reply With Quote
Old 03-09-2011, 10:09 PM   #23
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Lightbulb Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Verify your ownership of the site in Webmaster Tools.
On the Webmaster Tools home page, click the site you want.
On the Dashboard, click Site configuration in the left-hand navigation.
Click Crawler access, and then click Remove URL.
Click New removal request.
Type the URL of the page you want removed from search results (not the Google search results URL or cached page URL), and then click Continue. How to find the right URL. Note that the URL is case-sensitive—you will need to submit the URL using exactly the same characters and the same capitalization that the site uses.
Click Remove page from search results and cache.
Select the checkbox to confirm that you have completed the requirements listed in this article, and then click Submit Request.

Make sure you choose "Remove page and cache" in the dropdown.

Thanks. I started to do that but then the checkbox below it...



...it sounds like I need to either place a block on robots.txt (which I'm
NOT going to do since I don't want it listed there) or place a NOINDEX
META tag in the file (which I mentioned earlier...this is a SWF file and I
can not place a META tag in a SWF file!)...

So since I can't do either one of the two, I can't check the box, thus
can't move forward with this type of DEindexing option.

Thankyou for the explanation though! It's very useful to know about
in the future if I ever need to DEindex an actual HTML page, but as for
a SWF page...perhaps I'll wait it out and see if the NOINDEX META tag
I placed in the INDEX page of the directory the SWF is contained in
will work. It'd be good to know that anyway.

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 10:09 PM   #24
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Didn't know you could do this. Thanks. You could also put a blank index file under the directory--I have done that.

Yep exactly. Or you could be "super safe" and do both.

(blank index file AND change 'Index Manager' settings to 'No Indexing')

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 10:32 PM   #25
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Thanks. I started to do that but then the checkbox below it...



...it sounds like I need to either place a block on robots.txt (which I'm
NOT going to do since I don't want it listed there) or place a NOINDEX
META tag in the file (which I mentioned earlier...this is a SWF file and I
can not place a META tag in a SWF file!)...

So since I can't do either one of the two, I can't check the box, thus
can't move forward with this type of DEindexing option.

Thankyou for the explanation though! It's very useful to know about
in the future if I ever need to DEindex an actual HTML page, but as for
a SWF page...perhaps I'll wait it out and see if the NOINDEX META tag
I placed in the INDEX page of the directory the SWF is contained in
will work. It'd be good to know that anyway.
Yes, you need to do that to check the box. Could you password protect your file, and then put in in the robots.txt to disallow it? It would be there for only a few hours until Google de-indexed the page, then you could remove the reference from the robots.txt. I would rather do that than having it indexed, but that's only my opinion. What are the chances that someone looks up your robots.txt in the next few hours (hopefully Google wouldnt take long), and what are the chances that someone sees the file if you leave it indexed (without any time limit)? I am not trying to talk you into it, but that's how I see it (even without password protecting it).

I have to say, I am not familiar with SWF files..

Didn't Google give another option to block it? I thought there were three (you mentioned two), but I might not remember right.
halmo is offline   Reply With Quote
Old 03-09-2011, 10:34 PM   #26
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Yep exactly. Or you could be "super safe" and do both.

(blank index file AND change 'Index Manager' settings to 'No Indexing')
Yes, that's what I will do. Thanks.

And I added something about subdomains to my original post. Did you see that?
halmo is offline   Reply With Quote
Old 03-09-2011, 10:38 PM   #27
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Didn't Google give another option to block it? I thought there were three (you mentioned two), but I might not remember right.

Yes, there is a third option (see the checkbox image above) and that
is that it returns a "404 error", which doesn't apply either since it does
NOT return a 404 error (it loads fine).

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 10:40 PM   #28
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Thanks. I started to do that but then the checkbox below it...



...it sounds like I need to either place a block on robots.txt (which I'm
NOT going to do since I don't want it listed there) or place a NOINDEX
META tag in the file (which I mentioned earlier...this is a SWF file and I
can not place a META tag in a SWF file!)...

So since I can't do either one of the two, I can't check the box, thus
can't move forward with this type of DEindexing option.
Just realized, I think the third option I was thinking is the 404 error. (as Google states by the checkbox) If you remove the file from its current location, typing in its current URL will return a 404 error, won't it? That might just solve your problem. Easy.
halmo is offline   Reply With Quote
Old 03-09-2011, 10:44 PM   #29
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Yes, that's what I will do. Thanks.

And I added something about subdomains to my original post. Did you see that?

Yes, I did, and thankyou for the suggestion, however, as mentioned
earlier in this thread, the DISALLOW command in robots.txt does NOT
prevent files from becoming indexed (only crawled).

I'll paste it again:

Quote:
Originally Posted by Emmanuel Betinis

Actually, your URL can still become indexed by using
that code!
It happened to me! And if you do some
research you'll find that to be true.

Really! That command does not guarantee the link
will not be indexed! It merely states to not allow the
spider to crawl the page and gather information about
the link (like the description tag) when displaying it
in the SERPs. If other pages are linking to the URL
it can still be indexed (it happened to me so I found out
firsthand from experience on this one) & it will just
appear in the SERP as a filename only with no
description below it (like what happened to me).

If you really want to prevent a URL from being indexed,
or if you want an already-indexed URL to be removed from
the search index, you must insert the following code within
the heading of your page:

<meta name="robots" content="noindex,nofollow" />

...and then of course wait until the spider comes around to
visit the page again to see the updated META tag & remove
the page from it's index appropriately...

The only problem about my situation here is that
the file I want DEindexed is actually a .SWF file
and not an HTML file that I can add that code to.

(So I added it to the INDEX page of the directory pathway that
the SWF file is contained in to see, as a test, if Google will
DEindex only the INDEX page or the INDEX page as well as
all the associated files within that directory path - time will
tell)

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 10:45 PM   #30
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
Just realized, I think the third option I was thinking is the 404 error. (as Google states by the checkbox) If you remove the file from its current location, typing in its current URL will return a 404 error, won't it? That might just solve your problem. Easy.

Yes, that will work, but what if I needed to keep the same location of the file?

I'm curious as to how else one would get a SWF URL removed from Google's index
without having to change the filename (which is why I'm going to wait this out a few
days to see if the META tag I put in the INDEX page of the directory the SWF file
is contained in works to remove the URL of the SWF - if it doesn't then I'll just
end up using the 404 option).

Thanks for the input!

Emmanuel Betinis is offline   Reply With Quote
Old 03-09-2011, 11:16 PM   #31
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Yes, I did, and thankyou for the suggestion, however, as mentioned
earlier in this thread, the DISALLOW command in robots.txt does NOT
prevent files from becoming indexed (only crawled).

I'll paste it again:
I would think that if it's not crawled, it's not indexed either (how could the bot get the content of the page if it's not crawled?), but I don't know. However, it has to crawl the site to see if there are any restrictions to be crawled. First goes to see if there is a sitemap, if not, it goes to see if there is a robots.txt, if not, it goes to see if there is a meta tag on each page.

But you are correct, none of these will CERTAINLY prevent indexing because, as I said, these instruction are not obligatory for the search engines.
halmo is offline   Reply With Quote
Old 03-09-2011, 11:27 PM   #32
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Yes, that will work, but what if I needed to keep the same location of the file?

Thanks for the input!
You are welcome!

Then you put the file back after it gets deindexed. (But as long as it's under public_html, it might be indexed again even if you "restrict" it with robots.txt or any of the other methods because search engines are NOT obligated to obey it (as stated before). You might also want to check if Yahoo indexed it. I don't know how you would remove it from there.

You could put the file in a frame. Search engines can't read (or index) frames (as far as I know. I don't know the specifics, but you might want to do a search about it). Again, I don't know anything about SWF files, so I don't know if the frame would be appropriate.


Quote:
Originally Posted by Emmanuel Betinis View Post

I'm curious as to how else one would get a SWF URL removed from Google's index
without having to change the filename (which is why I'm going to wait this out a few
days to see if the META tag I put in the INDEX page of the directory the SWF file
is contained in works to remove the URL of the SWF - if it doesn't then I'll just
end up using the 404 option).

Thanks for the input!
I wouldn't think that the meta tag in the index page would do any good for another file, but I don't know. I thought the meta tags in a file refer only to that file. Why don't you want to remove the file from it's current location for a few hours? Are your customers accessing it?
halmo is offline   Reply With Quote
Old 03-10-2011, 12:28 AM   #33
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Lightbulb Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
I would think that if it's not crawled, it's not indexed either (how could the bot get the content of the page if it's not crawled?), but I don't know.
I thought the same thing at first...but...

...a page does not have to be crawled in order to be indexed! If Google
sees a URL being referenced by anchor text, it can still include the URL
in the index. It will not be crawled (the page will not be fetched) so it
will not have the description tag with it, and will appear as a URL only
(as it did in my case).


Emmanuel Betinis is offline   Reply With Quote
Old 03-10-2011, 12:33 AM   #34
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by halmo View Post
I wouldn't think that the meta tag in the index page would do any good for another file, but I don't know. I thought the meta tags in a file refer only to that file.
Right...but other than the Google URL Removal Form I'm not sure
of what other option I have for a SWF file.


Quote:
Originally Posted by halmo View Post
Why don't you want to remove the file from it's current location for a few hours? Are your customers accessing it?
Yes, customers are accessing it...but that'd be easy enough to change.
It's primarily because I'd like to figure out a way to remove a SWF URL
from Google's index without having to delete it's original location...just
to know how it's done in case it ever gets asked in the future or if I'm
ever in a similar situation in the future.

Emmanuel Betinis is offline   Reply With Quote
Old 03-10-2011, 01:31 AM   #35
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
I thought the same thing at first...but...

...a page does not have to be crawled in order to be indexed! If Google
sees a URL being referenced by anchor text, it can still include the URL
in the index. It will not be crawled (the page will not be fetched) so it
will not have the description tag with it, and will appear as a URL only
(as it did in my case).

Good point.
halmo is offline   Reply With Quote
Old 03-10-2011, 01:42 AM   #36
HyperActive Warrior
War Room Member
 
Join Date: Nov 2010
Location: USA
Posts: 173
Thanks: 52
Thanked 22 Times in 15 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Emmanuel Betinis View Post
Right...but other than the Google URL Removal Form I'm not sure
of what other option I have for a SWF file.




Yes, customers are accessing it...but that'd be easy enough to change.
It's primarily because I'd like to figure out a way to remove a SWF URL
from Google's index without having to delete it's original location...just
to know how it's done in case it ever gets asked in the future or if I'm
ever in a similar situation in the future.
If you have the file under a directory (not directly under the main domain), you could put this in the robots.txt to exclude everything under that directory (maybe Google would deindex it this way). Change "directory 1" to your directory name.:

User-agent: *
Disallow: /directory 1/

I can't think of any other way. Hope you will succeed.



Interestingly, Google just deindexed a page on my site that I didn't want deindexed. I have the follow tag in the file, and the robots.txt doesn't restrict it either. I would like to know why they did that.
halmo is offline   Reply With Quote
Old 03-10-2011, 01:48 AM   #37
HyperActive Warrior
 
Join Date: May 2010
Posts: 123
Thanks: 3
Thanked 5 Times in 5 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Sometimes the robot disobeys the rule and as a result page gets indexed.For better results contact Google directly.
seoindiaforu is offline   Reply With Quote
Old 03-12-2011, 09:40 AM   #38
Senior Warrior Member
War Room Member
 
Alan Petersen's Avatar
 
Join Date: Oct 2006
Location: San Francisco, CA
Posts: 3,446
Blog Entries: 11
Thanks: 749
Thanked 835 Times in 520 Posts
Social Networking View Member's Twitter Profile  View Member's YouTube Profile
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by BenoitT View Post
You can tell Google to not index your file by using the file robots.txt in your root folder. You put that inside:

PHP Code:
User-Agent: * 
Disallow: /folder/file.php
Disallow
: /folder2/file2.swf 
And yes, it is safe to do it this way. Obviously if every crawlers can see it, everybody can see it. If you want something private for real, you need to put a password otherwise someone can find your page somewhere or just randomly. If you only want crawlers to not index it, it's the perfect way.
And the big warning here is that there are shoddy characters out there who will type in YourURL.com/robots.txt to try and access that kind of stuff.

Sounds like common sense now but I hadn't really thought about it until Emmanuel let me know.

So if you have links like your download page or thank you page that you don't want crawled there are thieves who will look at your robots.txt file so be careful.

Alan Petersen is offline   Reply With Quote
Old 03-12-2011, 09:51 AM   #39
Hear me CLear
 
Join Date: Sep 2010
Location: Washington
Posts: 124
Thanks: 12
Thanked 3 Times in 3 Posts
Contact Info
Send a message via MSN to hearme
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

I have learned alot and just working on some of my website portion not to be indexed by google and other search engines.

hearme is offline   Reply With Quote
Old 03-12-2011, 12:18 PM   #40
SEO Strategist
War Room Member
 
yukon's Avatar
 
Join Date: Jun 2010
Posts: 6,532
Thanks: 355
Thanked 1,992 Times in 1,273 Posts
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by Alan Petersen View Post
And the big warning here is that there are shoddy characters out there who will type in YourURL.com/robots.txt to try and access that kind of stuff.

Sounds like common sense now but I hadn't really thought about it until Emmanuel let me know.

So if you have links like your download page or thank you page that you don't want crawled there are thieves who will look at your robots.txt file so be careful.
That's why you use the .htaccess file, instead of robots.txt like I posted above.

yukon is offline   Reply With Quote
Old 03-12-2011, 09:20 PM   #41
--> SEOHelpVideos.com <--
War Room Member
 
Emmanuel Betinis's Avatar
 
Join Date: Apr 2008
Location: Minnesota, USA
Posts: 730
Thanks: 74
Thanked 46 Times in 39 Posts
Social Networking View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Emmanuel Betinis
Default Re: I Said I DON'T Want To Be Indexed, OKAY?!

Quote:
Originally Posted by yukon View Post
That's why you use the .htaccess file, instead of robots.txt like I posted above.

Yukon,

thankyou for your willingness to help, and yes, I did see the
post you made a while back about using the .htaccess file,
but it seems as if that code you're suggesting prevents
Google from indexing
certain files...not removes already-
indexed URLs.

Emmanuel Betinis is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum

Tags
indexed, meta, nofollow, noindex, remove

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 05:04 AM.