Go Back   WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 04-30-2011, 09:56 PM   #1
HyperActive Warrior
 
Join Date: Mar 2011
Posts: 122
Thanks: 6
Thanked 9 Times in 9 Posts
Default Robots.txt

So I've been using wordpress for much of my work lately and just noticed that the robots.txt is by default
User-agent: * Disallow:

Wouldn't this prevent search engines from indexing my site? Also I can't find where this file is?

Any help would be greatly appreciated.

tyang is offline   Reply With Quote
Old 04-30-2011, 09:59 PM   #2
.
War Room Member
 
Join Date: Sep 2007
Posts: 1,167
Thanks: 697
Thanked 595 Times in 421 Posts
Default Re: Robots.txt

I believe it's in your dashboard settings where you can find
the little check boxes that take care of that for you.

You can block all search engines, allow, etc.


Ken
KenThompson is offline   Reply With Quote
Old 04-30-2011, 10:05 PM   #3
HyperActive Warrior
 
Join Date: Mar 2011
Posts: 122
Thanks: 6
Thanked 9 Times in 9 Posts
Default Re: Robots.txt

Thanks Ken, I do have that option checked under the privacy settings. Just not sure why when I type robots.txt after my domain is says Disallow. Maybe not something to worry about?

tyang is offline   Reply With Quote
Old 04-30-2011, 10:09 PM   #4
.
War Room Member
 
Join Date: Sep 2007
Posts: 1,167
Thanks: 697
Thanked 595 Times in 421 Posts
Default Re: Robots.txt

Well that's interesting. I've dabbled with robots.txt files but not
too often.

Maybe Istvan will spot this and lend some words of wisdom. He's
pretty "up" on WP and blogs.

There is a robots.txt plugin you can get, set up and see what effect
that has. But, Istvan will yell at me for suggesting you get yet another
plugin. lol


Ken
KenThompson is offline   Reply With Quote
Old 04-30-2011, 10:15 PM   #5
HyperActive Warrior
 
Join Date: Mar 2011
Posts: 122
Thanks: 6
Thanked 9 Times in 9 Posts
Default Re: Robots.txt

Appreciate your help, from digging around some more, it looks like

User-agent: * Disallow:

actually allows search engine. If you want to disallow a search engine you would put it after the ":" Go figure

tyang is offline   Reply With Quote
Old 04-30-2011, 10:17 PM   #6
.
War Room Member
 
Join Date: Sep 2007
Posts: 1,167
Thanks: 697
Thanked 595 Times in 421 Posts
Default Re: Robots.txt

Quote:
Originally Posted by tyang View Post
Appreciate your help, from digging around some more, it looks like

User-agent: * Disallow:

actually allows search engine. If you want to disallow a search engine you would put it after the ":" Go figure
Oh well of course! LOL

I knew that, was just testing you.


Ken
KenThompson is offline   Reply With Quote
Old 04-30-2011, 10:29 PM   #7
Watching you...
War Room Member
 
Istvan Horvath's Avatar
 
Join Date: Dec 2008
Location: Waterdown, Ontario, Canada
Posts: 6,498
Blog Entries: 2
Thanks: 1,757
Thanked 3,047 Times in 1,816 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Istvan Horvath
Default Re: Robots.txt

Quote:
Originally Posted by tyang View Post
just noticed that the robots.txt is by default
User-agent: * Disallow:
Quote:
Originally Posted by tyang View Post
Thanks Ken, I do have that option checked under the privacy settings.
So, then WHY are you surprized that it says: *Disallow???

If this is checked - I would like to block search engines, but allow normal visitors AND you add /robots.txt after your blogs URL - the disallow text will appear.

If you change the Privacy settings to - I would like my site to be visible to everyone, including search engines (like Google, Bing, Technorati) and archivers AND you add the /robots.txt after your URL... you will get a 404.

Provided your blog is set up normally...

Everything else is false

Istvan Horvath is offline   Reply With Quote
Old 04-30-2011, 10:36 PM   #8
Need a Website?Contact Me
 
Join Date: Feb 2011
Posts: 102
Thanks: 2
Thanked 15 Times in 11 Posts
Default Re: Robots.txt

Don't Trust the Robots. You've seen Terminator right?

no no - keyword your site is all you need.

If you need a website, something cool, slick, and affordable - something with built in aweber, hosting included, extremely easy to use. Unlimited pages, Drag and drop functionality. E-mail me. vanfenix1 at gmail.com. || I'll set you up a site for 15 days to test out || Squeeze pages? no problem. Lightweight E-commerce? easy as pie. My Websites are not like you've ever seen. Try it today FREE!
Vanfenix is offline   Reply With Quote
Old 04-30-2011, 11:02 PM   #9
HyperActive Warrior
 
Join Date: Mar 2011
Posts: 122
Thanks: 6
Thanked 9 Times in 9 Posts
Default Re: Robots.txt

Quote:
Originally Posted by Istvan Horvath View Post
So, then WHY are you surprized that it says: *Disallow???

If this is checked - I would like to block search engines, but allow normal visitors AND you add /robots.txt after your blogs URL - the disallow text will appear.

If you change the Privacy settings to - I would like my site to be visible to everyone, including search engines (like Google, Bing, Technorati) and archivers AND you add the /robots.txt after your URL... you will get a 404.

Provided your blog is set up normally...

Everything else is false
I was surprised by the disallow because there is a option for allow:, I didn't realize at the time that the disallow with the ":" is in fact allowing the search engines to spider my site. I Just have a site sitting for over three weeks without being index and I'm losing my mind checking everything. Thanks a lot for the reassurance, I've been a web designer for 4 years and never used robots.txt

tyang is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum

Tags
robotstxt

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 04:09 AM.