Go Back   WarriorForum - Internet Marketing Forums > Warrior Support Forums > Website Design
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 01-24-2009, 12:35 AM   #1
Active Warrior
War Room Member
 
Join Date: Jul 2008
Location: , , Bahamas.
Posts: 34
Thanks: 1
Thanked 1 Time in 1 Post
Default Unknown bots

Hey everyone just a quick question. Should I let unknown bots crawl my site? There are like 5 different bots with names like unknown robot or bots. What robots besides the obvious ones like google, yahoo and the others should I put in my robots.txt file? Any help would be appreciated.
DArmbrister is offline   Reply With Quote
Old 01-24-2009, 12:52 AM   #2
Advanced Warrior
War Room Member
 
Bruce Hearder's Avatar
 
Join Date: May 2004
Location: Perth, Australia.
Posts: 717
Thanks: 4
Thanked 182 Times in 138 Posts
Social Networking View Member's Twitter Profile  View Member's YouTube Profile
Contact Info
Send a message via Skype™ to Bruce Hearder
Default Re: Unknown bots

I would strongly suggest again blocking these unknown bots are they are most likley part of the major Search Engines(SEs) checking your site.

There is an increasing number of websites that are now using clocking to artifically increase their search engine rankings. The search engines (especially BigG) now have implemented a range of other bots that come from different IP addresses, and don't identify themselves as coming from Google at all.

IP Cloacking works like this :- a bot visits a website, the website determines from it IP address that its a bot, and so it gives it a bunch of keyword rich text to spider and index.
A human visits the site, the site determined that the visitor is not from a search engine, and now redirect the human visitor to another website (usually an affiliate page).

So the SEs are now trying to find these pages, by sending in bots that look and behave as humans, and others that have no distinguishing details at all. They want to see if the content they see is substantially different from their previous visit. If so, then the site may come up for a human review.

So, my recommendation is don't block these bots, is you have nothing to hide..

Hope this helps

Bruce

-----------------
Get Your Backlinks indexed quicker at BackLinks2RSS

Create Full Text Feeds from Partial RSS Feeds at FeedExpander.com. See the WarriorForum post about it here
Bruce Hearder is offline   Reply With Quote
Old 01-24-2009, 12:34 PM   #3
Active Warrior
War Room Member
 
Join Date: Jul 2008
Location: , , Bahamas.
Posts: 34
Thanks: 1
Thanked 1 Time in 1 Post
Default Re: Unknown bots

Thanks for the help Bruce
DArmbrister is offline   Reply With Quote
Old 02-06-2009, 11:04 AM   #4
HyperActive Warrior
 
Join Date: May 2008
Location: USA
Posts: 249
Blog Entries: 22
Thanks: 9
Thanked 29 Times in 27 Posts
Default Re: Unknown bots

What Bruce mentioned is valid, but I just wanted to offer another point of view. In the beginning I didn't care who came by my websites, and I was happy to have the visitors. True a lot of the automated robots (or bots) were related to the search engines, but in the last few years and months I started to see an increasing number of unknown sources. These weren't random, they were hitting the server relentlessly. So I took some advice from a web development company who was fed up with these suspicious connections, and decided to implement a long list for robots.txt, added some rules to .htaccess, and started monitoring everything with a web application firewall called mod_security. Why? Because of the following benefits, which I'm sure you've seen on other websites like botsense.com,
  • Reduced bandwidth costs
  • Reduced server load from illegitimate traffic
  • Stop email scrapers
  • Stop image scrapers
  • Stop copy scrapers
  • Stop snoopers!
So it is entirely your call. I decided to stop allowing connections I did not understand. If we have nothing to hide, then why do we allow bots to hide themselves and their potentially malicious intentions? I make exceptions if I can fully understand who the source is, why they need the connection to me, and exactly what they are doing to my server.

Sorry to sound negative, but with some of the security issues I've dealt with, it becomes an advantage to take a defensive position to protect myself and my client's assets.

awesometbn is offline   Reply With Quote
Old 02-06-2009, 12:06 PM   #5
Active Warrior
 
Join Date: Jan 2009
Posts: 32
Thanks: 1
Thanked 3 Times in 2 Posts
Lightbulb Re: Unknown bots

Just take a look at some recent trends:

Google 50%
Yahoo 23%
MSN 10%
AOL 6%
ASK 2%
Others 9%


So as you can see the another 9% do not make a big difference, with exeption of Alexa. Then is a very good idea to allow the "well know" spiders and block the rest.

netbie is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > Warrior Support Forums > Website Design

Tags
bots, unknown

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 05:30 AM.