Only a small percentage of websites get indexed

by JMac
4 replies
I found this snippet and thought it was interesting. I had no idea that so little of what is on the internet is actually indexed. I keep seeing people suggest that you can just make backlinks and the search engines will eventually find them but in my experience this is false. I purchased 200 forum profile backlinks in December of 2009 and then forgot about them. I recently looked and 149 of those links are still active but only 12 were showing up as indexed.

Here it is:
How Much of the Web is Indexed by Search Engines? (Excerpt from Search Engine Marketing Inc)
It sounds easy. Spiders visit the pages and send them to the search index. Those little spiders keep crawling until they index the entire Web, right? Wrong. The truth is that the great majority of Web pages are not indexed in search engines. Over the years, there have been many estimates of the gap between the number of indexed pages and all Web pages. In 1999, Lawrence and Giles found that the (now defunct) Northern Light search engine indexed just 16 percent of the estimated 800 million publicly available Web pages (Searching the World Wide Web by Steve Lawrence and C. Lee Giles, 1999). But the next year, Michael Dahn claimed the problem might be twice as bad as reported, because the "publicly available" Web may underestimate the total Web by half (Counting Angels on a Pinhead: Critically Interpreting Web Size Estimates by Michael Dahn, 2000).
Not to be outdone, in 2001 two studies estimated the total number of Web pages to be far larger than previously reported. Sherman and Price proclaimed the "Invisible Web is between 2 and 50 times larger than the visible Web" (The Invisible Web by Chris Sherman and Gary Price, p. 82, 2001). In Deep Content: Surfacing Hidden Value (BrightPlanet, 2001), Michael Bergman posited the Web contains 550 billion pages and search engines see only 0.03 percent of them.
Regardless of the wildly divergent numbers, the point is that an enormous number of pages are not indexed, and your Web site probably contains some of them. Each page on your site that is not indexed is completely invisible to searchers, which reduces traffic to your site, so your goal is to get as many indexed as possible.
#indexed #percentage #small #websites
  • Profile picture of the author King Shiloh
    Banned
    This is shocking! So, we are wasting our energy and time?
    {{ DiscussionBoard.errors[2788144].message }}
    • Profile picture of the author Bill Farnham
      Originally Posted by King Shiloh View Post

      This is shocking! So, we are wasting our energy and time?
      Apparently, and quite often it seems...:rolleyes:

      ~Bill
      Signature
      {{ DiscussionBoard.errors[2788174].message }}
  • Profile picture of the author Kael41
    No, this isn't shocking. If you're an old time SEO guy, then you remember the days of following the Inktomi spiders and wondering if your pages were going to be included in yahoo

    There is a rhyme and reason as to how the bots and spiders see your pages, and then include them into one of their indexes. The illustration of profile backlinks is a great example. Just because you build something, it doesn't mean the S.E's will find it. Google has always touted page/link structure on a webpage to maximize the googlebot's abiltiy to see your site and thus index it. You ever wonder why sitemaps are so powerful? Because it gives the spiders and bots an entry and flow point through your whole site...

    Now, given that you build out a deep profile link, it's a no-brainer why these little links do NOT get recognized most of the time. Set and forget deep and profile links rarely work anymore. You need to give the bots and spiders a way into those links to allow those links to be registered from an index perspective.
    {{ DiscussionBoard.errors[2788426].message }}
    • Profile picture of the author 4morereferrals
      Yes sir indeedy ...

      I and anther warrior did a pretty good control group study on this very issue - and then set about to create the solution - and I feel we have accomplished that mission.

      We found that its not just the lowly forum profile links that get left to rot by our friendly Google Bot. In fact our testing indicates that even the vaunted Wordpress blog posts - once they roll off that uber PR home page of yours/someone's blog - get tossed aside... on the article networks ...

      think - UAW - SEO Linkvine - Linkvana et al ...

      Whats even worse about the blog posts issue is - your site gets a rapid influx of backlinks from these Article Dir content type sites - 15 minutes after your post gets posted - then their hordes of other customers posts hot the front page - you roll off the all important page one, into the blogs archives etc.

      G Bot or Yahoo SLERP goes back to see if the url still exists where it found your links on the homepages initiall - and oooops - theyre not there any longer. RAPID LINK DECAY and those are now removed from your backlink profile just as fast as they got put in. And that quite frankly - sucks worse than if google never saw them to begin with.

      Wrote a 3 part email series on this to my VIP's - it was pretty popular. People love screen shots and colorful graphs as well as a Backlinkers Funniest Screenshots :-) The Good The Bad and The Ugly. Im re writing the report this morning - into a less detailed quick over-view type report. FREE of course ... Called The UgLY Truth About Your Backlinks.

      We followed 2 controlled groups of backlinks on profiles and blogs 150 links each and studied their indexation rates and departure from the index for 21 days total - and even though I thought I knew there would be a significant difference in the indexed/cached results between doing nothing and "super-uber-topic-secret-automated-processes" - we were pretty blown away at what Google Bot is and isn't doing, as well as how G Bot responds to certain "stimuli" ...

      > 80% of links tested - left to be found naturally by G Bot - never were [ if we gauge that by whats cached in their index ]

      1/2 of these links were on Optimized WP Blogs too - not just "spammy" forum profiles.
      Signature
      Rank Ascend Network - High PR Links / Guaranteed Rankings Increase
      {{ DiscussionBoard.errors[2788728].message }}

Trending Topics