How To Detect Duplicate Content Within A Website?

6 replies
I am trying to help a client figure out if his site has any duplicate content within his site. He has a number of articles in his site and he wants to make sure that he hasn't posted the same content twice or more.

Can anyone guide me towards any tools that can help detect duplicate content within a website?
#content #detect #duplicate #website
  • Profile picture of the author Alexa Smith
    Banned
    I think it's really unlikely to be a problem if he's unaware of it to the extent of "needing to find it". Problems normally arise only when the same file's duplicated, or at least substantial quantities of text within different files - and even then only if Google decides it's been done in a deliberate attempt to "game their algorithm" (as they put it). However, having said all that, there may also be an easy software-based answer to your exact question, and I have absolutely no idea.
    {{ DiscussionBoard.errors[4473698].message }}
    • Profile picture of the author vampiro
      Hi! I am not sure if this is the one you are looking but this is worth a try.

      Duplicate Checker

      Signature
      - V - A - M - P - I - R - O -
      {{ DiscussionBoard.errors[4473760].message }}
      • Profile picture of the author JohnMcCabe
        One thing you may want to look at is the ratio of "content" (the article text, etc.) and "overhead".

        One of the Stompernet guys had a video out a couple of years ago on the subject. Seems he had dupe content issues on one of his ecommerce sites because his navigation, ads, button images, etc. were 80-90% of the content on each page. Because this pseudo-content was repeated on each page, he was having a hard time getting interior pages indexed and ranked. He solved the problem by adding additional original text to each page.

        The algo may be smarter these days, but it's something to watch out for.
        {{ DiscussionBoard.errors[4473799].message }}
  • Profile picture of the author sbucciarel
    Banned
    A bigger duplicate content issue with WP blogs is the use of categories and tags. If you use the All-in-One-SEO plugin, it can be set to noindex,nofollow for tags and categories.

    How To Prevent Duplicate Content Issue On Wordpress Blogs
    {{ DiscussionBoard.errors[4473992].message }}
  • Profile picture of the author yourreviewer
    Thanks for the replies folks. I have just done my search online and so far I didn't find any software that will let me identify duplicate content within a website.
    {{ DiscussionBoard.errors[4474157].message }}
  • Profile picture of the author SKWeaver
    The Premium service at Copyscape has a "Batch Search" that lets you check your entire site for duplicate content... but I'm not sure it is checking WITHIN your site. Might be worth a look.
    {{ DiscussionBoard.errors[4474302].message }}

Trending Topics