Welcome to WebmasterWorld Guest from

Forum Moderators: ergophobe

Message Too Old, No Replies

looking for similar content

sort of like CopyScape, but just search locally

7:47 pm on Jan 11, 2011 (gmt 0)

Moderator from US 

WebmasterWorld Administrator lifeinasia is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Dec 10, 2005
votes: 48

We have thousands of short articles (usually 1-3 paragraphs) on our site written by a dozen writers. We happened to find some articles that were very similar to each other (maybe 3 words different between paragraphs).

Does anyone know of a tool that we could use to go through our database and find articles that are very similar to each other? We're using MS SQL, although we could probably port the data to MySQL for this project. As an alternative, we could write the text out to text files if the program couldn't directly access the DB.

12:23 am on Jan 12, 2011 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 17, 2005
votes: 0

You may want to contact the AHN news people, they have something similar that they use in their CMS

Copyscape also licenses the technology.