Welcome to WebmasterWorld Guest from 54.166.54.215

Forum Moderators: ergophobe

looking for similar content

sort of like CopyScape, but just search locally

   
7:47 pm on Jan 11, 2011 (gmt 0)

WebmasterWorld Administrator lifeinasia is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month



We have thousands of short articles (usually 1-3 paragraphs) on our site written by a dozen writers. We happened to find some articles that were very similar to each other (maybe 3 words different between paragraphs).

Does anyone know of a tool that we could use to go through our database and find articles that are very similar to each other? We're using MS SQL, although we could probably port the data to MySQL for this project. As an alternative, we could write the text out to text files if the program couldn't directly access the DB.

TIA!
12:23 am on Jan 12, 2011 (gmt 0)

10+ Year Member



You may want to contact the AHN news people, they have something similar that they use in their CMS

Copyscape also licenses the technology.
 

Featured Threads

My Threads

Hot Threads This Week

Hot Threads This Month