homepage Welcome to WebmasterWorld Guest from 54.204.69.92
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Code, Content, and Presentation / Content Management
Forum Library, Charter, Moderators: ergophobe

Content Management Forum

    
looking for similar content
sort of like CopyScape, but just search locally
LifeinAsia




msg:4252057
 7:47 pm on Jan 11, 2011 (gmt 0)

We have thousands of short articles (usually 1-3 paragraphs) on our site written by a dozen writers. We happened to find some articles that were very similar to each other (maybe 3 words different between paragraphs).

Does anyone know of a tool that we could use to go through our database and find articles that are very similar to each other? We're using MS SQL, although we could probably port the data to MySQL for this project. As an alternative, we could write the text out to text files if the program couldn't directly access the DB.

TIA!

 

eventus




msg:4252157
 12:23 am on Jan 12, 2011 (gmt 0)

You may want to contact the AHN news people, they have something similar that they use in their CMS

Copyscape also licenses the technology.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Content Management
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved