Forum Moderators: LifeinAsia

Message Too Old, No Replies

duplicate articles submission

auto check

         

experienced

8:48 am on Sep 4, 2007 (gmt 0)

10+ Year Member



i run an article submission site whereby i use to get 20-30 articles a day. It is very new site and in the coming time it would be receiving 400% more articles everyday. I am looking for a little solution for the duplicate article checking problem. Although we have proper guidelines but still people dont follow them as far as duplicacy is concern. We have to check all the articles manually through google & other engines by passing some text in quote, whether they are already published somewhere or copied and edited and submitted.

is there any idea to check these submitted articles for duplicacy from the web and from the existing database of ours. We normally except the duplicate articles if they are properly edited upto 70%-75% with the fair balance.

thanks

experienced

7:39 am on Sep 6, 2007 (gmt 0)

10+ Year Member



any1 Pls

limoshawn

12:03 pm on Sep 6, 2007 (gmt 0)

10+ Year Member



I’m sure there is a way to do what you’re looking for completely automatically, i just don't know how.

If I was going to tackle the situation with my limited knowledge I would do something in PHP. I would assume that the articles are being stored in a database. I would set up the script to take a predetermined portion of the article and append it to a Google search query and open a new page with that query. I would still have to review the results manually but at least the process leading up to the review would be automated.

good luck

experienced

9:44 am on Sep 19, 2007 (gmt 0)

10+ Year Member



well checking manually would be the best option but not possible if you have huge submissions. software can check the text lines but might be the same words have been used in the different context or different theme articles.. well...

I need some more comments if possible.

thanks