I'm trying to develop an php script that'll compare two articles and generate a percentage for uniqueness. A good example of software that does this is [
dupecop.com...]
I want a php script that does this that runs on my own server, and although I've tried combinations of similar_text() etc. I haven't found a method that gives a similar percentage to the sites and services out there that are geared to make sure your spun articles are unique enough.
There's a couple more sites that give similar numbers as dupecop for the same comparisons, so I figured there must be something out there already... but alas I can't find it.
Can anyone point me in the right direction?