Forum Moderators: open
We run sites that are driven by data that people enter.
All of the data is unique, however 30 of the pages share similar titles with one or two keywords changing depending on the subject of the content.
Is this considered duplicate content? i.e. similar titles but completely different content?
It's presenting a bit of a problem really, as there is no other way to title the pages... unless we remove the main keywords! (below they would be "Buy" and "online")
e.g. the titles are along the lines of
Buy Brand1 online
Buy Brand2 online
.......
Buy Brand30 online
So should "Buy" and "online" be removed, leaving only the brand?
Many Thanks...
...adding datestamp to each otherwise identical page. So technically speaking checksums of two pages will be different...
The larger unknown is how you get pages to test against each other (I would assume you group the results of common searches, but even this is a very large problem). Another question is how do you catch duplicate content that has (significantly) altered file formats - html vs pdf, or better yet html vs a binary format (a large jpg screen shot of the page for example).
*I use something similar to this to auto-sort mail into an appropriate IMAP subfolder at work - people create the folders and orginize their content as it suites them then a cronjob looks at similarities with the headers and creates filters for procmail. I get around something similar to the changed lines problem by replacing whitespace with linebreaks which ensures the analization successive words and thus ignore the length and compisition of lines.
[webmasterworld.com...]
[webmasterworld.com...] and
[webmasterworld.com...]
These have been strong for years.
The home page still ranks number 1 on one major two-word keyphrase, however the other two have slipped from 1-4.
There are some very odd results where ours used to be.
The traffic has not suffered much at all, as the site is very well established accross all search engines, however advertisers will soon notice the loss of positions...