Forum Moderators: open

Message Too Old, No Replies

Duplicate Content

Duplicate Content

         

lufc1955

1:24 pm on Nov 30, 2002 (gmt 0)

10+ Year Member



I was wondering whether anyone could tell me how a search engine like Google can detect duplicate pages on a web site. Take for example a site selling flights to Boston, Chicago, New York etc. If you have separate web pages with different key words relating to each destination will a search engine think that these are duplicate pages? Does Google look at the number of words on each page or what?

Macguru

3:01 pm on Nov 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WebmasterWorld [webmasterworld.com], lufc1955.

All search engines are using an indexer to analyse pages. Once a page is analised, it is quite easy to sort them using different keys. Pages with duplicate or near duplicate content can be automatically flagged and filtered out of the index this way. More on this here : How search engines work. A primer. [webmasterworld.com]

Some people think it is safer to keep at least 8 % to 13 % of difference between pages with near duplicate content. Changing just a city name in some template page is an old spam technique that search engines are fighting by this mean.

lufc1955

6:36 pm on Nov 30, 2002 (gmt 0)

10+ Year Member



Thanks Macguru. Thats a nice answer to my first post.

heini

7:15 pm on Nov 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi lufc1955 - welcome to WebmasterWorld!

Mac is correct, the reason why there are duplicate issues to be concerned about is that search engines might view this as trying to spamming their index, even if in cases like you described it is perfectly legitimate.
And yes - SEs do count words and analyze. After all that is part of the ranking proces anyway.