Search engines don't rank sites, they rank pages.
monkeythumpa, pages interconnect and SEs rank that interconnection. Throw 80 pages of white-link spam with 5 H1s on a site with 20 otherwise good pages and you'll see the 20 good ones fall.
Do you have any specific feedback/proof that this doesn't occur with flagged dupe content?
|Search engines don't rank sites, they rank pages. |
Often quoted and just as often wrong.
It would be absolutely stupid for a search engine to not consider the site factors while ranking the page. And search engine engineers ain't stupid.
As for the dup content, I believe that Google generally only filters out the duplicate pages. But I suppose that there might be some sort of threshold where they might just assume that if 80% of the site is duplicate that the other 20% is too.
Google is forgiving to a degree, dup pages get axed. With Yahoo be very, very careful. MSN is still getting their sealegs, no idea.
Thanks for feedback. I'll go for it and give some feedback in a few months.
Since my new content would actually be useful and mixed with other fresh/public stuff it shouldeasily pass a hand test.
Knowing yahoo with other filters, then tend to use a hand check if an autoreport is "iffy".
(This though assumes that filter is setup to return as a analog dup % and not a binary "dupe/non-dupe" )
I have a site that had good links but was about 90% duplicate content. It barely got any traffic at all from Google (though MSN and Yahoo didn't seem to mind) even though it was out of the sandbox. A few months ago, I decided to start writing some articles, and now it's probably about 50% original content and 50% duplicate content. I have started getting a lot more traffic from Google. I think whatever penalty I might have had is gone.
I know this doesn't exactly answer your question, but just based on my experience, I think it may be a percentage kind of thing. Some duplicate content may be fine as long as you continue adding original content. If you have something like four times as much duplicate content, then you might need to worry about being penalized.
Teshka, thanks a lot. That was a very useful and interesting post.
i have wondered if a site consisting of mostly articles generated by a news feed would do well at G, Y and MSN
or if the duplicated articles would be a problem
"i have wondered if a site consisting of mostly articles generated by a news feed would do well at G, Y and MSN"
I have been testing this a bit -- still to early to tell. Google seems to be indexing the pages (decent site already) -- but no traffic on the news ones yet
Any other comments on the above?
Howiejs, no comment on that quote.
I haven't implemented the original point. That particular site went offline for a few days due to a foolish host. Will try it when things are stable again.
Sorry for lack of update, but even with being down for a week <painfully long story>, that site was treated well by Bourbon so now I'm risk adverse and want to avoid dup filtering the entire site.
Now, I've downloaded a wack of free (and useful) info and humans are editing that offline. I'll supplement each page that goes onto with info from a mix of sources on all pages. Problem is that this takes time (and a bit o money) so it'll only be up in July.
Sorry I can't give a better update.