I have thought of this before. The trouble is if it is not an "e" to the spider, then you won't get a good ranking in the index. If you don't care about that page geting ranked, then why not just tell the spider not to crawl it? It is easier.
BTW there is no "dupe filter". But you are diluting your ranking by having duplicate content.
here's the thing. I have loads of original content on the page, and I have no problem ranking for my keywords. But I have also decided to include an RSS feed of topical news. I don't know how sensitive these filters are, but I certainly don't want my home page to appear under "More results..."
As it happens, I'm doing the XML>XSL>HTML translation myself, so I have the opportunity to alter the content with a script before it gets sent to the client.
I want the news to be useful for the reader, but I'm hesitant about using a big block of content that appears on hundreds of other sites.
If there was a reliable babblefishy kind of paraphraser that would change "Widgets go on Sale Tomorrow" to "Sale of Widgets Tomorrow", I'd use that!
I admit it's a half-baked idea, probably too simple to fool anyone or anything. But I was wondering if anyone had tried this kind of thing and tested the results?
if no one has, maybe I'll just do it and see if it makes a difference