Forum Moderators: open
I am sure you have all seen the “spidered” content that appears in the Google serps, you know the drill, absolutely useless crap that is ripped from mine and your pages. It is not even as if some decent cloaking has been put in place. Just blatant ripping of specific phrases, probably seeded from the sites that are in a typical Google SERP.
So these “search portals” or whatever you want to call them appear on the scene with absolutely no use apart from making the owners some cash :). So what are Google going to do about this and other serious degradations in search results?
I don’t have the answer by the way :)
Then there are the search engines themselves. By displaying our pages from their site (they call it caching... it is not), they have distorted copyright law for their own advancement without prior permission.
Some plausible courses of action for webmasters:
a) add a no archive tag to all pages that are indexable by search engines.
b) protecting yourself from page jackers by cloaking.
c) running spider monitors to ban any thing that looks like an unacceptable spider on your site.
d) nuke the se's entirely and require cookies to navigate your site.
e) do D above but give the se spiders a cookie free pass.
f) require login and buy traffic from your neighbors site ;-)
The serps getting indexed is nothing new. That's been there since the start of the engines themselves.
Allow people to register their content. A central content database, that people could use to prove the date of creation for their content. So when you write new pages, you just update your listing with them, and then in the future you can use them to verify that you were first to write something.
This could be a resource for you, when you go to the search engines to complain about duped content. Sort of like a subscription model of the Wayback machine.
Allow people to register their content.
Up until we signed on to that stupid Berne Convention, the USA had a wonderful place where you could (and had to) do that. It is called the Library of Congress Copyright Office.
In fact, you can, should and must still register your copyright there if you want enforcement ability.
So if you want a way to prove it is yours, start registering all your pages.
However, I do not do this when searching for something related to one of my own websites. I don't want there to be any question that I'm reporting a competitor. I report only on "real" searches that I did in civilian life.
Yes, I did fill out a spam report. What is puzzling to me, however, is HOW do these "portals" (there is one in particular I am thinking about, I bet you all know who they are) consistenly rank high with no content for these terms? This portal has a PR of 5. I don't get it.