Forum Moderators: open
A few days later a post appeared in Voelspriet, a Dutch site about searching and search engines. I certainly wasn't the only one, a whole bunch of Dutch seo-sites had been ripped and fed into Googlebot!
So far, so good, until this morning when WebWereld (Dutch for WebWorld) placed a news article about it on their site. More publicity and more respondents. To my surprise most readers blame Google for letting this happen and only a few blame the company for stealing other site's content and cloaking.
The company responded with a special page on their web site offering a special discount for their method which "has now been cured of startup problems."
I almost feel ashamed to be a Dutch seo!
my content was 'borrowed' by one of my competitors
I have confidence that Google will do what it can, but this points to a wider issue on which the search engines can presently - at best - only limit damage and even then only if they have the resources to devote to doing so.
I'd like to hear whether Google as the dominant search engine currently has any ideas of how to deter "those with little imagination" from believing that they can copy an original website wholesale and then pass it off as their own.
Any deterrent might be of help... and one endorsed by a major search engine like G might actually make a difference.
I've twice been a victim of this and it's my experience that cease and desist emails have little effect.
I never used to and I'm not happy about it but now I watermark every single image on my site. But how to watermark textual content?
Giving searchers what they want is the first priority of an SE... there's nothing to show that either SEs or the searching public would give much of a flying one about copyright abuse.
But then plagarism had always part of Internet Culture, it's OK if you credit who wrote it orginally.
create a unique signature
I like it! You could do something like having the first sentence and the last sentence on every page containing no words with the letter 'e'.
I imagine the chance of having this configuration by accident is well-nigh impossible.
It doesn't allow you to avoid the legal hassles though, does it? It just puts you in a better position when it comes to the burden of proof.
Yes, your translations are right.
@all others:
The first bit is part of the general introduction to the topic in the Webwerweld article.
"rather ingenious" is Henk's qualification of the tric, with a slightly negative undertone. (Henk is a journalist and founder of Voelspriet. He may be impressed by the technology used but is certainly not approving it).
Alot can get lost in translation, I am glad that the interviewer was overall negative on the end result. I did some more research into the site downloading incident... seems to be a very large corporation with banking etc. I am just going to keep an eye for duplicate content. Hopefully, they just liked something about the site or services and wanted it to reference. I never have had to block any of the netherlands from my servers... other parts of the ripe and apnic blocks yes. I am going to hold off that part. Some other sites I have get regular and good ecommerce orders from the netherlands. Thanks for the heads up.
I highly recommend that you document the infringement. This involves copying the entire content of the infringing site, and saving it. Let the infringer know that you did this.
Next, file a DMCA complaint with Google. Google will remove all the infringing pages from their index (it takes a few days... and if the site is taken down first, the pages don't get removed from the index, or the cache... which is annoying).
Do a WhoIs lookup to find the domain registrant.
If the domain is registered anonymously, you don't know who the infringer is... but you can still go after them.
Do a traceroute, or a DNS search, and find the IP address. Do a reverse IP lookup to find out who is hosting the site, or providing the Internet connection. Contact this company, and let them know their customer is infringing your copyrights. The DMCA provides web hosts and ISPs (and search engines) protection from copyright claims if they take reasonable steps to resolve the matter. Tell the ISP or web host that if they don't want to be sued for vicarious liability or contributory infringement, they need to take a look at the situation (compare the copied content with your original), contact their customer, or suspend the customer's Internet access immediately. Be aggressive in going to the right person, and letting them know you mean business. Some ISPs and web hosts are clueless about this at first, and you may have to educate them. I have found some to be familiar with this situation, and they take immediate action.
When my content was successfully taken off a US based webhost, only to appear a few days later on a server in China, it took a phone call to the infringer to let him know that I was going to take him to Federal court. Even though the infringer was in Canada, his domains were registered in the US, and domains are considered real property (they can be awarded in a judgement). Let the infringer know he stands to lose his domain or domains, and you may get his attention.
One more thing... if you actually file a copyright with the government, you can be awarded statutory damages, and not just actual damages.
You can use whois to find the domain owner and contact them with a notice of copyright and notify them to remove the offending materials first. Then follow it up with a written notice, a lawyers letter hits harder. Then, if needed you should be able to document a DMCA notice to google and the web hosting company for the pages, and if they host their own then their telecom provider. You can likely get their site off the internet this way if better judgement doesn't tell them to mess with someone more passive in the meantime.
Best of luck.
I guess there is nothing you can do against such a mass practice. But if you focus exclusively on your business and try to put out YOUR unfair competitor who stole YOUR content, then there are good chances to succeed one way or another. What is really important is the attitude. Your business depends first on you, not on Google or some other imminent impediments which are out of your control. The advises given by DVDBurning are of a real value. Let the Google do his job, do yours too. If Google caches stolen content, that's his business and his market will react sooner or latter. But if it's something you can do, than don't wait the big guys to solve your puzzle.
I personally interacted with such an awful situation and let me tell you that Google didn't help me at all even with an useful advice to my inquiry, not to say with some SERPs fittings. So I tried the hard way because there was no easy way.
The content patent still needs time to improve itself.
In our case it is an actual SEO company that is highjacking pages and content to get unrelated search results. They were contacted too, btw, and told us that we should be flattered. They only use the information and pages of companies they think are doing well. Such a comfort...
They are taking her information and using them to get results for competing companies. It's not a great feeling to do a test search and find your direct competitor coming up above or right below you for the same pages, especially when they're YOUR pages.
It is extremely frustrating to those who have worked for many months to develop unique and valuable content to see someone simply copy every bit of it word for word, posting it as their own. It is even more frustrating to see Google regard the copied version as the original, apparently applying a duplicate content penalty on the original pages. I can't be 100% certain that this is what happened in my case, but I saw my PR go from a 6 to a 3 (now back to a 4). I also saw the copied version of my pages show up higher in the SERPs for every single search.
It is easy for Google or anyone else to say that they are an innocent 3rd party to this ugly situation. Do they have any obligation to manually investigate and make a judgement as to who owns the content? Probably not. They are just a search engine, and they don't want to get embroiled in disputes between 2 different websites... even if one is clearly committing a criminal act.
From a business perspective, and a moral perspective, Google SHOULD take the time to look at these complaints, evaluate them on their merits, and make a determination. Google has old cached content from many sites, and could easily determine who posted the content first (in most cases). Anyone who took a look at my case, for instance, would easily see that the copied content included numerous references to my site, and even a press release about my site starting up! Google could have removed the offending site from their index, and given every page of the site a PR0. Even after DMCA complaints, the SERPs still showed every one of the 420 copied pages in the same place for the offending site.
It is also unfair if the original site receives an automated penalty (I can't prove this... but I'm fairly sure that this is the case). I feel that Google should set up some sort of special complaint system for these cases, and manually review and correct them.
I have a handful of specific pages where direct content from them is now all over the internet. Funny thing in one instance in a page I used a comparison (item1) was (x times) as effective as compared to (item2)... I miskeyed by accident in the html adding an extra zero to the number and didn't catch to fix it for about a month. All that time I was #1 for (item1) on google. Well, a year later(item1) is now all the rage and I can still search for my sentence with the extra zero on the figure and it is all over the internet now along with the rest of the related pages content. Personally, I liken these folks to "human roaches" that bottom feed of others work. It stinks, but still I don't expect google to police the internet.
As long as (item 1) is also related to the other sites it's mostly a dispute between you and those other companies.
It becomes a search engine's problem when (item 1) has no relation with the stealing web site and is only used for cloaking to get higher rankings (on irrelevant terms). This is misleading the search engines users. Can you see the difference?
Absolutely, cloaking is a major problem that google should care about becase it undermines the validity of their serps. Really there are two issues here, one the copyright issues and second the cloaking or spam report issue.
I think google could do better to devote more effort to removing cloaked redirect pages. Alot of the internet content now is this junk... so long that it is more cost effective than adwords or other ppc it will remain a problem. Maybe google could turn that carrot into a hammer?
This issue of stolen content seems to be growing more of a problem. Something needs to be done.
Is it feasible that we independent webmasters form some kind of group or association (or does one already exist) that would advance and protect our interests with respect to content? For example, would it be feasible that we each pay a reasonable membership fee that would finance the retainer of a web savvy attorney who, when one of the association is victimized, could immediately and agressively go after the offenders and shut them down? It appears hard for individual webmaster victims who get cut out of the herd to take on something like this.
Somebody educate me-- my proposal seems like a reasonable solution. If anybody knows of an existing alternative, let us know. And if anybody has a candidate attorney, let us know.
Does anybody want to run with this?