Forum Moderators: open
[webmasterworld.com...]
The URLs pertaining to my website that all point to my index page take the following form.
www.mydomain.com/?S=AC3%26Document=document
www.mydomain.com/?S=AC3%26Document=document
www.mydomain.com/?SID=xRSUNVW8R9P44HSYQ6UWED&
www.mydomain.com/?S=AC3%26Document=document
www.mydomain.com/default.asp?S=AC3&am
www.some-other-URL.com/go.php?id=aHR0cDovL3d3dy5jcmVkaXRjaGFtcGlvbi5jb20v
www.some-other-URL-2.com/go.php?id=aHR0cDovL3d3dy5jcmVkaXRjaGFtcGlvbi5jb20v
www.some-other-URL-3.com/file/callink.php?linkid=3
I have emailed google, but have received no reply. I am unsure what I can do to A) eliminate the incorrect URL's that appear to originate from my site and B) eliminate the mirror URLs that originate from unrelated websites.
Any help would be greatly appreciated.
Mostly it was from google india, malasia or other asian, but today I got a single hit from co.uk.
[edited by: walkman at 12:37 am (utc) on Jan. 12, 2005]
Just wait until next update you will see more of this hijacking/redirecting, because it is so easy and legal.
I also was a Google fan, but I always was afried it would change as soon they went on the market and yes it changed now I use yahoo and wisenut.
P.s still have the adsense active
Zeus, I agree with you... lets wait to see if a new update comes soon.
C
what do you mean? Got yourself penalized or?
One of my sites has been hijacked by several spam websites. These sites use a variation of the tracker/go.php redirect script. Many have also scraped a snippet of text from the page content of my website.
Now I'm no lawyer - but based on what I know of intellectual property law this use of my content constitutes illegal use of copyrighted material. Under the DMCA Safe Harbor Provisions online service providers (Google) are free from liability when their customers engage in copyright violation only if:
- They set up an agent to deal with copyright complaints
- After being notified of copyright infringement they expeditiously remove or disable access to the material
What this means is that Google must respond to notifications of copyright infringement and, if positive proof is given of the infringement, must act to immediately remove the infringing material from their index.
This is not something they can write an algorithm for - this is manual work done by real, breathing, salary collecting Googleplex employees. I've got a list of several dozen scraped/redirected sites infringing on my material and I bet the other WebmasterWorld hijackees can come up with several thousand more.
All those Safe Harbor notices would probably get Google's attention pretty quick:
[google.com...]
BTW - there are serious penalties for filing a fraudulent DMCA notice so do your own research and make certain that infringement is in fact occurring before filing a notice.
Here's some additional info on the DMCA Safe Harbor Provisions:
[chillingeffects.org...]
After being notified of copyright infringement they expeditiously remove or disable access to the material
I've reported several 302 redirects where google is caching my page against the the redirecting url.
I've reported these as DMCA violations and requested they remove the cached pages but so far after 3 weeks they are still showing.
But google's cache of my page showing against the jackers url is on google's server.
As I understand it that situation is indeed google's problem and I've asked google to deal with it
but so far without success
The second special case is the "Location:" header. Not only does it send this header back to the browser, but it also returns a REDIRECT (302) status code to the browser unless some 3xx status code has already been set.
Looking at the rfc (HTTP/1.1, update to RFC 2068), it also seems to be the best, general default response (see 302 Found [salemioche.com]):
Status Code 302 Found: The requested resource resides temporarily under a different URI. Since the redirection might be altered on occasion, the client SHOULD continue to use the Request-URI for future requests.
The above may well be old hat to many, but for me answered for the first time why so many of these re-direction sites send 302 status codes. It is not some kind of evil conspiracy, simply that they do not alter PHP's default behaviour when using a
Location:redirect header.
Everything is becoming stable again like it used to be before the 302 redirect problem. It hasn't totally went back to the way it was before but my traffic is slowly coming back.
I think by the next update it should come back alot more and I will let you know what happens.
Since reading this thread and looking at my linking practices, I've noticed a few things.#1. Two sites with a fairly high amount of php redirect/tracking links (over a 100) no longer show the php redirect link when I do:
site:www.mydomain.com in Google
Previously the links would show in the results. No title, no description, just the php redirect script. Although I noticed the links showing in the results, I never thought much of it or paid much attention to it. I don't think the links ever showed up in Yahoo or MSN. They don't now if they did. Sites that only have a handful of php redirect links still show in the results for that query.
One more site that had a handful of php redirect links no longer shows them in the results for a site:www.mydomain.com in Google. A 4th site I'm watching still has them. It seems this 'fix', if that's what it is, isn't being applied straight across the board. All four sites use the same php script.
I decided to wait and watch the site linking to me with a cgi scripted link - it still shows up as a result in site:www.mysite.com
One of my sites has 126 legitimate php-powered redirects (allows tracking of outbound traffic which is sent to partner sites)
The problem is that my robots file forbids all robots from any php file AND specifically lists every url that is used as a link tracker, this was done to stop them indexing what they think is a page.
Guess how many of these 'pages' are indexed by google?
...ALL of them, it does better at indexing them than the rest of the site!
Now it's really easy to find these pages as they are uniquely named as uniquebutdescriptivestring.php
MSN and Yahoo manage to not have a single one in their indices, plus they do a better job of finding more pages on the site. MSN has 100% of pages and Yahoo has 25% more than Google.
Looks as though I'll be getting emails from unhappy partners if the indexed redirects start acting like the spam 302s
If it does effect my partners I will consider a legal route but will probably get nowhere