Forum Moderators: Robert Charlton & goodroi
Sometimes, an HTTP status 302 redirect or an HTML META refresh causes Google to replace the redirect's destination URL with the redirect URL. The word "hijack" is commonly used to describe this problem, but redirects and refreshes are often implemented for click counting, and in some cases lead to a webmaster "hijacking" his or her own URLs.
Normally in these cases, a search for cache:[destination URL] in Google shows "This is G o o g l e's cache of [redirect URL]" and oftentimes site:[destination domain] lists the redirect URL as one of the pages in the domain.
Also link:[redirect URL] will show links to the destination URL, but this can happen for reasons other than "hijacking".
Searching Google for the destination URL will show the title and description from the destination URL, but the title will normally link to the redirect URL.
There has been much discussion on the topic, as can be seen from the links below.
How to Remove Hijacker Page Using Google Removal Tool [webmasterworld.com]
Google's response to 302 Hijacking [webmasterworld.com]
302 Redirects continues to be an issue [webmasterworld.com]
Hijackers & 302 Redirects [webmasterworld.com]
Solutions to 302 Hijacking [webmasterworld.com]
302 Redirects to/from Alexa? [webmasterworld.com]
The Redirect Problem - What Have You Tried? [webmasterworld.com]
I've been hijacked, what to do now? [webmasterworld.com]
The meta refresh bug and the URL removal tool [webmasterworld.com]
Dealing with hijacked sites [webmasterworld.com]
Are these two "bugs" related? [webmasterworld.com]
site:www.example.com Brings Up Other Domains [webmasterworld.com]
Incorrect URLs and Mirror URLs [webmasterworld.com]
302's - Page Jacking Revisited [webmasterworld.com]
Dupe content checker - 302's - Page Jacking - Meta Refreshes [webmasterworld.com]
Can site with a meta refresh hurt our ranking? [webmasterworld.com]
Google's response to: Redirected URL [webmasterworld.com]
Is there a new filter? [webmasterworld.com]
What about those redirects, copies and mirrors? [webmasterworld.com]
PR 7 - 0 and Address Nightmare [webmasterworld.com]
Meta Refresh leads to ... Replacement of the target URL! [webmasterworld.com]
302 redirects showing ultimate domain [webmasterworld.com]
Strange result in allinurl [webmasterworld.com]
Domain name mixup [webmasterworld.com]
Using redirects [webmasterworld.com]
redesigns, redirects, & google -- oh my [webmasterworld.com]
Not sure but I think it is Page Jacking [webmasterworld.com]
Duplicate content - a google bug? [webmasterworld.com]
How to nuke your opposition on Google? [webmasterworld.com] (January 2002 - when Google's treatment of redirects and META refreshes were worse than they are now)
Hijacked website [webmasterworld.com]
Serious help needed: Is there a rewrite solution to 302 hijackings? [webmasterworld.com]
How do you stop meta refresh hijackers? [webmasterworld.com]
Page hijacking: Beta can't handle simple redirects [webmasterworld.com] (MSN)
302 Hijacking solution [webmasterworld.com] (Supporters' Forum)
Location: versus hijacking [webmasterworld.com] (Supporters' Forum)
A way to end PageJacking? [webmasterworld.com] (Supporters' Forum)
Just got google-jacked [webmasterworld.com] (Supporters' Forum)
Our company Lisiting is being redirected [webmasterworld.com]
This thread is for further discussion of problems due to Google's 'canonicalisation' of URLs, when faced with HTTP redirects and HTML META refreshes. Note that each new idea for Google or webmasters to solve or help with this problem should be posted once to the Google 302 Redirect Ideas [webmasterworld.com] thread.
<Extra links added from the excellent post by Claus [webmasterworld.com]. Extra link added thanks to crobb305.>
[edited by: ciml at 11:45 am (utc) on Mar. 28, 2005]
For the person who asked about the url removal tool: its removal for six months, not 90 days. I understand how someone thought it might help to try the url removal tool, but please don't use it on one's own site. arubicus, did you say you saw weird behavior with www vs. non-www or trailing slashes vs. without? Could you submit something to google.com/support so I can try to get someone to check it out? Use arubicus in the form somewhere. I'm going to be gone Friday and this weekend, but I'd be curious to hear of any remaining canonicalization issues.
Ugh. Very bleary. Going to bed now..
I have sent mine with my handle.
I admit I might have a problem with duplicate content - but trying to add lots of reviews etc.
But the whole site has gone - but the way it has gone I am wondering if it is a canonical url non-www problem.
Okay, it's not a 302 one, but I've been thinking of it as a similar "canonicalization issue"...
Google's database is overflowing with URL listings like:
www.site.com/directory
where there is also a normal listing for
www.site.com/directory/
These occur from the trifecta of the unfortunate Google policy of URLs-are-pages combined with the bajillion puke scraper sites that scrape search results, where both Yahoo and MSN display results without the trailing slash
It's my experience that when a page gets a second URL only listing, it drops in the results, which would then end up penalizing pages without a file extension, particularly if they are popular and get scraped often.
These URL only links fade fairly quickly, but still it would be nice to see Google recognize and combine these with the canonical page, rather than seemingly demerit the canonical page.
I believe I may have been confused about the re-inclusion thing. My site does show up doing a site command and it does show up when I do a www.my-site.com with no commands. However, it shows up with no title or description.
I presume then that a re-inclusion will not do any good since the site is in the index. The problem I have is that it is no longer in the ranking which is the real underlying problem. And it is a site which was in the top 5 for a 3 million page plus keyword for years.
And yes, we did see other domains listed using the site command. Now they are gone, but everything is still listed as supplemental.
Do I wait? Do a re-inclusion? Can it hurt to do a re-inclusion?
GuinnessGuy
<Very few people used the url removal tool to take out their own sites, so I can try to gather some people into one group and ask someone if we can do anything on our end.>
Though I´m not among them who took out their sites by mistake, I wish to thank you on their behalf for taking care of this matter.
Very kind of you GG. Much appreciated.
Although this is not about redirection, I used it to remove some duplicate domain.com versions of my pages via noindex tags, including my home page. Unfortunately, as I now know, it removes the www versions as well.
It's been 20 days or so and despite being re-spidered the www pages have not reappeared.
The 'Remove Individual pages' section of the google help page does not have a footer, similar to 'Remove your website', indicating a 90 day period.
Does anybody know whether the 90 days / 6 months applies to individual pages removed using noindex tags?
Hard to define (as I am not an expert)
Following thread may help - read GG posts:-
[webmasterworld.com...]
But basically as I understand it Google finds the main url of the site (which is normally the page with the highest page rank) and perhaps where the site crawl starts.
However, I think, sometimes the wrong url can be picked. (Eg if you have the site on the non-www aswell - or your homepage is something like www.domain.com/home.php?sid=122323 - or all your links point to another page and therefore that page is seen as the most important and hence the canonical url)
Not 100% sure though