Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Google's 302 Redirect Problem

         

ciml

4:17 pm on Mar 25, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



(Continuing from Google's response to 302 Hijacking [webmasterworld.com] and 302 Redirects continues to be an issue [webmasterworld.com])

Sometimes, an HTTP status 302 redirect or an HTML META refresh causes Google to replace the redirect's destination URL with the redirect URL. The word "hijack" is commonly used to describe this problem, but redirects and refreshes are often implemented for click counting, and in some cases lead to a webmaster "hijacking" his or her own URLs.

Normally in these cases, a search for cache:[destination URL] in Google shows "This is G o o g l e's cache of [redirect URL]" and oftentimes site:[destination domain] lists the redirect URL as one of the pages in the domain.

Also link:[redirect URL] will show links to the destination URL, but this can happen for reasons other than "hijacking".

Searching Google for the destination URL will show the title and description from the destination URL, but the title will normally link to the redirect URL.

There has been much discussion on the topic, as can be seen from the links below.

How to Remove Hijacker Page Using Google Removal Tool [webmasterworld.com]
Google's response to 302 Hijacking [webmasterworld.com]
302 Redirects continues to be an issue [webmasterworld.com]
Hijackers & 302 Redirects [webmasterworld.com]
Solutions to 302 Hijacking [webmasterworld.com]
302 Redirects to/from Alexa? [webmasterworld.com]
The Redirect Problem - What Have You Tried? [webmasterworld.com]
I've been hijacked, what to do now? [webmasterworld.com]
The meta refresh bug and the URL removal tool [webmasterworld.com]
Dealing with hijacked sites [webmasterworld.com]
Are these two "bugs" related? [webmasterworld.com]
site:www.example.com Brings Up Other Domains [webmasterworld.com]
Incorrect URLs and Mirror URLs [webmasterworld.com]
302's - Page Jacking Revisited [webmasterworld.com]
Dupe content checker - 302's - Page Jacking - Meta Refreshes [webmasterworld.com]
Can site with a meta refresh hurt our ranking? [webmasterworld.com]
Google's response to: Redirected URL [webmasterworld.com]
Is there a new filter? [webmasterworld.com]
What about those redirects, copies and mirrors? [webmasterworld.com]
PR 7 - 0 and Address Nightmare [webmasterworld.com]
Meta Refresh leads to ... Replacement of the target URL! [webmasterworld.com]
302 redirects showing ultimate domain [webmasterworld.com]
Strange result in allinurl [webmasterworld.com]
Domain name mixup [webmasterworld.com]
Using redirects [webmasterworld.com]
redesigns, redirects, & google -- oh my [webmasterworld.com]
Not sure but I think it is Page Jacking [webmasterworld.com]
Duplicate content - a google bug? [webmasterworld.com]
How to nuke your opposition on Google? [webmasterworld.com] (January 2002 - when Google's treatment of redirects and META refreshes were worse than they are now)

Hijacked website [webmasterworld.com]
Serious help needed: Is there a rewrite solution to 302 hijackings? [webmasterworld.com]
How do you stop meta refresh hijackers? [webmasterworld.com]
Page hijacking: Beta can't handle simple redirects [webmasterworld.com] (MSN)

302 Hijacking solution [webmasterworld.com] (Supporters' Forum)
Location: versus hijacking [webmasterworld.com] (Supporters' Forum)
A way to end PageJacking? [webmasterworld.com] (Supporters' Forum)
Just got google-jacked [webmasterworld.com] (Supporters' Forum)
Our company Lisiting is being redirected [webmasterworld.com]

This thread is for further discussion of problems due to Google's 'canonicalisation' of URLs, when faced with HTTP redirects and HTML META refreshes. Note that each new idea for Google or webmasters to solve or help with this problem should be posted once to the Google 302 Redirect Ideas [webmasterworld.com] thread.

<Extra links added from the excellent post by Claus [webmasterworld.com]. Extra link added thanks to crobb305.>

[edited by: ciml at 11:45 am (utc) on Mar. 28, 2005]

arubicus

1:35 am on Apr 21, 2005 (gmt 0)

10+ Year Member



We are trying to figure this one out ourselves and waiting on a response from google. If it were a spam penalty, I'll be darned if I know where to look especially through the thousands of pages we have! I hate to do anything to the site unless I had a real good idea of what it was that was causing it. We rank very well in both msn and yahoo and don't want to risk those and loose everything (unless I do something that is easily reversable). Never exchange links, never cross link any of our own sites, on page factors are most generally typical, seeing others far worse. Internal linking is simple pyramid. Large amount of 302 hijacking from one particular search engine which had 80% of our pages. Old 301 pages popping back in (probably a side effect).

GoogleGuy

8:20 am on Apr 21, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sorry, I got pulled into this thread: [webmasterworld.com...]
I should have known better, but oh well. bucaro/illusionist, I searched for your nicks in our user support tracking database but didn't see any emails. If you do a reinclusion request, please include your handles so I can try to get someone to find you. Very few people used the url removal tool to take out their own sites, so I can try to gather some people into one group and ask someone if we can do anything on our end.

For the person who asked about the url removal tool: its removal for six months, not 90 days. I understand how someone thought it might help to try the url removal tool, but please don't use it on one's own site. arubicus, did you say you saw weird behavior with www vs. non-www or trailing slashes vs. without? Could you submit something to google.com/support so I can try to get someone to check it out? Use arubicus in the form somewhere. I'm going to be gone Friday and this weekend, but I'd be curious to hear of any remaining canonicalization issues.

Ugh. Very bleary. Going to bed now..

Dayo_UK

8:37 am on Apr 21, 2005 (gmt 0)



Night Night GG

I have sent mine with my handle.

I admit I might have a problem with duplicate content - but trying to add lots of reviews etc.

But the whole site has gone - but the way it has gone I am wondering if it is a canonical url non-www problem.

GoogleGuy

8:38 am on Apr 21, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks, Dayo_UK. Goodnight..

steveb

8:49 am on Apr 21, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



"I'd be curious to hear of any remaining canonicalization issues"

Okay, it's not a 302 one, but I've been thinking of it as a similar "canonicalization issue"...
Google's database is overflowing with URL listings like:
www.site.com/directory
where there is also a normal listing for
www.site.com/directory/
These occur from the trifecta of the unfortunate Google policy of URLs-are-pages combined with the bajillion puke scraper sites that scrape search results, where both Yahoo and MSN display results without the trailing slash
It's my experience that when a page gets a second URL only listing, it drops in the results, which would then end up penalizing pages without a file extension, particularly if they are popular and get scraped often.

These URL only links fade fairly quickly, but still it would be nice to see Google recognize and combine these with the canonical page, rather than seemingly demerit the canonical page.

GuinnessGuy

9:29 am on Apr 21, 2005 (gmt 0)

10+ Year Member



Greetings,

I believe I may have been confused about the re-inclusion thing. My site does show up doing a site command and it does show up when I do a www.my-site.com with no commands. However, it shows up with no title or description.

I presume then that a re-inclusion will not do any good since the site is in the index. The problem I have is that it is no longer in the ranking which is the real underlying problem. And it is a site which was in the top 5 for a 3 million page plus keyword for years.

And yes, we did see other domains listed using the site command. Now they are gone, but everything is still listed as supplemental.

Do I wait? Do a re-inclusion? Can it hurt to do a re-inclusion?

GuinnessGuy

reseller

10:12 am on Apr 21, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



GoogleGuy

<Very few people used the url removal tool to take out their own sites, so I can try to gather some people into one group and ask someone if we can do anything on our end.>

Though I´m not among them who took out their sites by mistake, I wish to thank you on their behalf for taking care of this matter.

Very kind of you GG. Much appreciated.

union_jack

12:07 pm on Apr 21, 2005 (gmt 0)

10+ Year Member



can someone tell me what canonicalization stands for, sorry to be so dumb.

prieshach

12:15 pm on Apr 21, 2005 (gmt 0)

10+ Year Member



<Very few people used the url removal tool to take out their own sites, so I can try to gather some people into one group and ask someone if we can do anything on our end.>

Although this is not about redirection, I used it to remove some duplicate domain.com versions of my pages via noindex tags, including my home page. Unfortunately, as I now know, it removes the www versions as well.

It's been 20 days or so and despite being re-spidered the www pages have not reappeared.

The 'Remove Individual pages' section of the google help page does not have a footer, similar to 'Remove your website', indicating a 90 day period.

Does anybody know whether the 90 days / 6 months applies to individual pages removed using noindex tags?

Dayo_UK

12:24 pm on Apr 21, 2005 (gmt 0)



Hi Union Jack

Hard to define (as I am not an expert)

Following thread may help - read GG posts:-

[webmasterworld.com...]

But basically as I understand it Google finds the main url of the site (which is normally the page with the highest page rank) and perhaps where the site crawl starts.

However, I think, sometimes the wrong url can be picked. (Eg if you have the site on the non-www aswell - or your homepage is something like www.domain.com/home.php?sid=122323 - or all your links point to another page and therefore that page is seen as the most important and hence the canonical url)

Not 100% sure though

This 467 message thread spans 47 pages: 467