|
Google's 302 Redirect Problem
|
steveb
#:733069
| 1:36 am on May 7, 2005 (utc 0) |
add.cgi displays all of eight pages, but 885,000 are claimed apply.cgi shows seven pages, but 1.38 million are claimed. update.cgi 341,000 editcat.cgi 100k Double everything for lowercase and capital letters, add a dash of Supplementals for deleted categories, and you start getting close to how Google's numbers are eight times off what is the reality of real indexed pages.
|
Vec_One
#:733070
| 1:50 am on May 7, 2005 (utc 0) |
For months, I have been carefully cleaning up all the redirects to my site that I could find in Google. I started with site: and now I'm finishing up the last in the inurl:. Yesterday, I snooped around MSN a little. Wow, there must be dozens, if not hundreds, of other people's pages with my pages in their caches. A typical one is like www.theirsite/fwd.php?url=http://www.my site.com. I assume that Google also has a lot of these pages in its index but just doesn't display them all in searches. Am I assuming correctly?
|
Reid
#:733071
| 3:21 am on May 7, 2005 (utc 0) |
| I assume that Google also has a lot of these pages in its index but just doesn't display them all in searches. Am I assuming correctly? |
| nothing ever completely disappears on the web. especially with google. 1.8 billion served. But even they want to filter out bad results don't they? The google search algo is one thing but their database is quite another. my philosophy is write good content with clean code (not html soup) follow Brett's 29 guidelines as much as possible, make sure googlebot works and you end up with a stable site that will weather the googlestorm or at least be ready for reinclusion. googlestorm 1 302 problem in index 2 algo change 3 debug (tweaking at the plex) 4 algo change 5 debug 6 algo change 7 debug .....
|
claus
#:733072
| 8:46 am on May 7, 2005 (utc 0) |
>> Google also has a lot of these pages in its index but just >> doesn't display them all in searches. Am I assuming correctly? Yes Vec_One, sofar Google's "fix" has been hiding the URLs in stead of deleting them. They have also made other changes, but they have not removed those URLs that are not pages - you are just not allowed to see them. And MSN... Well, it's still Hijackers Paradise there.
|
cat5
#:733073
| 10:18 am on May 7, 2005 (utc 0) |
How can i contact Google about webmaster and url issues?
|
joeduck
#:733074
| 5:51 pm on May 7, 2005 (utc 0) |
Cat5 - Google support is here: www.google.com/support/
|
joeduck
#:733075
| 3:43 am on May 8, 2005 (utc 0) |
claus - what do you mean about google hiding pages?
|
Marval
#:733076
| 4:00 am on May 8, 2005 (utc 0) |
joeduck - read back in the beginning of this thread and you will find Googleguy basically stating that they are removing the ability to see these pages in a site: command search - while the pages still exist in the database you just wont be able to see them using the normal tools
|
joeduck
#:733077
| 5:06 am on May 8, 2005 (utc 0) |
Marval - Wow - I read that GG comment differently but now you have me wondering if things have changed in the way the index treats the 302s. Claus is saying it's not fixed and we have seen some odd site:oursite results recently. GG said this: "We changed things so that site: won't return results from other sites in the supplemental results." I thought this meant they had fixed the 302 redirect problem that was treating _external_ pages as pages located in the site. If these bogus pages are still in the index, but not shown, it's creating an invisible problem rather than a solution. If they show you can often find a way to zap them with Google exclusion tool. GG - this issue about what site:site.com tells us about 302 problems would be very helpful to clarify. "O where O where has our GoogleGuy gone?"
|
eyezshine
#:733078
| 7:03 am on May 8, 2005 (utc 0) |
Thanks for leaving us hanging again Googleguy. This is how rumors get started. From lack of information. I have done everything I can think of to comply with the google guidelines and my site is still messed up in the serps?
|
claus
#:733079
| 9:35 am on May 8, 2005 (utc 0) |
Okay, sorry about the confusion. It's like Marval says. And then some. 2005-05-08: Status: Problem STILL NOT solved The 302 redirects are not removed from the index, they just don't come up when you search using the "site:" command. So, they're still there, but they're hidden in that command. I also misunderstood GoogleGuy here. I thought that they would be hidden in general. They're not. The only place you can't see the hijackers now is when you use the "site:" command. Now, who will most likely use the "site:" command? Yourself. Not the searchers who are the real targets of this hijacking. In regular searches 302 redirects are not hidden - they still take over the snipppet, cache, and title of the target page. The problem has not been solved. I still see 302 redirect URLs in the SERPs for regular queries, like, say the #1 for "african darfur crimes". (just to pick a hijack of BBC - that is, 100% non-profit and non-competitive). So, the only one that can no longer see these hijackers is the webmaster that has been hijacked, when using the "site:" command. Nice? Not!
|
g1smd
#:733080
| 4:31 pm on May 8, 2005 (utc 0) |
>> The problem has not been solved. That's what I said, like, 2 weeks ago (and I then had a nice technical sounding answer from GoogleGuy using the latest obfusciated buzzwords - something about heuristics and matching and supplemental results {why did he mention supplemental results anyway? they weren't in the original question} - that was supposed to make me think it was fixed, but which actually said nothing of the sort).
|
Dayo_UK
#:733081
| 6:46 pm on May 8, 2005 (utc 0) |
g1smd Have you got a link to that post? Things are strangely static in the index for me at the moment. (I am hoping this means they are testing a new index/algo/fix prior to launch - but then I have been hoping along these lines for weeks/months.)
|
Shurik
#:733082
| 6:56 am on May 9, 2005 (utc 0) |
They are not testing/fixing new algo, they are busy trying to sort out the mess created by their brand new Web Accelerator.
|
Reid
#:733083
| 7:09 am on May 9, 2005 (utc 0) |
| They are not testing/fixing new algo, they are busy trying to sort out the mess created by their brand new Web Accelerator. |
| Or there are 2 teams one working on each, and neither team knows what the other is doing because 'it's a secret'.
|
zeus
#:733084
| 11:14 am on May 9, 2005 (utc 0) |
Now we have to us inurl: and look for same title and description or take a look at the cache of each site. The worst thing, if you have no more redirecting/hijacking problem then you will still not be listed in the serps, because you have got some kind of supplemental ban and I also wonder this emails we had to sent for a reinclusion, with a special title text, is maybe just so they can easy trash those, because they fill up the indbox at google.
|
Panacea
#:733085
| 1:52 pm on May 9, 2005 (utc 0) |
It must have been about a month now since GooglyGuy told us how to file for a re-inclusion request. All I got back was the automated response. My supplemental listed sites have not been spidered for 4 months! [url=http://www.webmasterworld.com/forum30/29391.htm]Continued...[/url] [edited by: ciml at 4:33 pm (utc) on May 9, 2005]
|
| This 467 message thread spans 16 pages: < < 467 ( 1 ... 6 7 8 9 10 11 12 13 14 15 [16] ) |