Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Used URL removal tool with no success

Another sign Google is broken

         

textex

11:20 am on Sep 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



We had some issues of duplicate content being indexed from two different sites. Our robots.txt files was not set up right, allowing Google to spider a site we did not want it to.

As a result our main site was hit with some ranking/filter issues back in Feb/March?. We requested url removal of the inferior site.

To the present....
Our main site was performing well, with constant FRESH tags, up until three weeks ago. At that time, we bumped into the 30s for most of our kw's. Our cache for our index is from early August (not consistent with cache dates prior) and the rest of the site is from January (it was previously from July).

Was doing some poking around and noticed that the site we requested to be removed is listed in Google again with the same exact cache of our index page from our main site!

Anyone else experience anything like this?

lammert

6:21 pm on Sep 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



This is normal behaviour of the URL removal tool. It does not remove the URLs from the index, but only hides them for a period of three or six months. Then they reappear with all the attributes they had, like cache, PR etc.

Having a robots.txt does often not help. It blocks the Googlebot to respider the unwanted URLs and these URLs will get stuck in the index for years, eventually getting the supplemental status.

According to my experience, it is better to let Googlebot spider the unwanted site, but add the following meta tag to the head section of each page:

<meta name="robots" content="noindex,follow">

This will cause all unwanted duplicate copies to be removed from the index and your other site should get back in the index at the old position. The total process of spidering and deleting the old sites may take a few weeks to a few months, dependent on the number of pages and the frequency Googlebot visits that site.

stinkfoot

10:37 pm on Sep 7, 2005 (gmt 0)

10+ Year Member



Broken? Google? Naaaaaa .. it has been sucky like this for 4 months.

Try asking google whats up and they respond,

"We are sorry to hear, there is a problem, our robots failed, good luck we are not going to help you!"

textex

11:13 am on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Here is my update....

Requested a url removal again. Placed noindex tags on the same site. Site is out of Google.

However, my good site, which is 5 years old and was an authority in the industry (top three for years out of 523 million results) is now not even being returned in results with and without filter=0.

Site is still in index. Still has PR.

What is the deal?

zeus

11:27 am on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



hmm some of that you are talking about reminds me of my site, which got hit by the googlebug 302, it also had PR for some time and the ranking was bad to non and only 266 pages in the index out of 3000, my problem started the 3 nov.2004 no changes since and I will not see any changes until they update there supplemental results database, all the old googlebug 302 are still cached, so Im filtered out and now have a PR0 from before PR6

textex

11:59 am on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Obviously not the answer I wanted to hear.

Hopefully, the data from my site is in the middle of being implemented.

I also noticed that that when I search for the url of my site, the cache date is old, but the listing is new (description is updated).

When I examine 'pages from the site', there are old pages listed removed last year listed as supplemental. I am also missing several pages from the index.

What is the deal?

zeus

12:08 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



once again you mention things simular to my site, many supplemental results from as early 2004, try a site:your domain.com here 72.14.207.106 and you will even see more supplemental results.

Also check for non www results I bet you got some there also, but that is not what caused the problem, first later it will give you troubles.

textex

12:18 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You are right. Not good.

I can not understand why now, I am having this issue. I have had a 302 redirect set-up for almost 9 months. The cache of all the supplemental results are from January 05!

What is the deal?

You expect the entire site will eventually get PR 0 and booted from the index?

zeus

12:23 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



textex - I hope not, I hope you will make it, but I have lost almost all income from that site and trust me its a clean site and has been online for 4 years and no.1 on google of 10mill results, but then I got hit.

sticky me your site I will have a look, but please only if its nothing kinky/wierd :)

textex

12:50 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thank you for all of your help.
Nothing personal, I'll keep the site to myself.

Should I just sit back and see what happens, or is there any initiative you suggest.

sublime1

1:28 pm on Sep 9, 2005 (gmt 0)

10+ Year Member



We are facing a similar situation. We have a site that lets users list our widgets and sort them on various attributes. We also recently decided that there was little value in having content for about half of our widgets (in fact, thought it might be hurting us in Google).

So we:
* Return HTTP Status 410 for pages that are truly gone, not a 302, and not 404,
* Added a ref="nofollow" attribute to the anchor tag links to these pages
* Changed our robots.txt to exclude the set of pages we wanted to get rid of
* Used the Google removal tool as well

Yet we still see pages in the index, pages we don't want showing in the SERPs (not high, but there), and Googlebot still keeps trying to get the page. allinurl shows ~250 pages (something like 10 pages of search results sorted or filtered 25 different ways).

BUT ...

We did not add meta noindex,nofollow tags. Do I read correctly that recent experience suggests this is the only thing that actually works?

Thanks!

Import Export

1:47 pm on Sep 9, 2005 (gmt 0)

10+ Year Member




I've used the removal tool on a regular basis for some time now and have never once had a problem with it. I wish I could say the same for the rest of G :-/

zeus

2:08 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



textex - try a inurl:yoursite.com see if there are other domains with your title and a copy of your page in the cache

textex

2:27 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Zeus,
Tried the inurl thing and I noticed two significant (I think) issues.

1: There are 200+ different sites like so:
/www.mysite.com/ URL moved, Please ...
/www.mysite.com/ URL moved, Please visit wierdsite.com.
wierdsite.com/www.mysite.com/ - Supplemental Result - Similar pages

2: When I repeat the seach including duplicates the non-www version of my site comes back as well.

Seems like a 302 issue. I still can not understand why I would be plagued with this now, 9 months after setting up a 301 redirect.

SFReader

3:39 pm on Sep 9, 2005 (gmt 0)

10+ Year Member



You have muliple suggestions on what to do, and I expect you will do more than one of them.

Why not also make some changes in the content of your duplicate site so that if it continues to be listed anyway, it may not look like a duplicate?

textex

8:26 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Zeus,
Should I sit tight for now and hope the issue will correct itself?

-Tex

zeus

8:57 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hmm Tex - If you have a googlebug 302 problem and I say if, you can not do anything els then sit and wait a year or redesign your whole site, with content.

Still I say if you are in that situation, I got hit by the googlebug 302, tha means that another site has a link to you (302 link) which google then thinks its unique page/site then you at once have a dublicated situation, google will not corect this problem, my hope is that once they will get back to have a good search engine they will update there supplemental results DB and then we could MAYBE get out of this dublicated situation, because many of those old 302 links are in there caches.

Still look for other domains that has your title and a copy of your page in the cache, also try to take a part of a unique text from your site and past in to a "your text" search in google.

What I have tried:
contact google, googleguy, matt...
used the removal tool
contact the hijackers and the sites with a 302 link
changed some of my pages.
made 301 from non www to www for 5 month ago, the non www is still in the index.

IM SURE i dont have a ban, the site has NEVER been hit badly in ANY update and its as clean as it gets.

textex

9:06 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This site has had issues in the past with people scraping content from my index page. Site has ranked well for very competetive terms for quite some time, resulting in the scraping.

I update the content on the index about once every few months so that I avoid the scraping issue.

As mentioned, I see alot of this:

/www.mysite.com/ URL moved, Please ...
/www.mysite.com/ URL moved, Please visit wierdsite.com.
wierdsite.com/www.mysite.com/ - Supplemental Result - Similar pages

Would this be a cause of problem?

textex

9:09 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Also, is there a way to search on Yahoo! for the sites that may cause the 302 bug?

zeus

9:19 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yahoo had this problem for 1 month then they fixed it. It seems google is not so fare in that SE section.

If you mean seaching for possible googlebug 302 on yahoo, no you can not do that.

About the site urls, I would not say its a problem if they dont have a cache of your site, but I also have such sites as you mentioned pointing/copying the domain.

textex

9:55 pm on Sep 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Trying hard to get to the bottom of this....and I thank you for your input.

Do you think I should do more that just check the sites I link to see if they are penalized in Google? Or do you think I should see if they are suffering any sort of 302 bugs?

I have always been very careful with who I link to, but it seems as though there is more checking that needs to be done than just checking to see if a site has PR still and/or is still showing in the index.

Am I reading too deep into this whole thing?
<b>Is it merely something out of my hands?</b>

textex

10:12 pm on Sep 26, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A little update....

Site has been gone from SERPS since my last post.
However, site is being returned in results when you search for KWs in quotes.

I hope this is the first step for good things to come.

jd01

10:24 pm on Sep 26, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have had a 302 redirect set-up for almost 9 months.

If this is true, you are the cause of your problems... The proper handling of a 302 (undefined) redirect is to request the page from the original location, not the location being redirected to. You are effectively 'hi-jacking' your own (good) site with the site you would like to be removed -- If you are using and continue to use a 302, the site you are redirecting (would like removed) will not ever be removed permanently (unless Google decides to throw HTTP Standards out the window), and you will quite possibly continue to harm the site you would like indexed.

For more information on redirects, I highly recommend reviewing what the W3C has to say about the proper handling of redirects by user-agents.

Justin

textex

10:41 pm on Sep 26, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Huh?

I do not have a 302 on my 'bad' site, the one I want removed.

I have a 302 on the 'good' site to avoid the www and non-www indexing issues.

I have come to the conslusion that the re-listing of the 'bad' site and my negative effects on the good site was coincidence and I was hit with a canonical url issue instead.

jd01

10:55 pm on Sep 26, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sorry, misunderstood your post -- I do recommend changing the 302 to a 301 on the www - non-www, if you are set on redirecting. The 'good' site dropping may be totally unrelated to the other site... You might look at the '301 Club' and other recent threads before deciding to redirect from one version of your domain to another -- It appears these are causing some issues, and your 'good' site may have dropped because of the redirect, and not the other site being shown in the index again.

Justin

BTW Agree with the coincidence summary of your last post...