homepage Welcome to WebmasterWorld Guest from 54.205.254.108
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

This 277 message thread spans 10 pages: < < 277 ( 1 2 3 [4] 5 6 7 8 9 10 > >     
How to Remove Hijacker Page Using Google Removal Tool
8,058,044,651 page indexed (now minus 1)
Idaho




msg:756514
 6:19 pm on Mar 17, 2005 (gmt 0)

Continued from: [webmasterworld.com...]


With the help of posts from crobb305 and others, I was able to remove a hijacker's page from the Google index.

My site was doing very well in the SERPs. For over 2 years it had been on the first page for a competitive term (1.2 million listings). Then during the first week in January my site disappeared and traffic tanked for no obvious reason.

When searching for "site:www.mydomain.com" I noticed that my index page often wasn't listed or it appeared on about page 3 or 4 of the results after all my supplimental pages.

A search for "allinurl:mysite.com" often didn't show my index page at all but instead showed somebody else's domain (located in Turkey). When I clicked on this link, my site came up. When I clicked on the cached version of the site, it showed a very old cache of the page. This same site also showed up after all my results when doing a "site:www.mydomain.com"

Using a header checker tool on the site's URL I was able to see it was using a 302 link to my site.

Last night after reading some posts by crobb305 and others I went to Google.com and clicked on "About Google." Then I clicked on "Webmaster Info." Then I clicked on "I need my site information removed." Then I clicked on "remove individual pages." Where I found instructions on how to remove the page.

(Here's the exact page where I ended up. If mod needs to remove then snip away:) [google.com...]

I then clicked on the "urgent" link.

Then:
1. I signed up for an account with Google and replied back to them from an email they sent me;
2. I added the "noindex" meta tag according to their instructions and uploaded it to my site;
3. Using the instructions to remove a single page from the Google index, I added the hijacker's URL that was pointing to my site. (copy and paste from the result found on "allinurl" search)

This didn't work the first time because I had to remove a space from the url to get it to work.

4. I got a message back saying that the request would be taken care of within 24 hours. The URL that I entered showed on the uppper right hand part of the screen saying "removal of (hijacker's url)pending."
5. I then removed the "noindex" meta tag from my page and re-uploaded it to my site.

This morning the google account still shows the url removal as "pending" but when I do "site:" and "allinurl" searches the offending URL is gone and my index URL is back.

Conclusions and Speculations:
At some point last September, Google cached the hijack page's url pointing to my site. In January, Google penalized my site for duplicate content because it found both URL's and compared them. Mine got penalized because it was the only page that really existed. The hijacker's page didn't get penalized because it only existed as a re-direct to my site.

Because my index page was now penalized, it dropped almost completely from the SERPs. (Some of my suppliement pages showed up for obscure searches) but none of my money terms.

Because I haven't been able to get a response from the hijacker's webmaster, the 302 is still in place but it is buried deep in his site and the last Google cache of the page was sometime in September. Therefore with some luck Google won't re-index it any time soon.

Will my site return to the SERPs? I don't know. Any thoughts?

 

illusionist




msg:756604
 3:18 pm on Mar 19, 2005 (gmt 0)

I removed that noindex as soon as i submitted the 302 redirect url and got a pending statement.

I was searching for my unique site name and it came up in the third page before i used the url remove tool it came up as the first result.

Reid




msg:756605
 5:24 pm on Mar 19, 2005 (gmt 0)

Illusionist

Search site:yourdomain.com If your homepage is there (doesn't need to be top) and the other one is not, then it's fixed. done.

Reid




msg:756606
 5:45 pm on Mar 19, 2005 (gmt 0)

For you guys like Claus and others who had a bunch of pagejackers and got rid of them but your site won't recover since. I'm wondering if somehow google has associated you with a 'bad neighborhood' and that is why googlebot wont find you.
Tried resubmitting, maybe what you need to concentrate on is getting 'linked to' from some fresh 'good neighborhoods' googlebot will follow their links to you. The other thing is I would take a look at your site through a Lynx browser, may be surprised at what you find. Do a search for poodle predictor, check that out.

zeus




msg:756607
 10:07 pm on Mar 19, 2005 (gmt 0)

Reid - Yes it is a problem, I got rid of mine also, but still have 3 that are now redirecting to there homepage, but cache is still my site. To get googlebot back I tried, submitting/toolbar and links, nothing has changed. I did at one day get hit 120 time by googlebot, they did also get index, but slowly they also fall out of the index.

crobb305




msg:756608
 10:48 pm on Mar 19, 2005 (gmt 0)

Right now, Google likes having those stale caches in it's index. I have urls last cached Nov 2. Google is content with this new business model. And nothing is going to change. Hijacking urls are here to stay until they decide they want to provide the quality of search that they used to.

BTW, a new redirect just showed up for my site in the form of a subdomain like mysite.jm8.net

Using removal tool, it is now gone. But, for a while, it was ranking #1 for my company name and for snippets of text from my home page. As I have said before, absolutely pathetic.

crobb305




msg:756609
 1:12 am on Mar 20, 2005 (gmt 0)

walkman,

What is really funny (and still pathetic) is that Google is listing a bunch of bad urls in the form http/http://example.com/%20

And even though they don't exist, you can't submit them to Google for removal because the removal tool doesn't recognize the format. If the removal tool knows that the url is not correct, how does it get indexed in the first place? Why is Google loaded down with crap like this, fake urls, etc.

Sad.

g1smd




msg:756610
 1:16 am on Mar 20, 2005 (gmt 0)

I tried the URL Removal Tool to try to remove pages that are on a site where the domain has expired and some others on a domain that is password protected.

The action was refused because Google could not fetch the page to read the meta tags, nor could it fetch any robots.txt file either.

How can I add a temporary robots.txt file to a domain that does not exist? Google hangs onto those old cached pages like glue.

theBear




msg:756611
 1:57 am on Mar 20, 2005 (gmt 0)

crobb305,

Databases get all kinds of trash in them if you don't use some real edits right up front.

The devil is in the details, plenty of folks can dream up things.

It takes someone very picky to write good edits.

Unfortunely a lot of wetware malfunctions when doing this. You see examples of the failure all the time. That is why there are things like buffer offerflow exploits, executable code insertion exploits, etc ... etc ...

Just look at IE, XP, Moz, Firefox, Linux, or you name it.

There are failure modes because of not checking validity of data prior to acting on the data.

These are seperate from doing the wrong thing with the data.

Reid




msg:756612
 2:29 am on Mar 20, 2005 (gmt 0)

I just removed one from my clients site. Tried robots.txt - didn't work, tried META tag, didn't work "the page appears to exist" tried renaming the homepage temporarily to get a 404, didn't work, the virtual host he is using substituted their placeholder. So I did a mod rewrite to redirect the whole domain to a bogus page - got a 404 when I tried to view the page, URL removal worked.

(he only gets 20-30 uniques per day and the site was only down for a min, he was happy the link is gone)

Vec_One




msg:756613
 2:31 am on Mar 20, 2005 (gmt 0)

I don't understand why Googlebot stubbornly refuses to crawl all of these old pages that haven't been cached since September or November. I've tried submitting them and linking to them. It's almost as if Google is purposely refusing to solve this problem.

Could there be a rational purpose for this that isn't obvious? Nah, probably not. Most likely, it's just screwed up and nobody cares. At this point Alta Vista is looking pretty sophisticated.

hdpt00




msg:756614
 3:17 am on Mar 20, 2005 (gmt 0)

Here is what one a** is doing to hijack me. They are link through initially to yahoo, then to their domain, then to a redirect. The link from inurl: shows their domain, the link on their site is actually the whole yahoo 302 mess.

When I treid removal of the url in google (w/o the yahoo mess) it said page not found or something, when I copied the actual url link with yahoo stuff in it, it seemed to work. However, of the 4 hijackers, this page still shows up while the others are gone.

Dang I thoguht I might finally get these penalties off of me. Also, one site that is now a 404 did a 302 hijack to me and no site exists so google now gives me a no URL found error but still shows as a supplemental result.

I love you google.

thnkfst




msg:756615
 3:41 am on Mar 20, 2005 (gmt 0)

I've certainly had enough. The only remove page I can get is a Google Japan site and I'm not exactly inclined to read anything but French or English.

Anyone get this when they click on the automatic URL removal system?

Is there a page that is in English? or is it just Google punishing MAC users?

Emmett




msg:756616
 3:55 am on Mar 20, 2005 (gmt 0)


I don't understand why Googlebot stubbornly refuses to crawl all of these old pages that haven't been cached since September or November. I've tried submitting them and linking to them. It's almost as if Google is purposely refusing to solve this problem.

Could there be a rational purpose for this that isn't obvious? Nah, probably not. Most likely, it's just screwed up and nobody cares. At this point Alta Vista is looking pretty sophisticated.

Vec_One,

I have a theory. When I submitted some split site urls to google, no matter what it would never update the cache of the pages, then it dawned on me. Everytime googlebot visits my server was returning a "not modified" code so google wouldn't see a need to re-fetch the pages and update the index. I went through and did a global search and replace on those pages to re-save them with a new date. Now I've noticed double entries in my logs where the files get hit, then 301'd to the proper location.

Now I'm thinking thats why google isn't updating those php redirects that used to go to my site but are now directed to their homepage. Anybody know if a php script like that would return "not modified"?

walkman




msg:756617
 4:23 am on Mar 20, 2005 (gmt 0)

"What is really funny (and still pathetic) is that Google is listing a bunch of bad urls in the form http/http://example.com/%20"

they need 8 billion pages :)
I doubt they matter though...unless Google can index, them I don't think that they cause problems. I hope so anyway

nakulgoyal




msg:756618
 6:08 am on Mar 20, 2005 (gmt 0)

In my case, emailing the webmaster worked. But the information above is useful. Thanks everybody.

walkman




msg:756619
 6:33 am on Mar 20, 2005 (gmt 0)

"In my case, emailing the webmaster worked"

it worked for my 4 first ones. One guy couldn't care less. Sent him 3 emails. Maybe I should return the favor and link him ;)

Reid




msg:756620
 7:52 am on Mar 20, 2005 (gmt 0)

As long as that link is still active to your page you should be able to remove it.
Just rename the page or whatever you need to do to make that link return a 404 page not found.
Then submit the hijackers link to be removed. Once you get a 'successful' then your done.

the problem is removing inactive links or if someone has a copy of your page on another server.

I suspect that these guys like having these dead links layin around, maybe because the cache is never updated they still count somehow, I don't know.

I have one in my site: that was removed from the directory that hosted it but Googles URL tool won't remove it because the php file is still there but does not recognize the id# for that link, returns an empty header. They won't talk to me either, got banned for poking around investigating my link (and others). Joke is on them though because they banned my home IP (not my site).

Emmett




msg:756621
 8:06 pm on Mar 20, 2005 (gmt 0)

Has anyone figured out a solid way to get rid of a 302 redirect thats been updated to point to their home page but still cached in Google as the old redirect? I've tried submitting the link to Google's addurl and put the link on my home page but it hasn't updated yet.

Vec_One




msg:756622
 8:25 pm on Mar 20, 2005 (gmt 0)

Emmett, although other people have tried submitting and linking to such a page, I haven't heard any success stories. AFAIK, the only solution is to convince the webmaster to completely remove the link, and then submit it to the Google URL removal tool.

I am currently trying to have two of these links removed. If anyone knows a better way, *please* let me know!

walkman




msg:756623
 8:29 pm on Mar 20, 2005 (gmt 0)

success story...in removing the links via this method. I can only hope the serps will reflect that. Not holding my breath though

"Emmett, although other people have tried submitting and linking to such a page, I haven't heard any success stories. AFAIK, the only solution is to convince the webmaster to completely remove the link, and then submit it to the Google URL removal tool."

Emmett




msg:756624
 9:02 pm on Mar 20, 2005 (gmt 0)

Only problem with removing the link is their go.php script will return to their home page for any invalid id #. I doubt I could get them to change the programming of their script.

Would filing a DMCA on the google cache work in this case?

trimmer80




msg:756625
 9:09 pm on Mar 20, 2005 (gmt 0)

I have the same problem Emmett
, i am requesting the webmaster add a line to their .htaccess file that will force a 302 redirect for that exact url and thus i can remove it. :D

zeus




msg:756626
 9:55 pm on Mar 20, 2005 (gmt 0)

We all know that the only one that can solve 100.000s of websites problem is google, its there problem, so still the only solution is to go public with this 302 googlejacking to the press/internet sites, if you hit them so they can feel it in the stock price you will get a solution within a month.

Since there has been NO comments on this topic in the press or forums from google, they are not interested.

Lorel




msg:756627
 10:22 pm on Mar 20, 2005 (gmt 0)

"What is really funny (and still pathetic) is that Google is listing a bunch of bad urls in the form http/http://example.com/%20"

they need 8 billion pages :)
I doubt they matter though...unless Google can index, them I don't think that they cause problems. I hope so anyway

I agree. This whole mess is because if google fixes it then they loose a lot of pages indexed-and the amount they loose will tell us how many pages were affected too.

re the example.com%20

I've seen it under the site command. So Google is indexing them

g1smd




msg:756628
 12:38 am on Mar 21, 2005 (gmt 0)

I recently read an article about the very small number of websites using the domain of a small island nation in the South Atlantic, one which Britain fought a war over about 20 years ago.

To see for myself I did a site: search on the Top Level country domain. Even in a couple of thousand results I was amazed at the amount of malformed rubbish that has got into Google's database. Those sort of URLs shouldn't be in their database at all, never mind actually turning up in search results.

Reid




msg:756629
 8:35 am on Mar 21, 2005 (gmt 0)

Only problem with removing the link is their go.php script will return to their home page for any invalid id #. I doubt I could get them to change the programming of their script.

How about do a DOS attack to bring their server down - that way you'll get a 404 so you can remove your link.

Just a thought...

Dayo_UK




msg:756630
 10:23 am on Mar 21, 2005 (gmt 0)

This is getting ridiculous.

Cant Google just ask Yahoo if they can use Yahoo results until this is fixed - or perhaps ask - Ask Jeeves to use Teoma results (might be a little less embarrasing and they have a contract re Adwords.)

Well Google have certainly confused SEOs and Webmasters if that was the plan - confused us by asking why so many problems are not getting fixed.

entropicus




msg:756631
 11:04 am on Mar 21, 2005 (gmt 0)

this problem, assuming people are using it to gain PR or SEP, could also be done with 301s possibly.

What if i have a PR 6 site, then created a blank directory with a htaccess 301 redirect to the most popular file/page on a site with a PR 2?
eg:
redirect /mydir/mysite.html h**p://somesite.com/somefile.html

The googlebot would also see this as duplicate content, and also think that the content it directs to belongs to my site.

claus




msg:756632
 1:29 pm on Mar 21, 2005 (gmt 0)

entropicus, no 301 would not work. 301's and 302's work in opposite directions - see my answer to your post in the other thread :-)

vfilip




msg:756633
 3:39 pm on Mar 21, 2005 (gmt 0)

this is a google bug and google just doesn't care to resolve it . It indexes both my site and another site ( maybe scrapper site ) , but what about if i have paid an advertisement and they have linked me in that way ( 302 redirect)? Have i paid to kill my self? I mean this the usual way of an advertiser to link to sites! 302 redirect is a common practice in the web . what can we do?

Emmett




msg:756634
 12:12 am on Mar 22, 2005 (gmt 0)

OMG! I'm back in 2 datacenters!

64.233.167.99 and
64.233.167.104

The 302 link is still there but I don't seem to be getting a penalty for it.

Ranked #2 for my company name :D

This 277 message thread spans 10 pages: < < 277 ( 1 2 3 [4] 5 6 7 8 9 10 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved