Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

302 Redirects continues to be an issue

         

japanese

6:23 pm on Feb 27, 2005 (gmt 0)

10+ Year Member



recent related threads:
[webmasterworld.com...]
[webmasterworld.com...]
[webmasterworld.com...]



It is now 100% certain that any site can destroy low to midrange pagerank sites by causing googlebot to snap up a 302 redirect via scripts such as php, asp and cgi etc supported by an unseen randomly generated meta refresh page pointing to an unsuspecting site. The encroaching site in many cases actually write your websites location URL with a 302 redirect inside their server. This is flagrant violation of copyright and manipulation of search engine robots and geared to exploit and destroy websites and to artificially inflate ranking of the offending sites.

Many unethical webmasters and site owners are already creating thousands of TEMPLATED (ready to go) SKYSCRAPER sites fed by affiliate companies immense databases. These companies that have your website info within their databases feed your page snippets, without your permission, to vast numbers of the skyscraper sites. A carefully adjusted variant php based redirection script that causes a 302 redirect to your site, and included in the script an affiliate click checker, goes to work. What is very sneaky is the randomly generated meta refresh page that can only be detected via the use of a good header interrogation tool.

Googlebot and MSMBOT follow these php scripts to either an internal sub-domain containing the 302 redirect or serverside and “BANG” down goes your site if it has a pagerank below the offending site. Your index page is crippled because googlebot and msnbot now consider your home page at best a supplemental page of the offending site. The offending sites URL that contains your URL is indexed as belonging to the offending site. The offending site knows that google does not reveal all links pointing to your site, takes a couple of months to update, and thus an INURL:YOURSITE.COM will not be of much help to trace for a long time. Note that these scripts apply your URL mostly stripped or without the WWW. Making detection harder. This also causes googlebot to generate another URL listing for your site that can be seen as duplicate content. A 301 redirect resolves at least the short URL problem so aleviating google from deciding which of the two URL's of your site to index higher, more often the higher linked pagerank.

Your only hope is that your pagerank is higher than the offending site. This alone is no guarantee because the offending site would have targeted many higher pagerank sites within its system on the off chance that it strips at least one of the targets. This is further applied by hundreds of other hidden 301 permanent redirects to pagerank 7 or above sites, again in the hope of stripping a high pagerank site. This would then empower their scripts to highjack more efficiently. Sadly supposedly ethical big name affiliates are involved in this scam, they know it is going on and google adwords is probably the main target of revenue. Though I am sure only google do not approve of their adsense program to be used in such manner.

Many such offending sites have no e-mail contact and hidden WHOIS and no telephone number. Even if you were to contact them, you will find in most cases that the owner or webmaster cannot remove your links at their site because the feeds are by affiliate databases.

There is no point in contacting GOOGLE or MSN because this problem has been around for at least 9 months, only now it is escalating at an alarming rate. All pagerank sites of 5 or below are susceptible, if your site is 3 or 4 then be very alarmed. A skyscraper site only need create child page linking to get pagerank 4 or 5 without the need to strip other sites.

Caution, trying to exclude via robots text will not help because these scripts are nearly able to convert daily.

Trying to remove a link through google that looks like
new.searc**verywhere.co.uk/goto.php?path=yoursite.com%2F will result in your entire website being removed from google’s index for an indefinite period time, at least 90 days and you cannot get re-indexed within this timeline.

I am working on an automated 302 REBOUND SCRIPT to trace and counteract an offending site. This script will spider and detect all pages including sub-domains within an offending site and blast all of its pages, including dynamic pages with a 302 or 301 redirect. Hopefully it will detect the feeding database and blast it with as many 302 redirects as it contains URLS. So in essence a programme in perpetual motion creating millions of 302 redirects so long as it stays on. As every page is a unique URL, the script will hopefully continue to create and bombard a site that generates dynamically generated pages that possesses php, asp, cigi redirecting scripts. A SKYSCRAPER site that is fed can have its server totally occupied by a single efficient spider that continually requests pages in split seconds continually throughout the day and week.

If the repeatedly spidered site is depleted of its bandwidth, it may then be possible to remove it via googles URL removal tool. You only need a few seconds of 404 or a 403 regarding the offending site for google’s url console to detect what it needs. Either the site or the damaging link.

I hope I have been informative and to help anybody that has a hijacked site who’s natural revenue has been unfairly treated. Also note that your site may never gain its rank even after the removal of the offending links. Talking to offending site owners often result in their denial that they are causing problems and say that they are only counting outbound clicks. And they seam reluctant to remove your links....Yeah, pull the other one.

[edited by: Brett_Tabke at 9:49 pm (utc) on Mar. 16, 2005]

yankee

4:48 am on Mar 12, 2005 (gmt 0)

10+ Year Member



If I add base href tag, change all links to absolute, and add dynamically changing content daily, will it help get my traffic back? has anyone tried this to a site that's already been hijacked?

deanril

4:50 am on Mar 12, 2005 (gmt 0)

10+ Year Member



Ok, Hmmm.

I did an allinurl:[my domain] (no-www) and there is my site first then 5 other sites not mine, with my cache!

#1 The first one has this
url - www.[domain]/d/p.php?url=http://www.[my domain]%2F
But doesnt have a cache, probably using no cache, if you click on the link it goes to my site.

#2 url - www.[domain]/multi/ translate.asp?lp=en_de&url=http://www.[my domain]%2F

Cache is a translation of mysite into some other language and goes to mysite when clicked but then mysite is translated in this launguage

#3 url -lang.[domain]/av_translate.php?lp=en_ es&url=http://www.[my domain]%2F

Cache is a translation of mysite into some other language and goes to mysite when clicked but then mysite is translated in this launguage

#4 url - [domain]/c/15294
This one has a Title from one of my standard products, the title my ecom gives all products

cache is of my site(one of the products) but some pictures are missing. If you click on link with my title goes to my site, url seems normal once at mysite.

#4 I think is deliberate, the other ones I think maybe non deliberate and something to do with their software.

At the bottom I have the "repeat the search with the omitted results included" when I do that, I see mostly my stuff with these 5 towards the end.

Am I freaking out because of this thread or is this a legit attempt on my site?

edit: the same ones show up in the allinurl:[my domain] except #4

[edited by: deanril at 4:57 am (utc) on Mar. 12, 2005]

[edited by: ciml at 12:54 pm (utc) on Mar. 12, 2005]
[edit reason] Examplified [/edit]

surfgatinho

4:53 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I keep seeing my site appearing and then it's gone.
When it's not there I see a list of 302 style sites.

It seems to depend on data centres - does this mean Google are trying to fix it?
216.239.59.104 - bad
66.102.11.99 - good

Emmett

5:13 am on Mar 12, 2005 (gmt 0)

10+ Year Member




I keep seeing my site appearing and then it's gone.
When it's not there I see a list of 302 style sites.

I've seen some fluctuation. There was a period yesterday where the 302 site directing to mine was gone for a few minutes. Then it started showing an older index. My clients site which disappeared when mine did is back in the results for his name. My lost site was still gone without filter=0 though.

I've added the base href to my new site along with absolute url's and a 301 from mysite.com to www.mysite.com. I haven't done the dynamic content yet though. One of its pages got replaced by a 302 a few days ago. It's so deep in my site that I don't think it will cause major trouble. It's new so it's got 0 pagerank so I'm a little worried about it.

larryhatch

5:32 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hello Emmett:

You may want to visit this page: [greenbuilder.com...]
That site is saying BASE HREF is being deprecated, and may be
disallowed in the future.

Unless I misunderstand, it simply takes all your relative urls
like /somepage.html , and converts them to an absolute URL like
www.yoursite.com/somepage.html.

Personally, ALL my internal and external links are the full length absolute kind.
Only my image (.gif/.jpg) file URLs are relative, for ease in programming.

- Larry

john316

5:53 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I did this to one ripper:

it is two steps, you need their IP, htaccess and php:

RewriteEngine on
RewriteCond %{REMOTE_ADDR} ^111\.111\.111\.111$
RewriteCond %{REQUEST_URI}!^/redirect\.php$ [NC]
RewriteRule ^(.*) /redirect.php [R]

That will send requests from their IP to your redirect.php file that contains:

<?php
header("Location: [where*you*send*them.com...]
exit;
?>

That will send the request coming from their IP to lala land so when doofusbot requests h*tp://www.example.com/gotosite.asp?ccxxyyourstuff.com it gets where*ever*you*send*the*stupid*thing.com, I would recco a few destinations, but that is up to you (bad neihborhoods come to mind).

It takes a few weeks for the interlopers cache of your stuff to disappear.

Emmett

6:10 am on Mar 12, 2005 (gmt 0)

10+ Year Member




Personally, ALL my internal and external links are the full length absolute kind.
Only my image (.gif/.jpg) file URLs are relative, for ease in programming.

Thats what I'm doing also. I know base href is going away but I figured it couldn't hurt.

deanril

6:27 am on Mar 12, 2005 (gmt 0)

10+ Year Member



A addon post to my msg #347

When I did the check for allinurl:www.mysite.com it must have been a different datacenter, because I do it now on both www and non-www and the #4 one is completely gone.

Again the #4 site looks highly suspect as a wrong doer, going to the home page looks like a site thats up to no good.

But like I say I check now, and #4 the highly suspect one is gone. A possible indication of a fix on some datacenters?

Reid

7:19 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



base href is depecrated.

so what? I'm writing HTML4.01 my DTD header says 4.01
when they get to 4.02 let me know but do you think 4.01 will dissappear from the internet?

Reid

7:33 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I got the same thing too.
when I do a site: I get 4 links, apparently my linktous page is my home pag because it has a
"see more results' link underneath it which contains the rest of my web site (except my photo gallery).
2 php type redirects I already got rid of but the third cgi type redirect is a site linking to the index page of my photo gallery (and displaying my page within a frame).
I found my entire photo gallery under links: associated with the same cgi type link. So apparently my photo gallery is inbound links to my website.

I've been debating with myself wether I should get this cgi type link gone or wether I should spam my photo gallery with more inbound links.

MikeNoLastName

7:39 am on Mar 12, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just a suggestion:
Someone who has been severly damaged by this problem needs to have their attorney contact Google's attorneys under the auspices of "discovery" and force them to reveal in a documented court proceeding, HOW this could possibly be allowed to happen under their own noses, when it has been brought to their attention numerous times (I think it's called 'criminal negligence' in that case), or prove once and for all that there is NO way that it is possible. Simple as that.

MLHmptn

7:41 am on Mar 12, 2005 (gmt 0)

10+ Year Member



Glad to see all you people still love GOOGLE!

NOT!

What's even more disturbing is to hear people praise Google! Google is broke! Bottom line!

And really they could give a rats a$$ about any of us webmasters!

Bring the heat MSN and Yahoo!

kwngian

7:50 am on Mar 12, 2005 (gmt 0)

10+ Year Member



It seems to depend on data centres - does this mean Google are trying to fix it?
216.239.59.104 - bad
66.102.11.99 - good

Surfgatinho

I am seeing the opposite. All the bad ones are in 66.102.11.99.

Must be quite difficult to clean them up. Very soon all the subdomains will be scrutinized heavily by google.

stargeek

8:55 am on Mar 12, 2005 (gmt 0)

10+ Year Member



Someone who has been severly damaged by this problem needs to have their attorney contact Google's attorneys under the auspices of "discovery" and force them to reveal in a documented court proceeding, HOW this could possibly be allowed to happen under their own noses, when it has been brought to their attention numerous times (I think it's called 'criminal negligence' in that case), or prove once and for all that there is NO way that it is possible. Simple as that.

I'm willing to spend $$$ on this I'll contact a lawyer to see if this could actually hold any water.

ugnius

8:57 am on Mar 12, 2005 (gmt 0)

10+ Year Member



MLHmptn don't you have nothing to do than shout in almost every thread Google is broken.
Go and use that yahoo or MSN.

Google never was broken. Had some technical difficulties and still has some.

You don't like it, Go somewhere else. easy as is.

This 713 message thread spans 48 pages: 713