kila_m

msg:749007 | 7:25 pm on Mar 16, 2005 (gmt 0) |
Well at least some of the press have picked up on it.. any one want to post on slashdot as well? [theinquirer.net...]
|
Reid

msg:749008 | 7:35 pm on Mar 16, 2005 (gmt 0) |
want to know how bad this problem is? google 'intitle:302 found' Only 17,700 results. Not all are jackers but most of em are. [edited by: Reid at 8:03 pm (utc) on Mar. 16, 2005]
|
incrediBILL

msg:749009 | 7:54 pm on Mar 16, 2005 (gmt 0) |
It was a nice article until they just started making junk up.... | sites began masking outbound links to prevent PR "leaking" from their pages and "reducing" their pages' importance to the Google algorithm. <snip> The main ways they did this was by cloaking the link in javascript (which Google wasn't parsing), but larger and more sophisticated sites used cgi and php scripts to redirect users to the final destination. Big sites like the Yahoo directory are good examples. |
| Yes, Yahoo is a serious cloaker, it's called CLICK THRU TRACKING. You can teach someone to write, just not write right.
|
ciml

msg:749010 | 8:01 pm on Mar 16, 2005 (gmt 0) |
Reid, are you suggesting that any of those listings could be 'hijacking' another URL?
|
zeus

msg:749011 | 8:04 pm on Mar 16, 2005 (gmt 0) |
I do think at least 100.000 sites are affected by some kind og googlejacking.
|
Reid

msg:749012 | 8:06 pm on Mar 16, 2005 (gmt 0) |
sorry CIML I edited it to narrow down the results. take a look intitle:302 found tell me what you think.
|
Reid

msg:749013 | 8:13 pm on Mar 16, 2005 (gmt 0) |
Ha I was just browsing through them - looking at the cached pages. One url from a geocities page has a META refresh pointing at Google's homepage. Your right cmil A LOT OF them seem to be legit but others...
|
ciml

msg:749014 | 8:18 pm on Mar 16, 2005 (gmt 0) |
I think that about 70 of them are WebmasterWorld threads. :-) Seriously though, a URL that has 'hijacked' another URL doesn't show in Google with '302 Found' in the title. So we're seeing pages that: a) have 302 in the title, because they are information or discussion on the subject or: b) have 302 in the title, because they were intended to be redirects but for some reason they return 200 or: c) have 302 in the title, are 302 redirects but Google interpreted them as normal pages Note that a page that redirects now, may not have done when it was crawled. By the way, allintitle: is sometimes more useful than intitle: <added> | GOOGLE KILLED BY 302 REDIRECTS |
| I'm very keen that Google should should treat a link to a 302 redirecting URL as a link to its destination, but Google still has quite a lot of Web sites so I don't think it's dead just yet.
|
AndyA

msg:749015 | 8:27 pm on Mar 16, 2005 (gmt 0) |
If you're referring to the fact that we redirect before the site leaves Alexa.com, but still deliver the visitors to the site in question, it is our right to use redirects to track where people go on our site and that behavior will not be changed. |
| Message to Alexa: You can do whatever you want on your own site, AS LONG AS IT DOESN'T CAUSE DAMAGE TO ANOTHER SITE. As the owner of my domain name, I determine how it is used, who uses it, and how they use it. If I feel anyone is unfairly using my domain to their advantage and my detriment, you MUST remove it from your site if requested. Got this bit of info from my Attorney...
|
theBear

msg:749016 | 8:33 pm on Mar 16, 2005 (gmt 0) |
ciml, I sent Brett a sticky. That may have the proper button to push in it. And to all folks who think that all inurl: entries are hijackers if they aren't from your own site should go back to school. ie: inurl:example.com returns: Free Resume Example Learn how to write your own resume. Welcome to Free Resume Example. "For with God nothing shall be impossible." - Luke ... www.domain.com/job-seeker-guide/ free-resume-example.htm - 12k - Which has a standard html static href to a site that has example.com as part of its name. example.com has not been hijacked by www.domain.com nor does it have any actual links of ANY type to example.com.
|
Reid

msg:749017 | 8:41 pm on Mar 16, 2005 (gmt 0) |
And to all folks who think that all inurl: entries are hijackers if they aren't from your own site should go back to school. |
| I agree with that. I think the test to see if you are being hijacked is site:mysite hijackers will show up here. These should ONLY be pages from mysite with myurl. also link:mysite none of my own pages should show here as inbound links. This is the site splitting phenomena.
|
theBear

msg:749018 | 8:44 pm on Mar 16, 2005 (gmt 0) |
Ried link:mysite.com should even show your internal site pages if they carry enough weight to be worthy. The site split will like the page insertion show up in a site: search and maybe in a link search depending on the weight of the page.
|
theBear

msg:749019 | 8:46 pm on Mar 16, 2005 (gmt 0) |
Now this desn't take into account other means of copyright infringment which may be taking place.
|
Reid

msg:749020 | 8:49 pm on Mar 16, 2005 (gmt 0) |
There is googles answer right there. They already can tell me in site: which pages are indexed from mysite. all urls here should be from the same domain.
|
walkman

msg:749021 | 8:55 pm on Mar 16, 2005 (gmt 0) |
"I personally know a website owner who for 2 years ranked number 1 for is keywords in google, yahoo, msn etc etc. His site went into "total oblivion" in google and msn simultaneously after 6 hijackings of his site appeared. 2 in google and 4 in msn 4 months ago." woudl he go on the record if a reporter wanted to talk to him? We need real names, real stories to make this work
|
theBear

msg:749022 | 8:55 pm on Mar 16, 2005 (gmt 0) |
Yes from your domain and your domain is: everything but someone else's domain so if you are www.example.com you are also example.com and by extension the ip addy and port on the server .... so now you see a so called split site. In addtion you can park a domain which is what folks normally do when they buy the .com .net .darnx's huh Brett versions of their site name. This can lead to many sets of duplicate content if the site is built using relative hrefs. [edited by: theBear at 9:00 pm (utc) on Mar. 16, 2005]
|
crobb305

msg:749023 | 8:56 pm on Mar 16, 2005 (gmt 0) |
I would love to know what has happened to all the information Googleguy requested. All of the examples that were mailed to the special Google Groups location at his request. Why has nearly a year gone by since my site was originally hijacked by tracker2.php urls? Why do several UNRLEATED urls still show in site:mysite.com search? When are they going to do anything? Will it take a major news story to get the attention of their investors for Google to start listening to webmasters -- the very group of people who make them money? Is there no hope for any resolution to this problem? C [edited by: crobb305 at 8:59 pm (utc) on Mar. 16, 2005]
|
Reid

msg:749024 | 8:58 pm on Mar 16, 2005 (gmt 0) |
thanks for the clarification about link: Bear but am I right about site:?
|
crobb305

msg:749025 | 9:01 pm on Mar 16, 2005 (gmt 0) |
Reid, Yes, you are correct about site:mysite.com. That command should ONLY show pages that are truely part of your site. In my case, that search (for my homepage) still shows 3 unrelated urls (redirect urls); at one point, there were as many as 20 redirect urls showing. My site was hijacked by malicious tracker2 scripted urls last May and subsequently disappeared from Google serps 2 weeks later. It had previously disappaered in Yahoo because of the same problem. But, thanks to their quick action, the problem was resolved within 3 months. [webmasterworld.com...] Chris
|
theBear

msg:749026 | 9:15 pm on Mar 16, 2005 (gmt 0) |
Reid site: Should show pages that belong to your site but I suspect this shows what google thinks belongs to your domain. Domain and site aren't really the same thing.
|
surfgatinho

msg:749027 | 9:35 pm on Mar 16, 2005 (gmt 0) |
I think I might have hijacked myself! Anybody familiar with PostNuke/PHPNuke open source content management/portal system? Well in it's links module it uses, guess what 302 redirects. When I look at site:myurl I find one of my nuke sites listed in with the other site and sure enough the cached page brings up the page from my (hijacked) site. When I look in the code the redirect is done using the php header location:... method. So, this shows how you don't have to be a nasty scraper site to **** things up for someone you're actually trying to do a favour. Does anyone know how to create a 301 redirect using the PHP header?
|
Reid

msg:749028 | 9:36 pm on Mar 16, 2005 (gmt 0) |
| I suspect this shows what google thinks belongs to your domain. |
| That is exactly the problem - when an unrelated domain shows up in site:mysite google 'thinks' this page belongs to my domain ie. hijacked
|
Reid

msg:749029 | 9:42 pm on Mar 16, 2005 (gmt 0) |
what gets me is why on these unrelated pages in site:mysite are not updated. The cache shows an old outdated page (my homepage) from last November but my real homepage is updated in the cache last week.
|
Brett_Tabke

msg:749030 | 9:45 pm on Mar 16, 2005 (gmt 0) |
Again - read claus [webmasterworld.com...] Claus Wrote: Related Threads - topic: Redirect bug I've collected a few related threads, please contribute if you see one that i haven't mentioned. Do list name of thread and starting date: - Is there a new filter? [webmasterworld.com] (May 4, 2004)
- Big problem with Yahoo [webmasterworld.com] (Apr 30, 2004)
- PR 7 - 0 and Address Nightmare [webmasterworld.com] (Apr 28, 2004)
- Problem with Googlebot and robots.txt? [webmasterworld.com] (Apr 12, 2004)
- Meta Refresh leads to ... [webmasterworld.com] (Mar 18, 2004)
- weird link showing up for my site in Web results [webmasterworld.com] (Feb 10, 2004)
- Google indexing redirect pages [webmasterworld.com] (Jan 31, 2004)
- free hosting sites banned from google? [webmasterworld.com] (Jan 31, 2004)
- Is using a redirect to track outward bound links bad? [webmasterworld.com] (Jan 27, 2004)
- Our company Lisiting is being redirected. [webmasterworld.com] (Jan 5, 2004)
- 302 Redirects showing ultimate domain [webmasterworld.com] (Dec 21, 2003)
- Strange results in Allinurl [webmasterworld.com] (Dec 20, 2003)
- Domain name mixup [webmasterworld.com] (Dec 9, 2003)
- Using Redirects [webmasterworld.com] (Nov 17, 2003)
- redesigns, redirects, & google -- oh my! [webmasterworld.com] (Oct 22, 2003)
- Google Partial Indexing? [webmasterworld.com] (Oct 21, 2003)
- Not sure but I think it is Page Jacking [webmasterworld.com] (Oct 9, 2003)
- Unindexed URL Google Ranking Trick [webmasterworld.com] (Oct 9, 2003)
- http://click.fastsearch.com.... [webmasterworld.com] (Oct 8, 2003)
- Duplicate content - a google bug? [webmasterworld.com] (Sept 26, 2003)
- Banner ad redirect-page indexed as mirror site by Google [webmasterworld.com] (Aug 13, 2003)
- Indexed AlltheWeb pages causing Google duplicates [webmasterworld.com] (Aug 14, 2003)
- Banner ad redirect-page indexed as mirror site by Google [webmasterworld.com] (Aug 13, 2003)
- DeepFreshBot's 301 Handling [webmasterworld.com] (June 16, 2003)
Okay, i admit, the last one might be a litlle too old by now, but it seems the problem goes back to August 2003 at least. |
|
|
japanese

msg:749031 | 9:54 pm on Mar 16, 2005 (gmt 0) |
<snop> [edited by: Brett_Tabke at 2:21 am (utc) on Mar. 17, 2005] [edit reason] no specifics please. [/edit]
|
japanese

msg:749032 | 9:59 pm on Mar 16, 2005 (gmt 0) |
AND A NICE META REFRESH ============================== Sending request: GET /links/link.php?id=622 HTTP/1.0 Host: www.example.net User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1) • Finding host IP address... • Finding TCP protocol... • Binding to local socket... • Connecting to host... • Sending request... • Receiving response... Total bytes received = 659 Elapsed time so far: 0 seconds Header (Length = 555): HTTP/1.1·302·Found(CR) (LF) Date:·Wed,·16·Mar·2005·21:57:19·GMT(CR) (LF) Server:·Apache/1.3.33·(Unix)·mod_gzip/1.3.26.1a·mod_auth_passthrough/1.8·mod_log_bytes/1.2·mod_bwlimited/ 1.4·PHP/4.3.10·FrontPage/5.0.2.2634a·mod_ssl/2.8.22·OpenSSL/0.9.7a(CR) (LF) X-Powered-By:·PHP/4.3.10(CR) (LF) P3P:·CP="CAO·DSP·COR·CURa·ADMa·DEVa·OUR·IND·PHY·ONL·UNI·COM·NAV·INT·DEM·PRE" ·policyref="www.somesite.com/w3c/p3p.xml"(CR) (LF) Set-Cookie:·hits=++622+;·expires=Wed,·16-Mar-05·21:58:19·GMT(CR) (LF) Location:·http://www.example.com/index.cfm/ action/cat/tp/cost(CR) (LF) Connection:·close(CR) (LF) Content-Type:·text/html(CR) (LF) (CR) (LF) Content (Length = 104): <meta·http-equiv="refresh"·content="0;url=http://www.example.com/index.cfm/action/cat/tp/cost"> Done Elapsed time so far: 0 seconds [edited by: lawman at 11:46 pm (utc) on Mar. 16, 2005] [edited by: ciml at 10:15 am (utc) on Mar. 17, 2005] [edit reason] Examplified [/edit]
|
surfgatinho

msg:749033 | 10:00 pm on Mar 16, 2005 (gmt 0) |
In answer to my question about Postnuke websites, the web links module can be changed to not use 302s. If you add: Header('HTTP/1.1 301 Moved Permanently'); before the: Header('Location: '.$url); at around line 285 this does the trick. Does anyone know if this is a good solution?
|
crobb305

msg:749034 | 10:35 pm on Mar 16, 2005 (gmt 0) |
| what gets me is why on these unrelated pages in site:mysite are not updated. The cache shows an old outdated page (my homepage) from last November but my real homepage is updated in the cache last week. |
| Yes! I don't get this either. There are three urls that once pointed to my homepage via 302 that are STILL showing up under a site:mysite.com. These urls are NOT mine, and they have NOT redirected to my home page in 5 months! They were last cached on November 2. Until Google revisits the site, the bot will never know they no longer redirect to me. Google IS aware of this because I have emailed them 10 or 12 times with the same information. The fact that they let their cache go this stale is just pathetic. It has an influence on all other aspects of the search engine.
|
stargeek

msg:749035 | 10:41 pm on Mar 16, 2005 (gmt 0) |
has anyone else been seeing the serps on google.com switching between an index with at least a few unrelated instances of redirect hijacking fixed in a new index (not pre-allegra) and then back to allegra?
|
Hanu

msg:749036 | 10:58 pm on Mar 16, 2005 (gmt 0) |
> has anyone else been seeing the serps on google.com switching between an index with at least a few unrelated instances of redirect hijacking fixed in a new index (not pre-allegra) and then back to allegra? I see that something is going on. I'm not sure I like the new stuff better than Allegra. I'm doing well in both except that the new stuff has the DMOZ description instead of my own meta description. Also most of my competitors are missing. Good for me but bad for the users.
|
surfgatinho

msg:749037 | 11:11 pm on Mar 16, 2005 (gmt 0) |
This has been going on for over a week. It's mentioned a while back in this thread and here: [webmasterworld.com...]
|
|