Forum Moderators: phranque

Message Too Old, No Replies

How to safely block nph-proxy type access?

To stop illegal duplicating in SEs

         

Angonasec

12:58 pm on Oct 17, 2005 (gmt 0)



If you run a popular site it has probably already been duplicated in the SEs by somebody using an nph-proxy type of script.

I recently found this has happened to me, and managed to get G to pull 20,000 copied pages (only a few of which were mine).

If your G PR is below 7, the duplicates have a serious negative impact on your listing in G serps, (despite G's claim that competitors can't harm your listing.)

I know that blocking all future access to our site by visitors using nph-proxy.cgi (and related scripts) means we will lose a few legitimate visitors, but I think it is worth it. (Say if you disagree.)

So I'd like a safe piece of modrewrite code to do this.

Here's my clumsy stab:

(I'd like it to include any other proxy scripts too (like go.php, nph-proxy.py etc):

Can you help me to refine it please:

I'm using the chunk of code from WebmasterWorld that blocks nasty bots, (working well in my virtual host root, Ta!) is it OK to insert this line as well?

RewriteCond %{REQUEST_URI} ^/nph-proxy$ [NC,OR]

as a line in my nasty bot blocks... which ends...

RewriteRule .* - [F]

In passing: This is the kind of url we found in G resuting from a hacker site using such a tool to harm legitimate site's ranking. (NB: Now removed from G, and hopefully the penalties it inflicted will be lifted soon.)

[secure.*******lic.com...]

When you click the link in G you go straight to the Legitimate Site. G would automatically impose a duplicate penalty.

This thread is related, but was aadvised to ask here too:

nph-proxy pages in G: How to safely block access
[webmasterworld.com...]

Ta!

[edited by: jdMorgan at 1:05 pm (utc) on Oct. 17, 2005]
[edit reason] Corrected referenced thread link [/edit]

Angonasec

11:01 pm on Oct 17, 2005 (gmt 0)



In relation to the above, I'd like to block the following IP range from accessing our site

216.12.192.0 - 216.12.223.255

Along the lines of this code I found here at WebmasterWorld that blocks NPbot: (There are no broken vertical pipes in my original)

RewriteCond %{REMOTE_ADDR} ^12\.148\.196\.(12[8-9]¦1[3-9][0-9]¦2[0-4][0-9]¦25[0-5])$ [OR]
RewriteCond %{REMOTE_ADDR} ^12\.148\.209\.(19[2-9]¦2[0-4][0-9]¦25[0-5])$ [OR]
RewriteCond %{REMOTE_ADDR} ^12\.175\.0\.(3[2-9]¦4[0-7])$ [OR]

larryhatch

11:48 pm on Oct 17, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Angonasec:

What is a simple way to see if my pages were duplicated by an nph-proxy type gizmo? -Larry

Angonasec

11:17 pm on Oct 18, 2005 (gmt 0)



Try looking for your domain and the terms...

example.com nph-proxy

example.com go.php

That sort of thing

Meanwhile...

It'd be nice to hear from the modrewrite gurus...