Forum Moderators: DixonJones

Message Too Old, No Replies

Mega problems with referers

over 10 000 page views that shouldent exist

         

mack

10:16 pm on Nov 2, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I have webalizer suplied by by hosting company to look at my stats and record referals etc.

My site is a small scale search engine and I apear to have recieved literaly thousands of referals from other search engines ad url form.

On an average day I will serve 3000 page views on the 1st of November I served 10000 page views loads of the referals seamed to be comming from se add url forms. Some I have never heard of and some that I know of.

open find, Aesop, ah-ha, Claymont to name but a few...

I have never seen these referals from any search engine in the past... now i am getting them from loads of them

Does anyone have any idea what could be going on?

john316

10:41 pm on Nov 2, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It sounds like you got added to one of those "submit to 350,000 search engines!" type sites, they probably included your url in a script.

mack

10:46 pm on Nov 2, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Would the referal come direct from the se though?

For example the log for openfind looks like this...

POST /search/search.pl HTTP/1.1" 200 3840 "http://www.openfind.com/WriteReg.php" "Mozilla/4.0 (compatible; MSIE 4.0; Windows NT)"

In the above entry from my log /search/search.pl is the location of my search script.

john316

10:56 pm on Nov 2, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is it possible that the script is stepping through the add url pages and leaving the last one as a referer?

mack

11:53 pm on Nov 2, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



How would I go about finding the site that is hosting the script?

It had left referals from the add URL pages of about 20 se's!

bird

12:09 am on Nov 3, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Are those requests coming from many different IPs or just from one?

This sounds very much like a rogue bot that just feeds you fake referer information.

mack

12:15 am on Nov 3, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Openfind, referals are coming from 66.140.98.184
ah-ha, referals are from 68.36.229.193
Aesop, are coming from 66.13.13.150

None apear to be from the same IP address

mack

5:59 am on Nov 3, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



It gets worse.

My stats just apdated... figures for the 2nd whre just as bad as they where for the 1st.

What to do?

excell

6:17 am on Nov 3, 2002 (gmt 0)

10+ Year Member



Sounds like a security issue I think?

mack

6:36 am on Nov 3, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



you're not kidding... lol

It's like an enemy that I cant see.

IP address doesnt resolve so I can bar it that way.

No useragent - useless for htaccess

What is the propper thing to do in a situation like this.

alansch

6:50 am on Nov 3, 2002 (gmt 0)

10+ Year Member



Mack, I can't comment on the referrer question... but what script are you running for your site? And does it allow public submissions into your search database, which is being hammered by undesirables? Or are these strange referrals all going directly into your search function?

We've implemented three small searchable directories using Gossamer-Threads Links 2.0

One of these is a business directory which is open for public "suggestion" of URLs. This directory got put into one of these d**m autosubmitter services and we were getting literally hundreds of junk submits daily until I modified the submission script to reject automatic submissions.

The most obvious way of achieving this is for the script to check the referrer and ONLY allow posts that have as referer your own submission form. It's pretty easily worked around by an autosubmitter though - referer is the easiest header parameter to spoof!

Because of this, I added a couple of other traps as well. These are useable as long as the submission form is generated by a script.

One trap is to include the current date/time as a hidden field in the script generated submit form. Then when the data is posted, the script can compare that field with the current date/time, rejecting or ignoring the submission if the post date/time is in the future or more than, say, 10 minutes ago. This value can be tweaked as necessary to fit requirements.

The other trap ... Links 2.0 assigns a unique ID code each time it serves the submission form, assuming that it is going to be for a valid submission. The script then checks the post data and if the hidden ID field value is greater than the current counter value, or more than 5 less, the POST data can be rejected (or just plain ignored). That value may have to be tweaked if you can get a lot of valid submissions, to minimise the chances of ignoring a valid submissions.

Hope this helps :)

mack

7:14 am on Nov 3, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I run links 2 also and they have been unable to do this on links 2 because i to have implimented a number of hacks to prevent automated submissions.

The search tool in question is fdse. I have been using it as a general web search tool but this morning I woke up to 54 meg of submited sites.

what I have done is simply delite the added sites db file. then went back to the fdes admin page and refresh until a new site is added. At this point i am grabbing their email address and asking them what submission service thay are using.

and when I find out who it is.. my htacess will be redirecting all the unwanted traffic to their homepage :)

mack

11:50 pm on Nov 4, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I finaly found out where my traffic was coming from.

It apears that my site has been added to a piece of web positioning software, not WPG but something similar.

My question is , do these products hold their own database or is the Database remotly hosted so the software company can edit it at a later date?

excell

1:46 am on Nov 5, 2002 (gmt 0)

10+ Year Member



Usually the new data from the company is updated when the user fires up the program. So if you write and request removal hopefully you will get out soon.

mack

5:35 am on Nov 5, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



What I have done was to reposition the location of the script so that the un wanted visits get a 404. The visitors are ariving with the request "post" could i redirect these visitors using htaccess.

I am not sure that the user even gets to see any of my pages bacause of the post in the referer logs. Is it possible to direct all users to my actual website?

excell

5:59 am on Nov 5, 2002 (gmt 0)

10+ Year Member



You could but if they are coming in to that point via an auto submission software, then they are not even seeing anything, IMO don't worry about them.