Forum Moderators: DixonJones
WebTrends reported them by organization name, not by IP address, so I can't find their track details in my raw log files, even though I've searched the log file by their IP address as reported by Whois. They must be visiting my site from an IP address different than the one reported by Whois.
I very suspicious that they're up to no good. Any sleuths that might give me an idea what to look for?
I don't know too much about WebTrends, but can you get it to display a report on the activity by time for that IP?
I would be concerned to see how much time they are spending per page. ie are they taking a detailed look at your site, or are they constantly requesting the same page, or pages over and over at peak times to try and disrupt your service (a mild DOS attack)
Check your traffic by hour of the day. If you haver a couple of big spikes, find the IP they are coming from, and block it
I can't actually find them in my raw logs. Again, WebTrends reported them by organization name, not by IP, and I have not been able to find an IP associated with that organization that is to be found in my raw logs.
I looked for daily traffic spikes and saw none. The only spike I saw was in geographic data, by most popular cities, and that was Reston Virginia with over 28,000 visitor sessions (9000 which were reported as AOL visitor sessions). Since the named competitor is not located in Reston Virginia, or even in Virginia, I'm starting to wonder if WebTrends didn't blow a fuse somehow. If it weren't for the name of a competitor showing up, I would conclude this all to be a WebTrends glitch. But a "clever" scammer might try to camouflage themselves as AOL by using an ISP located in Reston Virginia.
Thanks for the suggestions!
Can you see the raw referrer URL? If you are getting '00s of hits from the same page of your competitors site, or they are requesting the same doc/docs over and over (maybe the same image) that would also indicate you are being pagejacked.
Have you been to look at your competitor site, see if you can spot any links to your own site?
somehost.affinity.com has been hitting my site hard for several months.
Webalizer version 2.01 stats for this month so far are:
Hits
22,632
Files
18,296
KBytes
173,343
Visits
1,828
Any help appreciated. I have no idea how to trace this or block it or even if it should be blocked?
(edited by: Liane at 5:06 pm (utc) on Jan. 28, 2002)
If so, you could just block the ip with an .htaccess file.
So long as its clearly marked as someone elses site/content, that shouldn't be a problem, IMO.
Its when you are presenting someone elses work, and taking the credit that you have trouble. I think the test case here was the Shetland Times vs. the Shetland Express or something similar
Basically one had been ripping off the others stories by pulling the content into a frame, thus chopping off all of the branding, and presenting it to the user as their own journalism. They got slammed
Ditto.
>what you mean by webjacked? Would this be where someone rips your code and posts it on their domain? Or something else like framing your content?
There are pageripper scripts that will pull contents from one page and write them into the page[s] tbat makes the call -sort of like having SSI to anything with an url, as I understand it. There are legitimate uses for these, I'm researching the possible use of such a script to move MLS data which I'm authorized to access into a real estate site. That said, there are plenty of opportunities for mischief. A serious jacker could use a variation to serve your page to an engine yet never have it visible to the general public.
However, the Google cache reports "this page cannot be found" on their search terms and the site description on the Google serps is quite different from what shows up on their pages so there is some kind of cloaking or deception going on there. Digging into the Google database for one non-competitive search term alone I find 3000 doorway pages in the Google serps, and they all take me to the same page on the competitor's site through some kind of re-direction (I have to hit my browser back button twice in quick succession to back out of the doorways' target page and get back to the Google serps). So this company does appear to be engaged in big time spamming.
I don't want to name them here because they are a reputable well-know company with nationwide markets and recognition and I certainly don't want to tarnish their name unless I have absolute proof that they are pagejacking, stealing and spamming. If they are, I would find it shocking.
Any suggestions how I could determine this if the cloaked page also had the Google no-archive tag?
What I guess I need is a way to read cloaked pages.
Magic 8-Ball says "A definite possibility."
I think there are less exotic ways, too -but I'm not an authority on the nitty-gritty of this. Just because they are a big name, don't dismiss the fact that they might be ripping. Some guy in the web marketing department might have his job/contract on the line and decided to go over the line.
I'll re-visit this a month but for now, life has to go on.