Welcome to WebmasterWorld Guest from 54.227.110.209

Forum Moderators: open

Message Too Old, No Replies

Do search engine robots carry http referrer content with them?

I want to deliver content based on http referrer but not to robots....

     

crowthercm

7:02 pm on Oct 9, 2003 (gmt 0)

10+ Year Member



Hello,

I would like to deliver content to a visitor that differs depending upon the http referrer tag. What I would like to be assured of is that some of the content is NEVER displayed to a search engine bot.

My first question is, do search engine bots carry http referrer data when they arrive from another site? Namely, if I have a script that pseudo wise does the follow:

Robot visits

If HTTP referrer is blank
deliver content A;
elseif HTTP referrer is not blank
deliver content B;
endif;

Will the search engine robot ever be delivered content B? This is what I'd like to avoid...

Thanks,
Chris

bcolflesh

7:04 pm on Oct 9, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Many bots deliberately send false UAs & referrers with the intention of getting what a "normal" visitor would see - your plan seems flawed IMHO.

crowthercm

7:08 pm on Oct 9, 2003 (gmt 0)

10+ Year Member



It may be flawed but in this particular case no bot should ever see this content. Namely, I want to deliver something that would require a human based decision, something that a bot could never do...

Is there no way to get around this?

bcolflesh

7:18 pm on Oct 9, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I want to deliver something that would require a human based decision

I didn't suss this from your initial question - you'll need to add some type of interaction to the page that can't be "botted", like a Turing test - PHP Architect has a really good article on this:

phparch.com/issue.php?mid=6

crowthercm

8:30 pm on Oct 9, 2003 (gmt 0)

10+ Year Member



Thanks (:

That's actually a very interesting read.

A turing test isn't exactly what I need however as I still want to deliver some content to the bot. The turing tests I've seen most often on the internet (and described in that article) are based on blocking robots from proceeding. I suppose I could do a time delayed redirect and hope that I've given the human enough time to fill in the turing test while also suggesting to the bot not to index the human content. (Without going into detail I don't want to include bot exclusion for the page).

Doing some reading at WebmasterWorld it also makes sense that search engines really protect against delivering content based upon whether a visitor is a robot/human. I really don't want to get into a non-stop redirect battle and face getting accidentally banned if my attempts are mistaken as malicious cloaking.

Can you suggest any other resources that might be helpful?

Are there any legitimate ways to psuedo wise do:
If you're a robot:
You get content B
elseif you're a human:
You get content A
endif;

?

dmorison

8:59 pm on Oct 9, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I would separate the logic; and do your cloaking the traditional way; based on IP and/or UA.

Simply have your bot test override the referer test; so given that you've got a bot; serve what you want.

Altavista frequently dumps funny URLs in the referer field. I've never seen G'bot present a referer tho'.

if (bot)
{
serve bot content
}
else
{
if (referer is "")
{
serve human content A
}
else
{
serve human content B
}
}

crowthercm

9:52 pm on Oct 9, 2003 (gmt 0)

10+ Year Member



I would separate the logic

I fully agree, the second portion using http referrers will be quite easy for me. If a bot accidentally slipped through and started spaming random referrers this would be problematic however.

do your cloaking the traditional way; based on IP and/or UA.

This is something that's entirely new to me. I've printed off a few threads here, so will get cracking on learning some more.

Altavista frequently dumps funny URLs in the referer field.

I took a quick look at my logs but most of the time found scooter was behaving himself/herself. One record I did find a scooter referrer of "http://www.root.mysite.com/category....." which doesn't actually exist. Is this the type of thing you found and have you noticed any patterns?

Thanks,
Chris

claus

5:19 pm on Oct 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



One easy-to-apply tactic is to do some of the work using javascript. Of course your human visitors would have to have JS enabled, but the bots do not execute JS.

crowthercm

5:43 pm on Oct 27, 2003 (gmt 0)

10+ Year Member



I tend to design with a "keep it simple stupid" mentality so I'd prefer to avoid using something like javascript. I've been doing some rethinking as well and a Turing test seems more and more appealing.

At the same time I'm still curious about the http referrer junk that bots send. I've periodically checked my logs and empirically I've got to agree that at this time googlebots don't send referrers.

Does anyone know if scooter follows any kind of pattern with it's referrer junk dumps? I could then always just add a ereg to just dump any referrers that contained a known junk pattern....

 

Featured Threads

Hot Threads This Week

Hot Threads This Month