Forum Moderators: phranque

Message Too Old, No Replies

facebookexternalhit

         

lucy24

5:59 am on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



There's a fairly recent thread [webmasterworld.com] about "facebookexternalhit" from the user-agent ID side. Basically the thread confirms that we're dealing with another variety of hotlinking-- an annoying one, because you have to make a separate RewriteRule with its own RewriteConditions to deal with them. (Maybe I need to do something about their preliminary robotic sweep, where they pick up the whole page including the one image that will be hotlinked. But that involves close peering over logs to see what I'd have to do to nip them in the bud. Ugh.)

But it doesn't answer a Curiosity Question. I am one of the two people on this planet who isn't on Facebook.* Can someone explain exactly what leads to this particular variety of hotlinking? That is, what activity on whose part? What does the user see? Where (within Facebook) is the hotlinked image coming from? Why can't Facebook just park the blasted image on their own site? That was a rhetorical question.


* I can't say FB because to me that means FutureBasic. Sorry.

g1smd

7:42 am on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I think it happens when someone is creating a link on a page over there.

Their system scans your page and presents all of the images on that page as thumbnails and asks you to pick one to go next to the link you just created.

I don't know if that image is then copied to their page, or is a hotlinked image pulled from your site each time the other page is viewed. I suspect the latter.

lucy24

8:36 am on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Yes, it's hotlinked. Blank referer, facebooketcetera as UA (that's why you have to make a separate RewriteRule). So the sole purpose of the initial crawl is to provide their members with a choice of images to hotlink. Nice. Especially if they're rescaling a full-size jpg to some standard thumbnail size.

I do believe I was right in deciding after further investigation to deploy BrowserMatch and shut them out at the source.

Staffa

11:23 am on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



On one of my sites I run a creative competition. Recently, someone submitted an image and then hotlinked the thumbnail to a FB page. No problem, after all it is his image and 5 Kb is not worth talking about.
Curious about what the page is about, I follow the link and of course have to register/login to FB to see. OK, no sweat, I won't see it for I don't want an account.

Then it starts, each time the page is viewed (I guess) there's that OP UA with an FB IP number fetching the page on my site. Wooh, hold on, what's sauce for the goose ..... and blocked the UA.
I don't know if that stopped the thumbnail from being shown on the FB page but since it is the page owner's own image he can always upload his own copy.

I always riles me thet the likes of FB think that because they are XYZ the world is theirs and the rest can fall off, think again.

matrix_jan

2:54 pm on Aug 14, 2011 (gmt 0)

10+ Year Member



You can control which image to show-up in FB with og:image.

g1smd

4:27 pm on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



You can control which image to show-up in FB with og:image.

Care to expand on that?

matrix_jan

6:05 pm on Aug 14, 2011 (gmt 0)

lucy24

9:15 pm on Aug 14, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Does anyone else get the same interesting effect I do if your browser window is narrower than the page and you scroll horizontally? (It's sufficiently interesting that if the answer is yes, you won't need a description.) Checked in two unrelated browsers. Seems incompatible with having "developer" in the title :)

phranque

11:49 pm on Aug 14, 2011 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



facebookexternalhit:
http://www.webmasterworld.com/search_engine_spiders/3552152.htm [webmasterworld.com]

Facebook?:
http://www.webmasterworld.com/search_engine_spiders/4051279.htm [webmasterworld.com]

mentions going back 4 years...
facebookexternalhit/1.0:
http://www.webmasterworld.com/search_engine_spiders/3371259.htm [webmasterworld.com]