homepage Welcome to WebmasterWorld Guest from 23.23.22.200
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe to WebmasterWorld

Home / Forums Index / Microsoft / Bing Search Engine News
Forum Library, Charter, Moderators: mack

Bing Search Engine News Forum

    
NOW what does Bing Preview want?
lucy24




msg:4641998
 1:25 am on Feb 3, 2014 (gmt 0)

I put "now what?" in the header because I've only just noticed this behavior while --stop me if you've heard this one-- looking for something else. Poring over raw logs tells me it's been happening since April 2013.

What it's doing: The BingPreview user-agent is picking up selected supporting files belonging to specific pages, which it helpfully identifies in the referer slot. Not all supporting files, and never the page itself. For that you have to look back 61 minutes (really) earlier in logs, where you find the ordinary bingbot getting the page. It seems to focus on one page for a while, and then turns its attention to a different one.

Here is the earliest specimen I can find in logs. This particular page continued into May, but by then it was branching out into others. Note the timestamps. It isn't coincidence; it's always like that.

"bingbot" = Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)
"BingPreview" = Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534+ (KHTML, like Gecko) BingPreview/1.0b

Everything else is unchanged.

157.56.229.138 - - [20/Apr/2013:16:03:41 -0700] "GET /hovercraft/hovercraft.html HTTP/1.1" 200 15841 "-" "bingbot"
199.30.25.130 - - [20/Apr/2013:17:05:02 -0700] "GET /hovercraft/images/eel.png HTTP/1.1" 200 1389 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.130 - - [20/Apr/2013:17:05:02 -0700] "GET /hovercraft/hoverstyles.css HTTP/1.1" 200 6277 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.130 - - [20/Apr/2013:17:05:03 -0700] "GET /hovercraft/images/list_tape.png HTTP/1.1" 200 546 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.130 - - [20/Apr/2013:17:05:03 -0700] "GET /piwik/piwik.js HTTP/1.1" 403 1247 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"

157.56.229.138 - - [21/Apr/2013:16:21:13 -0700] "GET /hovercraft/hovercraft.html HTTP/1.1" 200 15841 "-" "bingbot"
199.30.24.151 - - [21/Apr/2013:17:22:29 -0700] "GET /sharedstyles.css HTTP/1.1" 200 3601 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.151 - - [21/Apr/2013:17:22:29 -0700] "GET /hovercraft/hoverstyles.css HTTP/1.1" 200 6277 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.151 - - [21/Apr/2013:17:22:29 -0700] "GET /piwik/piwik.js HTTP/1.1" 403 1247 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.151 - - [21/Apr/2013:17:22:29 -0700] "GET /hovercraft/images/list_tape.png HTTP/1.1" 200 546 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"

157.56.229.138 - - [23/Apr/2013:10:21:01 -0700] "GET /hovercraft/hovercraft.html HTTP/1.1" 200 15841 "-" "bingbot"
199.30.24.7 - - [23/Apr/2013:11:22:21 -0700] "GET /hovercraft/images/eel.png HTTP/1.1" 200 1389 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.7 - - [23/Apr/2013:11:22:21 -0700] "GET /hovercraft/hoverstyles.css HTTP/1.1" 200 6277 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.7 - - [23/Apr/2013:11:22:21 -0700] "GET /sharedstyles.css HTTP/1.1" 200 3601 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.7 - - [23/Apr/2013:11:22:22 -0700] "GET /hovercraft/images/list_tape.png HTTP/1.1" 200 546 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.24.7 - - [23/Apr/2013:11:22:22 -0700] "GET /piwik/piwik.js HTTP/1.1" 403 1247 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"

157.56.229.138 - - [26/Apr/2013:11:07:02 -0700] "GET /hovercraft/hovercraft.html HTTP/1.1" 200 15841 "-" "bingbot"
199.30.25.122 - - [26/Apr/2013:12:08:19 -0700] "GET /hovercraft/images/eel.png HTTP/1.1" 200 1389 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.122 - - [26/Apr/2013:12:08:19 -0700] "GET /hovercraft/hoverstyles.css HTTP/1.1" 200 6277 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.122 - - [26/Apr/2013:12:08:19 -0700] "GET /sharedstyles.css HTTP/1.1" 200 3601 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.122 - - [26/Apr/2013:12:08:20 -0700] "GET /hovercraft/images/list_tape.png HTTP/1.1" 200 546 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"
199.30.25.122 - - [26/Apr/2013:12:08:20 -0700] "GET /piwik/piwik.js HTTP/1.1" 403 1247 "http://www.example.com/hovercraft/hovercraft.html" "BingPreview"

Further notes: piwik is analytics. Ordinary robots are barred; previews require brute force. The image /list_tape.png is visible on the page but is never mentioned in the page's own html; it's only referenced in CSS. (I had to look this up. I thought it would give .css as referer, but it really does come through as the page itself.) The page uses one other image file, which was never requested.

 

not2easy




msg:4642003
 4:40 am on Feb 3, 2014 (gmt 0)

Yes, there were pages of discussion here about that back then. Bing Preview shows a cached thumbnail version (that they create) of images in images search and if someone clicks to see it larger, they politely deliver the image, the page, its css and js files. Isn't that special? Saves you all that actual visitor bandwidth, you know. Best part? Because the visitor is not actually on your site, you never get to know what their IP or activity is. |Preview| is in the list of blocked UAs for some of my sites where the images are not for 'borrowing'.

lucy24




msg:4642017
 7:29 am on Feb 3, 2014 (gmt 0)

I remember when Bing Preview first came out in 2012 there was lots of discussion. Did anyone ever explain the 61 minutes and 20 seconds part? The most recent incident-- the one I finally noticed-- was:

21:43:03 page request by bingbot
22:44:43 BingPreview requests for supporting files

Oops, that one's 61 minutes and 40 seconds. Guess they're losing speed.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Microsoft / Bing Search Engine News
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved