Page is a not externally linkable
- Search Engines
-- Search Engine Spider and User Agent Identification
---- url(data:image/


Pfui - 11:09 pm on Oct 17, 2011 (gmt 0)


Going on almost a year ago, we talked about the "url(data:image/" log-exploding string, layout-busting examples of which can be seen here: [webmasterworld.com...]

At the time, consensus sort of suggested the URI might be related to the Google Toolbar -- specifically "GTB6.6" -- and Explorer.

I was wondering if anyone still sees (m)any of the URIs, or has any new thoughts/theories?

I see one or two every month (more when the person reuses the referrer during a visit), and the hits share the following characteristics (casually compiled since June):

URI Length: 1,691 characters
First 40 chars: url(data:image/png;base64,iVBORw0KGgoAAA
Location: Never root. Always /dir/url(data:image/
Type: Never jpg or gif. Always png

Referrer: Google SERPs
First 45 chars: http://www.google.com/url?sa=t&source=web&cd=
One was: http://www.google.co.uk/url?sa=t&source=web&cd=

UA: Explorer
MSIE 8.0? All but one
Trident/4.0? All but one
GTB7.x? All but one
The 'one': Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.0; Trident/5.0)

In the original thread, "chedar_ed" gave instructions on how to see what's represented by the base64 code but I stumble on those steps. What do you see?

By the way, if you want to see an example of a live "url(data:image/" image/URL, Google the following as-is --

Webmaster World

-- and scroll/advance to "Matt Cutts" or his tiny smiling face. Now check out that image's address. Hint: It's a whopping 2,199-characters long and begins -- wait for it --

data:image/jpeg;base64

I'm a Mac person. Is Explorer the only UA (still) choking on these things?

Anyone?

Bueller?


Thread source:: http://www.webmasterworld.com/search_engine_spiders/4375642.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com