Forum Moderators: open

Message Too Old, No Replies

Digg's UA

         

incrediBILL

8:29 am on Jan 19, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Large popular web site using default library UA, nice, real nice:
"PEAR HTTP_Request class ( [pear.php.net...] )"

Digg, Inc. EQUINIX-ED-DIGG-1 (NET-64-191-203-0-1)
64.191.203.0 - 64.191.203.255

incrediBILL

8:39 am on Jan 19, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just in case someone reads this that can fix the problem here's an example of the code you need:

$request->addHeader('User-Agent', 'Digg');

Samizdata

9:53 am on Jan 19, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



+1 Digg for the above post.

...

jdMorgan

3:22 am on Jan 20, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Since we may only get one shot at this, let's help them do it right:

$request->addHeader('User-Agent', 'Digg/1.0; (+http://digg.com/diggbot.html)');
$request->addHeader('From', 'diggbot@digg.com');

The /diggbot.html URL must resolve to an HTML page that explains the purpose of the crawl (or fetch) and the intended usage of data so collected, and diggbot@digg.com must resolve to support for Webmasters experiencing problems with the user-agent's activities.

Since I'm one who gripes about invalid and uninformative user-agent strings, I thought I'd better say something now... :)

I'm not sure if my literal parentheses need escaping in the quoted string...

Jim

coopster

8:20 pm on Jan 22, 2009 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



I'm not sure if my literal parentheses need escaping in the quoted string...

Nope. Good as is.