Obsessive-compulsive Googlebot Spidering

One week ago, I moved my site to another host. Today, on the old server, there were about 80 lines in the web log, all from the same G server. I expected a few DNS servers to lag behind, but what I saw in the log really caught my attention. Here’s a taste:


"GET /wiki/Planet_of_the_Apes_(film) HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/The_Truth_About_Chernobyl HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/The_Forgotten_Pollinators HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/New_Worlds,_Ancient_Texts HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/Anarchy,_State_and_Utopia HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/The_City_of_Lost_Children HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/Men's_Adventure_Magazines HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/Annal:1994_Carnegie_Medal HTTP/1.1" ... Googlebot/2.1;
"GET /wiki/Tess_of_the_D'Urbervilles HTTP/1.1" ... Googlebot/2.1;

Notice anything weird? All of the URIs are exactly 30 characters long! I noticed immediately because all the columns lined up in my text editor.

I am somewhat curious as to why one of G’s servers (66.249.70.70) is crawling a domain that changed IPs seven days ago, but not curious enough to ask. Here's why I am posting: Why 30 characters?

FYI, G indexes over 26,000 pages from my site. G crawled 80 pages between 2am and 5pm, and all URIs were 30 chars.

Obsessive-compulsive Googlebot Spidering

kennylucius

tedster

jomaxx

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week