homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

What is this bot?

5+ Year Member

Msg#: 2845 posted 8:02 pm on May 19, 2005 (gmt 0)

Any new information about what this bot is? I've been searching around for information but only have found posts about a year old, and no final conclusion from them.



WebmasterWorld Senior Member wilderness us a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

Msg#: 2845 posted 2:31 pm on May 20, 2005 (gmt 0)

As is the tradition, Welcome to WebMaster World.

I have no information on the bots intent.

I thought I recalled it being related to media however, I could be mistaken.

Something that has been a useful tool for me in sighting scrapers is that I have some old files online with mixed case.
Most of these bogus scrapers read these files incorrectly.

I had two such incorrect reads from this bot in December.



WebmasterWorld Senior Member 10+ Year Member

Msg#: 2845 posted 2:39 pm on May 20, 2005 (gmt 0)



10+ Year Member

Msg#: 2845 posted 12:37 pm on Jun 16, 2005 (gmt 0)

*** Found 1290 pages between 15/Jun/2005:03:23:20 and 15/Jun/2005:23:58:55 of with platform MSNPTC/1.0

Loop on two pages: consumer home and consumer/college by unusual
platform: "MSNPTC/1.0" appears to be a poorly coded spider
that does not check robots.txt ... started looping around 06:41 AM
and continued through midnight at about one pair every two minutes.

Low traffic = no harm done. is associated with Microsoft
but it is unlikely this is "official" in any sense. We also took hits
in lesser numbers from, 102, 105 and (also
a Microsoft administered domain). During June (01-14) to date there
have only been 1949 hits from MSNPTC so most occurred on 06/14. They
continue into 06/15 but with no significant traffic. Never any referer.

About 1/4 in June had longer platform strings:
"MSNPTC/1.0 (compatible; MSIE 6.0; Windows NT 5.2; MyIE2; .NET CLR 1.1.4322; .NET CLR 1.0.3705)
(MyIE2 has transformed into the Maxthon browser.)

There were assorted
other pages during June, including 262 "404s" to /%09%09%09 (Hex 09 is the
ASCII tab character).

(Long time lurker)

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved