Forum Moderators: open

Message Too Old, No Replies

Lexxe/Robot

         

engine

6:11 pm on Nov 2, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



60.240.248.23 HTTP/1.1 Lexxe/Robot

FYI

Lexxe Robot
[webmasterworld.com...]

Lexxe has been exploring more intelligent ways to find information for users in a more meaningful way. We believe this method will eventually bring far more accurate and relevant search results than the current search technology. Our technology is built upon the foundation of advanced Natural Language Processing technology. Please find out more about our technology.

[lexxe.com...]

incrediBILL

12:48 am on Nov 5, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



FYI - That user agent isn't permitted on my site, and I can see the UA being blocked, yet they are getting data from some other source(s) as they have my current content, but they certainly aren't doing the crawling. Only 6 sources get my content without reverse cloaked tracking in the content so a major crawler (Google/Yahoo/etc.) may be supplying them data.

FYI2 - They have more IPs, looks like the block 60.240.248.*

10/22/2007 60.240.248.19 "Lexxe/Robot"
11/01/2007 60.240.248.24 "Lexxe/Robot"
11/01/2007 60.240.248.23 "Lexxe/Robot"

Prior to that they were seen coming from TPGI.com's proxy servers such as:

202.7.166.169 "Lexxe/Robot"
VIA 1.1 syd-pow-pr7.tpgi.com.au:3128 (squid/2.5.STABLE12)
FORWARD 60.242.227.11

202.7.166.175 "Lexxe/Robot"
VIA 1.1 syd-pow-pr12.tpgi.com.au:3128 (squid/2.5.STABLE16)
FORWARD 60.242.227.12

So it appears they may have the range 60.242.227.* as well

BTW, don't block the TPGI proxy servers as those are shared with other customers, kind of like the AOL proxies.

incrediBILL

12:58 am on Nov 5, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just noticed someone's comment in the other thread commenting about all their SERPs have links all containing "http://av.rds.yahoo.com/" which probably explains where they're getting that data, from Yahoo.