homepage Welcome to WebmasterWorld Guest from 50.19.206.49
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

    
searchers.co.uk crawling
from 213.122.174.x
Hobbs




msg:3854667
 11:42 am on Feb 21, 2009 (gmt 0)

Hit robots.txt first, their site carries no robot exclusion page.

Came from: 213.122.174.x

Came as: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.4) Gecko/20030624 Netscape/7.1

 

incrediBILL




msg:3860089
 11:42 pm on Feb 28, 2009 (gmt 0)

I'm curious what evidence you have that links that IP to searchers.co.uk?

I was hit by the same IP and it was stopped automatically so I did a quick look for my tracking codes on their site and it appears that they are scraping Live results as it has valid msnbot crawl data embedded in their results.

Hobbs




msg:3860293
 10:00 am on Mar 1, 2009 (gmt 0)

Ripe identifies the IP as Reach Global Ltd
reach-global.co.uk redirects to rgg.co.uk
Where under 'Companies' you'll find searchers.co.uk listed

It was a logical deduction that searchers.co.uk is what crawls out of that portfolio.

incrediBILL




msg:3860485
 5:52 pm on Mar 1, 2009 (gmt 0)

It was a logical deduction but I did a few searches and the result set appears to be MSN, or some meta mashup with MSN being the core.

jetboy




msg:3918472
 1:29 pm on May 22, 2009 (gmt 0)

Just had searchers.co.uk on the phone, who claim to run their own index. They also claim to be running a large TV campaign.

Seems like the old Lycos UK/Touch setup though; buying keywords for fixed placement on a search engine that never sends any traffic.

Reach Global are based in Accrington, which I believe is the same part of the world as Touch ...

Pfui




msg:3918897
 4:56 am on May 23, 2009 (gmt 0)

I redirect the UA because it's circa June, 2003, and the likelihood of a real person actually using it is slim at best. The only okay file is robots.txt; nothing else unless a human touches base first.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved