homepage Welcome to WebmasterWorld Guest from 184.73.52.98
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe and Support WebmasterWorld
Visit PubCon.com
Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

    
searchers.co.uk crawling
from 213.122.174.x
Hobbs




msg:3854667
 11:42 am on Feb 21, 2009 (gmt 0)

Hit robots.txt first, their site carries no robot exclusion page.

Came from: 213.122.174.x

Came as: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.4) Gecko/20030624 Netscape/7.1

 

incrediBILL




msg:3860089
 11:42 pm on Feb 28, 2009 (gmt 0)

I'm curious what evidence you have that links that IP to searchers.co.uk?

I was hit by the same IP and it was stopped automatically so I did a quick look for my tracking codes on their site and it appears that they are scraping Live results as it has valid msnbot crawl data embedded in their results.

Hobbs




msg:3860293
 10:00 am on Mar 1, 2009 (gmt 0)

Ripe identifies the IP as Reach Global Ltd
reach-global.co.uk redirects to rgg.co.uk
Where under 'Companies' you'll find searchers.co.uk listed

It was a logical deduction that searchers.co.uk is what crawls out of that portfolio.

incrediBILL




msg:3860485
 5:52 pm on Mar 1, 2009 (gmt 0)

It was a logical deduction but I did a few searches and the result set appears to be MSN, or some meta mashup with MSN being the core.

jetboy




msg:3918472
 1:29 pm on May 22, 2009 (gmt 0)

Just had searchers.co.uk on the phone, who claim to run their own index. They also claim to be running a large TV campaign.

Seems like the old Lycos UK/Touch setup though; buying keywords for fixed placement on a search engine that never sends any traffic.

Reach Global are based in Accrington, which I believe is the same part of the world as Touch ...

Pfui




msg:3918897
 4:56 am on May 23, 2009 (gmt 0)

I redirect the UA because it's circa June, 2003, and the likelihood of a real person actually using it is slim at best. The only okay file is robots.txt; nothing else unless a human touches base first.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About
© Webmaster World 1996-2014 all rights reserved