Forum Moderators: open

Message Too Old, No Replies

Rexyobot

         

Pfui

1:34 am on Jun 21, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



nn.ftth.concepts.nl
Mozilla/5.0 (compatible; Rexyobot/2.03; +http://www.rexyo.com)

robots.txt? NO

Referer spammer? YES: [rexyo.com...]

The Netherlands-based 'fastest, newest' SE in beta "invitation" stage. Launching...soon.

GaryK

2:35 pm on Jun 21, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Seems to be churning through the version numbers quite quickly.

On the 14th it hit a few of my sites as 2.03. Last night it came back as this:

Mozilla/5.0 (compatible; Rexyobot/2.04; http://www.example.com)
213.148.250.?
d594fa48.ftth.concepts.nl

No robots.txt text, just the default root page for each site.

Every request included referrer spam.

Pfui

4:46 pm on Jun 21, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The log spam is newish, apparently, because an older hit I found in my notes was spam-free; also pre "Mozilla." From November, 2008:

85.113.243.125
RexyoBot 1.11

robots.txt? NO

(Also through concepts.nl)

GaryK

6:36 pm on Jun 21, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



That's what I show too. Should have looked earlier but I was in a rush to get to a father's day party.

RexyoBot 1.11
First seen: 11/8/2008
Last seen: 11/16/2008
Total Visits: 5

ADDED: Oh, I also have:

Mozilla/5.0 (compatible; Rexyobot/1.09; [rexyo.com...]

Mozilla/5.0 (compatible; Rexyobot/1.12; [rexyo.com...]

[edited by: GaryK at 6:36 pm (utc) on June 21, 2009]

Rexyo

6:11 pm on Jun 29, 2009 (gmt 0)

10+ Year Member



You guys are right. Rexyo has made some progress on its webcrawler.

A while back (in 2007) the RexyoBot was released at version 1.x Last Month (May 2009) we upgraded our webcrawler and released version 2.x to satisfy our users with a free invitation. We plan to invite a couple of hundred individuals.

Setting the referer link is not an uncommon technique for searchengines across the web. The bigger searchengines from companies like Google, Yahoo and Microsoft all make use of this referer technique.

GaryK

7:12 pm on Jun 29, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WebmasterWorld!

Respectfully, the way you're doing the link now is seen by many of us as link spam. The way you used to do it, with a link to your bot page, was far more acceptable.

[edited by: GaryK at 7:15 pm (utc) on June 29, 2009]

Pfui

7:14 pm on Jun 29, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



1.) Thank you for chiming in, Rexyo. Any thoughts on when you'll enable robots.txt-checking?

2.) In my experience, "setting the referrer link" to a bot's base URL (a.k.a. log-spamming) is all too common among smaller (as opposed to bigger) search engines.

With the exception of MSN's fake referers [webmasterworld.com], Google, Yahoo and Microsoft include an individual's search phrases in their referers and thus the entries are not bot-PR log spam but useful info.

Seeing as how your base URL is already in your UA, why use it to referer-spam, too?

Rexyo

9:23 pm on Jun 29, 2009 (gmt 0)

10+ Year Member



Some of the features that were originally in RexyoBot 1.09 are not yet ported to the (2009) version 2.04

Checking for the file "robots.txt" has always been considered important by the owners and will always be on the list of features. In fact, RexyoBot 1.09 through 1.13 all had this feature.

For now, RexyoBot is configured to only crawl homepage indexes. It will not go deeper than your homepage url: www.example.com, so the robots.txt feature is turned off.

However, when RexyoBot will start crawling deeper into websites, the robots.txt feature will be finished by our engineers and of course turned on.

Rexyo would really like to get some beta-testers to join us as soon as it goes live. Setting the referer link provides us a way to invite our future users. As we generate more traffic, the referer feature may be configured differently.

Pfui

10:03 pm on Jun 29, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Your comments confirm your bot's conduct, thanks. Alas, the fact Rexyobot does not currently request/honor robots.txt files -- which, of course, commonly Disallow all directories and files, including homepage URLs -- plus the fact Rexyo.com spams others' logs for its own benefit, well, each fact makes Rexyobot deservedly block-worthy, sorry.

[edited by: Pfui at 10:05 pm (utc) on June 29, 2009]

Rexyo

8:41 am on Jun 30, 2009 (gmt 0)

10+ Year Member



I have pushed our team to finish up a new release of RexyoBot. RexyoBot 2.06 was released this morning. The robots.txt feature from the 1.x series has now been ported to our current crawler.

It also includes some interesting new features. The referer feature has been turned off as the number of available Rexyo Beta invitations has declined.

Both thanks for your posts on RexyoBot, Pfui and GaryK