Forum Moderators: open

Message Too Old, No Replies

Whats happened to Search Engine Identification...

i

         

Josk

3:10 pm on Nov 4, 2003 (gmt 0)

10+ Year Member



Hi,

I just noticed that the Search Engine Identification forum is being depricated... Can I ask what the rational behind this is? I find this one of the most valuable parts of WebmasterWorld...

J

pendanticist

6:56 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Well, Brett will fill us in on his decision. Although it was mentioned before that he would be closing one or two down. That was just before opening the Apache Server one.

Oh, he also closed the Deprecated - Search Engine Spider Identification forum.

Pendanticist.

bakedjake

7:13 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Interested to see where the new posts should go.

Brett_Tabke

7:18 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



There is a massively disproportionate set of problems related to that forum and subject matter. We have had so many serious issues arise, that I just don't feel it is worth the risk to let that forum remain open.

Basically, the problem is that of posting IP addresses publically. Would you want your ip address posted on the highest traffic webmaster site on the net? Think about it...

We are discussing alternatives.

bakedjake

7:24 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I understand. It sort of becomes a spam blacklist service - and those in turn become some of the most hated organizations on the 'net, targetted legally and technologically by people who don't like what they're doing.

Again, interested in how it pans out.

BlueSky

8:13 pm on Nov 4, 2003 (gmt 0)

10+ Year Member



In this post here [webmasterworld.com], you express concern about bots running up frauduant clicks on websites. Yet, their presence is pervasive throughout the net and affects far more than just PPC programs. Keeping bots under control is just as much or even more of a chore than placing well in a search engine or creating a well-designed site. I suspect many here get visited far more by bad bots than good ones. Some may not even realize it until they are faced with unexpected bandwidth run-overs or have had their entire site stolen. That forum gave an avenue for people to compare notes and try to research what was happening on their sites. Closing it down will just migrate the info over to other forums which some had been doing anyway.

You say you're concerned about people posting IPs. That info is needed to determine if the bot should be banned by IP or UA as well as figuring out whether it really belongs to a SE or is an imposter. Without sharing info on these, people will just blindly copy the "perfect ban" list as being gospel not having a clue what any of it means, or that they may never see the majority of those on that list, or wonder why one wasn't stopped by an incorrect directive.

Chndru

8:23 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just a point. we are talking about Spiders not individuals or corporations behind it. So, i see no privacy issue behind it. IMHO, if the so claimed most traffic site for webmasters can't take a stand, who else will?

Josk

10:59 am on Nov 5, 2003 (gmt 0)

10+ Year Member



My point of view is this... I run my own spider traps and my spider tracking system. Data from this is then fed through to a system that either allows these spiders to roam freely it blocks them out. (I also post IPs to help others who don't have server farms to comb for nasties)

Now, if someone out there wants to write a bot then fine. But there is a way of writing a bot that is socially responsible. Have a meaningful UA, don't grab too many pages at a time, take notice of my robots.txt file. You don't do that then you get banned.

Get banned by me at least. If your IP has been posted then you *may* banned by others. You may get banned because you are from the wrong country. You may get banned because you are the wrong type of bot. You may get banned becuase you are a malicious bot. That decision is up to each webmaster.

And that's what I'd argue for keeping the spiders forum. By exposing these IP's we can try and generate social pressure to stop malicious bots. And by giving out the IP we allow webmasters to force these bots out technically.

Peeress

2:57 am on Nov 6, 2003 (gmt 0)

10+ Year Member



Well I'm sorry to hear about it closing, as that forum was addictive. But then, (I have to admit) I was becoming influenced (obsessed?) too much from reading posts that a bot or certain IP misbehaved on someone elses site, and I would think 'oh no! i better ban them too! they're taking over the world!' which is totally unnecessary as I have a bot trap which is suppose to decide for me (which I learned from reading other Webmasterworld forums)....so yes, that whole subject definitely can get out of hand lol.

Just because an IP or bot was bad on one site did not mean it was bad in mine. I also seem to be banning more UA's (or certain words) than IP's anyways (as the bot trap points those out to me) and this seems to be effective, (unless a certain ip is bothersome).

I think the most beneficial thing for people is to learn 'how' to ban IP's and bots, (how to make a botrap/read logfiles/determine if a visitor is misbehaving/where to look for the UA etc.etc.), rather than "what" to ban, that's all we really need to know.
As said previously, it's really up to the individual webmaster as he studies his logfiles.

If the search engine identification forum was causing serious problems for Webmasterworld, then I'd rather it be closed! I'm sure this will take nothing away from this site, only add to it.

Thank you for providing us with Webmasterworld.
It's one of the very few worthwhile and valuable sites on the net.

Josk

10:48 am on Nov 6, 2003 (gmt 0)

10+ Year Member




> If the search engine identification forum was causing
> serious problems for Webmasterworld, then I'd rather it be
> closed! I'm sure this will take nothing away from this site, > only add to it.

I'd rather this could be discussed so that a more proactive decision could be determined. Deciding not to talk about (potentially) bad bots is just going to drive the discussion away to somewhere else. Also it is fine for people who have the resources to find malicious bots, but what about the smaller guys?

So are malicious bots going to be allowed to be discussed in the robots.txt forum? Are bots that aren't from the main searchengines going to be discussed? If they are then I don't see a problem with the old forum, and if they aren't then I must say I'd be very disappointed...

Brett_Tabke

1:49 pm on Nov 6, 2003 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



>bots

A few dufus heads aren't posting bots. They are posting competitors or people they want attacked ip's.

According to legal advice, posting someones ip may be determined to be personally identifiable information and against the law in some countries. aka: red flag houston - legal alert - stop the presses - end of story.

Josk

2:08 pm on Nov 6, 2003 (gmt 0)

10+ Year Member



Have there been actual legal threats due to this...?

So will posting useragents be allowed? Its very annoying that some people are posting misleading information for personal gain or for some stupid personal vendetta...

Brett_Tabke

3:06 pm on Nov 6, 2003 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



yes x 2.

sidyadav

2:58 am on Nov 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I was thinking about that a long time ago
How can a site allow the Insulting of certain IPs/properties when there is No Actual proof that that bot/spider/crawler/worm visited the page, I mean , If I wanted I could post a message:
"I got a hit from this bot:
[12/11/2003:09:02:23 -0500] "GET / HTTP/1.0" 200 2163 "-" "BandwidthOverloader/1.0" from IP 55.676.22.42 (Its all fake)
hmm... Looks like another one to ban"

and so people will of course believe me and Ban all the hits coming from 55.676.22.42

Sid