homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Library
Forum: Search Engine Spider and User Agent Identification
Displaying Topics 1 - 40 (57 total) Sorted by: Thread-Subject, Direction: forward
#sort orderSubjectDate
1: new post   AVG - Valid Security Tool or Malware - Part Two
AVG anti virus latest update includes a pre-fetch link scanner tool that some are viewing as malware itself.
June 14, 2008
2: new post   AVG Stops Real-Time Scanning
July 7, 2008
3: new post   AVG Toolbar Glitch May Be Causing Visitor Loss
"Web sites with tight security are turning away AVG visitors with security toolbar broadcasting malformed HTTP headers and user agent strings."
May 10, 2008
4: new post   Another Phorm Type Ad System Discovered?
"The Kindsight service...is funded through an advertising mechanism but without the use of cookies, pop-ups or spam. "
June 20, 2008
5: new post   At Home with the Robots: 2015 Edition
An extensive review of robots, or web crawlers, and behaviour, good, friendly, or unhealthy.
Feb 10, 2015
6: new post   Bad Boy Bot of the Week
Pick a bot, any bot: This weeks fun one is an affiliate bot out walking and not respecting robots.txt.
Dec 6, 2002
7: new post   Blocking WebPage Monitoring Services
Most of these services are unwelcome bandwidth sponges. Lets call them out, identify them, and block them together.
Apr 9, 2003
8: new post   Blocking non-North American Traffic Made Simple
Webmasters discuss how to make an amazingly small optimized IP block list that allows only North American traffic to access a website. The technique can easily be applied to other geographical areas.
Apr 23, 2014
9: new post   Bot ID and Blocking GET xmlrpc.php To Avoid Exploit
WebmasterWorld Members discuss bot blocking to avoid xmlrpc exploit.
July 10, 2015
10: new post   Bot-Blocking Methodology
"WebmasterWorld Members discuss various bot-blocking methodologies."
Dec 13, 2008
11: new post   Botnet Pandemic Threatens Core of Internet
"Up to a quarter of computers on the net may be used by cyber criminals in so-called "botnets", according to Vint Cerf."
Jan 26, 2007
12: new post   Casper Bot Search Attempting To Infect Sites
"Seen quite a few of these over the past few days, generally in groups of half a dozen-ish."
July 7, 2010
13: new post   Comcast Launches Anti-Botnet Initiative "Constant Guard"
"Comcast is taking a leadership role and making a huge step forward in the eradication of botnets."
Oct 12, 2009
14: new post   Dealing With WordPress Comment Spam Escalation
"Just to see what would happen I enabled full comments on my WordPress blog and at first I just let the comments pile up in the WordPress moderation queue as I was curious how bad it would get since nothing ever got published.[br][br]It quickly ramped up from a few a day to 100s a day, peaking currently at over 500 spam posts a day."
Jan 22, 2014
15: new post   Default User Agents of Programming Libraries and Command Line Tools
Apr 13, 2008
16: new post   Digsby IM Enables Web Crawlers Control of Your PC & Bandwidth
Did Digsby just go darkside?
Sept 8, 2009
17: new post   Facebook Sues Data Scraper
"Warden gathered that data from public profiles using "crawling" software similar to what's commonly available on the Web..."
Apr 4, 2010
18: new post   Filtering Out Really Hard To Find Bad Bots
WebmasterWorld Members discuss how best to filter out unwanted, bad bots that are tough to find.
Jan 18, 2013
19: new post   Fresh IP's in MSN's Many Cloaked Bot Arsenal
"No UA, no robots.txt, no REF, no nothing. Not once. Not twice. Not even three times. Try eleven."
Sept 3, 2010
20: new post   Google Messing Up Javascript Stats
This is the first time in nearly 9 years I've seen G blatantly disregard robots.txt and they're doing it with a GoogleBot UA.
May 16, 2011
21: new post   Google's Web Preview Spider
"WebmasterWorld Members discuss the Web Preview Spider, whether it obeys robots.txt, and how to block it."
Nov 19, 2010
22: new post   Googlebot Fails to Pass DNS Verification
WebmasterWorld members have reported that an apparently valid Googlebot is failing DNS verification. Major impact for sites relying on Googlebot validation.
Apr 2, 2014
23: new post   HoneyNet White Paper : Know your Enemy: Web Application Threats
"Web applications present a very high risk, and an attractive target to attackers for the following reasons: Firstly, the quality of the code is often rather poor and many vulnerabilities of commonly used code are published."
Feb 27, 2007
24: new post   Honeynet Project to Trap Spider Attackers and Name Names
"The Web Honeynet Project, an independent group of Honeynet researchers from Securiteam...they plan to name not only the spider attack details, but also to divulge the IP addresses and other tracking information about the attackers themselves."
Jan 27, 2007
25: new post   How To Block Thousands of Spambot IPs Hitting a Site
WebmasterWorld members discuss the best methods of handling and blocking spambots with thousands of unique IP addresses hitting a site, causing bandwidth to rise from 1GB a month to 12GB a day.
Dec 12, 2011
26: new post   How important is the Robots.txt file now?
"Am I missing out on a lot of traffic by not having a robots.txt file? Or is that a thing of the past?"
Jan 25, 2002
27: new post   How to Identify and Block Fake BingBot Visits
How do you identify and block fake BingBot visits to your sites.
Apr 2, 2013
28: new post   IP Banning Primer
"I won't ask why you want to block IPs. But supposing you do, here's how to do it."
Sept 21, 2009
29: new post   Identifying And Analyzing Hostile & Friendly Bot Activity
"The following items can be used to identity bots and slow down and stop most unwanted traffic if applied with proper due care."
Mar 31, 2008
30: new post   Identifying Fake User Agent Strings
User agents come in all shapes and sizes. Some, like the fake Googlebots, are easy to recognize, but what about those really long ones. WebmasterWorld Members help clarify the identification process.
June 11, 2012
31: new post   MJ12bot Implements Ground-Breaking Validation Capability
"...first distributed spider to provide validation for webmasters."
Sept 3, 2009
32: new post   MSN Robot Crawlers Renamed
WebmasterWorld Exclusive: MSNDude "These are all Microsoft bots that have been around for a while...."
July 31, 2006
33: new post   MSNbot Changing to Bingbot on Oct.1, 2010
"we will drop the beta designation from the Bing crawler and change the name of the crawler to reflect Microsoft's new brand for search."
June 29, 2010
34: new post   Microsoft Bot 157 Ranges Updated
Microsoft's 157. range bots list updated.
Nov 16, 2011
35: new post   Microsoft Disables Live Search "Fake Referrers"
"(Microsoft) are working on a fix for this."
Apr 30, 2009
36: new post   Microsoft Launches Azure, an AWS Competitor
July 15, 2009
37: new post   Mysterious User Agent and 404 Random Strings
"Altavista and DIIbot use suspicious request methods to test 404 errors."
June 11, 2002
38: new post   New Fast/Overture Robot UA
Sept 7, 2003
39: new post   New Wave of SQL Injection Vulnerability Probes
Aug 29, 2008
40: new post   Now Seeing Bingbot
"Bingbot is now in the wild."
Sept 29, 2010
#sort orderSubjectDate
open in new window=open in new window

1 2 Next >>
Home / Forums Index / Library
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved