Forum Moderators: martinibuster
This user_agent string is the exact UA string that my browser records minus the NetCaptor 7.5 entry. Either this is an odd coincidence or NetCaptor is providing the client's UA string to their ad supplier. Not only does this mean that NetCaptor is giving up personal user_agent data, but it means that the bot is not following standard practices of identifying itself correctly.
If it isn't bad enough that the latest version of NetCaptor has bugs that are causing it's ad-blocking features to be turned on while it is itself displaying ads that are based on the content of the webpage, it is denying webmasters the ability to block the bot that helps deliver NetCaptor its ads. This is the very essence of the definition of scumware or parasite.
For those who are interested the bot comes from the Class B IP block of 168.75.0.0 -168.75.255.255, which according to ARIN WHOIS belongs to ClearBlue Technologies
For those who are interested in blocking this bot, I'd recommend blocking the Class C IP address range of 168.75.65. In your .htaccess file the entry would be:
RewriteCond %{HTTP_HOST} ^168.75.65.
RewriteRule ^(.*) - [F]
I am monitoring the situation to see if NetCaptor's ad provider changes the IP address of their bot.
NetCaptor is not providing the UA of the user to our context sensitive ad provider. I'm not sure what "bot" you are referring to. NetCaptor can, at its users preference, add "NetCaptor" to the parenthetical subsection of the UA string. The user can turn that off, as it appears you have done.
¦ the content of the webpage, it is denying
¦ webmasters the ability to block the bot that
¦ helps deliver NetCaptor its ads. This is the
¦ very essence of the definition of scumware or
¦ parasite.
Does Opera let you do this? What we're doing is not any different than what Opera is doing.
Now - it appears that some of you have configs that cause NetCaptor not to display some ads. It may be a javascript problem - we can't replicate the issue. Any ad blocking in the free version is entirely accidental. We're trying to determine what might be causing this, but we've been entirely unable replicate. Sorry,
Adam
Adam Stiles
Stilesoft Inc.
The very fact that your product allows users to remove the key word "NetCaptor" from the UA string, displays context sensitive ads AND blocks banner ads on websites, makes it a parasite that should be held in contempt. If you don't like this designation, then you MUST remove the ability to remove the keyword "NetCaptor" from UA strings AND discontinue the use of context sensitive ads. By providing the ability to hide the word "NetCaptor" from the UA string, you are admitting that your practices are sleazy and you know we would block your browser if we had the ability to do so.
For the record, many of us do not feel Opera's behavior is acceptable either and have taken measures to prevent it from displaying it's ads in conjunction of our webpages.
The code fetched by the iframe in the AdSense blocks is totally different when using IE/Mozilla/Opera compared to NetCaptor. The good code draws the adverts we all know and love. The bad code, as seen in NetCaptor, looks like this:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=windows-1252">
<META content="MSHTML 6.00.2800.1276" name=GENERATOR></HEAD>
<BODY><IMG src=""></BODY></HTML> Obviously, the entire body of the page is missing for one reason or another.
When I load the 'good' code into NetCaptor, it actually displays fine. So I am left beleiving that Google is making some kind of error when generating the code.
However, Adam claims to be able to view AdSense ads. The only other clue I have is this:
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1"> The above snippet is a line from the good code generated by Google when using IE6/WinXP. Perhaps some regional/language setting in conjunction with NetCaptor is causing the bug?
As a side note, why on Earth does Google generate code like this?
document.write('</ifr' + 'ame>');
Would anyone who is seeing AdSense getting blocked mind trying our 7.2.2 version? You can download it here:
download dot netcaptor dot com slash nc722 dot exe
This is an older version and does not have our context ads in it. I still don't think this problem is related to our ads, but we might be able to isolate when the "buglet" was added to our code that caused the problem some of you are seeing.
Adam
Geez, I have rarely seen such a good slam whitout having it degenerated into an unnaceptable flame.
My intention isn't to flame. Adam has taken an admiral stance of speaking up and address the issue raised by this thread. I do tend to believe that the ad blocking in unregistered NetCastor is an unintentional bug.
I am, however, trying to point out as clearly as possible that the overall problem extends well beyond just the ad-blocking issue. For instance, when I added a block for the IP address of the bot that indexes webpages so that the advertiser can provide NetCastor with context sensitive ads, it worked for a couple of pages and then ads started to appear again. While I have to wait until tomorrow to see my logs, I believe that this means that the ad supplier is using methods to circumvent efforts to prevent their bots from indexing a site. This is very sleazy and is an unethical behavior for any bot. If a website owner does not want bot 'X' to index their site, they have the right to block bot 'X'.
Now granted NetCastor is not in control of the bot their ad provider uses, however, they have an ethical responsibility to require that their advertiser properly identify themselves when they index pages and that the bots in question properly obey the robots.txt file.
AdamNetCaptor is not providing the UA of the user to our context sensitive ad provider.
When Netcapter sends an HTTP request to their advertiser to acquire text ads, the HTTP request contains the UA string in the header per normal HTTP protocols (just as normal HTTP requests do). The advertiser is then taking this UA string and striping the NetCaptor information out and using this as the string their bot uses to index a page to determine what ads to display.
I'm not sure what "bot" you are referring to. NetCaptor can, at its users preference, add "NetCaptor" to the parenthetical subsection of the UA string. The user can turn that off, as it appears you have done.
No I am using NetCaptor in it's default configuration here are the two hits to a test file I created that I knew that only NetCaptor and any bot that would index my page for the purposes of providing NetCaptor with ads would hit. The file in question was immediately deleted from my server after the test was run.
{masked for security purposes}.example.com - - [11/Jan/2004:18:09:00 -0500] "GET /NetCaptor.html HTTP/1.1" 200 16182 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; Q312461; NetCaptor 7.5.0 Gold; .NET CLR 1.0.3705; .NET CLR 1.1.4322)"
XXX.XXX.65.68 - - [11/Jan/2004:18:09:01 -0500] "GET /NetCaptor.html HTTP/1.1" 200 16182 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; .NET CLR1.0.3705; .NET CLR 1.1.4322)"
Note these things:
I don't think Adam is disingenuous, I just think he doesn't understand the mechanics of the way his browser gets its ads as well as he should.
[edited by: DaveAtIFG at 7:45 pm (utc) on Jan. 12, 2004]
[edit reason] Specifics deleted [/edit]
Note that the client UA does get sent to example.com, but those are MY servers. Our server does not pass that UA on to ContextWeb to make the ad requests.
I'll check with ContextWeb about how they make their user-agent. I will ask them to to properly obey robot.txt. If they won't make the change, I won't continue to use their services.
Adam
[edited by: DaveAtIFG at 8:28 pm (utc) on Jan. 12, 2004]
[edit reason] Removed specifics [/edit]
The Java Script I have on my pages to load images of books (linked to Amazon) works just fine. So do images & links to Amazon that are coded directly to display on the page. I have no paid ads, so I can't say about them.
To get your domains added to ContextWeb's list of sites their bot won't visit, go to ContextWeb's website (http://contextweb.com/contact.htm) and send them a polite email listing your domains you want them to stop indexing. When sending them an email politely explain to them why they need to obey the robots.txt file and user_agent string.