You seriously let that thing access your site?
Letting anything archive your content should be avoided, it has more pitfalls than benefits.
I'd base valid on the first two user agents below and if it comes from AWS or archive.org IP ranges.
USER AGENT: "ia_archiver (+http://www.alexa.com/site/help/webmasters; firstname.lastname@example.org)"
USER AGENT: "ia_archiver(OS-Wayback)"
The bad and the ugly:
USER AGENT: "ia_archiver"
Astounding but true: If you e-mail them and ask, they will tell you if it's theirs. I can count on the fingers of one hand the number of times I've got an answer to a "Please identify your robot" query.
|Letting anything archive your content should be avoided, it has more pitfalls than benefits. |
lets not be so vague here ;)
IMO that archival also applies to major SE's and "cache.
No Bill, to answer your question; nothing archives my content. However ia_archiver is also the bot Alexa uses to check your site and keep it in its index, which does have a positive affect when my advertisers check ranking prior to deciding whether to purchase one of my ad campaigns. Thanks for the IP list.
You're very welcome.
I have never permitted ia_archiver on my site yet have always showed up in Alexa.
I also sell advertising, never had anyone ask about Alexa, ever :)
Sorry to say, Alexa data displays on too many domain/traffic info sites, sometimes not identified as Alexa. I've only received feedback twice about Alexa stats during biz proposals, but I assume many more potential customers consider those numbers. Happily, I think the Alexa phenom has steadily decreased in importance over the last few years, replaced by the even more perplexing phenom of social media. Regardless, my strategy of covering all my bases remains my MO.
Alexa it to ranking what "MIPS" is to CPU benchmarks: meaningless indicator of performance.
@ motorhaven - agreed, but countless users don't know that. I never get the chance to enlighten many potential clients, they just make their money decisions with the data at hand, valid or not.
I agree with that. I recently had someone approach me about doing a major overhaul on his site and he kept bragging about his Alexa ranking. When I gave him more accurate information, he was disappointed.
I'd put Alexa up for potential advertisers, but my experience is that blocking my sites from Alexa outweighs the potential advertising benefits.
Obviously this varies for all of us depending on niche, so it comes down to a business decisions pro/con. Quite similar to Websense - sites which rely heavily on corporate traffic may not want to block it while others may find it beneficial.
|Obviously this varies for all of us depending on niche, so it comes down to a business decisions pro/con. Quite similar to Websense - sites which rely heavily on corporate traffic may not want to block it while others may find it beneficial. |
Given today's market trends and the variety of devices (PC's, cell-phones, Handhelds), I'm seeing legitimate users simply switching devices.
Course that's not economical for the mass harvester, nor are the bandwidth restrictions of most mobile devices.
This obnoxious bot using the China IPs is now asking for the index page of one site almost 100 times a day.
Why do they keep coming back with such frequency?
Today started trial period blocking all variations of ia_archiver, Alexa, etc.
I'll be monitoring the numbers with advert requests in relationship to any changes in Alexa ranking. Will post anything significant in a few weeks, thanks.
Real ia_archiver DOES respect robots.txt - haven't seen the real one in years.
It shouldn't impact Alexa rankings. Alexa builds ranking based on hits from browsers with the plug-in installed. I blocked it for years on a site in the top 10,000, and several other sites with no negative change in Alexa rank (and wouldn't care if it did!).