Forum Moderators: open

Message Too Old, No Replies

Is Ebay attempting wholesale site downloads?

Regular attempts from the EBay Network to download my entire site.

         

AlexK

10:52 pm on Sep 22, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



My site uses a bot-blocking routine [webmasterworld.com] originally posted in WebmasterWorld and refined by myself. Usefully, it logs all attempts to auto-download the site, divided into "fast-scrapers" (the majority) and "slow-scrapers".

Recently, I noticed the same IP appearing at irregular intervals across the last 7 weeks or so. The IP resolves to the EBay network [whois.]. It is difficult to believe that my little site would be picked out by someone working for EBay; far more likely that this is some kind of concerted effort to download websites wholesale. Perhaps you would be interested to check your own logs...

Here is a summary of the bot-attempts. Understand that these are all failed attempts (a 503 is returned). The number at the end is the number of tries within the same second, and gives an inkling of the speed of the attempts.

Blocked IPs:
.
# 216.113.181.** [ 216.113.181.** ] 58 lines
.
Time : Number of attempts
2006/09/22 18:57:22 : 1
2006/09/22 18:57:21 : 9
2006/09/16 01:58:08 : 2
2006/09/16 01:58:07 : 7
2006/09/14 08:19:04 : 5
2006/09/14 08:19:03 : 4
2006/09/05 15:45:34 : 6
2006/09/05 15:45:33 : 2
2006/08/13 18:21:42 : 12
2006/08/05 14:44:01 : 2
2006/08/05 14:44:00 : 8

I did take the time and effort (and expense) yesterday to make an international phone call from the UK to EBay-USA, and speak to "General Support" to notify EBay of this occurring from their network. They said that they would phone me back. No-one has done so.

[edited by: volatilegx at 3:08 pm (utc) on Nov. 30, 2006]
[edit reason] obfuscated ip addresses [/edit]

prozac

10:10 pm on Oct 10, 2006 (gmt 0)

10+ Year Member



i have the same problem

same IP address

216.113.181.67

its slowing down the server big time ( we have a really large site)

physics

10:32 pm on Oct 10, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



What does the bot UserAgent look like?

AlexK

2:17 am on Oct 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



physics:
What does the bot UserAgent look like?

(sorry for the long delay)

Looks exactly like this:

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; Q312461; .NET CLR 1.1.4322)

(each hit--about 80 recently--is identical UA)

physics

6:16 pm on Oct 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It's suspicious that it doesn't have a 'real' bot UA.

It could be a few different things:

- Ebay is developing some new kind of search engine (seems unlikely)
- A computer at Ebay has become infected with a virus and people are using it as a proxy (seems likely enough)
- An employee at ebay or a group there is doing some research (possible)

Do your sites have products or are they info only?

p.s. Neither of you does any business with Ebay, like affiliate right?

wilderness

12:45 pm on Oct 13, 2006 (gmt 0)

AlexK

6:52 pm on Oct 14, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hmm, interesting links, wilderness - thanks.

For the record, no direct business from myself with EBay, whether Affiliate or other. Also, all images have hot-link prevention in httpd.conf.

Current behaviour: Hit and run tactics.

Visits daily and takes between 4 and 20 pages, very fast (recent max was 11 Oct, 20 pages in 2 secs) (any more and the bot-block would kick in - suggests human mediation). No images; page + stylesheet only. No referrer on any request. Does NOT accept compressed pages.

UA has changed:

Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.8.0.7) Gecko/20060909 Firefox/1.5.0.7

Very many page-requests have 18-bytes of white-space preceding the Request. eg:
"GET  /Broadcom/ HTTP/1.0"
(I need to re-visit my mod_rewrite as one of the rules is supposed to fix this - it is not working!)

physics:

Do your sites have products or are they info only?

Site has driver-downloads available; primarily is content (info).

wilderness

2:36 pm on Oct 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



As previously provided, I've had numerous problems with eBay sellers copying information from my pages and providing links to my pages to support the selling of items.
(It's certainly within the realm of "fair use" if source is credited, however that was not done in this instance.)

This a violation of eBay's UAG for listing items, however eBay makes the reporting of such violations near impossible and even requires that you have an eBay account to report under a categroy of violation which is "similar, however not the same".

Yesterday at 15:04 EST a vistitor accessed one of my pages and copied the info.
At 15:35 the eBay item was listed-active and the link was checked and verified.

At 15:37 the BOT then checked the link as well.

Later I added a redirect to one of my other sites and pages which sells widgets of a topic similar to the page that was being linked to. (eBay and seller are quite lucky that I didn't redirect their potential customers to a porn page.

I've reported the violation to eBay, however, as has been there past record, I do not really expect any response until the sale ends.
This item has a current bid of 1k and has not meet the reserve.

In addition, this morning I had a website inquiry from another IP range (not the same as the initial seller access) looking for FREE information to the page that they had been redirected away from.
My response to the inquiry was a required registration and payment of research fees. (the information is not available any where else online and even publications containing this widegt information would be difficult to find).

Is eBay spidering?
Who knows what they are doing, however the page verifications (if intended) are not used to enforce violations of UAG. (else the item would have been removed immediately from active listings.)

Don

wilderness

1:48 am on Oct 28, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just a follow up on this.

eBay responded to my UAG violation late Friday afternoon (which seems to be there MO asking for more information, which unfortuamtely will not be responded to until Tuesday).

The eBay bot returned today and when normally has been taking five or less pages from each of my sites, gathered more than a dozen from each.

I've denied the bots range.
1) they are not using the bot to create solutions for violations of their own UAG (even though the bot is verfying these links).
2) I see no reason for their unidentified bot (no UA) to be crawling pages.

Don

AlexK

4:45 pm on Nov 3, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The wretched EBay scraper keeps going at a maniac rate - upto 14 pages/sec attempted during one blocked attempt, and gets blocked by the site bot-block routines (due to over-fast scraping) once or twice a week.

wilderness

5:22 pm on Nov 3, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Alex,
I received a response from eBay on Monday 10/30 on report of violation of their won UAG for sellers listings.

They've insisted that I join their VERO program to allow reporting of abuses.
It's absurd!

1) They require an active eBay account to report violations
2) their menu system does not cover such abuses and must be "fudged" to report a submission.
3) Now they require VERO.

I've denied their bot and redirected any links from eBay and/or sellers to other pages.

On the tail of my previous report, there was yet another seller listing and violation of their UAG on Wednesday-11/01.

Don

volatilegx

5:28 pm on Nov 3, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I wonder what would happen if you had your attorney write them a C&D letter? Would the attorney have to be an ebay user? I think the reporting policy is ridiculous, too.

acb123

7:01 pm on Nov 15, 2006 (gmt 0)

10+ Year Member



I too am having trouble with this spider: hit 20 pages of my oscommerce store multiple times all at once today.

Any news on it?

For now I have denied access via htaccess, but wondering if there is a user-agent somewhere. Could denying ebay access to the online store have repurcussions for the ebay store?

wilderness

3:50 am on Nov 16, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



acb,
If anybody hasn't, welcome to Webmaster World.

For now I have denied access via htaccess, but wondering if there is a user-agent somewhere.

I've never seen a UA when the bot crawls.
I did see a UA (of which I have misplaced and looked for repeatedly) when a human intervened to verify the reporting of violation of UAG. It was only that single instance, not before and not since.

Could denying ebay access to the online store have repurcussions for the ebay store?

Very likely.
In fact I've even wondered if ranting on their sellers UAG violations against my websites, could possibly effect my eBay account. (no store).
Although I'm able to differentiate between my websites and my eBay account, eBay may not see the same difference as I
do :(

Don

wilderness

3:52 am on Nov 16, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I wonder what would happen if you had your attorney write them a C&D letter? Would the attorney have to be an ebay user? I think the reporting policy is ridiculous, too.

Problem is Dan that by the time the C&D letter reached eBay and filtered down to the proper person that the item listing and violation of UAG would have likely ended ;)

Don

acb123

4:30 am on Nov 16, 2006 (gmt 0)

10+ Year Member



Thank you Don, for the welcome, and for the information. I have unblocked ebay from crawling my site, but it really is pernicious; it opens up about 20 or 30 sessions all at once. I have searched all over the web, and have yet to find anything else about this phenomenon.

wilderness

5:20 am on Nov 16, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I have searched all over the web, and have yet to find anything else about this phenomenon.

You mean that eBay (and its sellers) have selected a few individuals to pester and is negelecting the entire WWW ;)

A while back, one of the sellers that had linked to one of my pages, had links going to half-a-dozen different websites in the description of his/her item. (even linking to images on sites that didn't have protection in place.)
This practice of using supporting links to provider broader descriptions of sellers items is really vague in eBays policies.
In fact, in one portion of the policies the FAQ says that it's OK to link to a for sale page on your own website of comparable products, "just as long as that page doesn't offer a link to your websites home page".
I joined an eBay discussion group for a short period to seek clarification on these policies and most everybody there was scared to venture outside the policy limb for fear of losing their eBay accounts ;)

incrediBILL

9:18 pm on Nov 18, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Here's another IP from eBay worth noting:

216.113.168.*** "Java/1.5.0_09"

Tried to access a couple of pages although the UA was blocked, so it already knew about the pages existence. VERY ODD

[edited by: volatilegx at 3:06 pm (utc) on Nov. 30, 2006]
[edit reason] obfuscated ip address [/edit]

acb123

5:59 pm on Nov 21, 2006 (gmt 0)

10+ Year Member



It's BAAAACK...

216.113.181.**

Can't seem to spot its user-agent; only identified as eBay. If I can't spot a user-agent, it opens a session every visit. Anyone else have a user agent for this IP?

-_ACB

[edited by: volatilegx at 3:06 pm (utc) on Nov. 30, 2006]
[edit reason] obfuscated ip address [/edit]

acb123

6:02 pm on Nov 21, 2006 (gmt 0)

10+ Year Member



IncrediBill, what is the user-agent you are seeing from this annoyance? Is it "Java/1.5.0_09"?

Would blocking that from generating sessions screw up anything else, you think?

Best,

ACB

acb123

5:00 am on Nov 29, 2006 (gmt 0)

10+ Year Member



Saw a user agent:

User Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; SV1; .NET CLR 1.1.4322)

Clearly a bot, and clearly not paying attention to robots.txt...

Grr.

38.96.247.**

This one isn't ebay, though; it is "Performance Systems International"

Anyone heard of them?

--ACB

[edited by: volatilegx at 3:05 pm (utc) on Nov. 30, 2006]
[edit reason] obfuscated ip address [/edit]

wilderness

2:56 pm on Nov 29, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



[google.com...]

acb123

12:29 am on Nov 30, 2006 (gmt 0)

10+ Year Member



Ok, the Ebay thing is back:

here is its user agent.

User Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.8.0.7) Gecko/20060909 Firefox/1.5.0.7

Is there anything I can do to block this from generating sessions?

--ACB

acb123

12:32 am on Nov 30, 2006 (gmt 0)

10+ Year Member



(IP address: 216.113.181.**)

[edited by: volatilegx at 3:05 pm (utc) on Nov. 30, 2006]
[edit reason] obfuscated ip address [/edit]