homepage Welcome to WebmasterWorld Guest from 54.227.11.45
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Advertising / Paid Inclusion Engines and Topics
Forum Library, Charter, Moderators: Brett Tabke

Paid Inclusion Engines and Topics Forum

  posting off  
Slurp/si and Slurp/cat
WOW
drbill




msg:17785
 3:28 pm on Aug 30, 2000 (gmt 0)

Slurp/si and Slurp/cat are spidering like crazy. started about 6:00am PT and is following everylink on every page.
WOW is has been a long time since I have seen slurp usually it is just the submission spider.. Something must be up today. Any body else getting hit hard?

Regards

Dr. Bill

 

Brett_Tabke




msg:17786
 9:11 pm on Aug 30, 2000 (gmt 0)

I have also just noticed a huge run from Slurp on several domains - full complete crawls.

littleman




msg:17787
 9:52 pm on Aug 30, 2000 (gmt 0)

Unfortunately I don't think this is going to be the golden goose. All the spidering is coming out of Japan (goo3**.goo.ne.jp 202.212.5.**) - when this has happen before it was to maintain the Japanese db. This pattern seems very similar to the other times.

drbill




msg:17788
 12:15 am on Aug 31, 2000 (gmt 0)

Hi,

I am not seeing goo in the listings at all.. do they have a spider of their own?

littleman




msg:17789
 1:18 am on Aug 31, 2000 (gmt 0)

Dr Bill,
It will come in as Slurp, but the IP and host name are out of Japan. It still is inktomi, but it is an isolated database for the Japanese portals.

drbill




msg:17790
 1:24 am on Aug 31, 2000 (gmt 0)

Hi Littleman.

Is the Slurp/si or Slurp/cat or both?

littleman




msg:17791
 1:57 am on Aug 31, 2000 (gmt 0)

The Japanese Slurp I have seen over the last couple of days is actually Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...] - they hit one server 26 thousand times over the last two days - I really hate that they do that, its almost like a DOS. Slurp/si only came in 250 times on the same server. There have been no visits from Slurp/cat for me. So maybe something is happening on your particular sight? That would be nice, eh?

PeteU




msg:17792
 3:06 pm on Aug 31, 2000 (gmt 0)

Ha ha littleman, 26k in two days huh? it only happens when you have large number of subdomains setup
I wonder what you're doing...
;)

drbill




msg:17793
 5:31 pm on Aug 31, 2000 (gmt 0)

Littleman,

Sofar I have had close to 30,000 pages taken and it is still going. I hope it never stops :)

Brett_Tabke




msg:17794
 1:24 am on Sep 2, 2000 (gmt 0)

I've been visited by the entire stable of inktomi spiders in the last 5 days. (notice the very last one).

hits id : ip domain agent
78 SLURP: 216.35.116.108 wm3023.inktomi.com Slurp/cat (slurp@inktomi.com; [inktomi.com...]
53 SLURP: 216.35.116.105 wm3020.inktomi.com Slurp/cat (slurp@inktomi.com; [inktomi.com...]
40 SLURP: 216.35.103.77 j4024.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
40 SLURP: 216.35.103.78 j4025.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
39 SLURP: 216.35.103.54 j4014.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
39 SLURP: 216.35.103.58 j4018.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
37 SLURP: 216.35.103.52 j4012.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
37 SLURP: 216.35.103.60 j4020.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
34 SLURP: 216.35.103.53 j4013.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
34 SLURP: 216.35.103.74 j4021.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
32 SLURP: 216.35.103.59 j4019.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
32 SLURP: 216.35.103.75 j4022.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
31 SLURP: 216.35.103.55 j4015.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
31 SLURP: 216.35.103.57 j4017.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
31 SLURP: 216.35.103.76 j4023.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
31 SLURP: 216.35.116.53 j3013.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
30 SLURP: 216.35.103.56 j4016.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
28 SLURP: 216.35.116.58 j3018.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
26 SLURP: 216.35.116.43 j3003.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
26 SLURP: 216.35.116.54 j3014.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
25 SLURP: 216.35.103.51 j4011.inktomisearch.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
24 SLURP: 216.35.116.57 j3017.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
23 SLURP: 216.35.116.49 j3009.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
23 SLURP: 216.35.116.55 j3015.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
22 SLURP: 216.35.116.41 j3001.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
21 SLURP: 216.35.116.42 j3002.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
20 SLURP: 216.35.116.52 j3012.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
19 SLURP: 216.35.116.45 j3005.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
19 SLURP: 216.35.116.48 j3008.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
19 SLURP: 216.35.116.50 j3010.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
18 SLURP: 216.35.116.59 j3019.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
16 SLURP: 216.35.116.44 j3004.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
15 SLURP: 216.35.116.46 j3006.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
13 SLURP: 216.35.116.47 j3007.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
13 SLURP: 216.35.116.51 j3011.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
13 SLURP: 216.35.116.88 wm3008.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
12 SLURP: 209.185.143.96 q2000.inktomisearch.com Slurp/si (slurp@inktomi.com; [inktomi.com...]
12 SLURP: 216.35.103.71 j110.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
12 SLURP: 216.35.103.72 j111.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
11 SLURP: 216.35.116.56 j3016.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
10 SLURP: 216.35.116.104 wm3019.inktomi.com Slurp/cat (slurp@inktomi.com; [inktomi.com...]
7 SLURP: 216.35.116.106 wm3021.inktomi.com Slurp/si (slurp@inktomi.com; [inktomi.com...]
6 SLURP: 202.212.5.30 goo212.goo.ne.jp Slurp/si (slurp@inktomi.com; [inktomi.com...]
6 SLURP: 209.185.143.84 j5004.inktomisearch.com Slurp/si (slurp@inktomi.com;
6 SLURP: 216.35.103.61 j100.inktomi.com Slurp/si
6 SLURP: 216.35.103.62 j101.inktomi.com Slurp/si
6 SLURP: 216.35.103.73 j5006.inktomi.com Slurp/si
6 SLURP: 216.35.103.79 si4000.inktomi.com Slurp/si
6 SLURP: 216.35.103.80 si4001.inktomi.com Slurp/si
6 SLURP: 216.35.103.81 si4002.inktomi.com Slurp/si
6 SLURP: 216.35.116.103 wm3018.inktomi.com Slurp/si
6 SLURP: 216.35.116.109 wm3024.inktomi.com Slurp/si
6 SLURP: 216.35.116.90 si3000.inktomi.com Slurp/si
6 SLURP: 216.35.116.91 si3001.inktomi.com Slurp/si
6 SLURP: 216.35.116.92 si3002.inktomi.com Slurp/si
6 SLURP: 216.35.116.93 si3003.inktomi.com Slurp/si
5 SLURP: 209.1.13.232 j301.inktomi.com Slurp.so/1.0
4 SLURP: 209.1.13.231 j300.inktomi.com Slurp.so/1.0
4 SLURP: 209.185.122.201 q2005.inktomi.com Slurp/si
3 SLURP: 216.35.116.106 wm3021.inktomi.com Slurp/cat (slurp@inktomi.com; [inktomi.com...]
2 SLURP: 209.131.48.140 crusade.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
2 SLURP: 216.35.116.112 y2300.inktomi.com Slurp/3.0 (slurp@inktomi.com; [inktomi.com...]
SLURP: 164.124.250.232 Slurp/si (slurp@inktomi.com; [inktomi.com...]
SLURP: 209.185.141.226 y400.inktomi.com Slurp/2.0-KiteWeekly (slurp@inktomi.com; [inktomi.com...]
SLURP: 209.67.206.127 j521.inktomi.com Slurp.so/1.0 (slurp@inktomi.com; [inktomi.com...]
SLURP: 211.169.241.21 Slurp/si (slurp@inktomi.com; [inktomi.com...]
y2404.inktomi.com Slurp/2.0-GreatWhiteCrawl (slurp@inktomi.com; [inktomi.com...]

Brett_Tabke




msg:17795
 1:01 pm on Sep 14, 2000 (gmt 0)

Any one else seen "the great white crawl"?

drbill




msg:17796
 1:12 pm on Sep 14, 2000 (gmt 0)

Brett,

I have yet to see Great White :( Would like him/her to visit me.

websurfer




msg:17797
 10:35 pm on Sep 16, 2000 (gmt 0)

Inktomi has been poundingmy sites as well, not as much as others are saying, however my site isn't really that big either. Out of those different spiders, which one are the good ones?

websurfer




msg:17798
 10:38 pm on Sep 16, 2000 (gmt 0)

Do you have a IP Address for that great white??

websurfer




msg:17799
 10:59 pm on Sep 16, 2000 (gmt 0)

Has any one seen this

Tue Sep 12 07:43:25 2000 -- ¦Mozilla/4.72 [en] (X11; U; NetBSD 1.4.2 i386; Nav)¦209.185.141.185¦

When I do a look up on the IP address it comes back as j6000.inktomi.com what's up with that. ??

Air




msg:17800
 12:13 am on Sep 17, 2000 (gmt 0)

Websurfer,

That's the Ink spider that retrieves new submissions, been around for a while now, you must have submitted pages recently.

WebRookie




msg:17801
 7:40 pm on Sep 20, 2000 (gmt 0)

Please clarify for me if Slurp.so is the permanent database for Ink? I'm hoping so, hit very hard on September 17 by both Slurp.so and Slurp.

And could you tell me more about the great white crawl?

littleman




msg:17802
 8:00 pm on Sep 20, 2000 (gmt 0)

websurfer - You have an interesting format for IP logging. Are you running a home grown logging script? Is that the raw perl local(time) function? Just curious.

makemetop




msg:17803
 6:55 am on Sep 21, 2000 (gmt 0)

I got hit by j6000.inktomi.com, slurp/si and slurp/so on the same day (19th) for a site which dropped out of the DB a week ago and I deliberately have not resubmitted. I wonder what will happen now?

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Advertising / Paid Inclusion Engines and Topics
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved