Forum Moderators: open

Message Too Old, No Replies

Szukacz/1.5

         

Marcia

8:34 am on Mar 20, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Szukacz/1.5 (robot; www.szukacz.pl/jakdzialarobot.html; szuka

Just saw for the first time on one site. Sorry, no IP number.

Son_House

9:18 am on Mar 20, 2002 (gmt 0)

10+ Year Member



Saw it also:

bramka.proszynski.pl
194.181.35.5

volatilegx

6:46 pm on Mar 20, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This is a Polish engine. I also have it under:
194.181.35.6

keyplyr

7:30 am on Mar 26, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I've been seeing it lately as well.

volatilegx

5:48 pm on Apr 4, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just saw a new IP for Szukacz... 193.218.115.254

mpr

12:30 am on Apr 5, 2002 (gmt 0)



Let me introduce myself. I am one of the three people working on the Szukacz project. The activities of our robot have been noticed in this forum.

Szukacz is a search engine we have been developing over the last 2 years. Its main goal is to search for documents prepared in the Polish language. It is supposed to be a commercial service.

The main duty of the Szukacz robot is to search for Polish documents, wherever they are. However, it also gathers English-language documents for our "The Best of the World" collection.

At present we have two main collections of documents: the Polish collection of 8 million Polish documents from 148 thousand websites and the collection of 8 million documents, mostly in English, from 450 thousand other websites.

We try to eliminate multiple copies of documents as well as multiple copies of whole websites from our archives and from our collections.

The Szukacz robot identifies as Szukacz/1.5. It operates using two IPs: bramka.proszynski.pl and brama.proszynski.pl, where brama and bramka are Polish names for gateway and small gateway, respectively.

The robot is a mature beast now, we believe. It gathers both static and dynamic pages. It has a built-in safeguard not to crawl any single website too often. In selects links to crawl from its link database at random. Moreover, it waits at least a few seconds before enetering the same website for the next page. It follows the robots.txt and the robots metatag protocols. In fact, we do not get too many complains these days anymore.

Our present task is to make the Szukacz search engine fully operational (it is now in the beta stage). Right now we work on the asterisk masking of word endings. It is our goal to be able to use asterisks inside a phrase as well.

Asterisks is one of the features where we hope to be better than Google, which is already quite a strong mark in the Polish-language world.

As a promotion of our engine we offer Polish webmasters a possibility to use Szukacz, free of charge, to let the public search their websites.

Our search engine operates at [szukacz.pl....] However, the interface is in Polish, so it is of rather little use to all non-Polish users. The description of our robot has a summary in English at [szukacz.pl...]

<link fixed ~Marcia>

(edited by: Marcia at 9:49 am (utc) on April 5, 2002)

jatar_k

12:36 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



i would suggest if you use that link to take the period off of the end

<added>
i wish i read polish

wilderness

2:56 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



<snip>i wish i read polish>

Here is a German translation to English
It appears to be a functioning page.
There may be more in the Google search. There was no Polish to English translator at Alta Vista.

[translate.google.com...]

wilderness

2:58 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



<snip>As a promotion of our engine we offer Polish webmasters a possibility to use Szukacz, free of charge, to let the public search their websites.>

By the way, it would seem that at least one German website is using Szukacz as well?

rogerd

3:55 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



From a phonetic point of view, it seems like a rather unfortunate choice of names...

bartek

5:14 am on Apr 5, 2002 (gmt 0)

10+ Year Member



rogerd,
>unfortunate choice of names...

Looks scarier than it really is. Say "shoe" and "catch" all at once :)
Loosely translated it describes something that searches.

mpr,
Thanks for dropping by.

mpr

9:28 am on Apr 5, 2002 (gmt 0)



The name Szukacz was created for Polish speakers. (So far we havn't thought about competing with Google for non-Polish eyes). From the marketing point of view it is a very good choice. Our marketing line "Search using Szukacz" reads in Polish as "Szukaj Szukaczem"

The German website [searchcodes.de...] contains a query box of Szukacz. It does not employ Szukacz to search this particular site (it seems to have own search engine for local searches). It looks to me like a news service reporting that everybody is free and welcomed to include a query box of Szukacz in his/her website.

However, you can do more than that while including Szukacz query box. You can also add radio buttons, one of which could carry sort of "search this website" label and do local searches. It is done by setting an appropriate value to the "ct" (collection) parameter. Of course such a website has to be crawled and indexed by Szukacz quite independently before the above service could be used.

If a particular website (say, tripod.com) shows up in our "The best of the World" collection (ct=swiat), one can use "ct=tripod.com@swiat" as the name of the collection to limit searches to tripod.com. In fact, the effect is same as adding "host:tripod.com" into the user query (space is equvalent to the AND operator). The difference is that it is the webmaster and not the user who takes care of this.

heini

9:54 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hello mpr - welcome to wmw
Thank you for providing us with an insight into your activities and plans.

Will try your engine!

Rumbas

9:57 am on Apr 5, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Hi mpr,

Would you mind heading over to the European Forum [webmasterworld.com] and shed a little light on the Polish Thread [webmasterworld.com]?

Thanks.
Welcome to WebmasterWorld BTW :)