Forum Moderators: bakedjake

Message Too Old, No Replies

A new Metacrawler with a very good submission feature

The Realtime submission crawling should be adopt by others

         

Allergic

3:04 pm on Dec 8, 2002 (gmt 0)

10+ Year Member



A new Metacrawler as appear. Another one, some of you will think.

I mention it because he got a new submission feature that is very useful to see which pages are hit. It is showing you in real time all the pages it's fetching with the Http response (200, 403, etc).

A really great feature that should be adopt by all the other search engines.
[deltis.com...]

jimbeetle

9:06 pm on Dec 8, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Very interesting.

Did a submit and it found a couple of very vexing 404 errors that I have been unable to track down. Have been trying for a few months and I swear they don't exist, all similar to mydomain.com/directory/&. Don't know where the heck the "&" comes from. Maybe now I'm seeing something from the spiders point of view.

Kind of curious how long they can "provide a faster and more accurate search with no ads, pop-ups or other annoyances." They're gonna have to make money someplace.

Allergic

6:05 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



Yes Jimbeetle it is a good tools to see what is parse by the robot.

But they should explain a bit more the Hierarchy feature by telling us it is for the result windows only? Not for the real crawling process!?!

turk182

6:17 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



I've tested it and seems pretty interesting. I've also found some 404 extrange errors. I agree with you Allergic: they should explaing a bit more the Hierarchy feature.

turk182

6:21 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



One more thing. In their search listings I've found some strange things, like pages with number 1 position in Google for some keywords although the first page in a normal search for these keywords should be from our web site.

Allergic

6:26 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



Yes Turk182 I saw the same thing. I guess they use an older version of the Google's index!?!

andreasfriedrich

10:00 pm on Dec 9, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



As for the 404 errors it seems that their spider has some problems resolving relative URIs. I let it check one of my site and it reported total processed pages : 565, total 404 pages : 504 when in reality there are 348 pages and one 404 error.

Andreas

jeremy goodrich

10:10 pm on Dec 9, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Glorified meta engine. Sorry, but who cares?

It would take less than a week to put online the same thing, for many people here.

Are they authorized to use Google's results? If not, I expect that to go away fairly quickly...as for the rest, taking 10+ seconds to deliver me 54 results is hardly worth mentioning.

If we could please discuss some engines with 'merit' as in, something that grabs the user, *not just another meta engine* this forum would be much more valuable.

Up and coming engines, stuff with exciting, interesting technology, companies with researchers who 'know their stuff' or companies with a new idea in information retrieval - that's what we all should be talking about.

There are link checkers - free - that will do you much better for your site, if you are worried about 404's, etc. Open source, freely downloadable, and that will give you full http code eg, 200, 302, 404, 403, etc.

Nothing personal, I'm sure the people that put this thing together are happy with it - but every other time somebody goes and posts another unknown meta engine, it's one they are affiliated with. :)

Sorry, but as to 'other engines should adopt this' not hardly: it's your job, mine, and every other webmaster out there, not the search engine, to check for broken links. :)

Allergic

11:02 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



Sorry but I not really agreed Jeremy.

Why use, and download, a link checker who simulate a robot (and your never sure it is acting like the bot the search engine use) and after that use another software to read your log file to see if the bot have effectively hit your page when all the search engines will do it anyway but after putting you on a waiting list?

You prefer waiting from 2 days to 4 weeks to see the crawling bot was stuck somewhere and redo the process the next month?

In a perfect world a webmaster control everything but in my case i normally deal with (and try to educate) plenty of peoples!

I sometimes deal with a agency who make the marketing part and control the project, with the back-end guy who make the database and the directory structure of the site, with the studio who make the graphic template layout and design of the site, the client who sometimes badly use the content manager and the hosting provider who sometimes misconfigured the web server.

I point out this metacrawler only for that feature! If a search engine can help to save time I will take it.

Nick_W

11:08 pm on Dec 9, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Try a Google search for "Link Checker" -- there are dozens of tools that provide you with free reports in seconds...

Nick

Allergic

11:29 pm on Dec 9, 2002 (gmt 0)

10+ Year Member



I know that Nick_W, and i use thoses.

But sometimes the link checker and the online spider simulator give you a perfect score and see all the links ok.

But one of my friends who have a 800 pages in Php, have just 21 pages in Google after 5 months! I test everything with online tools and also software link checker and everything seem perfect. But nothing change :-(

If Google or Alltheweb put a well programmed tools like this I will loved it!

Reno

2:26 am on Dec 13, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It is a fine meta service, as meta services go. It's the second new one this week that I've seen (ZapMeta.com being the other), which only reinforces what is becoming an obvious trend:

The number of meta crawlers is going up;
The number of real search engines is going down.

An unfortunate inverse relationship, in my opinion....

Finder

3:16 am on Dec 13, 2002 (gmt 0)

10+ Year Member



But one of my friends who have a 800 pages in Php, have just 21 pages in Google after 5 months!

This may be due to too many variables in your links, not 404 errors. My site is in the same position.

index.html?section=foo&sub=blah is okay.

index.html?section=foo&sub=blah+another+variable is not.

Google takes the first url, but not the second. Fast, on the other hand, will take both. My entire site is in Fast while Google has about half.

jimbeetle

4:54 pm on Dec 13, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just did a search in quotes for one of my keyword phrases. The results don't really parallel those from the SEs themselves and totally and completely break down at around number 30 -- nothing faintly related to a highly competitive phrase.

Must be that Find Filtered feature that ensures "the final results that get displayed to you are the most relevant to your search term." Looks like they have a bit of work to do.