Forum Moderators: bakedjake
I mention it because he got a new submission feature that is very useful to see which pages are hit. It is showing you in real time all the pages it's fetching with the Http response (200, 403, etc).
A really great feature that should be adopt by all the other search engines.
[deltis.com...]
Did a submit and it found a couple of very vexing 404 errors that I have been unable to track down. Have been trying for a few months and I swear they don't exist, all similar to mydomain.com/directory/&. Don't know where the heck the "&" comes from. Maybe now I'm seeing something from the spiders point of view.
Kind of curious how long they can "provide a faster and more accurate search with no ads, pop-ups or other annoyances." They're gonna have to make money someplace.
It would take less than a week to put online the same thing, for many people here.
Are they authorized to use Google's results? If not, I expect that to go away fairly quickly...as for the rest, taking 10+ seconds to deliver me 54 results is hardly worth mentioning.
If we could please discuss some engines with 'merit' as in, something that grabs the user, *not just another meta engine* this forum would be much more valuable.
Up and coming engines, stuff with exciting, interesting technology, companies with researchers who 'know their stuff' or companies with a new idea in information retrieval - that's what we all should be talking about.
There are link checkers - free - that will do you much better for your site, if you are worried about 404's, etc. Open source, freely downloadable, and that will give you full http code eg, 200, 302, 404, 403, etc.
Nothing personal, I'm sure the people that put this thing together are happy with it - but every other time somebody goes and posts another unknown meta engine, it's one they are affiliated with. :)
Sorry, but as to 'other engines should adopt this' not hardly: it's your job, mine, and every other webmaster out there, not the search engine, to check for broken links. :)
Why use, and download, a link checker who simulate a robot (and your never sure it is acting like the bot the search engine use) and after that use another software to read your log file to see if the bot have effectively hit your page when all the search engines will do it anyway but after putting you on a waiting list?
You prefer waiting from 2 days to 4 weeks to see the crawling bot was stuck somewhere and redo the process the next month?
In a perfect world a webmaster control everything but in my case i normally deal with (and try to educate) plenty of peoples!
I sometimes deal with a agency who make the marketing part and control the project, with the back-end guy who make the database and the directory structure of the site, with the studio who make the graphic template layout and design of the site, the client who sometimes badly use the content manager and the hosting provider who sometimes misconfigured the web server.
I point out this metacrawler only for that feature! If a search engine can help to save time I will take it.
But sometimes the link checker and the online spider simulator give you a perfect score and see all the links ok.
But one of my friends who have a 800 pages in Php, have just 21 pages in Google after 5 months! I test everything with online tools and also software link checker and everything seem perfect. But nothing change :-(
If Google or Alltheweb put a well programmed tools like this I will loved it!
The number of meta crawlers is going up;
The number of real search engines is going down.
An unfortunate inverse relationship, in my opinion....
But one of my friends who have a 800 pages in Php, have just 21 pages in Google after 5 months!
This may be due to too many variables in your links, not 404 errors. My site is in the same position.
index.html?section=foo&sub=blah is okay.
index.html?section=foo&sub=blah+another+variable is not.
Google takes the first url, but not the second. Fast, on the other hand, will take both. My entire site is in Fast while Google has about half.
Must be that Find Filtered feature that ensures "the final results that get displayed to you are the most relevant to your search term." Looks like they have a bit of work to do.