homepage Welcome to WebmasterWorld Guest from 54.196.159.11
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Search Engines / Alternative Search Engines
Forum Library, Charter, Moderators: bakedjake

Alternative Search Engines Forum

    
Advanced Keyword Search at the Wayback Machine
heini




msg:465114
 2:10 pm on Sep 4, 2003 (gmt 0)

What a cool tool:
[recall.archive.org...]

It's an experimental app searching through the 11 Bill docs stored at archive.org by keywords. It comes with date limiters, so you can search for pages on any subject in a specific timeframe. The ranking is content based.

Additionall features:
- graph displaying the number of pages over time
- related topics to further refine searches
- graph showing the main related topics popularity in time

 

jrobbio




msg:465115
 3:19 pm on Sep 4, 2003 (gmt 0)

That is an excellent tool. Thanks for that.

It doesn't seem to graph things if the url pool is too small.

Yidaki




msg:465116
 8:12 pm on Sep 4, 2003 (gmt 0)

Wow! That's a new reall search engine, I guess! Even theme clustering, Categories, Topics ... i'm very impressed -> bookmarked! This is great news, heini - thanks for spotting it!

<added>
Ohps, why does a search return my site allthough its not listed at archive.org (since ia_archiver is disallowed to index it)? Are they using alexa data? Hmm ...
</added>

papamaku




msg:465117
 8:24 pm on Sep 4, 2003 (gmt 0)

at last - i think they must have been planning this for so long - it could be an absolutely brilliant tool and so useful.

also with 11 billion pages - kinda puts Google and FAST in their place :)

Yidaki




msg:465118
 12:10 pm on Sep 7, 2003 (gmt 0)

>11 billion pages

I suppose this number includes all time snapshots from a page, or!? If so, i wonder how many "real" unique pages they indexed? The 11 billion pages could get cut down to just a few 100k unique pages. I can't find any number about the unique pages neither on archive.org nor on recall.archive.org.

sidyadav




msg:465119
 9:31 am on Sep 8, 2003 (gmt 0)

Great Tool! Well done to the Wayback machine :)

vitaplease




msg:465120
 9:46 am on Sep 8, 2003 (gmt 0)

This is amazing stuff, thanks Heini.

I can imagine it can come in handy with some copyright - who was first - stuff as well.

I do not seem to get their "before" data limiter working.
It seems to always show until April 2003?

Did not know there was a wayback forum either: [archive.org...]

Seems Wayback even has 30 billion pages - wonder why Anna Patterson limited herself to only 11 billion :)

vibgyor79




msg:465121
 7:14 am on Sep 13, 2003 (gmt 0)

I found my website allright but it displays the URL as www.myurl.com:80

what is colon 80?

percentages




msg:465122
 7:25 am on Sep 13, 2003 (gmt 0)

>what is colon 80?

The port number.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Alternative Search Engines
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved