homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Search Engines / Alternative Search Engines
Forum Library, Charter, Moderators: bakedjake

Alternative Search Engines Forum

Advanced Keyword Search at the Wayback Machine

 2:10 pm on Sep 4, 2003 (gmt 0)

What a cool tool:

It's an experimental app searching through the 11 Bill docs stored at archive.org by keywords. It comes with date limiters, so you can search for pages on any subject in a specific timeframe. The ranking is content based.

Additionall features:
- graph displaying the number of pages over time
- related topics to further refine searches
- graph showing the main related topics popularity in time



 3:19 pm on Sep 4, 2003 (gmt 0)

That is an excellent tool. Thanks for that.

It doesn't seem to graph things if the url pool is too small.


 8:12 pm on Sep 4, 2003 (gmt 0)

Wow! That's a new reall search engine, I guess! Even theme clustering, Categories, Topics ... i'm very impressed -> bookmarked! This is great news, heini - thanks for spotting it!

Ohps, why does a search return my site allthough its not listed at archive.org (since ia_archiver is disallowed to index it)? Are they using alexa data? Hmm ...


 8:24 pm on Sep 4, 2003 (gmt 0)

at last - i think they must have been planning this for so long - it could be an absolutely brilliant tool and so useful.

also with 11 billion pages - kinda puts Google and FAST in their place :)


 12:10 pm on Sep 7, 2003 (gmt 0)

>11 billion pages

I suppose this number includes all time snapshots from a page, or!? If so, i wonder how many "real" unique pages they indexed? The 11 billion pages could get cut down to just a few 100k unique pages. I can't find any number about the unique pages neither on archive.org nor on recall.archive.org.


 9:31 am on Sep 8, 2003 (gmt 0)

Great Tool! Well done to the Wayback machine :)


 9:46 am on Sep 8, 2003 (gmt 0)

This is amazing stuff, thanks Heini.

I can imagine it can come in handy with some copyright - who was first - stuff as well.

I do not seem to get their "before" data limiter working.
It seems to always show until April 2003?

Did not know there was a wayback forum either: [archive.org...]

Seems Wayback even has 30 billion pages - wonder why Anna Patterson limited herself to only 11 billion :)


 7:14 am on Sep 13, 2003 (gmt 0)

I found my website allright but it displays the URL as www.myurl.com:80

what is colon 80?


 7:25 am on Sep 13, 2003 (gmt 0)

>what is colon 80?

The port number.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Alternative Search Engines
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved