homepage Welcome to WebmasterWorld Guest from 54.237.213.31
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Search Engines / Alternative Search Engines
Forum Library, Charter, Moderators: bakedjake

Alternative Search Engines Forum

    
Advanced Keyword Search at the Wayback Machine
heini

WebmasterWorld Senior Member heini us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 1079 posted 2:10 pm on Sep 4, 2003 (gmt 0)

What a cool tool:
[recall.archive.org...]

It's an experimental app searching through the 11 Bill docs stored at archive.org by keywords. It comes with date limiters, so you can search for pages on any subject in a specific timeframe. The ranking is content based.

Additionall features:
- graph displaying the number of pages over time
- related topics to further refine searches
- graph showing the main related topics popularity in time

 

jrobbio

10+ Year Member



 
Msg#: 1079 posted 3:19 pm on Sep 4, 2003 (gmt 0)

That is an excellent tool. Thanks for that.

It doesn't seem to graph things if the url pool is too small.

Yidaki

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 1079 posted 8:12 pm on Sep 4, 2003 (gmt 0)

Wow! That's a new reall search engine, I guess! Even theme clustering, Categories, Topics ... i'm very impressed -> bookmarked! This is great news, heini - thanks for spotting it!

<added>
Ohps, why does a search return my site allthough its not listed at archive.org (since ia_archiver is disallowed to index it)? Are they using alexa data? Hmm ...
</added>

papamaku

10+ Year Member



 
Msg#: 1079 posted 8:24 pm on Sep 4, 2003 (gmt 0)

at last - i think they must have been planning this for so long - it could be an absolutely brilliant tool and so useful.

also with 11 billion pages - kinda puts Google and FAST in their place :)

Yidaki

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 1079 posted 12:10 pm on Sep 7, 2003 (gmt 0)

>11 billion pages

I suppose this number includes all time snapshots from a page, or!? If so, i wonder how many "real" unique pages they indexed? The 11 billion pages could get cut down to just a few 100k unique pages. I can't find any number about the unique pages neither on archive.org nor on recall.archive.org.

sidyadav

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 1079 posted 9:31 am on Sep 8, 2003 (gmt 0)

Great Tool! Well done to the Wayback machine :)

vitaplease

WebmasterWorld Senior Member vitaplease us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 1079 posted 9:46 am on Sep 8, 2003 (gmt 0)

This is amazing stuff, thanks Heini.

I can imagine it can come in handy with some copyright - who was first - stuff as well.

I do not seem to get their "before" data limiter working.
It seems to always show until April 2003?

Did not know there was a wayback forum either: [archive.org...]

Seems Wayback even has 30 billion pages - wonder why Anna Patterson limited herself to only 11 billion :)

vibgyor79

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 1079 posted 7:14 am on Sep 13, 2003 (gmt 0)

I found my website allright but it displays the URL as www.myurl.com:80

what is colon 80?

percentages

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 1079 posted 7:25 am on Sep 13, 2003 (gmt 0)

>what is colon 80?

The port number.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Alternative Search Engines
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved