Welcome to WebmasterWorld Guest from 54.226.67.166

Forum Moderators: bakedjake

Message Too Old, No Replies

Advanced Keyword Search at the Wayback Machine

     

heini

2:10 pm on Sep 4, 2003 (gmt 0)

WebmasterWorld Senior Member heini is a WebmasterWorld Top Contributor of All Time 10+ Year Member



What a cool tool:
[recall.archive.org...]

It's an experimental app searching through the 11 Bill docs stored at archive.org by keywords. It comes with date limiters, so you can search for pages on any subject in a specific timeframe. The ranking is content based.

Additionall features:
- graph displaying the number of pages over time
- related topics to further refine searches
- graph showing the main related topics popularity in time

jrobbio

3:19 pm on Sep 4, 2003 (gmt 0)

10+ Year Member



That is an excellent tool. Thanks for that.

It doesn't seem to graph things if the url pool is too small.

Yidaki

8:12 pm on Sep 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Wow! That's a new reall search engine, I guess! Even theme clustering, Categories, Topics ... i'm very impressed -> bookmarked! This is great news, heini - thanks for spotting it!

<added>
Ohps, why does a search return my site allthough its not listed at archive.org (since ia_archiver is disallowed to index it)? Are they using alexa data? Hmm ...
</added>

papamaku

8:24 pm on Sep 4, 2003 (gmt 0)

10+ Year Member



at last - i think they must have been planning this for so long - it could be an absolutely brilliant tool and so useful.

also with 11 billion pages - kinda puts Google and FAST in their place :)

Yidaki

12:10 pm on Sep 7, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>11 billion pages

I suppose this number includes all time snapshots from a page, or!? If so, i wonder how many "real" unique pages they indexed? The 11 billion pages could get cut down to just a few 100k unique pages. I can't find any number about the unique pages neither on archive.org nor on recall.archive.org.

sidyadav

9:31 am on Sep 8, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Great Tool! Well done to the Wayback machine :)

vitaplease

9:46 am on Sep 8, 2003 (gmt 0)

WebmasterWorld Senior Member vitaplease is a WebmasterWorld Top Contributor of All Time 10+ Year Member



This is amazing stuff, thanks Heini.

I can imagine it can come in handy with some copyright - who was first - stuff as well.

I do not seem to get their "before" data limiter working.
It seems to always show until April 2003?

Did not know there was a wayback forum either: [archive.org...]

Seems Wayback even has 30 billion pages - wonder why Anna Patterson limited herself to only 11 billion :)

vibgyor79

7:14 am on Sep 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I found my website allright but it displays the URL as www.myurl.com:80

what is colon 80?

percentages

7:25 am on Sep 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>what is colon 80?

The port number.

 

Featured Threads

Hot Threads This Week

Hot Threads This Month