homepage Welcome to WebmasterWorld Guest from 54.145.183.190
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Yahoo / Deprecated - Altavista, Alltheweb.com
Forum Library, Charter, Moderator: open

Deprecated - Altavista, Alltheweb.com Forum

    
Fast Announcing Much Larger Version
They mean ALL the web!
Laisha

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 599 posted 7:28 pm on Aug 27, 2001 (gmt 0)

Fast has announced that it will launch a new, much larger version this fall.

"The number of listed pages will be tripled to 1.8 billion webpages. Fast is aware that experts believe the Web has more pages than this. However, Fast will throw out duplicates and "junk" (probably doorway pages and pages with little or no text)."

http://www.pandia.com/sw-2001/48-fast.html

 

qianxing

10+ Year Member



 
Msg#: 599 posted 3:46 am on Aug 28, 2001 (gmt 0)

I've seen their algo eliminate duplicates pretty well as of recent. Interestingly, though, their method of doing this seems to be by which page they index first...

The potential for this is interesting - imagine someone mirroring your website and then getting crawled first. Your site would be excluded from the index as duplicate material.

And that says nothing about their ability to actually give you the most relavent pages. If there are 2,000,000 pages out there relavent to buying a cell phone, but 1,990,000 of those are people talking about their pet cat or their trip to Jamaica that happen to mention "cell phone" and buying something, then I would much rather have only 10,000 pages show up as results for that search.

Unless Fast is able to work out that algo, I doubt they're going to improve their image much with regards to delivering the best content. It seems as if they'll just end up the huge bloated index they were a few months ago, just bigger.

-qianxing

awcabot

10+ Year Member



 
Msg#: 599 posted 1:51 pm on Sep 11, 2001 (gmt 0)

I've seen their algo eliminate duplicates pretty well as of recent. Interestingly, though, their method of doing this seems to be by which page they index first...
The potential for this is interesting - imagine someone mirroring your website and then getting crawled first. Your site would be excluded from the index as duplicate material.

I do not think this is cause for concern: if you are an online store, even your telephone number and address of the CGI scripts must be the same to be counted as a duplicate, so if anything you gain even more exposure :)

As of Fast growing even bigger, I wish that they (and all other search engines) incorporate a much more powerful search engine, similar to LEXIS/NEXIS, where you can do full and complicated Boolean searches, time or size limitations, proximity to other words and so on. The works.

perkoch

10+ Year Member



 
Msg#: 599 posted 7:40 am on Sep 12, 2001 (gmt 0)

Actually, one "rumour" has it that Fast IS preparing true Boolean searching in order to compete with sites like HotBot.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Yahoo / Deprecated - Altavista, Alltheweb.com
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved