Forum Moderators: open
thus giving the bigger number.
I just tried poking around their site, to see if they said how many they had fully indexed of these 2.5 billion (wow that's a lot), but there's no mention I could find.
Anyone?
Also, AllTheWeb currently shows a total size of 2,112,188,990 pages. This is up a bit from its June 17th announcement of 2,095,568,809. The annoucement focused on how ATW had "dethroned" Google as the "world's largest search engine. Today, Google goes back to number one with the 2,469,940,685 number. Will AllTheWeb up its number soon? Will another engine join (Inktomi) in on the fun?
[google.com...]
See #4 there?
The Google index contains two types of pages--fully indexed and partially indexed pages. Your page is currently partially indexed, which means that although we know about your site, our robots have not read all the content on your page(s) in past crawls.
Took me a minute to find that! There was a better description I read before, but basically, they include in their counting pages they only have found links to, but have never indexed before.
This probably makes it a bit easier to 'build' a bigger index, because they haven't actually gone and parsed a few hundred million of those pages :)
I bet that will go for all the robots "noindex" and whatever else the bot finds.
Actually, remember a few posts a while back about G not obeying robots.txt?? hmmm- if that was true then you have to wonder if their bigger index really has any substance. Anyway- it was exactly 2.1 billion something or other for months!
Might work for English language pages, but what about the rest.
That is going to account for the missing millions of pages.
The bottom line is:
How useful the search results are - Google kills ATW in MHO.
How many people use the search engine - Again Google is king!
whats_up_skip,
the point I was trying to make is that there are already 300 million more (> ) pages than Google claims to index for a search on "the" only. (not the other way around hence "missing").
BTW, Alltheweb search for "the" 1,046,066,940
And yes the Spanish, German, Chinese, etc equivalents for high occuring words such as "the" are not even included, although the numbers Google shows for these words are very limited.
"y" in Spanish only sites for Google: 6,090,000.
"y" in Spanish only sites for Alltheweb: 29,092,521
It has been said that if you kept a certain number of monkeys in a room for an infinite amount of time and gave them all a typewriter, eventually they would reproduce the entire works of Shakespeare.
Now, thanks to the Internet, we know that's not true...
Now, thanks to the Internet, we know that's not true...
I just want you to know that I am stealing that line. It is the funniest thing i have read in a long long time.
Todd the Plagiarist