Forum Moderators: bakedjake
[prnewswire.com...]
announced today its recent database expansion from 650 million webpages to
over 1 billion. The new index also includes a comprehensive refresh of
webpages from the previous index.
If they can grow without losing their focus then they will be serious contenders in the search engine market soon.
Query for one of my important keywords and Gigablast's results:
#1 eBay Portal
#2 Meta-Search-Engine, showing eBay Portals as top reults
#3 Meta-Search-Engine, showing eBay Portals as top reults
#4 Meta-Search-Engine, showing eBay Portals as top reults
#5 AdWords, Amazon and eBay offers
#6 75% on-topic site
#7 Auction site (non-eBay)
#8 my site
Same query on Google:
#1 On-topic end-user site
#2 my site
#3 On-topic end-user site
#4 On-topic competitor site
#5 On-topic competitor site
#6 slightly off-topic end-user site
#7 On-topic commercial services site
#8 On-topic competitor site
The others seem to continue to interpret 301 as "oh yeah, this place moved, didn't it, let's get out the map, ah, here we go, we shoulda turned left a block back. no worries, let's go there now. Nah, nuthin to change, nuthin' to write down. I'll just remember it in my head and be ready the next time (yeah, right!)."
By the way, only yesterday I was using GigaBlast after failing to find what I was looking for in Google. I'm not taking a shot at Google here, but at times the "drill down" layout of GigaBlast is very helpful. (And I did find what I was looking for...)
Spent about five minutes looking around:
"widget tool box"
1. Spammy site
2. Spammy site same as #1 but different landing page.
3. #1 selling "widget tool box" site (ranked #1 by Google and Yahoo).
4. Spammy site
5. Spammy site
"country name"
1. Government Tourist Board (mainly tourism fluff and brochures)
2. Independent Country Portal (ranked #1 by Google and Yahoo).
3. Spammy site
4. Weird Wiki-type guerilla Indymedia site.
5. Spammy site.
The spammy sites mostly have inside pages named widget-tool-box.html
Finally, I hope this Matt dude leaves some room for exploration and testing from the webmsaters side. Basically, hopefully he doesnt drop you at the first sign of irregularity, but gives you the chance to re-try.
How Wells does it with a handfull of PCs is just beyond me. -Larry
From the press release, it looks like Matt has outgrown his backroom cluster:
Gigablast will spider their websites in real-time at the rate of one page every five seconds. With multiple dedicated clusters, Gigablast can handle large amounts of DSS queries and webpages.
I bet the day after the G IPO, Matt had a dozen venture capitalists knocking on his door... ;)
<added>
Which is great for Matt, I've always enjoyed seeing how Gigablast progressed through the years. Glad to see his success. :D
</added>
I like the stripped down option, you can quickly visualize how the SE sees the site, with the keywords highlithed.
Few seem to know that this is possible in Google too, if Google has a chached snapshot of the page. In the heading frame it says:
This cached page may reference images which are no longer available. Click here for the cached text only.
And if you click on the link, it gives you exactly the representation how Google sees it. Try it with a page which has a form on it. Very interesting...
nevertheless, MSN is suffering the same problem with other languages and a bit of "snobbism" for the american development team can't be so wrong, the search engine war first will be decided in english.
(*fingers crossed*)
my 2 pennies,
P!
Eventually he sorted the instability problems, and before you know it (well a few years later) he has a billion spiderd.
Query for one of my important keywords and Gigablast's results:#1 eBay Portal
#2 Meta-Search-Engine, showing eBay Portals as top reults
#3 Meta-Search-Engine, showing eBay Portals as top reults
#4 Meta-Search-Engine, showing eBay Portals as top reults
#5 AdWords, Amazon and eBay offers
#6 75% on-topic site
#7 Auction site (non-eBay)
#8 my site
Good news! Results just got better:
#1 eBay/AdSense "portal" (probably the one from Jan07 - I'm not sure)
#2 Slightly related news site
#3 Related site, offering a service related to the search term
#4 On-topic enduser site
#5 my site
#6 my site
#7 spammy SEO site
#8 On-topic enduser PDF file
Seems Gigablast got rid of all so called "Meta Search Engines". The right way to go (imho). I'd LOVE to see SERPS without ANY eBay and without ANY "metasearch" results!