Forum Moderators: open
Have fun :)
[reuters.com...]
orielly [oreillynet.com]
MOUNTAIN VIEW, Calif. (Reuters) - Google Inc. (GOOG.O: Quote, Profile, Research) on Thursday rolled out a preliminary version of its new desktop search tool, making the first move against its major competitors in the race to provide tools for finding information buried in computer hard drives.The Google Desktop offering takes direct aim at Microsoft Corp. (MSFT.O: Quote, Profile, Research) , which bought a desktop search business in July, as well as current and expected desktop product releases from other companies such as Apple Computer Inc. (AAPL.O: Quote, Profile, Research) , Ask Jeeves Inc. (ASKJ.O: Quote, Profile, Research) and Time Warner Inc.'s (TWX.N: Quote, Profile, Research) AOL.
[edited by: Brett_Tabke at 11:33 am (utc) on Oct. 15, 2004]
[edit reason] added links [/edit]
So far, I have a luke warm reception to it as it is IE integrated only (which is evil). It also doesn't allow you to specify which directories or actual file types to index. Just not enough info coming out on this thing yet.I wonder - will certain firewalls block it? I know some serurity software is going to block its operation for sure.
All-n-all, it's a cool idea. I've been using Lookout for a couple months now, and I think it is just awesome.
LookOut is OE integrated only (which is evil). Sure, it searches other things as well, but clearly Microsoft screwed up when you have to use a program within a program to finally get a decent search in Windows. LookOut + Outlook feels pretty bloated too.
In the end, I have a feeling that this is another 20% of some Googler's time. This is a geek toy with minimal penetration meant to keep the Google Groupies happy. It might become big or it might become stale like the Google Deskbar.
Lastly, I have to add that I am sort of pissed off that I have the Gmail Notifier, Google Deskbar, Google Desktop, and Google Toolbar (i don't use IE but it's there), Picasa, and Hello all installed and collecting various informations about me (which I don't mind, they obviously put it to good use). The problem is that, although they all have a central contact point, Google Headquarters, they don't talk to eachother. As a matter of fact, why aren't they optionally combined in one install application?
integrating these local systray applications should be the next step I assume. Then launching a GMessenger wouldn't require a separate application manager.I guess before that all these tools would need a clear "big picture" look, to clean up the trash from valuable
I guess you missed my thread, here [webmasterworld.com]? Google already has a messenger :)
This aint about money - it's about survival - they must compete. Yahoo and MS are gonna give theirs away for free.
You know how good Lookout is right? Now imagine when that is tagged as a little form window right next to your start button and chained to MSN search as the default off desktop engine.
The results in Lookout are better orgainized to me, but Googles is somewhat promising. Seeing the results from my machine in what looks like a Google window is just plain creepy. I don't like that part at all.
Where is the html stored for the search results? Can we change that?
"The application will give Google new opportunities for advertising in the future, however. Google's Mayer said that for now, the company will not tie text ads to personal information viewed on the desktop results pages, but it may explore such avenues eventually. Google will display texts ads adjacent to search results like it typically does. A residual effect of the service, she said, could be that people use Google search more often and therefore that will boost its ad pages."
It creeps me out too. Anyone notice your desktop results appear in a regular Google search by default? If people become accustomed to using the Toolbar for desktop searches, you have a new way for additional AdWords impressions.
1. It supposedly searches the temporary internet files. Does it search Mozilla/Firefox or Opera cache files, or only those accessed with IE?
2. What does the data store look like? Is it one big file or a bunch of little ones? Text or binary? Does it look in any way potentially accessible or readable by any other application (eg. an XML-like data repository which could be parsed)?
3. Any rough idea of the correlation between the size of the data store and the size of your "My Documents" folder (where, I assume, most of your documents are)? I see they suggest 500Mb of free space in the docs...
4. Any hackers out there captured any of the data transmitted to the mother ship? Is it readable or encoded in some way?
It also doesn't allow you to specify which directories or actual file types to index. Just not enough info coming out on this thing yet.
From a news letter we received from Search Day (Search Engine Watch)we received this explanation for file types:
You can use some of Google's search syntax to get around this. To narrow to file type, use the filetype command. For example, this:
cars filetype:email
would bring back only email matches with the word cars mentioned. A list of known file types we've tested to date and found work are:
Word: filetype:word or filetype:doc
Excel: filetype:excel or filetype:xls
PowerPoint: filetype:powerpoint or filetype:ppt
Text: filetype:text or filetype:txt
Email: filetype:email
Chat: filetype:chat
Web History/HTML Files: filetype:web or filetype:html
Images: filetype:jpg or filetype:gif
Acrobat: filetype:pdf
Windows Media: filetype:wma or filetype:wmv
MP3: filetype:mp3
For email, chat and web history, you can also narrow by clicking on the count numbers, as described above. As for images, Acrobat and other files, keep in mind that only text in the file names will be matched, not any meta data or actual text contained within the files.
I am guessing it won't since G relies too heavily on links. But it may tell you if your on page factors are better. Assuming everything else equal (links) this may tell you who would appear first. So in sense you can maximize your on page SEO.
Let me know if anyone does this.
Anyone imported like the top 10 SERPs for a keyword to see if it comes back with the same results locally?
There's no way it's going to use the same algo as on the public search - for the simple reason that the data set is so different. You aren't going to spam your own desktop search, so the algo is going to be much less complex and more trusting of the document and metadata information.
Having said that, how long before someone works out how to spam the data store via their visited pages in the temporary internet store. Search for your banking and budget documents, and get back results from your IE cache for mortgage offers...
In the Google Desktop Search folder, the following files are present:
dbc2em.cf1
dbc2emh.ht1
dbdam
dbdao
dbeam
dbeao
dbm
dbu2dm.cf1
dbu2dmh.ht1
dbvm.cf1
dbvmh.ht1
fii.cf1
fiih.ht1
hes.evt
rpm.cf1
rpmh.ht1
sites.txt
I was looking at the .exe files for the Desktop Search and noticed a c:\src\1.0_branch\googleclient\total_recall\build\release\obj\trs\GoogleDesktopIndex.pdb - this is a Windows programs so the datastore may be a proprietary "Program DataBase" build by MS Visual Studio like VC++.
As for using the desktop to game the algo. I doubt they will be the same algo since there is little link popularity through your desktop and Google knows our intentions are different when we want our PC indexed. I am not deliberatley going to try and spam myself so they can probably use simpler algorithms with fewer filters. I bet that what works for getting a high listing on your desktop search will not work for getting a high listing in Google proper.
That is hardly a fair comparison when LookOut is doing a full index hogging all processor time, while Google is gently chugging along.
The moral is for Google Desktop Search to provide a "rebuilt database in real-time" option.
You have to admit that LookOut requires .NET installation and a lot more time for the actual installation than Desktop Search.
So, to profit fully from Desktop Search, you have to be running IE as your primary browser (and use Microsoft Office for your documents and Outlook or Outlook Express for email).
sun818, what do you see if you try to open up one of those files in the data store with a text editor? Is it a text format or binary?
Finally, Google recommends allowing four applications to access the internet [desktop.google.com] without restrictions - although they don't explain why the desktop crawl program or the others should ever need to do so.