| Welcome to WebmasterWorld Guest from 22.214.171.124 |
register, login, search, subscribe, help, library, PubCon, announcements, recent posts, open posts,
|Subscribe to WebmasterWorld|
|Extracting Keywords from Word documents|
| 6:52 am on Aug 25, 2003 (gmt 0)|
There are plenty of tools available to keyword analysis of html pages.
Anyone know of a good way to analyse & extract keywords from Word documents?
| 6:14 am on Aug 26, 2003 (gmt 0)|
If you like to work online with a non Microsoft tool, you should first export the file to XML then proceed from there.
Otherwise, you can open the file using a simple windows program or a script on a windows platform using Microsoft Word COM object to extract text from the file and read it then parse it.
| 6:35 am on Aug 26, 2003 (gmt 0)|
If you are simply after Single Word Density then you could copy the text into NoteTab and then use the Statistics function. Other than that I am not much help I am afraid.
| 6:33 pm on Aug 26, 2003 (gmt 0)|
A crude but effective approach would be to convert Word to HTML and then use one of the HTML tools available.
All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
WebmasterWorld ® and PubCon ® are a Registered Trademarks of Pubcon Inc.
© Pubcon Inc. 1996-2012 all rights reserved