Welcome to WebmasterWorld Guest from 54.145.44.134

Forum Moderators: open

Message Too Old, No Replies

Extracting Keywords from Word documents

     

biggles

6:52 am on Aug 25, 2003 (gmt 0)

10+ Year Member



There are plenty of tools available to keyword analysis of html pages.

Anyone know of a good way to analyse & extract keywords from Word documents?

Thanks

bilalak

6:14 am on Aug 26, 2003 (gmt 0)

10+ Year Member



If you like to work online with a non Microsoft tool, you should first export the file to XML then proceed from there.

Otherwise, you can open the file using a simple windows program or a script on a windows platform using Microsoft Word COM object to extract text from the file and read it then parse it.

Luck!

Woz

6:35 am on Aug 26, 2003 (gmt 0)

WebmasterWorld Senior Member woz is a WebmasterWorld Top Contributor of All Time 10+ Year Member



If you are simply after Single Word Density then you could copy the text into NoteTab and then use the Statistics function. Other than that I am not much help I am afraid.

Onya
Woz

gmoney

6:33 pm on Aug 26, 2003 (gmt 0)

10+ Year Member



A crude but effective approach would be to convert Word to HTML and then use one of the HTML tools available.