homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / WebmasterWorld / Webmaster General
Forum Library, Charter, Moderators: phranque

Webmaster General Forum

converting .doc to html

 9:26 pm on Jun 1, 2005 (gmt 0)

I'm not sure where I should be posting this so please move it if it isn't in the right place.

Ok I get emails about every week needing me to convert an MS Word (sometimes Excel) document into html. I know I can do a File > Save as Webpage, but I don't like all the extra code MS puts in there.

Is there an easier way to get the info out besides having to retype all of it?




 9:35 pm on Jun 1, 2005 (gmt 0)

Either save it as a txt file and paste it into your html page. or,....

If you use frontpage, copy and past the text straight from the doc onto the page, and then click on the little icon bottom right of the pasted text and format it as Text only.


 9:40 pm on Jun 1, 2005 (gmt 0)

hmm, i guess i should have made myself clearer. it is typed in ms word, but they have used tables for the names and phone numbers.

I try not to use FP as I can't stand it. Only use FP to upload as I can't use ftp for some unknown reason.


 11:00 pm on Jun 1, 2005 (gmt 0)

I feel your pain... I had this problem a few years back and had to write myself a utility to parse that junk out and then create a new file on the fly including the template.

I don't even know if I have the code anylonger it was vb6. I can look for it if you want?


 12:48 am on Jun 2, 2005 (gmt 0)

There are several commercial packages out there which (I imagine) might do a better job with the code (hopefully ;). Try searching for
convert .doc to .html
in your favorite browser.


 5:00 am on Jun 2, 2005 (gmt 0)

FrontPage 2003 has a very simple way to accomplish your goal. Just go to the Tools menu and select Optimize HTML... In that menu you can have FP remove all sorts of unnecessary code generated by Office programs, including Word HTML.

Another option is to use the free HTML TIDY [w3.org] program, which also will clean out Word's extraneous code.

Global Options:
 top home search open messages active posts  

Home / Forums Index / WebmasterWorld / Webmaster General
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved