Welcome to WebmasterWorld Guest from 174.129.151.95

Forum Moderators: phranque

Message Too Old, No Replies

converting .doc to html

   
9:26 pm on Jun 1, 2005 (gmt 0)

10+ Year Member



I'm not sure where I should be posting this so please move it if it isn't in the right place.

Ok I get emails about every week needing me to convert an MS Word (sometimes Excel) document into html. I know I can do a File > Save as Webpage, but I don't like all the extra code MS puts in there.

Is there an easier way to get the info out besides having to retype all of it?

Thanks
Bill

9:35 pm on Jun 1, 2005 (gmt 0)

10+ Year Member



Either save it as a txt file and paste it into your html page. or,....

If you use frontpage, copy and past the text straight from the doc onto the page, and then click on the little icon bottom right of the pasted text and format it as Text only.

9:40 pm on Jun 1, 2005 (gmt 0)

10+ Year Member



hmm, i guess i should have made myself clearer. it is typed in ms word, but they have used tables for the names and phone numbers.

I try not to use FP as I can't stand it. Only use FP to upload as I can't use ftp for some unknown reason.

11:00 pm on Jun 1, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I feel your pain... I had this problem a few years back and had to write myself a utility to parse that junk out and then create a new file on the fly including the template.

I don't even know if I have the code anylonger it was vb6. I can look for it if you want?

12:48 am on Jun 2, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



There are several commercial packages out there which (I imagine) might do a better job with the code (hopefully ;). Try searching for
convert .doc to .html
in your favorite browser.
5:00 am on Jun 2, 2005 (gmt 0)

WebmasterWorld Administrator bill is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



FrontPage 2003 has a very simple way to accomplish your goal. Just go to the Tools menu and select Optimize HTML... In that menu you can have FP remove all sorts of unnecessary code generated by Office programs, including Word HTML.

Another option is to use the free HTML TIDY [w3.org] program, which also will clean out Word's extraneous code.