Welcome to WebmasterWorld Guest from 188.8.131.52 , register , free tools , login , search , pro membership , help , library , announcements , recent posts , open posts Become a Pro Member
converting .doc to html badams1 msg:377215 9:26 pm on Jun 1, 2005 (gmt 0) I'm not sure where I should be posting this so please move it if it isn't in the right place.
Ok I get emails about every week needing me to convert an MS Word (sometimes Excel) document into html. I know I can do a File > Save as Webpage, but I don't like all the extra code MS puts in there.
Is there an easier way to get the info out besides having to retype all of it?
netchicken1 msg:377216 9:35 pm on Jun 1, 2005 (gmt 0)
Either save it as a txt file and paste it into your html page. or,....
If you use frontpage, copy and past the text straight from the doc onto the page, and then click on the little icon bottom right of the pasted text and format it as Text only.
badams1 msg:377217 9:40 pm on Jun 1, 2005 (gmt 0)
hmm, i guess i should have made myself clearer. it is typed in ms word, but they have used tables for the names and phone numbers.
I try not to use FP as I can't stand it. Only use FP to upload as I can't use ftp for some unknown reason.
Easy_Coder msg:377218 11:00 pm on Jun 1, 2005 (gmt 0)
I feel your pain... I had this problem a few years back and had to write myself a utility to parse that junk out and then create a new file on the fly including the template.
I don't even know if I have the code anylonger it was vb6. I can look for it if you want?
physics msg:377219 12:48 am on Jun 2, 2005 (gmt 0)
There are several commercial packages out there which (I imagine) might do a better job with the code (hopefully ;). Try searching for convert .doc to .html in your favorite browser. bill msg:377220 5:00 am on Jun 2, 2005 (gmt 0)
FrontPage 2003 has a very simple way to accomplish your goal. Just go to the Tools menu and select Optimize HTML... In that menu you can have FP remove all sorts of unnecessary code generated by Office programs, including Word HTML.
Another option is to use the free
HTML TIDY [ w3.org] program, which also will clean out Word's extraneous code.