Forum Moderators: open
What is the quickest way to transform Word documents into HTML thus creating an uploadable page? I could just code everything by hand but if a quicker way exists out there, please let me know.
Dreamweaver has a 'clean up word' command - isn't that good of them ;)
Apart from that and maybe PDF'ing the docs there is nothing more you can do really.
Copy and pasting large chunks of text reduces code bloat for the Save As HTML feature but for larger documents it is a bit time consuming.
And limbo came up with a good suggestion on the clean-up tool, this should help reduce the bloated code!
-gs
As in,
<html>
<body>
</body>
</html>
.
If so, save as "plain text"(.txt). That will remove all fluff and cruft that Word throws in there.
Actually, for html editing, I would then suggest you use another application, as Word isn't really ideal. There are many good free text editors out there. HapEdit I use for html, and topStyle lite for css.
I can't use another editor because the info is forwarded to me from a source that uses Word. I'd feel pushy if I insisted they use something else. :o)
Thank you for your welcome, Mohamed_E. webmasterworld comes highly recommended from a lot of my fellow webmistresses and they certainly were right about this forum!
I really can't understand what's going so let me say this, and see what you got:
Definition:
.txt means the file name *ends* in .txt
.doc means the file name ends in .doc
in otherwords, im talking about a file extension.
if you can't see the file extension, in windows, open up a folder. Go to Tools -> Folder Options -> View -> Unclick " Hide File Extensions for Known File Types"
.txt file = any editor will open it. And, its pure text. No formatting information. This is what's needed for a html page.
.doc file = MS Word only doc. It has extra stuff in it that will break a web page. In fact, if you actually look at whats in a .doc file, there is no way it would ever work on the web.
So, if you are receiving a .doc file, you will have to open MS Word, and do the Save As .txt.
If you are receiving a .txt file, and you simply prefer to edit the html in MS Word, then you shouldn't have to do anything. When you save, MS WOrd will prompt you that all formatting will be removed (which is good).
Finally, before you upload the file to your webserver, you should change the file extension of your web page to .html, .php, .html, whatever.
If you are *working* with .html files, I would seriously suggest not using Ms Word. WHen you open a .html file in Internet Explorer and then SAVE it, it then saves the file in the "Ms Word-ugly web page format". This is because when MS Word opens up a .html file, is sees the .html on the end of the file and decides it will act as a HTML editor. So the solution is to keep your html files as ending in .txt, if you REALLY want to use MS word, or play russian roulette with your file, and always remember to do a "save as" plain text. If you forget even once, and do a regular save, and close word, your file will be ruined for good.
Do a search on this site, you will find links to good html text editors, that won't cause you these problems.