Forum Moderators: open

Message Too Old, No Replies

Converting content to HTML.

Looking for a tool.

         

mep00

5:41 am on Jan 15, 2004 (gmt 0)

10+ Year Member



I'm looking for a tool which will either convert MSWord files into clean, symantic HTML (Have you ever looked at the garbage Word puts out?) or to allow someone who has at least a clue on how to use a word processor to write HTML formated content. In either case, the content will be stored in a db and thrown into a template upon display. In truth, it doesn't even need to be true HTML, as long as I can easily convert it on the fly (via php).

I've thought about writing a javascript app (or maybe vbscript) or a stand alone app to do it, but I don't have the time right now.

Sinner_G

9:41 am on Jan 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Dreamweaver has a tool to clean up Word HTML. So basically what you do is open the Word document, save as HTML, open that in Dreamweaver and clean it up.

That's the theory, I've never used it, so am not sure how effective it is.

piskie

9:54 am on Jan 15, 2004 (gmt 0)

10+ Year Member



DW cleans up Word HTML better than other Apps that I have seen.
If this is a one off exercise, you can download a trial version (30 days I think) and do the job before it expires.

NeedScripts

10:43 am on Jan 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



In FP 2003 you can just copy paste the stuff and it should take care of it, Also, if I am not wrong, FP2003 has another inbuild function to handle pages created in word.

NS

Mohamed_E

12:46 pm on Jan 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



HTML-Tidy has a function that will substantially improve the output of Word, though I now usually save as text and add the html myself.

robert adams

3:38 pm on Jan 15, 2004 (gmt 0)

10+ Year Member



Your best bet is to no try to convert word to html. Just write the html.

Why make the extra step?

robert

krieves

3:47 pm on Jan 15, 2004 (gmt 0)

10+ Year Member



Try cutting the text from Word then paste it into Notepad. Then copy from Notepad and paste into FP.

robert adams

5:29 pm on Jan 15, 2004 (gmt 0)

10+ Year Member



Try cutting the text from Word then paste it into Notepad. Then copy from Notepad and paste into FP.

that's almost as bad as converting from word :)

if you are going to put it in notepad just add html on and off tags and save as .html and poof, it is a webpage.

luck,
robert

bcolflesh

5:36 pm on Jan 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



MS Office HTML filter:

microsoft.com/downloads/details.aspx?FamilyID=209ADBEE-3FBD-482C-83B0-96FB79B74DED&displaylang=EN

office.microsoft.com/assistance/preview.aspx?AssetID=HA010548651033&CTT=6&Origin=EC010553071033

isitreal

8:17 pm on Jan 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Openoffice.org puts out reasonably clean HTML, you can load your word doc into that then save it as html, still needs some touchup but it's way better than MS Word's version.

mep00

10:42 pm on Jan 15, 2004 (gmt 0)

10+ Year Member



bcolflesh-- Thanks, this looks like it might be an answer. Even if it will still need a little tweeking, it's make my life easier. Two things I'm still wondering about it: how does it handle headings and does it also work with Ofice XP?

Then copy from Notepad and paste into FP.
Out of the frying pan and into the fire. What will FP offer me? I understand why I might copy it into TopStyle Pro, which I often use, but why FP?

Openoffice.org puts out reasonably clean HTML.
I'll need to check this out, too. Thanks.

Just write the html.
Of course I can do this, but it's something I was trying to avoid; coding a page is one thing, but just adding markup to content generated by others is one of the few time I would prefer a WYSIWYGish solution. Also, I want others to be able to generate HTML, leaving me only to proofread.

bill

3:47 am on Jan 16, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Then copy from Notepad and paste into FP.

Out of the frying pan and into the fire. What will FP offer me? I understand why I might copy it into TopStyle Pro, which I often use, but why FP?
Your original question was: I'm looking for a tool which will either convert MSWord files into clean, symantic HTML.... FP will do just that if you follow the advice given. You'll get 100% validating HTML. Nothing more, nothing less.

TopStyle has an HTML editor, but you'll still have to write the code, it will not convert anything from Word. If you wanted to hand code the page you won't need any tools other than NotePad or TopStyle.

g1smd

9:54 pm on Jan 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Right at the beginning of this thread you asked about semantic markup.

Basic HTML to be used with CSS only needs the block-level elements of headings, paragraphs, lists, tables and forms.

I'll bet that any tool that converts documents to HTML doesn't put <hx> ... </hx> tags around things that realy ought to be headings.

Most of the tools produce massive code bloat with either many font, bold, etc tags, or replace all of those with inline styles which are also code bloat.

g1smd

10:00 pm on Jan 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



See also: [webmasterworld.com...]