Forum Moderators: open
...Ok...fine...
Is there a way to force Word to provide stripped HTML code (h1, h2, p, etc...)? I went to their site and had them download an extention that supposedly cleans it up. It did make it 'cleaner' but it is still messy.
Times change quickly in cyberspace and it would not surprise me that full releases of such programs are available today.
If it were my job assignment and important to determine an answer to this question, I would print out a list of the Fortune 100, get on the phone, call each company, ask to speak to the webmaster and ask them the question.
My thoughts on your having success in reverting to the phone is (a) who calls the webmaster for XXX big.corp and asks them a question they can relate to? It's is a low-percentage call with a high-percentage chance for response (b) what are the chances they have solved this problem with dogged creativity for their intranet [high, I think], and finally (c) if you ask, they will spill their beans because you will be the first one to have expressed any interest at all in the details.
Interest in your experience with this question will be high, best of luck and good wishes.
-pshea
I downloaded the trial version and I think, with a bit work on both client and our side, it will strip out the 'garbage'.
If the client can discipline themselves to use H1, H2 tags in their file, WordCleaner takes out the font and span tags.
It's a pretty good tool from what I can see. There is also a level of customization that you can create as well.
All that said, it's still a shame that Word does not give you an option to output simple code. What a waste of time and effort.
Thank you both for your help!
Cheers,
ken
All that said, it's still a shame that Word does not give you an option to output simple code.
Word is not an HTML editor, it is a Word Processing program. The only way, and I do mean the only way to strip out anything from Word is using NotePad or another text editor. Even then, there is still a visual inspection required to verify that all MS code has been removed. In most instances, you will end up manually removing the ghosts, usually mso stuff.
P.S. I have a secret weapon for Word files... FrontPage! ;)