Welcome to WebmasterWorld Guest from 54.167.155.147

Forum Moderators: open

Message Too Old, No Replies

HTML Stripper?

To convert old FP site to Expressions

   
4:09 pm on Aug 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



We have kind of a "side" site from our main that is mainly detailed tech info on some products. It was done in in FrontPage, but I want to redo it from scratch in the new Expressions Web Designer.

Basically what I need to do is strip out ALL the existing HTML, Java, anything that is not pure text so that I can just cut and paste just the text into the new site.

Anyone know of a good stripper that really works? (have tried a couple of the freeware ones, with mixed results).

5:02 pm on Aug 13, 2006 (gmt 0)

5+ Year Member



Look at the page in a browser, select text, copy, paste into notepad.
5:24 pm on Aug 13, 2006 (gmt 0)

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member



If the original site was built using WYSIWYG functions such as adding <font> tags, etc. you can use FP's built in Remove Formatting command. This will strip all formatting tags.

Open the page in FP Normal View. Ctrl + A to Select All, Ctrl + Shift + Z to Remove Formatting. You can also do this through your menu commands.

P.S. jessejump's option may be the best one for you based on your request to "strip everything". Although, with some sites, you might find it difficult to select everything. ;)

5:30 pm on Aug 13, 2006 (gmt 0)

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member



After reading this again...

We have kind of a "side" site from our main that is mainly detailed tech info on some products. It was done in in FrontPage, but I want to redo it from scratch in the new Expressions Web Designer.

You really won't need to do anything. They are basically the same program with Expressions Web Designer being the next generation of FrontPage.

11:42 pm on Aug 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The reason I want to strip it all out is because the site has been worked on over a period of years by several people, some less qualified than others, and the various pages have a hodge podge mix of java, flash, FP elements (most of which Expressions no longer uses BTW), and other trash. Also Doctypes etc. are a total mess, with at least 8 variations ;p

Expressions does not really use a lot of the old FP methods, like for example you no longer need to have FP extensions installed, which is the main reason I want to get totally away from FP - and this lightly used site is a good guinea pig for the beta version.

The select text and paste to notepad does not get rid of all of it, but it does get rid of 98%, so that might actually be the easiest way to go in the long run. The strippers I have tried all had bugs, like leaving <p> tags or not taking out the </p>.

[edited by: Wlauzon at 11:43 pm (utc) on Aug. 13, 2006]

12:10 am on Aug 14, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



<<...and paste to notepad does not get rid of all of it>>

What do you have remaining after you use notepad?

12:59 am on Aug 14, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



On a few pages had some odd html tags, but after looking closer, it appears that whoever made the page did not close some tags (that is "<http equiv" instead of "<http equiv>")...