Forum Moderators: open

Message Too Old, No Replies

save rendered .aspx locally?

scrapper

         

kensav

7:42 pm on Aug 16, 2006 (gmt 0)

10+ Year Member



I need to save press releases from a client's site and burn to a disk for a customer of thiers'.

Problem is that all the pages are .aspx and I need an HTML version of the pages and I cant just copy all the files. Obviously.

I can easily just browse to the website and save page as .html but 7 years of press release at 5-10 a month is too much for me.

Any way of saving all the pages with some scrapper script to save the page as rendered HTML with images?

Not having any luck with 3rd party applications either.

Plz send me a sticky note if you have something you were sucessful with.

webdoctor

4:44 am on Aug 17, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Have you tried wget? You can mirror with 'wget -m http://www.example.com/' - although you might want to rename the mirrored filed from .aspx to .html for easier browsing.

kensav

1:46 pm on Aug 17, 2006 (gmt 0)

10+ Year Member



won't wget pull the source code from the page? I need the page rendered as html as you would see it in the browser.

Ocean10000

2:58 pm on Aug 17, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Wget will pull the html as rendered by Asp.net just like a browser/crawler does. There is what they referrer to as a mirror mode which should also download style sheets and images also if my memory serves me correctly.

kensav

8:28 pm on Aug 17, 2006 (gmt 0)

10+ Year Member



thx guys for the help.

if anyone cares the command I used was:

wget -r -np http:/