Forum Moderators: coopster

Message Too Old, No Replies

Text to spreadsheet

html to excel

         

scripthelp

9:44 pm on Jul 12, 2008 (gmt 0)

10+ Year Member



Hello,

I currently have a site that has about 150 html pages on it.

I am restructuring the site and am trying for find a quick way to transfer the information into a database or spreadsheet.

For example, on most pages it will just be the title, subtitle, and then the article text.

I want an automatic tool that will go through my site and put each piece of information in a different column (one for title, one for sub, one for article) and row (for each page) in excel.

Is there such a tool?

Thank You,

Alex

PHP_Chimp

3:21 pm on Jul 13, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I dont know if there is a tool to do that for you. Have you tried looking on Google?

However it shouldnt take to much work with regular expressions to search for <title> and the other tags you want, then put them into CSV format. You can then read the CSV into a spreadsheet.

preg_match [uk.php.net] will get you started.

brotherhood of LAN

2:36 pm on Jul 15, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



on top of using PHP & regex, WGET has a simple single line command that would store all of your HTML pages, if you wish to scrape the pages from the front end.

scripthelp

3:53 pm on Jul 16, 2008 (gmt 0)

10+ Year Member



Thanks for the advice.

I will try it and report back with my results.