Does anyone know of any good utility or use of a program to parse HTML? What I need to do is take a folder with hundreds of HTML docs (product pages from an online catalog of one of our suppliers) and parse the product info out of the HTML and into a CSV, tab delimited, or XML file so that we can import it into our database.
The HTML files are all the same formatting, only the content is different.
I am not too interested in command line style interfaces. Is there a GUI solution for this type of work?
Thanks in advance!