Welcome to WebmasterWorld Guest from 188.8.131.52 , register , free tools , login , search , pro membership , help , library , announcements , recent posts , open posts Become a Pro Member
Converting PDF into HTML SilverLining msg:4221572 12:13 pm on Oct 25, 2010 (gmt 0) Hi, What is the best way to get table-based data from a PDF converted into an HTML table. I used Acrobat to export as HTML, but this did not work well (too many unnecessary SPANs and the data is mixed-up). I saved the data as .csv and was hoping to use that in combination with Regex to get table fields wrapped around the data - maybe I gave up too soon.
milosevic msg:4231357 10:30 am on Nov 17, 2010 (gmt 0)
Hi SilverLining, I was tackling this same problem yesterday (specifically tables too) and gave up (I don't have Acrobat though). If you managed to get to a csv of the table data formatted correctly, it should indeed be possible to process this into a HTML table. The easiest way would be to copy the data from the csv and paste it into Word and then view the .doc in Google Docs "view as HTML" option and copy the source code.