homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / WYSIWYG and Text Code Editors
Forum Library, Charter, Moderator: open

WYSIWYG and Text Code Editors Forum

Converting PDF into HTML

5+ Year Member

Msg#: 4221570 posted 12:13 pm on Oct 25, 2010 (gmt 0)


What is the best way to get table-based data from a PDF converted into an HTML table. I used Acrobat to export as HTML, but this did not work well (too many unnecessary SPANs and the data is mixed-up).

I saved the data as .csv and was hoping to use that in combination with Regex to get table fields wrapped around the data - maybe I gave up too soon.



Msg#: 4221570 posted 10:30 am on Nov 17, 2010 (gmt 0)

Hi SilverLining, I was tackling this same problem yesterday (specifically tables too) and gave up (I don't have Acrobat though).

If you managed to get to a csv of the table data formatted correctly, it should indeed be possible to process this into a HTML table.

The easiest way would be to copy the data from the csv and paste it into Word and then view the .doc in Google Docs "view as HTML" option and copy the source code.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Code, Content, and Presentation / WYSIWYG and Text Code Editors
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved