homepage Welcome to WebmasterWorld Guest from 54.237.38.30
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / WYSIWYG and Text Code Editors
Forum Library, Charter, Moderator: open

WYSIWYG and Text Code Editors Forum

    
Converting PDF into HTML
SilverLining

5+ Year Member



 
Msg#: 4221570 posted 12:13 pm on Oct 25, 2010 (gmt 0)

Hi,

What is the best way to get table-based data from a PDF converted into an HTML table. I used Acrobat to export as HTML, but this did not work well (too many unnecessary SPANs and the data is mixed-up).

I saved the data as .csv and was hoping to use that in combination with Regex to get table fields wrapped around the data - maybe I gave up too soon.

 

milosevic



 
Msg#: 4221570 posted 10:30 am on Nov 17, 2010 (gmt 0)

Hi SilverLining, I was tackling this same problem yesterday (specifically tables too) and gave up (I don't have Acrobat though).

If you managed to get to a csv of the table data formatted correctly, it should indeed be possible to process this into a HTML table.

The easiest way would be to copy the data from the csv and paste it into Word and then view the .doc in Google Docs "view as HTML" option and copy the source code.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / WYSIWYG and Text Code Editors
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved