| Bite Size Chunking PDFs - Software Solution?
|
akreider

msg:4084393 | 7:44 pm on Feb 21, 2010 (gmt 0) | I have a website with 400 PDFs. I once converted two of them into html, creating 10-15 webpages (one for every section of the pdf). This dramatically increased my search engine traffic for the files. I'm wondering if there is a tool to do this automatically? I guess detecting when a new section begins might be tricky. You could also just automatically break it up every page or two, but that wouldn't look professional. Or what tools would you recommend for doing this manually?
|
neostar

msg:4084506 | 12:52 am on Feb 22, 2010 (gmt 0) | Use server side programming lang to convert it like php.
|
akreider

msg:4087290 | 7:04 pm on Feb 25, 2010 (gmt 0) | I've been thinking about php, but it seems like it will be very hard to do it intelligently. Any advice how the pdf format works? Can you detect paragraph breaks, page breaks by manually parsing it?
|
|
|