homepage Welcome to WebmasterWorld Guest from 54.196.196.108
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / Content Management
Forum Library, Charter, Moderators: ergophobe

Content Management Forum

    
Mass PDF MetaData Generators?
Pjman




msg:4407654
 12:00 pm on Jan 17, 2012 (gmt 0)

I'm working on a project that works with thousands of small PDF documents.

I was wondering if anyone heard of some software that would automatically generate and implant meta data for PDF files for the content of the PDF?

Thanks For Your Time

 

ergophobe




msg:4407795
 5:17 pm on Jan 17, 2012 (gmt 0)

This is built into Acrobat depending on what actual meta data you want to modify. I'm using Acrobat 8 and it's pretty simple:

1. On the upper menu bar, go to Advanced -> Document Processing -> Batch Processing.

2. This opens the Batch Processing dialog box. Create a New Sequence. Give it a name.

3. Now you are looking at the sequence editor. Choose Select Commands.

4. From the list of commands, choose Description and Add to move it to the left side.

5. Double click and change the author, title, keywords and all that.

Now this only works for simple cases. If you want to handle more complex cases where the keywords and such will be taken from the document itself, you'll need to do some scripting. I don't know how to do that. I've done a bit of InDesign scripting and, like MS Office applications, the Adobe scripting options are powerful, but there's real programming involved and often a relatively esoteric API.

Whether or not someone has created an app that will already do this dynamic stuff, I don't know, but since extracting data from the document itself will be highly variable and depend on how the document is constructed, it seems like it would be a challenge.

Pjman




msg:4407805
 5:26 pm on Jan 17, 2012 (gmt 0)

Thank you very much for your input. It was extremely helpful. I have been looking all day and from what I can tell no one makes something that extracts words and adds them to meta-date.

Your method will give me a good start though. Thanks.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Content Management
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved