Forum Moderators: open

Message Too Old, No Replies

Spidering pdf files

WHAT TO USE?

         

ljr2u

11:30 am on Dec 30, 2001 (gmt 0)



What is the best programming software to use in order to spider PDF documents?

Air

8:37 pm on Jan 1, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to wmw ljr2u!

I would start with Perl LWP module(s) to retrieve the PDF documents, and throw in the PDF::Parse library to be able to parse the PDF documents retrieved.