Hi there,
the project I am working on is an online respository that uses iVia open source software, which crawls, collects, and indexes and then makes items searchable and accessible. It utilizes php/mysql, c++, and operates on linux (ubuntu.) The problem is, iVia was designed to manage HTML pages, and the majority of the documents we want to include in the repository are going to be PDF files, which are apparently to large for it to process. Does anyone know of any kind of add on piece of software that that can read pdfs and dump them into a MySQL database? Or even where I should start looking?
Thank you so so so so much for any advice and/or help.