Posted by Justin Koivisto on 03/08/06 19:07
I am looking for a way to convert PDF files into text content. I don't
care about layout or formatting, just the plain text that I can use to
search against in a database.
I've look into the pdftotext tool from:
http://www.foolabs.com/xpdf/download.html
However, when I use it via the command line, it works fine. If I issue
the same command via a system() call, there are major problems that
cause the server to crash. (Don't know why, there aren't any error
messages been generated anywhere.)
I am looking to use this when a PDF file is uploaded via a form and
store the text in a database for a search function.
TIA
-- Justin
[Back to original message]
|