|
Posted by Miguel Cruz on 11/18/12 11:42
Justin Koivisto <justin@koivi.com> wrote:
> I am looking for a way to convert PDF files into text content. I don't
> care about layout or formatting, just the plain text that I can use to
> search against in a database.
>
> I've look into the pdftotext tool from:
> http://www.foolabs.com/xpdf/download.html
>
> However, when I use it via the command line, it works fine. If I issue
> the same command via a system() call, there are major problems that
> cause the server to crash. (Don't know why, there aren't any error
> messages been generated anywhere.)
What do you mean when you say the server crashes? The Apache process
dies? The entire machine locks up? The server physically falls off the
rack and lands on the floor?
How about doing an experiment where you use system() to call a shell
script that sets up some debugging and dumps the environment, and see
what you come up with?
miguel
--
Photos from 38 countries on 5 continents: http://travel.u.nu
Latest photos: Australia; Malaysia; Burma; Thailand; Hong Kong
Airports of the world: http://airport.u.nu
Navigation:
[Reply to this message]
|