|
Posted by Shelly on 09/19/07 15:14
I had to do my first investigation regarding PDF files. Surprisingly, I
found that the only functions in PHP were for creating PDF files.
The potential customer receives order forms from the corporate headquarters
and they are PDF forms. What we want to do is to extract information from
these forms and process the data into a database. To do this we need to
read certain set fields. Nowhere did I find a function to be able to read
PDF files, let alone extract information from them.
My thoughts, in the absence of this function, would be if there were a way
to open the file, strip the formatting, and then work on the text stream.
The key unknown for me in this is how to strip the formatting.
So, do I hear any suggestions for either?:
(1) How to read predetermined field entries from a PDF file or
(2) How to convert a PDF into an unformatted text stream
Shelly
Navigation:
[Reply to this message]
|