bill lam wrote: > On Wed, 28 Jan 2009, Alex Rufon wrote: >> PDF format, they have to go through these steps: >> >> 1. Export the PDF file into HTML >> >> 2. Parse the HTML file >> >> 3. Insert/Update the databases > > Theoretically you can parse pdf file yourself because it is a plain > test format with possible embedded graphic or compressed text in zlib. > However I think that you can just export pdf into txt directly using > some utilities (please google yourself). Openoffice can be used as a > command line converter between various formats including pdf and txt > iirc.
Just to add to Bill's comment: PDF files contain drawing instructions that are arbitrarily complex. You might be able to parse simple examples, but in general it would be very difficult write a J program to turn these instructions into data. Why not ask your clients to send edi in plain text? ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
