GhostScript package should have a command-line tool to extract text. But then you'll have to identify where the information is.
Good thing you don't have to OCR. ----- Original Message ---- > From: Alex Rufon <[email protected]> > > Oh. Its a PO alright. It can be sent as an EDI-850 in standard ASCII format. > The > problem is the client is giving us the PO with the bill of materials, some > production notes, pictures of the garment, costings, prefered suppliers ... > the > works. They are actually providing us the style/production book disguised as > the > PO in PDF format. Getting it in excel would be more easier. > > I did recommend that they send each separately in manageable format but I am > several layers away from the client. NOTE: Layers = Position/Job Title/Person > ... so I am only on the "advising" and "fixing what is wrong with whay they > have > now but not too much or someone might loose their job" capacity. ;) > > Oh well, I saw the sample and its a mess when read directly into J. > > I did stumbled upon XPDF which can rip text and images from a PDF file and it > looks hopefull ... theres just issues like how to automatically identify the > exported images. > > r/alex > > ________________________________ > > From: [email protected] on behalf of Chris Burke > Sent: Wed 1/28/2009 8:29 PM > To: General forum > Subject: Re: [Jgeneral] Parsing PDF documents in J > > > > Alex Rufon wrote: > > Oh, I did ask if the client can send the PO in EDI (way much easier all > > around) but it seems that somebody from the client is saying that "we > > are your client and you should be happy with what we give you". Cue > > lightning and organ music!!! > > This cannot be right. EDI is a plain text format. If your customer is > making a pdf out of it, then they are not sending in standard EDI > format. Just call the customer and lean on them to send you the proper > format. > > > > ---------------------------------------------------------------------- > For information about J forums see http://www.jsoftware.com/forums.htm ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
