Virgiliu Craciun wrote: > Unfortunately, it seems that there is no 'podofotxtextract' tool > included in the > package, and so far we've been unsuccessful in finding it.
In the source tree it's in tools/podoftxtextract/ > We had the feeling that this wouldn't be straightforward at all. > Fortunately, > the PDFs we're interested in are generated by one single application. I > will > try to use the PDFBrowser to understand the structure (are there any > binaries > for Windows?) Yes. See the SourceForge download page for PoDoFo. > Another good thing would be that the string has always the same format > (pattern, > numer of digits), please see example: > Doc: 17220080930.121655.008 > So we may be able to distinguish if from the crowd. Yes, *maybe*. There's no guarantee the app actually draws it as a single string, though ; nothing stops it from doing things like: Draw digit "8" Reposition cursor to left of digit Draw digit "0" Reposition cursor to left of digit ... (and some apps do use inefficient methods like this for, say, right justification). Most likely you will find the string as a whole, just be aware that it's not guaranteed. > If I am stuck, I will post an annonymised file. We do work now under time > pressure with this (and it's just a small bit of the project!), so maybe > someone could help us with podofotxtextract or other code example to get > the > text out from a PDF. Sorry, I'm not really inclined to do commercial support for free. See how you go with the explanations provided, the code in podofotxtextract, etc. If you have questions feel free to ask, but I'm not going to spend ages fiddling around writing example code I'm afraid. >> Pierre pointed out podofotxtextract, which is probably a good start. Pierre's reply was just to point you at podofotxtextract. That's all. -- Craig Ringer ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Podofo-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/podofo-users
