You are describing two different features. One is called OCR zones, the other is layout analysis (example: https://github.com/tabulapdf/tabula). I've seen forks of Mayan with this feature added as a commercial plugin but none have donated the code to be added to the core version I develop. These features are very complex, costly and located in a patent minefield (http://patents.justia.com/patents-by-us-classification/382/321). Without external sponsorship I'm not able to implement these.
On Friday, December 16, 2016 at 5:36:25 AM UTC-4, [email protected] wrote: > > Hello, > I'm looking for a programm, which could read a document and extract > informations from it. > For example, I become a bill from Apple (the programm would recognize it, > because I would have defined if in this region, there is Apple with its > adress and also defined the placed which define for Apple where to find, it > is a bill) and I would like to extract from it for example the bill number > (which should always be on the same place) and the total price of the bill > (the place of it differ, depending on the number of articles I ordered. > > I unfortunatly didn't find the technical word for finding it on the web. > How is this called? Is this possible with Mayan EDMS? > > I thank you already for replying and wish you a good day, > > Cheers, > > Sam > -- --- You received this message because you are subscribed to the Google Groups "Mayan EDMS" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
