The page segmentation file (.pseg.png) contains pixel accurate information about where the lines are.
For finding words within lines, it depends on the recognizer and the script of how to get at that information. ocropus-lattices gives you bounding boxes relative to the text line. ocropus-rpred (the new recognizer) outputs a sequence of classification vectors that you could use. Tom On Monday, March 4, 2013 4:06:55 AM UTC-8, Al Byers wrote: > > I would like to have ocropus analyze a document and report the position of > key words so that I can go back and do a more detailed analysis of > particular areas. Is there output that will give me that information? -- You received this message because you are subscribed to the Google Groups "ocropus" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msg/ocropus/-/7iXmJxGvOSMJ. For more options, visit https://groups.google.com/groups/opt_out.
