John, Excellent answer yet again, much appreciated! -Aaron
On Wednesday, July 16, 2014, John Hewson <[email protected]> wrote: > Hi Aaron, > > You could store a map of TextPosition => Color which you populate in > processTextPosition. > Lines are not known until the end of TextStripper’s processing (they have > to be inferred) so you could override a method from one of the phases at > the end and I think you should have access to either lines or TextPositions > which are merged into continuous runs (I can’t remember which). > Alternatively you can always grab all TextPositions with the same y > position. > > -- John > > On 16 Jul 2014, at 13:27, Aaron Hartman <[email protected] <javascript:;>> > wrote: > > > Hi everyone; > > I am currently scanning PDF’s for errors that have red text in them. I > accomplished this by extending the PDFTextStripper class and overriding the > processTextPosition method to examine the PDGraphicsState for the > appropriate color values. > > > > Once this position is found is it possible to extract only the line > where that red text resides? For the user it would be beneficial to see the > line in which the error occurs. Since the processTextPosition has the > actual position I was hoping there may be a way to extract the line with > the error from within this method, or by storing the position and accessing > it elsewhere. > > > > If there is a way to accomplish this, please let me know! > > > > Thank you for your time. > > > > -Aaron > >
