Used font in the PDF file is "Kingfisher-Heavy", is it one of the unmatching fonts?
Can I use character height values in order to solve correct this problem? For example; if I can get the height for each character in the pdf file, I can compare this value with nearer characters, then I could convert a lowercase character to uppercase using some logic. Does PDFBox provide an interface to get height values for textposition objects, or characters? On Wed, Oct 12, 2011 at 8:29 PM, Kévin Sailly <[email protected]>wrote: > Hello, > > May be a font problem, the embended one in the pdf file is matching the > standard font mapping to unicode? > > Regards, > Kévin > > 2011/10/12 Yavuz Nuzumlalı <[email protected]> > > > Hi, > > > > When I try to use TextPosition to get text in a PDF file, it sometimes > > gives > > me related character with changed case. > > > > For example, The text in the pdf is like this: > > > > "BEBEK RANGE ROVER " > > > > And PDFBox returns the text like this: > > > > "bebek RANGe ROVeR " > > > > I'm using processTextPosition() method to get text. What could be the > > problem, I can't figured out how to solve the problem. > > > > Thanks. > > >

