Version is PDFBox 2.0.7 release <https://pdfbox.apache.org/download.cgi#20x>

2017-08-09 15:47 GMT-03:00 Tilman Hausherr <thaush...@t-online.de>:

> Am 08.08.2017 um 22:36 schrieb Zubiri, Tomas:
>
>>
>> Good evening!
>>
>> I am having trouble with the following Chinese file:
>> http://www.filedropper.com/1327415361
>>
>>
>> Page 2 contains only 7 characters, 3 numbers and 4 chinese characters,
>> but TextStripper shows 19 TextPositions.
>> The Chinese characters appear 4 times, sometimes with different x
>> coordinates.
>>
>>
> That is page 3.
>
>
>> It is worthy to note that TextPosition.getWidthOfSpace() returns NaN for
>> any of these characters.
>>
>>
> What version are you using?
>
> Here's what I get:
>
>
> String[42.6,55.560303 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,79.6803 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,103.80029 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,127.92029 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,152.04028 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,176.16028 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,200.28027 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[42.6,224.40027 fs=14.04 xscale=14.031576 height=8.09406
> space=3.507894 width=3.5078926]
> String[434.3998,254.6402 fs=20.04 xscale=20.027977 height=11.923801
> space=5.567778 width=11.135559]1
> String[445.5554,254.6402 fs=20.04 xscale=20.027977 height=11.923801
> space=5.567778 width=11.135559]0
> String[456.71097,254.6402 fs=20.04 xscale=20.027977 height=11.923801
> space=5.567778 width=11.135559]3
> String[473.0396,254.6402 fs=20.04 xscale=20.027977 height=9.959881
> space=20.027977 width=20.027985]?
> String[493.08762,254.6402 fs=20.04 xscale=20.027977 height=9.959881
> space=20.027977 width=20.027985]?
> String[513.1356,254.6402 fs=20.04 xscale=20.027977 height=9.959881
> space=20.027977 width=20.027954]?
> String[533.1836,254.6402 fs=20.04 xscale=20.027977 height=9.959881
> space=20.027977 width=20.027954]?
> String[552.8387,254.6402 fs=20.04 xscale=20.027977 height=11.553061
> space=5.0069942 width=5.007019]
>
> There are some space characters. You can see their position with the
> DrawPrintTextLocations example from the source code download.
>
> The only weirdness is that the cyan rectangle is too wide for some. Maybe
> a bug in getBounds2D(), or an invisible point...
>
> Tilman
>
>
>

Reply via email to