[ 
https://issues.apache.org/jira/browse/PDFBOX-3715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904477#comment-15904477
 ] 

Tilman Hausherr commented on PDFBOX-3715:
-----------------------------------------

And no problem when running PrintTextLocations either:
{code}
String[130.8572,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=6.498001]N rot: 0.0
String[137.24539,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]o rot: 0.0
String[141.63559,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.25]  rot: 0.0
String[144.33559,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]p rot: 0.0
String[148.72578,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=3.9960022]a rot: 0.0
String[152.61198,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.996994]r rot: 0.0
String[155.49919,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.501999]t rot: 0.0
String[157.9634,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.25]  rot: 0.0
String[160.66339,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]o rot: 0.0
String[165.05359,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.996994]f rot: 0.0
String[168.02365,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.25]  rot: 0.0
String[170.72365,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.501999]t rot: 0.0
String[173.12125,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]h rot: 0.0
String[177.51685,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=3.9960022]e rot: 0.0
String[181.40845,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.25]  rot: 0.0
String[184.10844,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=3.9960022]c rot: 0.0
String[188.00005,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]o rot: 0.0
String[192.39565,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]n rot: 0.0
String[196.79124,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.501999]t rot: 0.0
String[199.18884,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=3.9960022]e rot: 0.0
String[203.08044,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]n rot: 0.0
String[207.47604,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.501999]t rot: 0.0
String[209.93665,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=3.501007]s rot: 0.0
String[213.27559,133.08313 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.25]  rot: 0.0
String[216.05125,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=4.5]o rot: 0.0
String[220.44685,133.11017 fs=1.0 xscale=9.0 height=5.9040003 space=2.25 
width=2.996994]f rot: 0.0
{code}


> Text Stripper trims last spaces - regression of 2.0
> ---------------------------------------------------
>
>                 Key: PDFBOX-3715
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3715
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: Roman
>         Attachments: WindowsPhone7.pdf_page1_qdf.pdf
>
>
> When migrated from 1.8 to 2.0, we realized that some spaces are disappeared. 
> Please see attached PDF. Disappeared spaces are shown as blue boxes in it. 
> Those spaces WERE present in 1.8 version.
> Our App overrides *PDFTextStripper* class, implements *writePage()* method, 
> and uses *charactersByArticle* property, which is actually a list of all 
> *TextPosition* objects existing for every character from document.
> Some trailing spaces are disappeared from it. In the same time, those spaces 
> are present in PDF via explicit declaration. For example, these piece of 
> attached PDF contains the space right after "contents" word:
> {code}
> [( the content)-7(s )-2(of t)...]TJ
> {code}
> PS
>   I found that this bug occurs only when *sortExtractedTextByPosition* mode 
> is set to *false*. The spaces actually not disappeared, but moved to the 
> begin of *charactersByArticle* list. Such behavior is not expected when 
> sorting is Off.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to