[ 
https://issues.apache.org/jira/browse/PDFBOX-4000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harun Reşit Zafer updated PDFBOX-4000:
--------------------------------------
    Description: 
Attached 3 documents have lines similar to {{THIS AGREEMENT is made as of the 
5th day of February, 2016.}} PdfBox returns this line as 3 separate lines:
`THIS AGREEMENT is made as of the 5`
`th`
` day of`

You can find each line close to the top of documents.



  was:
Attached 3 documents have lines similar to `THIS AGREEMENT is made as of the 
5th day of February, 2016.` PdfBox returns this line as 3 separate lines:
`THIS AGREEMENT is made as of the 5`
`th`
` day of`

You can find each line close to the top of documents.




> Wrong line break detection for the before ordinal indicator superscripts.
> -------------------------------------------------------------------------
>
>                 Key: PDFBOX-4000
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4000
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.6, 2.0.7, 2.0.8
>         Environment: Windows 10 64-bit
>            Reporter: Harun Reşit Zafer
>         Attachments: contract_00569_SEDAR.pdf, contract_00882_SEDAR.pdf, 
> contract_00968_SEDAR.pdf
>
>
> Attached 3 documents have lines similar to {{THIS AGREEMENT is made as of the 
> 5th day of February, 2016.}} PdfBox returns this line as 3 separate lines:
> `THIS AGREEMENT is made as of the 5`
> `th`
> ` day of`
> You can find each line close to the top of documents.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to