[jira] [Commented] (PDFBOX-3044) Test files character encoding

Tilman Hausherr (JIRA) Thu, 22 Oct 2015 12:25:40 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14969718#comment-14969718
 ]


Tilman Hausherr commented on PDFBOX-3044:
-----------------------------------------

1.8 and 2.0. My argument is that because the core logic of text extraction is 
the same, it is easier to fix regressions. (I was able to fix three regressions 
by comparing variables in both versions)

> Test files character encoding
> -----------------------------
>
>                 Key: PDFBOX-3044
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3044
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: Ben McCann
>
> The files in pdfbox/src/test/resources/input all seem to be UTF16 encoded. 
> I'm having a really difficult time using these files with the tools that I 
> typically use (git, meld, etc.)  Would it be possible to change the encoding 
> to UTF8?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (PDFBOX-3044) Test files character encoding

Reply via email to