[
https://issues.apache.org/jira/browse/PDFBOX-420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15536309#comment-15536309
]
Tilman Hausherr commented on PDFBOX-420:
----------------------------------------
I assume your patch was used, thanks. To be sure, I just did an extraction with
the map10.pdf file from this issue with today's code, the result is very
similar to mat10_After.txt file (only a few spaces are different).
> Japanese Characters are garbled.
> --------------------------------
>
> Key: PDFBOX-420
> URL: https://issues.apache.org/jira/browse/PDFBOX-420
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Takashi Komatsubara
> Priority: Critical
> Fix For: 1.1.0
>
> Attachments: TestFilesForJapaneseGarbledIssue.zip,
> supportJapanese-fontbox.patch, supportJapanese.patch,
> textextract._20090326_01.zip
>
>
> The extracted Japanese characters are completely garbled.
> This issue is very critical for Japanese users.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]