gopalbhalala created PDFBOX-3445:
------------------------------------
Summary: Can not read PDF correctly
Key: PDFBOX-3445
URL: https://issues.apache.org/jira/browse/PDFBOX-3445
Project: PDFBox
Issue Type: Bug
Components: FontBox, Text extraction
Affects Versions: 2.0.2
Reporter: gopalbhalala
Hi Team,
I have two PDF in Gujarati language but font is Different, 1st PDF have Shruti
font and 2nd PDF have LMG-RUPE font, Shruti read correctly in tika parser and
it gives me a correct output, but LMG-RUPE pdf gives me a worng output.
Metadata is same for both pdf.
1) drive.google.com/open?id=0B4Sse_x7pvrqRnRETzNsUk1BY0k (Shruti font)
2) https://drive.google.com/open?id=0B4Sse_x7pvrqVC0zb2NqTzNvYVU
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]