[
https://issues.apache.org/jira/browse/PDFBOX-748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler closed PDFBOX-748.
-------------------------------------
Resolution: Cannot Reproduce
Assignee: Andreas Lehmkühler
Can't reproduce any issue with the attached pdf. At least since PDFBox 1.3.1
everything works as expected.
> Unable to extract special characters from pdf
> ---------------------------------------------
>
> Key: PDFBOX-748
> URL: https://issues.apache.org/jira/browse/PDFBOX-748
> Project: PDFBox
> Issue Type: Bug
> Environment: Windows XP, .Net 2.0
> Reporter: Vipul Pujari
> Assignee: Andreas Lehmkühler
> Attachments: msnet-formatting-strings.pdf
>
>
> Using below code
> Dim ObjBytesRead As String
> Dim doc As Org.pdfbox.pdmodel.PDDocument =
> Org.pdfbox.pdmodel.PDDocument.load(FileName)
> Dim stripper As New Org.pdfbox.util.PDFTextStripper
> ObjBytesRead = stripper.getText(doc)
> I am unable to extract special characters("_") from pdf
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira