[ 
https://issues.apache.org/jira/browse/PDFBOX-957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991754#comment-12991754
 ] 

Ashok Chigullapally commented on PDFBOX-957:
--------------------------------------------

Two pdf resumes included in the attachments.

> Text extraction using ExtractText (pdf file is input file) generates some 
> weired characters
> -------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-957
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-957
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.4.0
>         Environment: Windows 7
>            Reporter: Ashok Chigullapally
>            Priority: Critical
>              Labels: pdfbox, textExtraction
>         Attachments: Resume1.pdf, Resume2.pdf
>
>
> When I tried to extract text from pdf document it is generating some 
> gibberish text. 
> ExtractText.exe "\Jobvite\Resumes\Resume-Boston.pdf Resume-Boston.txt
> Will provide the pdf documents when requested, I could not find a way to 
> include attachments.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to