[ 
https://issues.apache.org/jira/browse/PDFBOX-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14973246#comment-14973246
 ] 

Andreas Lehmkühler commented on PDFBOX-3055:
--------------------------------------------

I agree with [~tilman]. ExtractText is nothing more than a parameter parser 
calling PDFTextStripper. Your code doesn't extend ExractText but omits most of 
the implemented functionality. Saying that, IMHO it would make more sense to 
simply write your own little command line tool.

> Change visibility of ExtractText constructor to public
> ------------------------------------------------------
>
>                 Key: PDFBOX-3055
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3055
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Text extraction
>    Affects Versions: 1.8.10
>         Environment: (any)
>            Reporter: Michael Goddard
>            Priority: Minor
>   Original Estimate: 10m
>  Remaining Estimate: 10m
>
> I was just hoping to do a simple extension, using this class but, due to the 
> private constructor, can't:
> {code}
> import org.apache.pdfbox.ExtractText;
> public class MultiPDFtoText
> {
>   public static void main (String[] args)
>   {
>     for (String pdf : args)
>     {
>       String txt = pdf.replaceFirst("\\.pdf$", ".txt");
>       System.out.println("PDF: " + pdf + ", TXT: " + txt);
>       ExtractText extractor = new ExtractText();
>       extractor.startExtraction(new String[] { pdf, txt });
>     }
>   }
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to