[ 
https://issues.apache.org/jira/browse/PDFBOX-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14973227#comment-14973227
 ] 

Tilman Hausherr commented on PDFBOX-3055:
-----------------------------------------

Why don't you take the source code and change it what you really need, i.e. 
call the stripper methods directly? There's lots of stuff in ExtractText that 
you probably don't need.

> Change visibility of ExtractText constructor to public
> ------------------------------------------------------
>
>                 Key: PDFBOX-3055
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3055
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Text extraction
>    Affects Versions: 1.8.10
>         Environment: (any)
>            Reporter: Michael Goddard
>            Priority: Minor
>   Original Estimate: 10m
>  Remaining Estimate: 10m
>
> I was just hoping to do a simple extension, using this class but, due to the 
> private constructor, can't:
> {code}
> import org.apache.pdfbox.ExtractText;
> public class MultiPDFtoText
> {
>   public static void main (String[] args)
>   {
>     for (String pdf : args)
>     {
>       String txt = pdf.replaceFirst("\\.pdf$", ".txt");
>       System.out.println("PDF: " + pdf + ", TXT: " + txt);
>       ExtractText extractor = new ExtractText();
>       extractor.startExtraction(new String[] { pdf, txt });
>     }
>   }
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to