Ah yes, I see that there is an open issue for Android support 
https://issues.apache.org/jira/browse/PDFBOX-586

-- John

On 5 Feb 2014, at 23:02, Andreas Lehmkuehler <[email protected]> wrote:

> Hi,
> 
> Am 06.02.2014 00:39, schrieb John Hewson:
>> PDFBox depends on AWT for handling fonts and graphics. Unfortunately Android 
>> does not support AWT and there is currently no suitable replacement.
>> 
>> However, for text extraction it seems likely that you only need a subset of 
>> PDFBox which doesn't require AWT. You could try removing all code not used 
>> by PDFToText and see if you can build a minimal PDFBox which doesn't depend 
>> on AWT. If you succeed we can add an Android configuration to the Maven 
>> build.
> There are already some discussions/plans to modularize PDFBox so that it 
> would be possible to choose a reduced subset of PDFBox (with less/no AWT 
> dependencies) to be used on android to extract text and similar things.
> 
>> -- John
> 
> BR
> Andreas Lehmkühler
> 
>> 
>>> On 4 Feb 2014, at 15:29, Sharda <[email protected]> wrote:
>>> 
>>> Hello all,
>>> 
>>> I am trying to extract the text from a PDF for annotation purposes
>>> (highlight, strikeout and underline). Some posts in stackoverflow posted in
>>> 2012, says that android and pdfbox are not compatible with each other.
>>> Thus, I wanted to know if this is still an issue or its solved by now?
>>> 
>>> thanks and regards,
>>> 
>>> --
>>> Shobhit Sharda
>>> MSc in SSE
>>> RWTH Aachen University
>>> Aachen
>>> Germany
>>> blog: http://shobhitsharda.wordpress.com
> 

Reply via email to