[
https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404833#comment-13404833
]
Michael McCandless commented on TIKA-758:
-----------------------------------------
Looks like the TODOs are all in PDF2XHTML.java, currently:
{noformat}
mike@vine:/l/tika.trunk$ grep -r TODO . | grep -i PDFBOX | grep .java:
./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java: //
TODO: remove once PDFBOX-1130 is fixed:
./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java:
// TODO: remove once PDFBOX-1143 is fixed:
./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java:
// TODO: remove once PDFBOX-1130 is fixed
./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java:
// TODO: remove once PDFBOX-1130 is fixed
{noformat}
> Address TODOs when we upgrade to next PDFBox release
> ----------------------------------------------------
>
> Key: TIKA-758
> URL: https://issues.apache.org/jira/browse/TIKA-758
> Project: Tika
> Issue Type: Improvement
> Reporter: Michael McCandless
>
> Like TIKA-757 for POI, I'm opening this blanket issue to address any TODOs in
> the code when we next upgrade PDFBox.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira