[
https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-1442:
----------------------------------
Attachment: PDFBox_1_8_8-CLASSICVPDFBox_1_8_8-NONSEQ-b162.xlsx
Thanks... one problem in both excel files: the copying of my remark doesn't
work. In the excel file, I saw this formula:
{code}
=_xlfn.IFNA(SVERWEIS($A2;Sheet2!$A$2:$C$108;2;FALSCH);"")
{code}
IFNA is from excel 2013, which is not available in earlier versions. Next time,
please use IFERROR. (I did this and now it works). What I didn't do is to
replace the formulas with their results. (But what is "_xlfn."?)
About seq vs. nonseq - I think the nonseq parser is now slightly better, see my
comments.
The other file I'll look at tomorrow :-)
> Upgrade to PDFBox 1.8.8
> -----------------------
>
> Key: TIKA-1442
> URL: https://issues.apache.org/jira/browse/TIKA-1442
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Assignee: Tim Allison
> Fix For: 1.8
>
> Attachments: PDFBox_1_8_6DVPDFBox_1_8_8-TRAD-b156.xlsx,
> PDFBox_1_8_6VPDFBox_1_8_8-CLASSIC-b162.xlsx,
> PDFBox_1_8_6VPDFBox_1_8_8-b145.xlsx, PDFBox_1_8_6VPDFBox_1_8_8-b145.zip,
> PDFBox_1_8_8-CLASSICVPDFBox_1_8_8-NONSEQ-b162.xlsx,
> PDFBox_1_8_8-CLASSICVPDFBox_1_8_8-NONSEQ-b162.xlsx,
> PDFBox_1_8_8-ClassicVPDFBox_1_8_8-NonSeq.xlsx,
> PDFBox_1_8_8-ClassicVPDFBox_1_8_8-NonSeq.xlsx,
> PDFBox_1_8_8-TRADVPDFBox_1_8_8-NONSEQ-b156.xlsx,
> pdfbox_1_8_6V1_8_8-SNAPSHOT.xlsx, pdfbox_1_8_6V1_8_8-SNAPSHOTb.xlsx,
> pdfbox_1_8_6V1_8_8-SNAPSHOTc.xlsx, pdfbox_1_8_6V1_8_8-SNAPSHOTc.zip
>
>
> Given the regressions we identified in PDFBox 1.8.7, we should upgrade to
> 1.8.8 as soon as it is ready. I'm tempted to call this a blocker on Tika
> 1.7. Let's use this issue to carry on the discussion of regression testing
> (if any further discussion is necessary) or any other prep that needs to
> happen before 1.8.8's release.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)