I'm almost done... then I got this when doing the last step:
[tilman@cloud-server-02 eval]$ java -jar tika-eval-1.17-SNAPSHOT.jar
Report -db pdfboxAvsB
0 [main] INFO org.apache.tika.eval.reports.Report - Writing report:
All Mimes In A to mimes/all_mimes_A.xlsx
Exception in thread "main" java.io.IOException: Permission denied
at java.io.UnixFileSystem.createFileExclusively(Native Method)
at java.io.File.createTempFile(File.java:2024)
at
org.apache.poi.util.DefaultTempFileCreationStrategy.createTempFile(DefaultTempFileCreationStrategy.java:110)
at org.apache.poi.util.TempFile.createTempFile(TempFile.java:66)
at
org.apache.poi.xssf.streaming.SXSSFWorkbook.write(SXSSFWorkbook.java:924)
at org.apache.tika.eval.reports.Report.dumpXLSX(Report.java:85)
at org.apache.tika.eval.reports.Report.writeReport(Report.java:64)
at
org.apache.tika.eval.reports.ResultsReporter.execute(ResultsReporter.java:305)
at
org.apache.tika.eval.reports.ResultsReporter.main(ResultsReporter.java:266)
at
org.apache.tika.eval.TikaEvalCLI.handleReport(TikaEvalCLI.java:264)
at org.apache.tika.eval.TikaEvalCLI.execute(TikaEvalCLI.java:52)
at org.apache.tika.eval.TikaEvalCLI.main(TikaEvalCLI.java:273)
I changed the source, and now I got the path, it is
/work/eval/reports/mimes/all_mimes_A.xlsx . The file exists and it is empty.
I tried with a 1.16 version and the same happened.
Then I thought, maybe the file with the permission problem isn't the
target at all; could this be some temp file / temp directory where I
don't have permission?
smaller improvements for the documentation:
- appBatchExecutor.sh should have 775 permission or the documentation
should have "nohup sh ./appBatchExecutor.sh &"
- "H" is missing, which is identical to "C"
- mention that "pdfboxAvsB" db files are to be removed before starting?
I had accidentally aborted a run and couldn't restart.
Tilman
memo for me:
java -jar tika-eval-1.17-SNAPSHOT.jar Compare -extractsA
/data4/batch_runs/pdfbox_2_0_4 -extractsB
/data4/batch_runs/pdfbox_2_0_9-SNAPSHOT1 -db pdfboxAvsB
java -jar tika-eval-1.17-SNAPSHOT.jar Report -db pdfboxAvsB