This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.
from b13e418 TIKA-3210 -- info -> debug
new 69fd2c6 TIKA-3215 -- add a wrapper for the commandline linux file
command as a detector
new 65c3183 TIKA-3216 -- Add FileProfiler
new 380e9f7 TIKA-3217 -- add XMP pdf schema to metadata extraction for
PDFs.
The 3 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../apache/tika/detect/FileCommandDetector.java | 208 +++++++++++++++++++++
.../main/java/org/apache/tika/metadata/PDF.java | 2 +
.../tika/detect/FileCommandDetectorTest.java | 44 +++++
...-xmlreaderutils.xml => FileCommandDetector.xml} | 6 +-
.../java/org/apache/tika/eval/ExtractComparer.java | 2 +-
.../java/org/apache/tika/eval/ExtractProfiler.java | 2 +-
.../java/org/apache/tika/eval/FileProfiler.java | 158 ++++++++++++++++
.../java/org/apache/tika/eval/TikaEvalCLI.java | 62 +++++-
.../org/apache/tika/eval/XMLErrorLogUpdater.java | 1 -
.../tika/eval/batch/EvalConsumerBuilder.java | 3 +-
...ofilerBuilder.java => FileProfilerBuilder.java} | 48 ++---
.../main/java/org/apache/tika/eval/db/Cols.java | 3 +
.../java/org/apache/tika/eval/db/JDBCUtil.java | 49 ++---
.../java/org/apache/tika/eval/db/MimeBuffer.java | 1 -
.../java/org/apache/tika/eval/io/DBWriter.java | 11 +-
.../resources/tika-eval-file-profiler-config.xml} | 28 ++-
.../java/org/apache/tika/parser/pdf/PDFParser.java | 10 +-
.../tika/parser/pdf/PDMetadataExtractor.java | 78 +++++---
.../tika/detect/TestFileCommandDetector.java | 59 ++++++
.../org/apache/tika/parser/pdf/PDFParserTest.java | 10 +-
20 files changed, 660 insertions(+), 125 deletions(-)
create mode 100644
tika-core/src/main/java/org/apache/tika/detect/FileCommandDetector.java
create mode 100644
tika-core/src/test/java/org/apache/tika/detect/FileCommandDetectorTest.java
copy
tika-core/src/test/resources/org/apache/tika/config/{TIKA-2732-xmlreaderutils.xml
=> FileCommandDetector.xml} (83%)
create mode 100644
tika-eval/src/main/java/org/apache/tika/eval/FileProfiler.java
copy
tika-eval/src/main/java/org/apache/tika/eval/batch/{ExtractProfilerBuilder.java
=> FileProfilerBuilder.java} (65%)
copy
tika-eval/src/{test/resources/single-file-profiler-crawl-extract-config.xml =>
main/resources/tika-eval-file-profiler-config.xml} (73%)
create mode 100644
tika-parsers/src/test/java/org/apache/tika/detect/TestFileCommandDetector.java