Dear everyone,

I am trying to integrate the Apache OODT Crawler with the Apache Tika.
According
to the Apache OODT Crawler Help (
https://cwiki.apache.org/confluence/display/OODT/OODT+Crawler+Help), I can
use the TikaCmdLineMetExtractor directly. However, when I run the command:

./crawler_launcher
--filemgrUrl http://localhost:9000
--operation --launchMetCrawler
--clientTransferer
org.apache.oodt.cas.filemgr.datatransfer.LocalDataTransferFactory
--productPath /usr/local/meerkat/data/staging/products/hdf5
--metExtractor
org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor
--metExtractorConfig
/usr/local/meerkat/extractors/tikaextractor/tikaextractor.config

It raises an error:
ERROR: Validation Failures: - Value
'org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor' for
option metExtractor is not a valid class

I am wondering is this a software version problem? Which version of the
Apache OODT Crawler and the Apache Tika should I use to run such a command?
Currently, I am using the Apache OODT Cas-Crawler-0.6 and the Apache
Tika-1.6.

Thank you very much for your time and help!

Best,
Angela Wang

Reply via email to