Dear everyone, I am trying to integrate the Apache OODT Crawler with the Apache Tika. According to the Apache OODT Crawler Help ( https://cwiki.apache.org/confluence/display/OODT/OODT+Crawler+Help), I can use the TikaCmdLineMetExtractor directly. However, when I run the command:
./crawler_launcher --filemgrUrl http://localhost:9000 --operation --launchMetCrawler --clientTransferer org.apache.oodt.cas.filemgr.datatransfer.LocalDataTransferFactory --productPath /usr/local/meerkat/data/staging/products/hdf5 --metExtractor org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor --metExtractorConfig /usr/local/meerkat/extractors/tikaextractor/tikaextractor.config It raises an error: ERROR: Validation Failures: - Value 'org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor' for option metExtractor is not a valid class I am wondering is this a software version problem? Which version of the Apache OODT Crawler and the Apache Tika should I use to run such a command? Currently, I am using the Apache OODT Cas-Crawler-0.6 and the Apache Tika-1.6. Thank you very much for your time and help! Best, Angela Wang
