Hi Angela,

The TikaExtractor is available here:

svn.apache.org/repos/asf/oodt/trunk/metadata/src/main/java/org/apache/oodt/
cas/metadata/extractors/

Looking at CHANGES.txt looks like this only showed up in 0.7, so you'll
need to upgrade to 0.7 (just released).

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: [email protected]
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++






-----Original Message-----
From: Mengying Wang <[email protected]>
Date: Friday, September 19, 2014 12:46 PM
To: <[email protected]>
Subject: Where is the TikaCmdLineMetExtractor?

>Dear everyone,
>I am trying to integrate the Apache OODT Crawler with the Apache Tika.
>According to the Apache OODT Crawler Help
>(https://cwiki.apache.org/confluence/display/OODT/OODT+Crawler+Help), I
>can use the TikaCmdLineMetExtractor directly. However, when I run the
>command: 
>
>./crawler_launcher
>--filemgrUrl http://localhost:9000
>--operation --launchMetCrawler
>--clientTransferer
>org.apache.oodt.cas.filemgr.datatransfer.LocalDataTransferFactory
>--productPath /usr/local/meerkat/data/staging/products/hdf5
>--metExtractor 
>org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor
>--metExtractorConfig
>/usr/local/meerkat/extractors/tikaextractor/tikaextractor.config
>
>It raises an error:
>ERROR: Validation Failures: - Value
>'org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor' for
>option metExtractor is not a valid class
>
>
>I am wondering is this a software version problem? Which version of the
>Apache OODT Crawler and the Apache Tika should I use to run such a
>command? Currently, I am using the Apache OODT Cas-Crawler-0.6 and the
>Apache Tika-1.6.
>
>Thank you very much for your time and help!
>
>Best,
>Angela Wang 
>

Reply via email to