Hi K, OK so I did a bit of searching here and located a bunch of files which are defined as legacy... you can check the search results out below https://github.com/apache/oodt/search?utf8=%E2%9C%93&q=AutoDetectProductCrawler&type=Code I would urge you to have a look at the AutoDetectProductCrawler Javadoc description included in master branch [0] as well to see if you've got everything required. Finally, I came across some documentation on the wiki which may guide you in the right direction [1]. It may also be outdated though so please let us know if that it the case. hth
[0] https://github.com/apache/oodt/blob/91d0bafe71124906bd94baad746189caf35fb39c/crawler/src/main/java/org/apache/oodt/cas/crawl/AutoDetectProductCrawler.java#L40-L64 [1] https://cwiki.apache.org/confluence/display/OODT/Mime+type+detection+with+the+AutoDetectProductCrawler On Mon, Apr 4, 2016 at 10:54 PM, Konstantinos Mavrommatis < [email protected]> wrote: > Hi, > It seems to be happening for a number of types of files that I have in the > mimetypes.xml. > A few things are puzzling to me: this file which is a .gz file is not > processed by the regular tika mimetypes which contains the gzip files > A file that has no extension, which defaults to txt is passed to the > MetExtractor.pl and processed. > > Any ideas I can find what are the preconditions that fail ? I tried to > change the log level to DEBUG for all components but I did not get much > more information. This must be something that changed in the OODT releases > >0.6 but could not find anything relevant in the release notes. > I also noticed in the documentation of the AutoDecectProductCrawler that > it uses the file met-extr-preconditions.xml which I could not find anywhere > in the deployed OODT or the src directories. Could that be a reason for the > problem I observe? > > Thanks > K > > -----Original Message----- > From: Lewis John Mcgibbney [mailto:[email protected]] > Sent: Monday, April 04, 2016 3:24 PM > To: [email protected] > Subject: Re: Transition from OODT 0.6 to 0.12 cannot find extractor > specifications > > Hi Konstantinos, > It appears to be happening with a tar.gz file as well right? > > WARNING: No extractor specs specified for > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fastq/cas-crawler-04-02-16.log.gz > > I wonder if it is the file names... However I would be extremely surprised > as I've seen some much more verbose file naming. > Lewis > > On Saturday, April 2, 2016, Konstantinos Mavrommatis < > [email protected]> wrote: > > > Hi, > > I am trying to replicate a fully functional service that I had setup > > long time ago using OODT 0.6 but I am having the following problem > > that does not allow me to ingest files. When I try to ingest files > > with the extension fastq.gz I get the line: > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > And of course the file is not ingested. This process works without > > problem with OODT 0.6 on a different server. > > > > The crawler command I am running is: > > ./crawler_launcher \ > > --operation \ > > --launchAutoCrawler \ > > --productPath $FILEPATH \ > > --filemgrUrl $OODT_FILEMGR_URL \ > > --clientTransferer > > org.apache.oodt.cas.filemgr.datatransfer.InPlaceDataTransferFactory \ > > --mimeExtractorRepo ../policy/mime-extractor-map.xml \ --noRecur \ > > --crawlForDirs 2>&1 > > > > > > > > I have setup OODT 0.12 on a server which runs FM listening to port 9000. > > From a client machine I have verified that I can use FM to ingest > products. > > I am now trying to use crawler to crawl and ingest all files in a > > directory. Since I have non standard MIME types in these directories I > > have done the following: > > 1. Added my own mime types in policy/mimetypes.xml eg > > <mime-type type="text/fastq"> > > <glob pattern="*.fastq"/> > > <glob pattern="*.fastq.gz"/> > > <glob pattern="*.fastq.bz"/> > > <glob pattern="*.fastq.bz2"/> > > <glob pattern="*.fastq.bzip"/> > > <glob pattern="*.fq"/> > > <glob pattern="*.fq.gz"/> > > <glob pattern="*.fq.bz"/> > > <glob pattern="*.fq.bz2"/> > > <glob pattern="*.fq.bzip"/> > > </mime-type> > > 2. created the file policy/mime-extractor-map.xml > > > > <mime type="text/fastq"> > > <extractor > > class="org.apache.oodt.cas.metadata.extractors.ExternMetExtractor"> > > <config > > file="/apache-oodt/crawler/bin/fastq.config"/> > > <preCondComparators> > > <preCondComparator > > id="CheckThatDataFileSizeIsGreaterThanZero"/> > > </preCondComparators> > > </extractor> > > </mime> > > > > 3. created the file fastq.config > > <?xml version="1.0" encoding="UTF-8"?> <cas:externextractor > > xmlns:cas="http://oodt.jpl.nasa.gov/1.0/cas"> > > <exec workingDir=""> > > > > > <extractorBinPath>/apache-oodt/crawler/bin/MetExtractorNGS.pl</extractorBinPath> > > <args> > > <arg isDataFile="true"></arg> > > <arg>fastq</arg> > > </args> > > </exec> > > </cas:externextractor> > > > > > > > > The MetExtractorNGS.pl is a small perl script that opens the file to > > be ingested, gets some information and stores it in the .met file that > > corresponds to the file to be ingested and have manually verified that > > works as expected producing the correct met file. > > > > What am I missing here? Any ideas comments suggestions will be greatly > > appreciated. > > Thanks in advance for any help > > Kostas > > > > > > > > PS1 The full output from running the crawler command follows: > > > > > > Setting property 'StdProductCrawler.filemgrUrl' > > Setting property 'MetExtractorProductCrawler.filemgrUrl' > > Setting property 'AutoDetectProductCrawler.filemgrUrl' > > Setting property 'StdProductCrawler.clientTransferer' > > Setting property 'MetExtractorProductCrawler.clientTransferer' > > Setting property 'AutoDetectProductCrawler.clientTransferer' > > Setting property 'StdProductCrawler.noRecur' > > Setting property 'MetExtractorProductCrawler.noRecur' > > Setting property 'AutoDetectProductCrawler.noRecur' > > Setting property 'AutoDetectProductCrawler.mimeExtractorRepo' > > Setting property 'StdProductCrawler.productPath' > > Setting property 'MetExtractorProductCrawler.productPath' > > Setting property 'AutoDetectProductCrawler.productPath' > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'AutoDetectProductCrawler.noRecur' set to value [true] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'StdProductCrawler.productPath' set to value > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'MetExtractorProductCrawler.noRecur' set to value > > [true] Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'AutoDetectProductCrawler.mimeExtractorRepo' set to > > value [../policy/mime-extractor-map.xml] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'MetExtractorProductCrawler.clientTransferer' set to > > value > > [org.apache.oodt.cas.filemgr.datatransfer.InPlaceDataTransferFactory] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'AutoDetectProductCrawler.filemgrUrl' set to value [ > > https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A900 > > 0&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yvZ1Cs- > > T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=OvpwZVR1Xq > > gKclL83VXAWh__c7nz87xK_nS-O7hIXqc&e= ] Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'AutoDetectProductCrawler.clientTransferer' set to > > value > > [org.apache.oodt.cas.filemgr.datatransfer.InPlaceDataTransferFactory] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'StdProductCrawler.noRecur' set to value [true] Apr 02, > > 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'StdProductCrawler.filemgrUrl' set to value [ > > https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A900 > > 0&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yvZ1Cs- > > T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=OvpwZVR1Xq > > gKclL83VXAWh__c7nz87xK_nS-O7hIXqc&e= ] Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'AutoDetectProductCrawler.productPath' set to value > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'MetExtractorProductCrawler.filemgrUrl' set to value [ > > https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A900 > > 0&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yvZ1Cs- > > T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=OvpwZVR1Xq > > gKclL83VXAWh__c7nz87xK_nS-O7hIXqc&e= ] Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'StdProductCrawler.clientTransferer' set to value > > [org.apache.oodt.cas.filemgr.datatransfer.InPlaceDataTransferFactory] > > Apr 02, 2016 10:12:13 PM > > org.springframework.beans.factory.config.PropertyOverrideConfigurer > > processKey > > FINE: Property 'MetExtractorProductCrawler.productPath' set to value > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] Apr 02, 2016 10:12:13 PM org.apache.oodt.cas.crawl.ProductCrawler > > crawl > > INFO: Crawling > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q Apr 02, 2016 10:12:13 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz > > Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler passesPreconditions > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > WARNING: Failed to pass preconditions for ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/E837642_R1.fastq.gz] Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz.met > > Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.metadata.preconditions.PreCondEvalUtils eval > > INFO: Passed precondition comparator id > > CheckThatDataFileSizeIsGreaterThanZero > > Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Generating met file for product file: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/E837642_R1.fastq.gz.met] > > Apr 02, 2016 10:12:14 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Executing command line: > > [/celgene/software/apache-oodt/crawler/bin/MetExtractorNGS.pl > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz.met > > text ] with workingDir: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] > > to extract metadata > > OUTPUT: [WARN : MetExtractorNGS - 2016/04/02 22:12:15] - Input file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R1.fastq.gz.met will be ignored. .met files are not > > processed ! > > Apr 02, 2016 10:12:15 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > SEVERE: Failed to get metadata for product : Met extractor failed to > > create metadata file > > org.apache.oodt.cas.metadata.exceptions.MetExtractionException: Met > > extractor failed to create metadata file > > at > > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor.extrMetadata(ExternMetExtractor.java:120) > > at > > > org.apache.oodt.cas.metadata.AbstractMetExtractor.extractMetadata(AbstractMetExtractor.java:74) > > at > > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler.getMetadataForProduct(AutoDetectProductCrawler.java:84) > > at > > > org.apache.oodt.cas.crawl.ProductCrawler.handleFile(ProductCrawler.java:136) > > at > > org.apache.oodt.cas.crawl.ProductCrawler.crawl(ProductCrawler.java:104) > > at > > org.apache.oodt.cas.crawl.ProductCrawler.crawl(ProductCrawler.java:74) > > at > > > org.apache.oodt.cas.crawl.cli.action.CrawlerLauncherCliAction.execute(CrawlerLauncherCliAction.java:58) > > at > > org.apache.oodt.cas.cli.CmdLineUtility.execute(CmdLineUtility.java:331) > > at > > org.apache.oodt.cas.cli.CmdLineUtility.run(CmdLineUtility.java:188) > > at > > org.apache.oodt.cas.crawl.CrawlerLauncher.main(CrawlerLauncher.java:36 > > ) > > > > Apr 02, 2016 10:12:15 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R2.fastq.gz > > Apr 02, 2016 10:12:15 PM > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler passesPreconditions > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R2.fastq.gz Apr 02, 2016 10:12:15 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > WARNING: Failed to pass preconditions for ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/E837642_R2.fastq.gz] Apr 02, 2016 10:12:15 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R2.fastq.gz.met > > Apr 02, 2016 10:12:15 PM > > org.apache.oodt.cas.metadata.preconditions.PreCondEvalUtils eval > > INFO: Passed precondition comparator id > > CheckThatDataFileSizeIsGreaterThanZero > > Apr 02, 2016 10:12:16 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Generating met file for product file: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/E837642_R2.fastq.gz.met] > > Apr 02, 2016 10:12:16 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Executing command line: > > [/celgene/software/apache-oodt/crawler/bin/MetExtractorNGS.pl > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R2.fastq.gz.met > > text ] with workingDir: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] > > to extract metadata > > OUTPUT: [WARN : MetExtractorNGS - 2016/04/02 22:12:16] - Input file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/E837642_R2.fastq.gz.met will be ignored. .met files are not > > processed ! > > Apr 02, 2016 10:12:17 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > SEVERE: Failed to get metadata for product : Met extractor failed to > > create metadata file > > org.apache.oodt.cas.metadata.exceptions.MetExtractionException: Met > > extractor failed to create metadata file > > at > > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor.extrMetadata(ExternMetExtractor.java:120) > > at > > > org.apache.oodt.cas.metadata.AbstractMetExtractor.extractMetadata(AbstractMetExtractor.java:74) > > at > > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler.getMetadataForProduct(AutoDetectProductCrawler.java:84) > > at > > > org.apache.oodt.cas.crawl.ProductCrawler.handleFile(ProductCrawler.java:136) > > at > > org.apache.oodt.cas.crawl.ProductCrawler.crawl(ProductCrawler.java:104) > > at > > org.apache.oodt.cas.crawl.ProductCrawler.crawl(ProductCrawler.java:74) > > at > > > org.apache.oodt.cas.crawl.cli.action.CrawlerLauncherCliAction.execute(CrawlerLauncherCliAction.java:58) > > at > > org.apache.oodt.cas.cli.CmdLineUtility.execute(CmdLineUtility.java:331) > > at > > org.apache.oodt.cas.cli.CmdLineUtility.run(CmdLineUtility.java:188) > > at > > org.apache.oodt.cas.crawl.CrawlerLauncher.main(CrawlerLauncher.java:36 > > ) > > > > Apr 02, 2016 10:12:17 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-04-02-16.log.gz > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler passesPreconditions > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-04-02-16.log.gz Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > WARNING: Failed to pass preconditions for ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/cas-crawler-04-02-16.log.gz] Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-04-02-16.tar.gz > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler passesPreconditions > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-04-02-16.tar.gz Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > WARNING: Failed to pass preconditions for ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/cas-crawler-04-02-16.tar.gz] Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-mnt-celgene.rnd.combio.mmgp.external-TestSeqData-RNA-Seq > > -RawData-fastq-04-02-16.tar.gz > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.AutoDetectProductCrawler passesPreconditions > > WARNING: No extractor specs specified for > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/cas-crawler-mnt-celgene.rnd.combio.mmgp.external-TestSeqData-RNA-Seq > > -RawData-fastq-04-02-16.tar.gz Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > WARNING: Failed to pass preconditions for ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/cas-crawler-mnt-celgene.rnd.combio.mmgp.external-TestSeqData-RNA-Se > > q-RawData-fastq-04-02-16.tar.gz] Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Handling file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/test > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.metadata.preconditions.PreCondEvalUtils eval > > INFO: Passed precondition comparator id > > CheckThatDataFileSizeIsGreaterThanZero > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Generating met file for product file: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/test] > > Apr 02, 2016 10:12:17 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Executing command line: > > [/celgene/software/apache-oodt/crawler/bin/MetExtractorNGS.pl > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/test > > text ] with workingDir: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq] > > to extract metadata > > OUTPUT: [DEBUG : MetExtractorNGS - 2016/04/02 22:12:18] - Accessing > > NGS server at > > https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A808 > > 2_RPC2&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yv > > Z1Cs-T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=tSci2 > > Q1bJj0cQnBHjjOwtZjjx9uNMoN5Bi-ABG0Q7Y4&e= > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > metadata for file_host are not in array format.Converting.. > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > adding key/value [file_host]/[ip-192-168-8-66] > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > metadata for ProductType are not in array format.Converting.. > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > adding key/value [ProductType]/[GenericFile] > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > metadata for ingest_user are not in array format.Converting.. > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > adding key/value [ingest_user]/[kmavrommatis] > > OUTPUT: [DEBUG : MetExtractorNGS - 2016/04/02 22:12:18] - The file > > path is ARRAY(0x22d3f48). It will be added under the FilePath metadata > > field > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > metadata for FilePath are not in array format.Converting.. > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - addMetadata: > > adding key/value > > [FilePath]/[/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/ > > RawData/fastq/test] > > OUTPUT: [DEBUG : MetExtractorNGS - 2016/04/02 22:12:18] - This file is > > of type text > > OUTPUT: [DEBUG : MetExtractorNGS - 2016/04/02 22:12:18] - Storing > > metadata in file > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/test.met > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - Changing > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/test > > to > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - > > /mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fast > > q/test > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - Changing > > kmavrommatis to > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - kmavrommatis > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - Changing > > GenericFile to > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - GenericFile > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - Changing > > ip-192-168-8-66 to > > OUTPUT: [DEBUG : metadataPrepare - 2016/04/02 22:12:18] - > > ip-192-168-8-66 > > OUTPUT: [DEBUG : MetExtractorNGS - 2016/04/02 22:12:19] - Process > > finished SUCCESSFULLY Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.metadata.extractors.ExternMetExtractor > > extrMetadata > > INFO: Met extraction successful for product file: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/test] Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.crawl.ProductCrawler ingest > > INFO: ProductCrawler: Ready to ingest product: > > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fastq/test]: > > ProductType: [GenericFile] > > Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.filemgr.ingest.StdIngester > > setFileManager > > INFO: StdIngester: connected to file manager: > > [https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A90 > > 00&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yvZ1Cs > > -T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=OvpwZVR1X > > qgKclL83VXAWh__c7nz87xK_nS-O7hIXqc&e= ] Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.filemgr.datatransfer.InPlaceDataTransferer > > setFileManagerUrl > > INFO: In Place Data Transfer to: > > [https://urldefense.proofpoint.com/v2/url?u=http-3A__192.168.8.44-3A90 > > 00&d=CwIBaQ&c=CZZujK3G2KuXGEKnzq-Hrg&r=wndYZ4MLMT9l3Zb2WZv2hq2O6yvZ1Cs > > -T2gHY95y7ZA&m=Qaz0eKz7FHe35NMF43A17ey59ANhAqJD5ZfwZQC0VRo&s=OvpwZVR1X > > qgKclL83VXAWh__c7nz87xK_nS-O7hIXqc&e= ] enabled Apr 02, 2016 10:12:19 > > PM org.apache.oodt.cas.filemgr.ingest.StdIngester > > ingest > > INFO: StdIngester: ingesting product: ProductName: [test]: ProductType: > > [GenericFile]: FileLocation: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/] > > Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.filemgr.system.XmlRpcFileManagerClient > > ingestProduct > > FINEST: File Manager Client: clientTransfer enabled: transfering > > product [test] Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.filemgr.versioning.VersioningUtils > > createBasicDataStoreRefsFlat > > FINE: VersioningUtils: Generated data store ref: > > file:/opt/oodt/data/archive/test/test from origRef: > > file:/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData > > /fastq/test Apr 02, 2016 10:12:19 PM > > org.apache.oodt.cas.crawl.ProductCrawler ingest > > INFO: Successfully ingested product: > > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fastq/test]: > > product id: 4c8de2da-265a-48c4-8380-3f1103dfecfc > > Apr 02, 2016 10:12:19 PM org.apache.oodt.cas.crawl.ProductCrawler > > handleFile > > INFO: Successful ingest of product: > > [/mnt/celgene.rnd.combio.mmgp.external/TestSeqData/RNA-Seq/RawData/fas > > tq/test] > > > > > > ********************************************************* > > THIS ELECTRONIC MAIL MESSAGE AND ANY ATTACHMENT IS CONFIDENTIAL AND > > MAY CONTAIN LEGALLY PRIVILEGED INFORMATION INTENDED ONLY FOR THE USE > > OF THE INDIVIDUAL OR INDIVIDUALS NAMED ABOVE. > > If the reader is not the intended recipient, or the employee or agent > > responsible to deliver it to the intended recipient, you are hereby > > notified that any dissemination, distribution or copying of this > > communication is strictly prohibited. If you have received this > > communication in error, please reply to the sender to notify us of the > > error and delete the original message. Thank You. > > > > > -- > *Lewis* > > ********************************************************* > THIS ELECTRONIC MAIL MESSAGE AND ANY ATTACHMENT IS > CONFIDENTIAL AND MAY CONTAIN LEGALLY PRIVILEGED > INFORMATION INTENDED ONLY FOR THE USE OF THE INDIVIDUAL > OR INDIVIDUALS NAMED ABOVE. > If the reader is not the intended recipient, or the > employee or agent responsible to deliver it to the > intended recipient, you are hereby notified that any > dissemination, distribution or copying of this > communication is strictly prohibited. If you have > received this communication in error, please reply to the > sender to notify us of the error and delete the original > message. Thank You. > -- *Lewis*
