[
https://issues.apache.org/jira/browse/OODT-817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved OODT-817.
------------------------------------
Resolution: Fixed
- I fixed this in r1660501. I also took the opportunity to use
StringEscapeUtils to also escape the information going into the extracted met
(since I was noticing using SolrCatalog that the extracted Tika met using
OCR/Tesseract had a content field with bad XML chars like '&').
> TikaCmdLineExtractor needs to add Filename and FileLocation fields
> ------------------------------------------------------------------
>
> Key: OODT-817
> URL: https://issues.apache.org/jira/browse/OODT-817
> Project: OODT
> Issue Type: Bug
> Components: metadata container
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 0.9
>
>
> The contract for client-side met extraction is that the extractors will
> provide Filename, FileLocation and ProductType as core met fields.
> TikaCmdLineMetExtractor allows specification of java property met fields,
> which works fine for ProductType, but doesn't for dynamic met like Filename
> and FileLocation.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)