[ 
https://issues.apache.org/jira/browse/OODT-817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann resolved OODT-817.
------------------------------------
    Resolution: Fixed

- I fixed this in r1660501. I also took the opportunity to use 
StringEscapeUtils to also escape the information going into the extracted met 
(since I was noticing using SolrCatalog that the extracted Tika met using 
OCR/Tesseract had a content field with bad XML chars like '&').

> TikaCmdLineExtractor needs to add Filename and FileLocation fields
> ------------------------------------------------------------------
>
>                 Key: OODT-817
>                 URL: https://issues.apache.org/jira/browse/OODT-817
>             Project: OODT
>          Issue Type: Bug
>          Components: metadata container
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 0.9
>
>
> The contract for client-side met extraction is that the extractors will 
> provide Filename, FileLocation and ProductType as core met fields. 
> TikaCmdLineMetExtractor allows specification of java property met fields, 
> which works fine for ProductType, but doesn't for dynamic met like Filename 
> and FileLocation. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to