[ 
https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-2332:
---------------------------

    Affects Version/s:     (was: 4.0)
        Fix Version/s: 3.2

I can't find any docs suggestion how exactly TikaEntityProcessor should be 
expected to deal with zip files, particularly what to expect if a zip files 
contains multiple documents.

FWIW: TikaEntityProcessor did not exist in Solr 1.4.1, so the behavior 
currently seen in the 3x branch (and the 3.1rc1 artifacts) is not a regression.

> TikaEntityProcessor retrieves only File Names from Zip extraction
> -----------------------------------------------------------------
>
>                 Key: SOLR-2332
>                 URL: https://issues.apache.org/jira/browse/SOLR-2332
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler
>            Reporter: Jayendra Patil
>             Fix For: 3.2
>
>         Attachments: SOLR-2332.patch, solr-word.zip
>
>
> Extraction of Zip files using TikaEntityProcessor results in only names of 
> file.
> It does not extract the contents of the Files in the Zip

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to