[ 
https://issues.apache.org/jira/browse/SOLR-6007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shinichiro Abe updated SOLR-6007:
---------------------------------

    Attachment: japanese-sjis.zip

The unit test using Tika 1.6-dev(trunk) passed.

> Add param "archive.encoding" for ExtractingRequestHandler
> ---------------------------------------------------------
>
>                 Key: SOLR-6007
>                 URL: https://issues.apache.org/jira/browse/SOLR-6007
>             Project: Solr
>          Issue Type: New Feature
>          Components: contrib - Solr Cell (Tika extraction)
>            Reporter: Shinichiro Abe
>            Priority: Minor
>         Attachments: SOLR-6007.patch, japanese-sjis.zip
>
>
> When extracting from the zip files which are zipped at Windows OS(Japanese), 
> the file name extracted from zip is garbled(these file names were written by 
> CJK language). TIKA-936 allows us to set custom encoding(i.e. SJIS), so I can 
> get not-being garbled file name. It would be nice if archive encoding 
> parameter in Solr Cell could be specified.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to