Hi,

On Sun, May 31, 2015 at 12:30 AM, <[email protected]> wrote:

>
>
> Hi comunity.
> Im using nutch 1.9 and solr 4.10.
> I use nutch for parse zip documents, but the field language is empty in
> solr for all of this documents and this is a problem for me.
> ParseZip plugin use tika to detect mimetype and to extract content of
> files but language is missing.
> I was thinking that if the package has 3 documents so the language could
> be a multivalued field and contain all language from the documents inside.
> What you think about this topic?
>

Please open a Jira issue and if possible attach a patch for the
functionality. It think it would be a nice addition to the parse-zip plugin
and to me makes good sense.
Thanks
Lewis

Reply via email to