[
https://issues.apache.org/jira/browse/TIKA-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15296375#comment-15296375
]
Tim Allison edited comment on TIKA-1984 at 5/23/16 1:50 PM:
------------------------------------------------------------
[~kkrugler] and [~chrismattmann], should we add configuration of language
detection to TikaConfig?
Unlike parsers, language detectors are state-ful. It would be neat if we could
change that or at least create language detector factories so that when wrapped
code (e.g. tika-server's /meta) needs a new language detector, this object
could be built via configuration.
As a separate, but related item, I imagine, there may be some lds that could
statically load language models once and share those models among the ld
instances?
was (Author: [email protected]):
[~kkrugler] and [~chrismattmann], should we add configuration of language
detection to TikaConfig?
> Add configurability for language detection to BasicContentHandlerFactory
> ------------------------------------------------------------------------
>
> Key: TIKA-1984
> URL: https://issues.apache.org/jira/browse/TIKA-1984
> Project: Tika
> Issue Type: New Feature
> Reporter: Tim Allison
> Priority: Trivial
>
> The RecursiveParserWrapper needs a new handler via a ContentHandlerFactory
> for each embedded document. There is currently no way to configure the
> BasicContentHandlerFactory to include language detection. Let's add that.
> This request is driven by [[email protected]]'s request on
> TIKA-1982.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)