[ 
https://issues.apache.org/jira/browse/TIKA-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15296375#comment-15296375
 ] 

Tim Allison edited comment on TIKA-1984 at 5/23/16 1:50 PM:
------------------------------------------------------------

[~kkrugler] and [~chrismattmann], should we add configuration of language 
detection to TikaConfig?

Unlike parsers, language detectors are state-ful.  It would be neat if we could 
change that or at least create language detector factories so that when wrapped 
code (e.g. tika-server's /meta) needs a new language detector, this object 
could be built via configuration.  

As a separate, but related item, I imagine, there may be some lds that could 
statically load language models once and share those models among the ld 
instances? 


was (Author: [email protected]):
[~kkrugler] and [~chrismattmann], should we add configuration of language 
detection to TikaConfig?

> Add configurability for language detection to BasicContentHandlerFactory
> ------------------------------------------------------------------------
>
>                 Key: TIKA-1984
>                 URL: https://issues.apache.org/jira/browse/TIKA-1984
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Trivial
>
> The RecursiveParserWrapper needs a new handler via a ContentHandlerFactory 
> for each embedded document.  There is currently no way to configure the 
> BasicContentHandlerFactory to include language detection.  Let's add that.
> This request is driven by [[email protected]]'s request on 
> TIKA-1982.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to