[
https://issues.apache.org/jira/browse/SOLR-10402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18042007#comment-18042007
]
Eric Pugh commented on SOLR-10402:
----------------------------------
I want this!
> Add extract Streaming Expression to support scalable extraction services
> -------------------------------------------------------------------------
>
> Key: SOLR-10402
> URL: https://issues.apache.org/jira/browse/SOLR-10402
> Project: Solr
> Issue Type: New Feature
> Components: streaming expressions
> Reporter: Joel Bernstein
> Priority: Major
>
> The *extract* Streaming Expression is designed to offload extraction
> services, such as Apache Tika, to worker nodes. This will allow a separate
> Solr Cloud collection to perform the heavyweight extractions and then send
> the results to a Solr Cloud collection for indexing.
> By leveraging the Solr parallel executor framework
> (http://joelsolr.blogspot.com/2017/01/deploying-solrs-new-parallel-executor.html)
> and worker nodes we should be able to deploy massively scalable extraction
> services.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]