[
https://issues.apache.org/jira/browse/SOLR-10402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joel Bernstein updated SOLR-10402:
----------------------------------
Summary: Add extract Streaming Expression to support scalable extraction
services (was: Add extract Streaming Expression to perform heavyweight
extractions on worker nodes )
> Add extract Streaming Expression to support scalable extraction services
> -------------------------------------------------------------------------
>
> Key: SOLR-10402
> URL: https://issues.apache.org/jira/browse/SOLR-10402
> Project: Solr
> Issue Type: New Feature
> Security Level: Public(Default Security Level. Issues are Public)
> Reporter: Joel Bernstein
>
> The *extract* Streaming Expression is designed to offload extraction
> services, such as Apache Tika, to worker nodes. This will allow a separate
> Solr Cloud cluster to perform the heavyweight extractions and then send the
> results to a Solr Cloud collection for indexing.
> By leveraging the Solr parallel executor framework
> (http://joelsolr.blogspot.com/2017/01/deploying-solrs-new-parallel-executor.html)
> and worker nodes we should be able to deploy massively scalable extraction
> services.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]