[
https://issues.apache.org/jira/browse/OAK-9434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amrit Verma updated OAK-9434:
-----------------------------
Description:
In case of large indexes, indexing takes a long time. In case of MongoDB
Document store, Currently it is a two step process - download the data from
mongodb then create index based on that data.
If something fails during this process, indexing needs to be restarted from
beginning of the download step. We should make the indexing process resumable
from the point it stopped.
Since data download from mongodb seems to be more time taking than indexing
itself, we first focus on download part.
This Jira issue is for implementing resumable/parallel download.
> MongoDB indexing: implement parallel chunk download
> ---------------------------------------------------
>
> Key: OAK-9434
> URL: https://issues.apache.org/jira/browse/OAK-9434
> Project: Jackrabbit Oak
> Issue Type: Improvement
> Components: indexing
> Affects Versions: 1.38.0
> Reporter: Amrit Verma
> Assignee: Amrit Verma
> Priority: Major
>
> In case of large indexes, indexing takes a long time. In case of MongoDB
> Document store, Currently it is a two step process - download the data from
> mongodb then create index based on that data.
> If something fails during this process, indexing needs to be restarted from
> beginning of the download step. We should make the indexing process resumable
> from the point it stopped.
> Since data download from mongodb seems to be more time taking than indexing
> itself, we first focus on download part.
> This Jira issue is for implementing resumable/parallel download.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)