[jira] [Updated] (OAK-9434) MongoDB indexing: implement parallel chunk download

Amrit Verma (Jira) Mon, 17 May 2021 01:26:09 -0700


     [ 
https://issues.apache.org/jira/browse/OAK-9434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Amrit Verma updated OAK-9434:
-----------------------------
    Description: 
In case of large indexes, indexing takes a long time. In case of MongoDB 
Document store, Currently it is a two step process - download the data from 
mongodb then create index based on that data.

If something fails during this process, indexing needs to be restarted from 
beginning of the download step. We should make the indexing process resumable 
from the point it stopped. 

Since data download from mongodb seems to be more time taking than indexing 
itself, we first focus on download part. 

This Jira issue is for implementing resumable/parallel download.

> MongoDB indexing: implement parallel chunk download
> ---------------------------------------------------
>
>                 Key: OAK-9434
>                 URL: https://issues.apache.org/jira/browse/OAK-9434
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: indexing
>    Affects Versions: 1.38.0
>            Reporter: Amrit Verma
>            Assignee: Amrit Verma
>            Priority: Major
>
> In case of large indexes, indexing takes a long time. In case of MongoDB 
> Document store, Currently it is a two step process - download the data from 
> mongodb then create index based on that data.
> If something fails during this process, indexing needs to be restarted from 
> beginning of the download step. We should make the indexing process resumable 
> from the point it stopped. 
> Since data download from mongodb seems to be more time taking than indexing 
> itself, we first focus on download part. 
> This Jira issue is for implementing resumable/parallel download.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (OAK-9434) MongoDB indexing: implement parallel chunk download

Reply via email to