[ 
https://issues.apache.org/jira/browse/OAK-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301095#comment-14301095
 ] 

Marcel Reutegger commented on OAK-2110:
---------------------------------------

I tested the time to get documents marked with {{_deletedOnce}} with the 
generic implementation and compared it with the mongo-specific one. The 
mongo-specific version is roughly ten times faster. I think this is mostly 
because it only has to execute the query once on MongoDB and can stream the 
entire result to Oak vs. multiple queries to get the matching documents in 
batches.

To solve this, we will probably have to change the signature of 
DocumentStore.query() or introduce a new method.

> performance issues with VersionGarbageCollector
> -----------------------------------------------
>
>                 Key: OAK-2110
>                 URL: https://issues.apache.org/jira/browse/OAK-2110
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: mongomk
>            Reporter: Julian Reschke
>             Fix For: 1.1.7
>
>
> This one currently special-cases Mongo. For other persistences, it
> - fetches *all* documents
> - filters by SD_TYPE
> - filters by lastmod of versions
> - deletes what remains
> This is not only inefficient but also fails with OutOfMemory for any larger 
> repo.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to