[ 
https://issues.apache.org/jira/browse/OAK-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301375#comment-14301375
 ] 

Julian Reschke commented on OAK-2110:
-------------------------------------

Some thoughts:

- we will probably not be able to match the version that is optimized for 
Mongo, so it would be good to have a consensus what *needs* to be achieved 
(this is a BG operation, right?)

- the limit on the batch size is mainly motivated by the size of the 
NodeDocuments returned by query(); a variant that just returns identifiers 
might help

- alternatively, we could consider a new variant of query that takes more 
complex conditions (operators other than >=, plus constraints on multiple 
indexed properties)

> performance issues with VersionGarbageCollector
> -----------------------------------------------
>
>                 Key: OAK-2110
>                 URL: https://issues.apache.org/jira/browse/OAK-2110
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: mongomk
>            Reporter: Julian Reschke
>             Fix For: 1.1.7
>
>
> This one currently special-cases Mongo. For other persistences, it
> - fetches *all* documents
> - filters by SD_TYPE
> - filters by lastmod of versions
> - deletes what remains
> This is not only inefficient but also fails with OutOfMemory for any larger 
> repo.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to