[ 
https://issues.apache.org/jira/browse/OAK-6353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chetan Mehrotra updated OAK-6353:
---------------------------------
    Attachment: OAK-6353-v1.patch

[initial implementation|^OAK-6353-v1.patch] based on discussion with 
[~catholicon] [~mreutegg] and [~tomek.rekawek]

# It uses DocumentNodeStore#getNode(path, rev) - Directly to get the NodeState 
instead of via NodeStore. This ensures that intermediate nodes are not required 
to be read
# Supports filtering by path at DocumentStore level i.e. nodes which are not in 
included paths and hidden nodes are not added to NodeDocument cache

With this I am getting traversal rate of 20-30k/sec compared to 1k/sec in 
previous approach. Would get more numbers


> Use Document order traversal for reindexing performed on DocumentNodeStore 
> setups
> ---------------------------------------------------------------------------------
>
>                 Key: OAK-6353
>                 URL: https://issues.apache.org/jira/browse/OAK-6353
>             Project: Jackrabbit Oak
>          Issue Type: Technical task
>          Components: run
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>             Fix For: 1.8
>
>         Attachments: OAK-6353-v1.patch
>
>
> [~tmueller] suggested 
> [here|https://issues.apache.org/jira/browse/OAK-6246?focusedCommentId=16034442&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16034442]
>  that document order traversal can be faster compared to current mode of path 
> based traversal. Initial test indicate that such a traversal can be order of 
> magnitude faster. 
> So this task is meant to implement such an approach and see if it can be a 
> viable indexing mode used for DocumentNodeStore based setups



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to