[
https://issues.apache.org/jira/browse/OAK-6353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chetan Mehrotra updated OAK-6353:
---------------------------------
Attachment: OAK-6353-v1.patch
[initial implementation|^OAK-6353-v1.patch] based on discussion with
[~catholicon] [~mreutegg] and [~tomek.rekawek]
# It uses DocumentNodeStore#getNode(path, rev) - Directly to get the NodeState
instead of via NodeStore. This ensures that intermediate nodes are not required
to be read
# Supports filtering by path at DocumentStore level i.e. nodes which are not in
included paths and hidden nodes are not added to NodeDocument cache
With this I am getting traversal rate of 20-30k/sec compared to 1k/sec in
previous approach. Would get more numbers
> Use Document order traversal for reindexing performed on DocumentNodeStore
> setups
> ---------------------------------------------------------------------------------
>
> Key: OAK-6353
> URL: https://issues.apache.org/jira/browse/OAK-6353
> Project: Jackrabbit Oak
> Issue Type: Technical task
> Components: run
> Reporter: Chetan Mehrotra
> Assignee: Chetan Mehrotra
> Fix For: 1.8
>
> Attachments: OAK-6353-v1.patch
>
>
> [~tmueller] suggested
> [here|https://issues.apache.org/jira/browse/OAK-6246?focusedCommentId=16034442&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16034442]
> that document order traversal can be faster compared to current mode of path
> based traversal. Initial test indicate that such a traversal can be order of
> magnitude faster.
> So this task is meant to implement such an approach and see if it can be a
> viable indexing mode used for DocumentNodeStore based setups
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)