Chetan Mehrotra created OAK-7122:
------------------------------------

             Summary: Implement script to compare lucene indexes logically
                 Key: OAK-7122
                 URL: https://issues.apache.org/jira/browse/OAK-7122
             Project: Jackrabbit Oak
          Issue Type: Task
          Components: run
            Reporter: Chetan Mehrotra
            Assignee: Chetan Mehrotra
             Fix For: 1.8


With Document Traversal based indexing we have implemented a newer indexing 
logic. To validate that index produced by it is is same as one done by existing 
indexing flow we need to implement a script which can enable comparing the 
index content logically

This was recently discussed on lucene mailing list [1] and suggestion there was 
it can be done by un-inverting the index. So to enable that we need to 
implement a script which can 

# Open a Lucene index
# Map the Lucene Document to path of node
# For each document determine what all fields are associated with it (stored 
and non stored)
# Dump this content in file sorted by path and for each line field name sorted 
by name

Then such dumps can be generated for old and new index and compared via simple 
text diff

[1] http://lucene.markmail.org/thread/wt22gk6aufs4uz55



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to