Hello everyone,
Here is the summary of the discussion with Preston on 4th and 12th July
regarding the VXQuery indexing project.
*Modifications to the current implementation of update and delete index
processes.*
1. An XML file is used to store metadata instead of using a binary format.
The structure of the file content would be,
<indexes>
<index>
<collection>/home/mendaka/xml</collection>
<files>
<file>
<path>/home/menaka/xml/catalog.xml</path>
<md5>44AC8A401C32384D9EB00952E1C96685</md5>
<fileName>catalog.xml</fileName>
<lastModified>10/07/2016 23:41:13</lastModified>
</file>
</files>
</index>
</indexes>
So that, a single file can be used to store all metadata.
2. Update the wiki regarding create, update and delete index.
*Parallel index creation and access.*
A separate rewrite rule will be created for indexing, like collection
rules.
Also some modifications will be done to following classes so that those can
be reused in index parallelization process.
1 AbstractCollectionRule
2 VXQueryMetadataProvider
For the first step,
Create the following classes.
1. IntroduceIndexingRule
2. VXQueryIndexDataSource
3. VXQueryIndexingOperatorDescriptor
First implement for index creation, update and delete because all those
processes share similar execution pattern.
For the next week, I'll be working on changing the xml file structure and
parallel index creation process.
Thank you very much
Menaka
--
*Menaka Madushanka Jayawardena*
Faculty of Engineering, <http://www.pdn.ac.lk/eng>
University of Peradeniyaya.
LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
TP:- 071 885 1183/ 071 350 5470