Hello, I modified the implementation to use only one argument for update index query.
So the new query structure would be, *update-index(index_folder)* Collection information is stored when creating the index for the first time in build-index-on-collection query and stored as metadata. Thank you very much Menaka On 24 June 2016 at 03:42, Menaka Madushanka <[email protected]> wrote: > Hello Steven, > > Almost done. :-) > > On 24 June 2016 at 03:16, Steven Jacobs <[email protected]> wrote: > >> Auto-correct is always changing your name when I don't pay attention, I >> apologize Menaka. >> Steven >> >> On Thu, Jun 23, 2016 at 2:45 PM, Steven Jacobs <[email protected]> wrote: >> >>> Melaka- One high level comment. I think it will be better to have >>> update-index take a single argument as we discussed (just the index >>> folder). The collection location can be saved as part of the metadata >>> information in the collection folder. >>> Steven >>> >>> On Wed, Jun 22, 2016 at 2:04 PM, Menaka Madushanka < >>> [email protected]> wrote: >>> >>>> Hello, >>>> >>>> This is the summary of the implementation. (Included in Pull Request >>>> message as well) >>>> >>>> *Update Index Query* >>>> The update-index query takes two arguments, collection directory and >>>> index directory. >>>> It shares some of the functionalities from build-index-on-collection >>>> query so, some changes were done to the following classes in order to use >>>> them in updating index process and to maximize code reuse >>>> >>>> >>>> 1. IndexConstructorUtil.java : Created a new function to get an >>>> instance of IndexDocumentBuilder which can be used in IndexUpdater.java >>>> class. >>>> 2. IndexDocumentBuilder.java : Added a new string filed containing >>>> the corresponding file path which is needed to retrieve a document >>>> related >>>> to an XML file. >>>> >>>> >>>> *Metadata handling* >>>> Here a POJO is created to properly manage the metadata for a file. >>>> (XmlMetadata.java) >>>> Currently it contains following fields. >>>> >>>> >>>> 1. File path >>>> 2. File Name (Not used) >>>> 3. Checksum String >>>> >>>> When storing metadata, a HashMap is created with file path as the key >>>> and XmlMetadata object. This map is then serialized and written to a file >>>> named metadata.file and stored in the same directory where the index is >>>> stored. >>>> >>>> *Update Index process* >>>> >>>> - If a file is detected as modified, the current index document >>>> related to that file is deleted and newly created index document is >>>> added. >>>> - If a new file is detected, a new index document will be created >>>> and added to the existing index. >>>> - If the file is deleted, delete the index document related to that >>>> file. >>>> - After every task, update the metadata object and after all >>>> processes completed, write the new metadata map to the file. >>>> >>>> Please review the pull request and merge. >>>> >>>> https://github.com/apache/vxquery/pull/62 >>>> >>>> Thank you >>>> Menaka >>>> >>>> >>>> -- >>>> *Menaka Madushanka Jayawardena* >>>> Faculty of Engineering, <http://www.pdn.ac.lk/eng> >>>> University of Peradeniyaya. >>>> LinkedIn <http://lk.linkedin.com/in/menakajayawardena> >>>> TP:- 071 885 1183/ 071 350 5470 >>>> >>> >>> >> > > > -- > *Menaka Madushanka Jayawardena* > Faculty of Engineering, <http://www.pdn.ac.lk/eng> > University of Peradeniyaya. > LinkedIn <http://lk.linkedin.com/in/menakajayawardena> > TP:- 071 885 1183/ 071 350 5470 > -- *Menaka Madushanka Jayawardena* Faculty of Engineering, <http://www.pdn.ac.lk/eng> University of Peradeniyaya. LinkedIn <http://lk.linkedin.com/in/menakajayawardena> TP:- 071 885 1183/ 071 350 5470
