Hello,

I modified the implementation to use only one argument for update index
query.

So the new query structure would be,

*update-index(index_folder)*

Collection information is stored when creating the index for the first time
in build-index-on-collection query and stored as metadata.

Thank you very much
Menaka

On 24 June 2016 at 03:42, Menaka Madushanka <[email protected]> wrote:

> Hello Steven,
>
> Almost done. :-)
>
> On 24 June 2016 at 03:16, Steven Jacobs <[email protected]> wrote:
>
>> Auto-correct is always changing your name when I don't pay attention, I
>> apologize Menaka.
>> Steven
>>
>> On Thu, Jun 23, 2016 at 2:45 PM, Steven Jacobs <[email protected]> wrote:
>>
>>> Melaka- One high level comment. I think it will be better to have
>>> update-index take a single argument as we discussed (just the index
>>> folder). The collection location can be saved as part of the metadata
>>> information in the collection folder.
>>> Steven
>>>
>>> On Wed, Jun 22, 2016 at 2:04 PM, Menaka Madushanka <
>>> [email protected]> wrote:
>>>
>>>> Hello,
>>>>
>>>> This is the summary of the implementation. (Included in Pull Request
>>>> message as well)
>>>>
>>>> *Update Index Query*
>>>> The update-index query takes two arguments, collection directory and
>>>> index directory.
>>>> It shares some of the functionalities from build-index-on-collection
>>>> query so, some changes were done to the following classes in order to use
>>>> them in updating index process and to maximize code reuse
>>>>
>>>>
>>>>    1. IndexConstructorUtil.java : Created a new function to get an
>>>>    instance of IndexDocumentBuilder which can be used in IndexUpdater.java
>>>>    class.
>>>>    2. IndexDocumentBuilder.java : Added a new string filed containing
>>>>    the corresponding file path which is needed to retrieve a document 
>>>> related
>>>>    to an XML file.
>>>>
>>>>
>>>> *Metadata handling*
>>>> Here a POJO is created to properly manage the metadata for a file.
>>>> (XmlMetadata.java)
>>>> Currently it contains following fields.
>>>>
>>>>
>>>>    1. File path
>>>>    2. File Name (Not used)
>>>>    3. Checksum String
>>>>
>>>> When storing metadata, a HashMap is created with file path as the key
>>>> and XmlMetadata object. This map is then serialized and written to a file
>>>> named metadata.file and stored in the same directory where the index is
>>>> stored.
>>>>
>>>> *Update Index process*
>>>>
>>>>    - If a file is detected as modified, the current index document
>>>>    related to that file is deleted and newly created index document is 
>>>> added.
>>>>    - If a new file is detected, a new index document will be created
>>>>    and added to the existing index.
>>>>    - If the file is deleted, delete the index document related to that
>>>>    file.
>>>>    - After every task, update the metadata object and after all
>>>>    processes completed, write the new metadata map to the file.
>>>>
>>>> Please review the pull request and merge.
>>>>
>>>> https://github.com/apache/vxquery/pull/62
>>>>
>>>> Thank you
>>>> Menaka
>>>>
>>>>
>>>> --
>>>> *Menaka Madushanka Jayawardena*
>>>> Faculty of Engineering, <http://www.pdn.ac.lk/eng>
>>>> University of Peradeniyaya.
>>>> LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
>>>> TP:- 071 885 1183/ 071 350 5470
>>>>
>>>
>>>
>>
>
>
> --
> *Menaka Madushanka Jayawardena*
> Faculty of Engineering, <http://www.pdn.ac.lk/eng>
> University of Peradeniyaya.
> LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
> TP:- 071 885 1183/ 071 350 5470
>



-- 
*Menaka Madushanka Jayawardena*
Faculty of Engineering, <http://www.pdn.ac.lk/eng>
University of Peradeniyaya.
LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
TP:- 071 885 1183/ 071 350 5470

Reply via email to