Thank you very much Preston. :-)

I can do that.

On 24 June 2016 at 21:24, Preston Carman <[email protected]> wrote:

> What do you think about adding your description of the update process
> to the wiki [1]? We can use this as the start of documenting the
> indexing functionality. You have written a nice description and it
> would be nice to have the in a place that others can see it to learn
> about our indexing process.
>
> [1]
> https://cwiki.apache.org/confluence/display/VXQUERY/Lucene+Indexing+Project+2016
>
> On Thu, Jun 23, 2016 at 3:39 PM, Menaka Madushanka
> <[email protected]> wrote:
> > Hello,
> >
> > I modified the implementation to use only one argument for update index
> > query.
> >
> > So the new query structure would be,
> >
> > *update-index(index_folder)*
> >
> > Collection information is stored when creating the index for the first
> time
> > in build-index-on-collection query and stored as metadata.
> >
> > Thank you very much
> > Menaka
> >
> > On 24 June 2016 at 03:42, Menaka Madushanka <[email protected]>
> wrote:
> >
> >> Hello Steven,
> >>
> >> Almost done. :-)
> >>
> >> On 24 June 2016 at 03:16, Steven Jacobs <[email protected]> wrote:
> >>
> >>> Auto-correct is always changing your name when I don't pay attention, I
> >>> apologize Menaka.
> >>> Steven
> >>>
> >>> On Thu, Jun 23, 2016 at 2:45 PM, Steven Jacobs <[email protected]>
> wrote:
> >>>
> >>>> Melaka- One high level comment. I think it will be better to have
> >>>> update-index take a single argument as we discussed (just the index
> >>>> folder). The collection location can be saved as part of the metadata
> >>>> information in the collection folder.
> >>>> Steven
> >>>>
> >>>> On Wed, Jun 22, 2016 at 2:04 PM, Menaka Madushanka <
> >>>> [email protected]> wrote:
> >>>>
> >>>>> Hello,
> >>>>>
> >>>>> This is the summary of the implementation. (Included in Pull Request
> >>>>> message as well)
> >>>>>
> >>>>> *Update Index Query*
> >>>>> The update-index query takes two arguments, collection directory and
> >>>>> index directory.
> >>>>> It shares some of the functionalities from build-index-on-collection
> >>>>> query so, some changes were done to the following classes in order
> to use
> >>>>> them in updating index process and to maximize code reuse
> >>>>>
> >>>>>
> >>>>>    1. IndexConstructorUtil.java : Created a new function to get an
> >>>>>    instance of IndexDocumentBuilder which can be used in
> IndexUpdater.java
> >>>>>    class.
> >>>>>    2. IndexDocumentBuilder.java : Added a new string filed containing
> >>>>>    the corresponding file path which is needed to retrieve a
> document related
> >>>>>    to an XML file.
> >>>>>
> >>>>>
> >>>>> *Metadata handling*
> >>>>> Here a POJO is created to properly manage the metadata for a file.
> >>>>> (XmlMetadata.java)
> >>>>> Currently it contains following fields.
> >>>>>
> >>>>>
> >>>>>    1. File path
> >>>>>    2. File Name (Not used)
> >>>>>    3. Checksum String
> >>>>>
> >>>>> When storing metadata, a HashMap is created with file path as the key
> >>>>> and XmlMetadata object. This map is then serialized and written to a
> file
> >>>>> named metadata.file and stored in the same directory where the index
> is
> >>>>> stored.
> >>>>>
> >>>>> *Update Index process*
> >>>>>
> >>>>>    - If a file is detected as modified, the current index document
> >>>>>    related to that file is deleted and newly created index document
> is added.
> >>>>>    - If a new file is detected, a new index document will be created
> >>>>>    and added to the existing index.
> >>>>>    - If the file is deleted, delete the index document related to
> that
> >>>>>    file.
> >>>>>    - After every task, update the metadata object and after all
> >>>>>    processes completed, write the new metadata map to the file.
> >>>>>
> >>>>> Please review the pull request and merge.
> >>>>>
> >>>>> https://github.com/apache/vxquery/pull/62
> >>>>>
> >>>>> Thank you
> >>>>> Menaka
> >>>>>
> >>>>>
> >>>>> --
> >>>>> *Menaka Madushanka Jayawardena*
> >>>>> Faculty of Engineering, <http://www.pdn.ac.lk/eng>
> >>>>> University of Peradeniyaya.
> >>>>> LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
> >>>>> TP:- 071 885 1183/ 071 350 5470
> >>>>>
> >>>>
> >>>>
> >>>
> >>
> >>
> >> --
> >> *Menaka Madushanka Jayawardena*
> >> Faculty of Engineering, <http://www.pdn.ac.lk/eng>
> >> University of Peradeniyaya.
> >> LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
> >> TP:- 071 885 1183/ 071 350 5470
> >>
> >
> >
> >
> > --
> > *Menaka Madushanka Jayawardena*
> > Faculty of Engineering, <http://www.pdn.ac.lk/eng>
> > University of Peradeniyaya.
> > LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
> > TP:- 071 885 1183/ 071 350 5470
>



-- 
*Menaka Madushanka Jayawardena*
Faculty of Engineering, <http://www.pdn.ac.lk/eng>
University of Peradeniyaya.
LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
TP:- 071 885 1183/ 071 350 5470

Reply via email to