Hi everyone,

I came up with an implementation plan for the $subject. This will be able
to detect file content changes as well as deletions and additions.

Methodology:
1. Generate checksum (MD5/ SHA) for each file. These checksum values will
be written to a single properties file in following format.

*path_to_the_file=checksum_string*


2.In the first time run,  the checksum will be calculated and the
properties file will be created.

3. When running a query,

   1. The properties file will be read and loaded in to memory.
   2. The checksum values will be checked for each file.
   3. If any modification is detected, the index will be updated and the
   new checksum value will be stored.

In the process of checking the checksum, the path of the file will be taken
by the file itself and retrieve the checksum for that file from properties.
So, if any file insertion or deletion can be detected because we consider
the actual file first.

To make the process more clear, I have attached the flow diagram herewith.

I'd be very happy to have any feedback on this approach.

Thank you very much
Menaka

-- 
*Menaka Madushanka Jayawardena*
Faculty of Engineering, <http://www.pdn.ac.lk/eng>
University of Peradeniyaya.
LinkedIn <http://lk.linkedin.com/in/menakajayawardena>
TP:- 071 885 1183/ 071 350 5470

Reply via email to