Sir i am PG student, my research topic is to optimize the indexing file
[reduce index file size, RAM usage, CPU utilization, and create index with
payload to improve searching speed].

Currently working scope is Desktop search engine

1.i am using lucene for indexing the pdf files[indexing file name and
content]. after applying standard analyzer lucene index file size is 11 MB
for 1.77GB
and windows 8 windows.edb file size 42 MB for 1.77GB[Tested for windows
desktop environment]. So the space complexity is done.

How to do time complexity?

2. how to apply lemmatization with standard analyzer to reduce index file
size and ADD PAYLOAD during indexing.

3. from where i can find the test benchmark.

-- 
Regards

Rahevar Mrugendrasinh

Reply via email to