Sir i am PG student, my research topic is to optimize the indexing file [reduce index file size, RAM usage, CPU utilization, and create index with payload to improve searching speed].
Currently working scope is Desktop search engine 1.i am using lucene for indexing the pdf files[indexing file name and content]. after applying standard analyzer lucene index file size is 11 MB for 1.77GB and windows 8 windows.edb file size 42 MB for 1.77GB[Tested for windows desktop environment]. So the space complexity is done. How to do time complexity? 2. how to apply lemmatization with standard analyzer to reduce index file size and ADD PAYLOAD during indexing. 3. from where i can find the test benchmark. -- Regards Rahevar Mrugendrasinh