With c# .net framework, I am developing a module for full text search in 
about 1,000,000 files (each file is less than 500 KB, and search conditions 
such as AND OR, should be available to be applied to each file), and there 
is a chance that I can upload all the files onto Dictionary or List<string> 
objects(in-memory) 
when the app is started.

I am considering the following strategies.

1) List<string> (local in-memory) : Parallel.foreach and apply a regex (as 
I need indexes of search words...) for each string.

2) Open source : Lucene

3) Open source : Elastic search

4) Open source : Yara ( I am aware that it is for detecting malwares. A 
developer recommended it for me. https://github.com/stellarbear/YaraSharp)

5) Redis or DB (This seems slower than 1))

Which one is the fastest? I do not know whether Yara is appropriate for 
this case..


-- 
You received this message because you are subscribed to the Google Groups 
"YARA" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/yara-project/03d6fe44-5925-4075-a227-2142e8f7dc41%40googlegroups.com.

Reply via email to