Hi community,

 

I have a general understanding of Lucene concepts, and I'm wondering if
it's the right tool for my job:

 

- I need to extract data like e.g. time intervals ("8am - 12pm"), street
addresses from a set of files. The common issue with this data unit is
that they contain spaces and are not always definable through regexes.

 

- the extraction must take into consideration the "proximity": for
example, a mail address which is close to the work "Contacts" will
receive a higher rank, since I'm looking for contact data.

 

Do you think I can get any advantage from building a solution on Lucene?

 

  Gianluca

Reply via email to