1)use tika to index .msg files (Tika does support Microsoft outlook format
and I am already using Tika: http://lucene.apache.org/tika/formats.html).
2)while indexing you'll have to write handler to extract To, CC, Bcc values
and store it in a separate field in index.
3)when user searches on .msg files, compare if s/he is in To, Cc, Bcc field
first before returning result to the page and filter results accordingly.



Abhishek Srivastava-2 wrote:
> 
> Hello Everyone,
> 
> In my company, we store a lot of old emails (.msg files) in a database
> (done
> for the purpose of legal compliance).
> 
> The users have been asking us to give search functionality on the old
> emails.
> 
> One of the primary requirement is that when people search, they should
> only
> be able to search in their own emails (emails in which they were in the
> to,
> cc or bcc list).
> 
> How can solr be used?
> 
> from what I know about this product is that it only searches xml
> content...
> so I will have to extract the body of the email and convert it to xml
> right?
> 
> How will I limit the search results to only those emails where the user
> who
> is searching was in the to, cc or bcc list?
> 
> Please do recommend me an approach for providing a solution to our
> requirement.
> 
> 

-- 
View this message in context: 
http://old.nabble.com/Searching-.msg-files-tp26788199p26835015.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to