On 2/14/11 4:49 AM, Radhouane Aniba wrote:
Hello everyone,
Quite unusual request to this list, I am wondering if there is any analysis
engine that allow to mine MBOX like formats such as the famous mailman
mailing list archives in a way that it allow to structure these kind of data
into messages-replies ?
If anyone have already treated this topic I will be very interested in
discussing it further.
We have a tika integration, and tika has support for mbox.
Maybe that is good enough to do the extraction.
Jörn