Hi, as part of testing my FreeEed <http://freeeed.org/> open source eDiscovery engine, I am processing the 153 Enron PSTs found here<http://www.edrm.net/resources/data-sets/edrm-enron-email-data-set-v2> .
Naturally, I see lot of errors and warning. For example, I started with the error described here <https://issues.apache.org/jira/browse/PDFBOX-1008>. For that, I replaced version of PDFBox from 1.5.0 to 1.6.0, since I am building with maven from the latest svn checkout anyway. However, for the future, my question is: is there a more systematic way to approach this. Is anybody interested in the results of all the testing that I am doing, and if yes, how should I report my findings? Thank you, Mark
