On Thu, 9 Jun 2011, sgraessle wrote:
I am looking at integrating the Alfresco code in order to handle the images. What would be the most efficient way to do so? I went ahead and downloaded the entire Alfresco project, but I don't need all of it only the HTML rendering capacity and I would like to deal eliminate the extra code.
Just use code from the class I pointed you at, it's largely standalone. Also be aware of the license - Alfresco is not under the same license as Tika (it's LGPL instead of ASL)
You may also want to just crib off/use the Tika app code for embedded document extraction, that's already built into Tika and does everything you need
Nick
