Tika update

2006-08-16 Thread Jukka Zitting
Hi, There was recently discussion on perhaps starting a new Lucene sub-project, named Tika, to create a general-purpose library from the parser components and other features in Nutch that might interest a wider audience. To keep things rolling we've created a temporary staging area for the

Re: Tika update

2006-08-16 Thread Chris Mattmann
Hi Jukka, Thanks for your email. Indeed, there was discussion on the Lucene PMC email list, about the Tika project. It was decided by the powers that be to discuss it more on the Nutch mailing list before moving forward with any vote on making Tika a sub-project of Apache Lucene. With regards to

Re: Tika update

2006-08-16 Thread Sami Siren
Chris Mattmann wrote: However, the current Nutch software contains many value-added pieces of code that are monolithically packaged together. If the services and capabilities from the code were provided as separate, modular component libraries, such services and capabilities could benefit many

Re: Tika update

2006-08-16 Thread Jukka Zitting
Hi, On 8/16/06, Sami Siren [EMAIL PROTECTED] wrote: IMO to solve the main problem one does not need to set up another project, just refactor and repackage. I'd be happy either way, as long as I get a nice reusable library to use in Jackrabbit. :-) I think the key question on whether to