Hi,
There was recently discussion on perhaps starting a new Lucene
sub-project, named Tika, to create a general-purpose library from the
parser components and other features in Nutch that might interest a
wider audience. To keep things rolling we've created a temporary
staging area for the
Hi Jukka,
Thanks for your email. Indeed, there was discussion on the Lucene PMC email
list, about the Tika project. It was decided by the powers that be to
discuss it more on the Nutch mailing list before moving forward with any
vote on making Tika a sub-project of Apache Lucene. With regards to
Chris Mattmann wrote:
However, the current Nutch software contains many value-added pieces of
code that are monolithically packaged together. If the services and
capabilities from the code were provided as separate, modular component
libraries, such services and capabilities could benefit many
Hi,
On 8/16/06, Sami Siren [EMAIL PROTECTED] wrote:
IMO to solve the main problem one does not need to set up another
project, just refactor and repackage.
I'd be happy either way, as long as I get a nice reusable library to
use in Jackrabbit. :-)
I think the key question on whether to