Re: [jira] Lius into apache incubator

2007-03-01 Thread Rida Benjelloun
Hi, Thanks Doug, I think that your help will be very appricieted as a mentor. Regards. On 3/1/07, Doug Cutting <[EMAIL PROTECTED]> wrote: Jukka Zitting wrote: > PS. Will people mind if we use this list for fleshing out the details? > I've created a Google Group for Tika where we could also take

Re: [jira] Lius into apache incubator

2007-03-01 Thread Thorsten Scherler
Renaud forwarded me the thread and I just subscribed, so apologize for not proper responding. Thanks Renaud for the headsup. > Hi, > > On 3/1/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > > Is the Droids lab at all related to that parsing project in Nutch? > > Partly, yes. I've been looking

Re: [jira] Lius into apache incubator

2007-03-01 Thread Jukka Zitting
Hi, On 3/1/07, Doug Cutting <[EMAIL PROTECTED]> wrote: Jukka Zitting wrote: > PS. Will people mind if we use this list for fleshing out the details? > I've created a Google Group for Tika where we could also take the > discussion if that's preferred. I think the Incubator Wiki would be the best

Re: [jira] Lius into apache incubator

2007-03-01 Thread Doug Cutting
Jukka Zitting wrote: PS. Will people mind if we use this list for fleshing out the details? I've created a Google Group for Tika where we could also take the discussion if that's preferred. I think the Incubator Wiki would be the best place for this. http://wiki.apache.org/incubator/?action=fu

Re: [jira] Lius into apache incubator

2007-03-01 Thread Jukka Zitting
Hi, On 3/1/07, Rida Benjelloun <[EMAIL PROTECTED]> wrote: On 3/1/07, Jukka Zitting <[EMAIL PROTECTED]> wrote: > Would there be interest within the Lucene PMC in sponsoring a proposal > along such lines? I can volunteer to put together the proposal and act > as the champion and mentor of the proj

Re: [jira] Lius into apache incubator

2007-03-01 Thread Jukka Zitting
Hi, On 3/1/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: Is the Droids lab at all related to that parsing project in Nutch? Partly, yes. I've been looking at Droids and so far I think it's main focus has been on the crawling part rather than on the analysis of retrieved content. A generic con

Re: [jira] Lius into apache incubator

2007-03-01 Thread Rida Benjelloun
Hi, On 3/1/07, Jukka Zitting <[EMAIL PROTECTED]> wrote: Hi, On 3/1/07, Rida Benjelloun <[EMAIL PROTECTED]> wrote: > Lius could be used as a starting point of Tika project, if Tika committers > are interested on it. We can also as mark said decouple Lius's parser logic > from it's indexing logic

Re: [jira] Lius into apache incubator

2007-03-01 Thread Grant Ingersoll
Is the Droids lab at all related to that parsing project in Nutch? There seems to be several efforts that are related here that could probably make for a nice new project under Lucene, IMO. They all seem to have to do with getting and preparing text for processing by some type of consume

Re: [jira] Lius into apache incubator

2007-03-01 Thread Jukka Zitting
Hi, On 3/1/07, Rida Benjelloun <[EMAIL PROTECTED]> wrote: Lius could be used as a starting point of Tika project, if Tika committers are interested on it. We can also as mark said decouple Lius's parser logic from it's indexing logic. I'm very interested in doing that. Another very useful code

Re: [jira] Lius into apache incubator

2007-03-01 Thread Rida Benjelloun
Hi, You could actually use Lius as text extraction API, I have implement for each Indexer a method that allows you to get the String content of the Document. Lius could be used as a starting point of Tika project, if Tika committers are interested on it. We can also as mark said decouple Lius's pa

Re: [jira] Lius into apache incubator

2007-03-01 Thread Jukka Zitting
Hi, I am interested in a Lius/Tika project that could be used not only with Lucene. As mentioned by Mark, there are a number of related efforts which leads me to believe a application-independent content analysis/parsing tool would be very helpful for many users. I'd like to propose taking the p