Hi, Stefan,

On Sun, May 16, 2004 at 07:52:17PM +0200, Stefan Groschupf wrote:
> Hi John,
> 
> nutch use the Log Api of java 1.4.x and not log4j.
> I would prefer using log4j as well, since it more flexible from my  
> point of view.
> But just for a content extractor add log4j?
> Or are there any sub dependencies in your used libraries?

My patch uses PDFBox to parse pdf files. PDFBox uses log4j.
And that's the only place log4j is used.
We may have to find a way to remove/suppress log4j if it is needed.
For now, just present the code to people who want to use it right way.

> Do you check that the license  are compatible to the nutch license?

PDFBox is BSD license, should be okay with Nutch in my opinoin.
Doug should have a final say on this.

Others are all of apache license.

> Do you notice that there is already a pdf, word and excel content  

I did not know. Where is the parser code?

> extractor but just for the still not ready plugin mechanism?

This patch is from my recent work under deadline pressure.
I could not afford waiting.

John


-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to