Hello,

I need to develop a "french" parser. Google index french documents parsing "é" (HTML : e´) and "è" characters to "e". I think there's is already french parser for Lucene, so this is not really a problem.

Problem is : can it be created as a nutch plugin ? where should I put it ? Is there any started project about it ?

Thanks

Christophe.


------------------------------------------------------- This SF.net email is sponsored by Microsoft Mobile & Embedded DevCon 2005 Attend MEDC 2005 May 9-12 in Vegas. Learn more about the latest Windows Embedded(r) & Windows Mobile(tm) platforms, applications & content. Register by 3/29 & save $300 http://ads.osdn.com/?ad_id=6883&alloc_id=15149&op=click _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to