Hi Bai, As you know, were using Tika for a lot of our parsing and content extraction now. Without you expanding on your request all I can really do is direct you to the plugin central section of the wiki where you will find a comprehensive quick-start guide to developing plugins for Nutch.
On Fri, Sep 23, 2011 at 8:04 PM, Bai Shen <[email protected]> wrote: > Are there any good tutorials/examples for custom parsing? I need to parse > additional formats and also look for additional metadata. > -- *Lewis*

