You can check the following plugin resources:

http://wiki.apache.org/nutch/WritingPluginExample
http://wiki.apache.org/nutch/PluginCentral
http://sujitpal.blogspot.com/2009/07/nutch-custom-plugin-to-parse-and-add.html


Thanks and Regards,
Sonal
<https://github.com/sonalgoyal/hiho>Connect Hadoop with databases,
Salesforce, FTP servers and others <https://github.com/sonalgoyal/hiho>
Nube Technologies <http://www.nubetech.co>

<http://in.linkedin.com/in/sonalgoyal>





On Thu, Feb 10, 2011 at 4:34 PM, firespin <[email protected]> wrote:

> Hello,
>
> I am new to nutch and just recently setup my nutch 1.0 search engine.
> I would like to crawl a list of sites and index only webpages with
> specific footprints in the url and inside the html content. Can nutch
> already do this?. or do I need a plugin? (If so where can I find
> anyone who creates custom nutch plugins?)
>
>
> Terrell
> firespinguy at gmail.com
>

Reply via email to