hi,
Now i'm working in a custom plugin to add some extra field to elasticsearch
index, you can do the same for your own logic, i learn a lot by read the
code of some extisting plugin like tika, elastic indexer....



On Mon, Mar 13, 2017 at 8:25 PM, Srinivasan Ramaswamy <[email protected]>
wrote:

> Hi nutch-users,
>
> I would like to write a nutch plugin to parse each url and extract
> different elements from the page (using something like jsoup parser) and
> construct a json and write it to s3 (I am running my nutch cluster in AWS).
> I am curious to know whether there is any existing plugin that can do some
> of the work for me.
>
> I do see an example of how to write a parser plugin over at
> https://wiki.apache.org/nutch/WritingPluginExample-1.2
> I am curious to hear from people who have tried a similar use case, to
> learn from others experience.
>
> Thanks
> Srini
>



-- 
Concepteur et développeur web symfony2
https://github.com/lsroudi
http://lsroudi.com/

Reply via email to