hi, Now i'm working in a custom plugin to add some extra field to elasticsearch index, you can do the same for your own logic, i learn a lot by read the code of some extisting plugin like tika, elastic indexer....
On Mon, Mar 13, 2017 at 8:25 PM, Srinivasan Ramaswamy <[email protected]> wrote: > Hi nutch-users, > > I would like to write a nutch plugin to parse each url and extract > different elements from the page (using something like jsoup parser) and > construct a json and write it to s3 (I am running my nutch cluster in AWS). > I am curious to know whether there is any existing plugin that can do some > of the work for me. > > I do see an example of how to write a parser plugin over at > https://wiki.apache.org/nutch/WritingPluginExample-1.2 > I am curious to hear from people who have tried a similar use case, to > learn from others experience. > > Thanks > Srini > -- Concepteur et développeur web symfony2 https://github.com/lsroudi http://lsroudi.com/

