Re: Custom Parser / Indexer Starting points

2018-02-17 Thread Evert Wagenaar
single Parser returns a result, the following parsers will not > be run. > > > -Original Message- > > From: David Ferrero [mailto:david.ferr...@zion.com] > > Sent: 12 February 2018 06:23 > > To: dev@nutch.apache.org > > Subject: Re: Custom Parser / Indexer Star

RE: Custom Parser / Indexer Starting points

2018-02-11 Thread Yossi Tamari
nal Message- > From: David Ferrero [mailto:david.ferr...@zion.com] > Sent: 12 February 2018 06:23 > To: dev@nutch.apache.org > Subject: Re: Custom Parser / Indexer Starting points > > Thank you for all the tips. I think I need to understand better the pipeline of > parse

Re: Custom Parser / Indexer Starting points

2018-02-11 Thread David Ferrero
Thank you for all the tips. I think I need to understand better the pipeline of parsers and if/how their plug-in.includes order matters. > On Feb 11, 2018, at 1:18 AM, Yossi Tamari wrote: > > Hi David, > > The interfaces related to extending Nutch parser/indexer are

RE: Custom Parser / Indexer Starting points

2018-02-11 Thread Yossi Tamari
Hi David, The interfaces related to extending Nutch parser/indexer are actually very simple. However, finding up-to-date documented samples is not. Luckily, Nutch comes with plenty built-in, so my suggestion would be to pick one, and dive into its implementation. Then just copy its folder and use