Andrzej Bialecki wrote:

Philipp Suter wrote:

I would have some spare cycles starting end of july until end of august.. but I would need some short explanation where and how to integrate the flash text extractor. furthermore is there any document, whatsoever explaining the nutch deign approach? I never had a look at the sources of nutch and the design is very much tuned for performance, which does not make it easier to understand it but better to use it :-)


First, you need to check out the complete source from SVN trunk. Then you can copy one of the existing plugins and use it as a template. I attached an Eclipse project - just put these two files in the main directory (where README.txt is), import the project into Eclipse and off you go.

Thanks!

I will have a look at it as soon as I come back from my holiday (appr. 23.7.). Do the sources of Stefan Groschupf still exist or are they included somehow in another plugin? Probably they are a good starting point for a new implementation.


-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP, AMD, and NVIDIA. To register visit http://www.hp.com/go/dualwebinar
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to