I would have some spare cycles starting end of july until end of august.. but I would need some short explanation where and how to integrate the flash text extractor. furthermore is there any document, whatsoever explaining the nutch deign approach? I never had a look at the sources of nutch and the design is very much tuned for performance, which does not make it easier to understand it but better to use it :-)

try:
http://wiki.media-style.com/display/nutchDocu/

HTH
Stefan

Reply via email to