Hello,
I'm trying to crawl a local Intranet using nutch-1.0.  The difficulty comes
from crawling bulletin board.  The bulletin-board consists of javascipt
code.  Nutch must use the JSParsefilter to parse the javascipt and move up
to next b-board page for the contents parsing, but the crawler doesn't
extract the proper link.  Anyone had similar experience? Any help will be
appreciated.

Thank You!

Reply via email to