javascript crawling

eric park Sun, 06 Jun 2010 18:13:36 -0700

Hello,
I'm trying to crawl a local Intranet using nutch-1.0.  The difficulty comes
from crawling bulletin board.  The bulletin-board consists of javascipt
code.  Nutch must use the JSParsefilter to parse the javascipt and move up
to next b-board page for the contents parsing, but the crawler doesn't
extract the proper link.  Anyone had similar experience? Any help will be
appreciated.


Thank You!

javascript crawling

Reply via email to