Dear Patrick, Sure, I was thinking of selenium. It seems that there is nutch plugin for this purpose which works with selenium. But I did not test that yet. Regards.
On Mon, Sep 1, 2014 at 6:51 PM, Patrick Kirsch <[email protected]> wrote: > Am 06.08.2014 10:24, schrieb Ali Nazemian: > > Dear all, > > Hi, > > - Some of forums use java script for identifying paging and java script > is > > a client side programming language. Somehow it should be parsed with > nutch. > Parsing of plain javascript files (plain links) is possible. > Difficult is the situation, if links will be generated (e.g. click > events) through a Javascript JQuery Framework like JQuery. > In this case Nutch needs to behave more like a browser and need the help > of selenium, phantomjs or xulrunner etc. > > - The depth method of nutch for crawling becomes useless since each page > > consider in new depth. But also infinite depth is off the choice cause it > > can be face us with infite crawling! > > - More... > > I really appreciate if somebody guide me through this subject. > > Best regards. > > > Regards, > Patrick > -- A.Nazemian

