Dear Patrick,
Sure, I was thinking of selenium. It seems that there is nutch plugin for
this purpose which works with selenium. But I did not test that yet.
Regards.


On Mon, Sep 1, 2014 at 6:51 PM, Patrick Kirsch <[email protected]> wrote:

> Am 06.08.2014 10:24, schrieb Ali Nazemian:
> > Dear all,
> > Hi,
> > - Some of forums use java script for identifying paging and java script
> is
> > a client side programming language. Somehow it should be parsed with
> nutch.
> Parsing of plain javascript files (plain links) is possible.
> Difficult is the situation, if links will be generated (e.g. click
> events) through a Javascript JQuery Framework like JQuery.
> In this case Nutch needs to behave more like a browser and need the help
> of selenium, phantomjs or xulrunner etc.
> > - The depth method of nutch for crawling becomes useless since each page
> > consider in new depth. But also infinite depth is off the choice cause it
> > can be face us with infite crawling!
> > - More...
> > I really appreciate if somebody guide me through this subject.
> > Best regards.
> >
> Regards,
>  Patrick
>



-- 
A.Nazemian

Reply via email to