Hello Talat! You can embed it in a parse filter plugin or even better, a parser plugin. There is a method to detect client side redirects via JS but also meta tags. HtmlUnit will start methods on most events such as onLoad, just like a browser would.
M. -----Original message----- > From:Talat Uyarer <[email protected]> > Sent: Monday 6th July 2015 15:38 > To: [email protected] > Subject: Re: Nutch and JS/Css rendering > > Hi Markus, > > Thanks for sharing your experience. We can use HtmlUnit four our > feature. Actually I do not understand their architecture and in our > Nutch architecture How should we position HtmlUnit ? How do they > handle Ajax based pages For example How do they know which js function > should run ? Secondly How to handle Js based Redirect pages ? Have > you any idea ? > > 2015-07-06 13:02 GMT+03:00 Markus Jelsma <[email protected]>: > > Hello Talat - we have used HtmlUnit to execute JS inside our parsers. It > > works very well but, whatever i tried, i have not been able to make events > > work on scrolldown. Since HtmlUnit is a lib, it does not require a separate > > daemon such as Selenium, which is an advantage in distributed > > fault-tolerant jobs. > > > > M. > > > > -----Original message----- > >> From:Talat Uyarer <[email protected]> > >> Sent: Monday 6th July 2015 11:34 > >> To: [email protected] > >> Subject: Nutch and JS/Css rendering > >> > >> Hi all, > >> > >> I saw in there[1] "Google decided to try to understand pages by > >> executing JavaScript." What do you think, can we give JS rendering > >> support for Nutch ? If you have an idea please share with me, I will > >> be glad. > >> > >> [1] > >> http://googlewebmastercentral.blogspot.com.tr/2014/05/understanding-web-pages-better.html > >> > >> -- > >> Talat UYARER > >> > > > > -- > Talat UYARER > Websitesi: http://talat.uyarer.com > Twitter: http://twitter.com/talatuyarer > Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304 >

