Hi Markus, Thanks for sharing your experience. We can use HtmlUnit four our feature. Actually I do not understand their architecture and in our Nutch architecture How should we position HtmlUnit ? How do they handle Ajax based pages For example How do they know which js function should run ? Secondly How to handle Js based Redirect pages ? Have you any idea ?
2015-07-06 13:02 GMT+03:00 Markus Jelsma <[email protected]>: > Hello Talat - we have used HtmlUnit to execute JS inside our parsers. It > works very well but, whatever i tried, i have not been able to make events > work on scrolldown. Since HtmlUnit is a lib, it does not require a separate > daemon such as Selenium, which is an advantage in distributed fault-tolerant > jobs. > > M. > > -----Original message----- >> From:Talat Uyarer <[email protected]> >> Sent: Monday 6th July 2015 11:34 >> To: [email protected] >> Subject: Nutch and JS/Css rendering >> >> Hi all, >> >> I saw in there[1] "Google decided to try to understand pages by >> executing JavaScript." What do you think, can we give JS rendering >> support for Nutch ? If you have an idea please share with me, I will >> be glad. >> >> [1] >> http://googlewebmastercentral.blogspot.com.tr/2014/05/understanding-web-pages-better.html >> >> -- >> Talat UYARER >> -- Talat UYARER Websitesi: http://talat.uyarer.com Twitter: http://twitter.com/talatuyarer Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

