Hello Talat! You can embed it in a parse filter plugin or even better, a parser 
plugin. There is a method to detect client side redirects via JS but also meta 
tags. HtmlUnit will start methods on most events such as onLoad, just like a 
browser would.

M.
 
-----Original message-----
> From:Talat Uyarer <[email protected]>
> Sent: Monday 6th July 2015 15:38
> To: [email protected]
> Subject: Re: Nutch and JS/Css rendering
> 
> Hi Markus,
> 
> Thanks for sharing your experience. We can use HtmlUnit four our
> feature. Actually I do not understand their architecture and in our
> Nutch architecture How should we position HtmlUnit ? How do they
> handle Ajax based pages For example How do they know which js function
> should run ?  Secondly How to handle Js based Redirect pages ? Have
> you any idea ?
> 
> 2015-07-06 13:02 GMT+03:00 Markus Jelsma <[email protected]>:
> > Hello Talat - we have used HtmlUnit to execute JS inside our parsers. It 
> > works very well but, whatever i tried, i have not been able to make events 
> > work on scrolldown. Since HtmlUnit is a lib, it does not require a separate 
> > daemon such as Selenium, which is an advantage in distributed 
> > fault-tolerant jobs.
> >
> > M.
> >
> > -----Original message-----
> >> From:Talat Uyarer <[email protected]>
> >> Sent: Monday 6th July 2015 11:34
> >> To: [email protected]
> >> Subject: Nutch and JS/Css rendering
> >>
> >> Hi all,
> >>
> >> I saw in there[1] "Google decided to try to understand pages by
> >> executing JavaScript." What do you think, can we  give JS rendering
> >> support for Nutch ? If you have an idea please share with me, I will
> >> be glad.
> >>
> >> [1] 
> >> http://googlewebmastercentral.blogspot.com.tr/2014/05/understanding-web-pages-better.html
> >>
> >> --
> >> Talat UYARER
> >>
> 
> 
> 
> -- 
> Talat UYARER
> Websitesi: http://talat.uyarer.com
> Twitter: http://twitter.com/talatuyarer
> Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304
> 

Reply via email to