Hi Markus,

Thanks for sharing your experience. We can use HtmlUnit four our
feature. Actually I do not understand their architecture and in our
Nutch architecture How should we position HtmlUnit ? How do they
handle Ajax based pages For example How do they know which js function
should run ?  Secondly How to handle Js based Redirect pages ? Have
you any idea ?

2015-07-06 13:02 GMT+03:00 Markus Jelsma <[email protected]>:
> Hello Talat - we have used HtmlUnit to execute JS inside our parsers. It 
> works very well but, whatever i tried, i have not been able to make events 
> work on scrolldown. Since HtmlUnit is a lib, it does not require a separate 
> daemon such as Selenium, which is an advantage in distributed fault-tolerant 
> jobs.
>
> M.
>
> -----Original message-----
>> From:Talat Uyarer <[email protected]>
>> Sent: Monday 6th July 2015 11:34
>> To: [email protected]
>> Subject: Nutch and JS/Css rendering
>>
>> Hi all,
>>
>> I saw in there[1] "Google decided to try to understand pages by
>> executing JavaScript." What do you think, can we  give JS rendering
>> support for Nutch ? If you have an idea please share with me, I will
>> be glad.
>>
>> [1] 
>> http://googlewebmastercentral.blogspot.com.tr/2014/05/understanding-web-pages-better.html
>>
>> --
>> Talat UYARER
>>



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Reply via email to