On 3/15/06, Jérôme Charron <[EMAIL PROTECTED]> wrote: > > > > I am not familiar with Rhino engine. But it is said jdk 6 adopted it > > as embeded javascript engine. Can we build one RhinoInterpreter first, > > and then evaluate the javascipt function to get the result rather than > > extracting pure text now. > > Hi Jack, > > I recently write a small article about search engine and javascript (in > french, sorry): > http://www.moteurzine.com/archives/2006/moteurzine127.html#2 > > My conslusion is simply: Ok, you can figure to you use a javascript > interpreter to extract > URLs. But in fact, how could you simulate all the user interaction? > You could you make that the nutch crawler acts as a human user? > Interpreting Javascript is one thing, knowing all the possible outputs of a > javascript is another one. > No? Hi Jérôme. Thanks for you article even I don't know french at all. I agree with you on "nutch crawler cannot simulate all the user interaction". Somthing like onClick and onKeyDown event. And now I don't how RhinoInterpreter deal with form submit and xmlhttprequest(more time need to know Rhino).
> Regards > > Jérôme > > -- > http://motrech.free.fr/ > http://www.frutch.org/ > > -- Keep Discovering ... ... http://www.jroller.com/page/jmars ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
