> > I am not familiar with Rhino engine. But it is said jdk 6 adopted it > as embeded javascript engine. Can we build one RhinoInterpreter first, > and then evaluate the javascipt function to get the result rather than > extracting pure text now.
Hi Jack, I recently write a small article about search engine and javascript (in french, sorry): http://www.moteurzine.com/archives/2006/moteurzine127.html#2 My conslusion is simply: Ok, you can figure to you use a javascript interpreter to extract URLs. But in fact, how could you simulate all the user interaction? You could you make that the nutch crawler acts as a human user? Interpreting Javascript is one thing, knowing all the possible outputs of a javascript is another one. No? Regards Jérôme -- http://motrech.free.fr/ http://www.frutch.org/
