One way around this is to have a custom protocol implementation and get it to fetch via Selenium
J. On 21 June 2013 19:54, Lewis John Mcgibbney <[email protected]>wrote: > Hi, > Nearly all of this page is generated by JS right? > Right now my answer is no. We fetch then parse page source... which in this > case is mostly all JS. The magic happens in the browser. > ... > Lewis > > > On Tue, Jun 18, 2013 at 10:59 PM, Deals Collect <[email protected] > >wrote: > > > Hi all, > > > > Can Nutch get the HTML content generated by Javascript? For example, this > > job site > > > > > https://schneiderele.taleo.net/careersection/2/jobdetail.ftl?job=72522&lang=en > > > > > > Many thanks, > > > > > > -- > *Lewis* > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

