RE: nutch 1.15 index multiple cores with solr 7.5

2018-12-21 Thread hany . nasr
Same issue here. What did you do with url regex & normalization?; these configurations might be changed from site to another. Kind regards, Hany Shehata Enterprise Engineer Green Six Sigma Certified Solutions Architect, Marketing and Communications IT Corporate Functions | HSBC Operations, Ser

Re: nutch 1.15 index multiple cores with solr 7.5

2018-12-21 Thread Sebastian Nagel
Hi, Nutch loads all configuration files from the Java class path and picks the first file found on the class path (and ignores other files with the same name). If there are multiple crawls with different configurations, just place a crawl-specific configuration directory in front of the classpat

Re: Apache Nutch 2.3.1 not able to fetch content rendered by ajax

2018-12-21 Thread Sebastian Nagel
Hi, sorry for the late reply. Looks like one of the really nasty dependency conflicts with incompatible class implementations resp. versions which are only observed at runtime. That's the potential conflicting candidates (from current master): runtime/local/plugins/lib-selenium/xml-apis-1.4.01.

RE: Apache Nutch 2.3.1 not able to fetch content rendered by ajax

2018-12-21 Thread Venkata MR
Hi Sebastian, Pls find the link for issue: https://issues.apache.org/jira/browse/NUTCH-2681 Thanks & Regards Venkata MR +91 98455 77125 -Original Message- From: Sebastian Nagel Sent: 21 December 2018 19:19 To: user@nutch.apache.org Cc: Venkata MR Subject: Re: Apache Nutch 2.3.1 not a