It looks like CONNECTORS-627 is going to make us spin another release candidate.
Karl On Tue, Jan 29, 2013 at 11:41 AM, Karl Wright <[email protected]> wrote: > I ran the multiprocess example, with httpmime.jar just as we deliver > it in the connector-lib directory, and I did not see this issue. It > is almost certainly configuration, seems likely. > > Karl > > On Tue, Jan 29, 2013 at 11:26 AM, Erlend Garåsen > <[email protected]> wrote: >> >> I have to run now, but I will investigate this further. BTW, I have the >> following in my lib folder, so it should work: >> httpmime.jar >> >> I did not see this yesterday when I was testing RC6 with Resin. The >> difference now is that the crawler just fetches and fetches, but nothing >> gets posted to Solr. >> >> I hop it is me who have misconfigured something, but I will get back to this >> as soon as possible. >> >> FATAL 2013-01-29 17:19:17,609 (Worker thread '17') - Error tossed: >> org/apache/http/entity/mime/content/ContentBody >> java.lang.NoClassDefFoundError: >> org/apache/http/entity/mime/content/ContentBody >> at >> org.apache.manifoldcf.agents.output.solr.HttpPoster.<init>(HttpPoster.java:246) >> at >> org.apache.manifoldcf.agents.output.solr.SolrConnector.getSession(SolrConnector.java:256) >> at >> org.apache.manifoldcf.agents.output.solr.SolrConnector.removeDocument(SolrConnector.java:629) >> at >> org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.removeDocument(IncrementalIngester.java:1598) >> at >> org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.performIngestion(IncrementalIngester.java:469) >> at >> org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:370) >> at >> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocument(WorkerThread.java:1651) >> at >> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.deleteDocument(WorkerThread.java:1672) >> at >> org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.processDocuments(WebcrawlerConnector.java:1445) >> at >> org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423) >> at >> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:551) >> Caused by: java.lang.ClassNotFoundException: >> org.apache.http.entity.mime.content.ContentBody >> at java.net.URLClassLoader$1.run(URLClassLoader.java:202) >> at java.security.AccessController.doPrivileged(Native Method) >> at java.net.URLClassLoader.findClass(URLClassLoader.java:190) >> at java.lang.ClassLoader.loadClass(ClassLoader.java:306) >> at java.net.FactoryURLClassLoader.loadClass(URLClassLoader.java:627) >> at java.lang.ClassLoader.loadClass(ClassLoader.java:247) >> ... 11 more >> >> Erlend >> >> >> On 28.01.13 23.09, Karl Wright wrote: >>> >>> Please vote on whether or not to release ManifoldCF 1.1, RC7. >>> >>> The release artifact can be found at: >>> >>> http://people.apache.org/~kwright/apache-manifoldcf-1.1 >>> >>> There is a tag at: >>> >>> https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.1-RC7 >>> >>> This release candidate fixes a packaging problem for wars similar to >>> CONNECTORS-619. It also fixes a problem with the CMIS connector >>> and another SolrJ-related issue (CONNECTORS-622 and CONNECTORS-623). >>> >>> This release candidate provides a better workaround for >>> CONNECTORS-616 than RC5. It also fixes CONNECTORS-617. >>> >>> This release candidate fixes one problem since RC4, which is >>> the inconfigurability of the commit action path for Solr commits in >>> the Solr connector. This needed to be fixed to maintain backwards >>> compatibility. CONNECTORS-621. >>> >>> This release candidate fixes two problems since RC3. The problems >>> were in the included jars for the multiprocess example (CONNECTORS-619) >>> and in connection leakage for JDBC handles (CONNECTORS-620). >>> >>> This release candidate fixes one problem since RC2. The problem is >>> CONNECTORS-618, which relates to MySQL performance. >>> >>> This release candidate fixes one additional problem since RC1. The >>> problem is CONNECTORS-616, and relates to Solr dropping connections >>> during >>> indexing. >>> >>> This release candidate fixes two other problems since RC0, both >>> related to Solr 4.0.0 support. >>> - CONNECTORS-613: The version of Tika used in Solr 4.0.0 cannot >>> extract text unless told an accurate mime type. While this is >>> probably a Tika bug, in this ticket we at least make sure a good guess >>> as to the mime type is sent to Solr. >>> - CONNECTORS-614: Fix logic having to do with releasing idle Solr >>> connections. This shows up as socket timeout exceptions, because it >>> becomes very easy to exhaust the Solr application server's thread pool >>> when idle connections are not released in a timely way. >>> >>> This release includes a significant amount of long-planned upgrading >>> and refactoring since Apache ManifoldCF 1.0.1, including: >>> - Port to HttpComponents from commons-httpclient >>> - Port to SolrJ from homegrown for the Solr connector, so that >>> SolrCloud is supported >>> - Improved NTLM support >>> - Partial Kerberos support >>> - Many other improvements, which are summarized in CHANGES.txt >>> >> >> >> -- >> Erlend Garåsen >> Center for Information Technology Services >> University of Oslo >> P.O. Box 1086 Blindern, N-0317 OSLO, Norway >> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
