-1 so far. Until the problem described below is solved or explained.

Running Jetty within the example folder seems to work normally, but not within the multiprocess-example folder. In both configurations I have defined a Solr Output Connector and a web crawler. The funny thing within the latter folder is that nothing is sent to Solr. The crawler just fetches and fetches, and that is the only activity I can see.

I have ran:
./start-database.sh
./initialize.sh
./start-agents.sh
./start-webapps.sh

The Solr Output connection is working and I have gone through the settings in my job - very similar configurations from my first attempt within the example folder, but nothing shows up.

When I looked in my logs, I discovered this:
FATAL 2013-01-22 14:10:31,802 (Worker thread '43') - Error tossed: Could not initialize class org.apache.solr.client.solrj.impl.HttpSolrServer java.lang.NoClassDefFoundError: Could not initialize class org.apache.solr.client.solrj.impl.HttpSolrServer at org.apache.manifoldcf.agents.output.solr.HttpPoster.<init>(HttpPoster.java:246) at org.apache.manifoldcf.agents.output.solr.SolrConnector.getSession(SolrConnector.java:256) at org.apache.manifoldcf.agents.output.solr.SolrConnector.addOrReplaceDocument(SolrConnector.java:609) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.addOrReplaceDocument(IncrementalIngester.java:1579) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.performIngestion(IncrementalIngester.java:504) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:370) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocument(WorkerThread.java:1651) at org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.processDocuments(WebcrawlerConnector.java:1409) at org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:551)

BTW, I'm running Solr 3.1, not the latest version. I don't think this has something to do with the problems described above since my Solr server does not seem to be hit my MCF at all.

Erlend

On 22.01.13 09.59, Karl Wright wrote:
Please vote on whether or not to release ManifoldCF 1.1, RC3.

The release artifact can be found at:

http://people.apache.org/~kwright/apache-manifoldcf-1.1

There is a tag at:

https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.1-RC3

Please vote on whether or not to release ManifoldCF 1.1, RC2.

The release artifact can be found at:

http://people.apache.org/~kwright/apache-manifoldcf-1.1

There is a tag at:

https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.1-RC2

This release candidate fixes one problem since RC2.  The problem is
CONNECTORS-618, which relates to MySQL performance.

This release candidate fixes one additional problem since RC1.  The
problem is CONNECTORS-616, and relates to Solr dropping connections
during
indexing.

This release candidate fixes two other problems since RC0, both
related to Solr 4.0.0 support.
- CONNECTORS-613: The version of Tika used in Solr 4.0.0 cannot
extract text unless told an accurate mime type.  While this is
probably a Tika bug, in this ticket we at least make sure a good guess
as to the mime type is sent to Solr.
- CONNECTORS-614: Fix logic having to do with releasing idle Solr
connections.  This shows up as socket timeout exceptions, because it
becomes very easy to exhaust the Solr application server's thread pool
when idle connections are not released in a timely way.

This release includes a significant amount of long-planned upgrading
and refactoring since Apache ManifoldCF 1.0.1, including:
- Port to HttpComponents from commons-httpclient
- Port to SolrJ from homegrown for the Solr connector, so that
SolrCloud is supported
- Improved NTLM support
- Partial Kerberos support
- Many other improvements, which are summarized in CHANGES.txt

Karl



--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050

Reply via email to