Oh - the other obvious reason for no indexing is that there many be no changes in the document.
Karl On Thu, Feb 6, 2014 at 5:50 AM, Karl Wright <[email protected]> wrote: > The document *will* be sent to the output connector if "Decided to ingest" > is logged, so the problem must be in the solr connector itself. The > following code on the solr side is executed: > > if (Logging.ingest.isDebugEnabled()) > Logging.ingest.debug("indexPost(): '" + documentURI + "'"); > > // The SOLR connector cannot deal with folder-level security at this > time. If they are seen, reject the document. > if (document.countDirectoryACLs() != 0) > return false; > > // If the document is too long, reject it. > if (maxDocumentLength != null && document.getBinaryLength() > > maxDocumentLength.longValue()) > return false; > > Otherwise, if a problem is seen, there should be a manifoldcf.log entry, > and also a simple history entry. Please look there first, and if that > doesn't work, turn on ingest debugging to be sure we get to this point. > > Karl > > > > On Thu, Feb 6, 2014 at 4:46 AM, Erlend Garåsen <[email protected]>wrote: > >> >> We're still having problems with this release on our test server. It runs >> stable and does not hang anymore, but nothing gets sent to Solr. Since >> there was a problem with the SSL certificate in previous RCs, maybe there >> is a similar problem related to the Solr Output Connector? We have >> configured the same certificate in order to post documents to Solr. >> >> I get entries like this in manifoldcf.log which indicates that documents >> should be indexed, but they aren't: >> DEBUG 2014-02-06 10:28:06,609 (Worker thread '29') - WEB: Decided to >> ingest 'http://www.ibsen.uio.no/varia.xhtml' >> >> In Simple history, only fetch activities are shown. Any suggestions how >> to debug what's really going on? I can try to turn on debug logging for >> Httpclient in case that helps. >> >> Erlend >> >> >> On 2/4/14 1:33 PM, Karl Wright wrote: >> >>> This is a major release of ManifoldCF that includes the following: >>> >>> - Federated authority support >>> - Multiple authorization domains >>> - ZooKeeper process coordination >>> - Multiple agents processes >>> - Support for SharePoint Claims-based authorization >>> - An Email connector >>> - A revamped look-and-feel >>> >>> Voting will remain open for 3 days. >>> >>> You can download the artifacts from >>> http://people.apache.org/~kwright/apache-manifoldcf-1.5 . There is >>> also a >>> release tag at >>> https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.5-RC7 . >>> >>> This RC includes changes to the dist directory organization so that jar >>> files are not duplicated, saving 40MB from each binary download. It also >>> fixes an issue with connection limits in the zookeeper example. Finally, >>> it fixes a limitation in the CMIS connector (CONNECTORS-864) and a maven >>> build problem (CONNECTORS-865). Also fixes CONNECTORS-866 (the lockclean >>> script), and two more Maven version issues. Finally, corrects a LiveLink >>> connector reversion described in CONNECTORS-871. Missing SolrJ >>> dependencies in CONNECTORS-873. Workaround for SolrJ runtime exception >>> being thrown in CONNECTORS-874. Throttling lockup dealt with, improved, >>> and tested in CONNECTORS-872. >>> >>> Karl >>> >>> >> >
