Documents: 5826
Active: 3
Processed: 5826.

It's these three documents which are crawled over and over again.

I'm ending this job now because I'm pretty sure it will never stop.

Erlend

On 28.09.12 11.49, Erlend Garåsen wrote:

I'm trying to start a crawl before I have to run to the airport. I just
discovered that MCF recrawls the same host over and over again when it
returns result code 500:
09-28-2012 11:40:11.024     fetch
http://foreninger.uio.no/go/oslo_open_2012_no.php
     500

It's just not this document, but several others returning the same HTTP
result code.

Meanwhile, the following is filling up my log:
FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed:
String index out of range: -1
java.lang.StringIndexOutOfBoundsException: String index out of range: -1

I'm pretty sure they are related to each other.

I will end this job before I leave because I'm afraid that MCF will try
to fetch these documents over and over again during this weekend.

Erlend

On 28.09.12 09.58, Karl Wright wrote:
Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
can be found at:

http://people.apache.org/~kwright/apache-manifoldcf-1.0

There is also an SVN tag at:

https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

Fixes since RC4:

CONNECTORS-545

Fixes since RC3:

CONNECTORS-544





--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050

Reply via email to