Hi Grant

Please see attached.

Thanks,

- Chris


----- Original Message -----
From: Grant Ingersoll <gsing...@apache.org>
Sent: Fri, 28/8/2009 1:26pm
To: java-user@lucene.apache.org
Subject: Re: New "Stream closed" exception with Java 6

Are there any log messages around that spot?  Did something happen to  
the underlying Reader you are using?  Can you share that little bit of  
indexing code?

On Aug 27, 2009, at 10:11 AM, Chris Bamford wrote:

> Hi,
>
> Since moving our app to Java 6 and Tomcat 6, we have started getting  
> occasional exceptions of the form:
>
> java.io.IOException: Stream closed
>    at sun.nio.cs.StreamDecoder.ensureOpen(Unknown Source)
>    at sun.nio.cs.StreamDecoder.read(Unknown Source)
>    at java.io.InputStreamReader.read(Unknown Source)
>    at  
> org 
> .apache 
> .lucene 
> .analysis 
> .standard.StandardTokenizerImpl.zzRefill(StandardTokenizerImpl.java: 
> 443)
>    at  
> org 
> .apache 
> .lucene 
> .analysis 
> .standard 
> .StandardTokenizerImpl.getNextToken(StandardTokenizerImpl.java:629)
>    at  
> org 
> .apache 
> .lucene 
> .analysis.standard.StandardTokenizer.next(StandardTokenizer.java:140)
>    at  
> org 
> .apache 
> .lucene.analysis.standard.StandardFilter.next(StandardFilter.java:43)
>    at  
> org.apache.lucene.analysis.LowerCaseFilter.next(LowerCaseFilter.java: 
> 34)
>    at  
> org 
> .apache 
> .lucene.analysis.snowball.SnowballFilter.next(SnowballFilter.java:64)
>    at  
> org 
> .apache 
> .lucene 
> .index.DocInverterPerField.processFields(DocInverterPerField.java:134)
>    at  
> org 
> .apache 
> .lucene 
> .index 
> .DocFieldConsumersPerField 
> .processFields(DocFieldConsumersPerField.java:36)
>    at  
> org 
> .apache 
> .lucene 
> .index 
> .DocFieldProcessorPerThread 
> .processDocument(DocFieldProcessorPerThread.java:234)
>    at  
> org 
> .apache 
> .lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:765)
>    at  
> org 
> .apache 
> .lucene.index.DocumentsWriter.addDocument(DocumentsWriter.java:743)
>    at  
> org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1902)
>    at  
> org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1880)
>    at  
> com 
> .scalix 
> .index.manager.BatchUpdater.performContentAdds(BatchUpdater.java:386)
>    at  
> com.scalix.index.manager.BatchUpdater.processMods(BatchUpdater.java: 
> 176)
>    at  
> com.scalix.index.manager.QueueManager.processQueue(QueueManager.java: 
> 134)
>    at com.scalix.index.manager.QueueManager.run(QueueManager.java:81)
>    at java.lang.Thread.run(Unknown Source)
>
> We are using Lucene 2.4.  Any ideas what is happening?
>
> Thanks!
>
> - Chris
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:
http://www.lucidimagination.com/search


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Since my last email I have poked around and discovered two things:

1) These exceptions are related to a handful of particular documents 
(repeatable), so I suspect the content is the problem.
2) They index fine on Lucene 2.0 but not 2.4

I have added to the error handling, so we get better diagnostics:
 
            try {
                indexWriter.addDocument(doc);
            } catch (CorruptIndexException ex) {
                throw new IndexerException ("CorruptIndexException on doc: " + 
doc.toString() +
                    " - " + ex.toString());
            } catch (IOException ex) {
                throw new IndexerException ("IOException on doc: " + 
doc.toString() +
                    " - " + ex.toString());
            }

Here is an example exception (we are indexing email messages):

com.scalix.index.api.IndexerException: IOException on doc: 
Document<indexed,tokenized<return-path:fre...@scalix.com> 
indexed,tokenized<received
:from [172.28.176.173] (bigfoot.scalix.com [172.16.0.31])
    by bigfoot.scalix.com (Scalix SMTP Relay 11.4.1.11929)
    via ESMTP; Mon, 18 Aug 2008 23:52:31 -0400 (EDT)> 
indexed<CREATE-DATE:20080819035308> indexed,tokenized<date:Mon Aug 18 23:53:08 
EDT 2008>
indexed,tokenized<from:barney<barn...@scalix.com>> indexed,tokenized<cc:Stefan 
=?ISO-8859-1?Q?V=F6lkel?= <stefan.voel...@scalix.com>>  
indexed,tokenized<cc:Peter Kelly <pke...@scalix.com>> 
indexed<message-id:48aa43a4.3090...@scalix.
com> 
indexed<in-reply-to:lbea59c0eb3be48d1bcd72524401cf9ae.1219072640.mail.us.scalix....@mhs>
 indexed<references:LBEA59C0EB3BE48d1BCD72524401CF
9ae.1219072640.mail.us.scalix....@mhs> indexed,tokenized<subject:Re: PLEASE 
READ EVERYONE! Weekly Status Reports> indexed,tokenized<autoforward
ed:FALSE> indexed,tokenized<importance:Normal> 
indexed,tokenized<priority:Normal> indexed,tokenized<sensitivity:Normal> 
indexed,tokenized<x-msm
ail-priority:Medium> indexed,tokenized<x-priority:3> 
indexed<x-scalix-autoreplied:FALSE> indexed,tokenized<user-agent:Thunderbird 
1.5.0.7 (X11/
20061023)> indexed,tokenized<mime-version:1.0> indexed<SIZE-KBYTES:0000000006> 
indexed<DELIVERY-DATE:20080819035235> indexed,tokenized<x-scalix
-internaldate:Mon Aug 18 23:52:35 EDT 2008> indexed<x-scalix-has-attach:FALSE> 
indexed,tokenized<content-type:text/plain;
        charset="US-ASCII";
        format="flowed"> indexed,tokenized<content-transfer-encoding:7bit> 
indexed,tokenized<content-disposition:inline> indexed<CONTENT-TYPE:t
ext/plain> indexed,tokenized<BODY:java.io.inputstreamrea...@1c6f48c> 
indexed<x-scalix-class:IPM.Note> indexed<MSG-TYPE:email> stored/uncompress
ed,indexed<INDEXID:90010ac-4682aaad-48aa4383-54daaf>> - java.io.IOException: 
Stream closed
        at 
com.scalix.index.manager.BatchUpdater.performContentAdds(BatchUpdater.java:395)
        at 
com.scalix.index.manager.BatchUpdater.processMods(BatchUpdater.java:179)
        at 
com.scalix.index.manager.QueueManager.processQueue(QueueManager.java:134)
        at com.scalix.index.manager.QueueManager.run(QueueManager.java:81)
        at java.lang.Thread.run(Unknown Source)

Unfortunately, I cannot seem to see what is in the email BODY part (highlighted 
above) as it is hidden behind some object
reference (<BODY:java.io.inputstreamrea...@1c6f48c>).  How can I do this?

Do you need any more information?


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to