Hi Grant Please see attached.
Thanks, - Chris ----- Original Message ----- From: Grant Ingersoll <gsing...@apache.org> Sent: Fri, 28/8/2009 1:26pm To: java-user@lucene.apache.org Subject: Re: New "Stream closed" exception with Java 6 Are there any log messages around that spot? Did something happen to the underlying Reader you are using? Can you share that little bit of indexing code? On Aug 27, 2009, at 10:11 AM, Chris Bamford wrote: > Hi, > > Since moving our app to Java 6 and Tomcat 6, we have started getting > occasional exceptions of the form: > > java.io.IOException: Stream closed > at sun.nio.cs.StreamDecoder.ensureOpen(Unknown Source) > at sun.nio.cs.StreamDecoder.read(Unknown Source) > at java.io.InputStreamReader.read(Unknown Source) > at > org > .apache > .lucene > .analysis > .standard.StandardTokenizerImpl.zzRefill(StandardTokenizerImpl.java: > 443) > at > org > .apache > .lucene > .analysis > .standard > .StandardTokenizerImpl.getNextToken(StandardTokenizerImpl.java:629) > at > org > .apache > .lucene > .analysis.standard.StandardTokenizer.next(StandardTokenizer.java:140) > at > org > .apache > .lucene.analysis.standard.StandardFilter.next(StandardFilter.java:43) > at > org.apache.lucene.analysis.LowerCaseFilter.next(LowerCaseFilter.java: > 34) > at > org > .apache > .lucene.analysis.snowball.SnowballFilter.next(SnowballFilter.java:64) > at > org > .apache > .lucene > .index.DocInverterPerField.processFields(DocInverterPerField.java:134) > at > org > .apache > .lucene > .index > .DocFieldConsumersPerField > .processFields(DocFieldConsumersPerField.java:36) > at > org > .apache > .lucene > .index > .DocFieldProcessorPerThread > .processDocument(DocFieldProcessorPerThread.java:234) > at > org > .apache > .lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:765) > at > org > .apache > .lucene.index.DocumentsWriter.addDocument(DocumentsWriter.java:743) > at > org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1902) > at > org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1880) > at > com > .scalix > .index.manager.BatchUpdater.performContentAdds(BatchUpdater.java:386) > at > com.scalix.index.manager.BatchUpdater.processMods(BatchUpdater.java: > 176) > at > com.scalix.index.manager.QueueManager.processQueue(QueueManager.java: > 134) > at com.scalix.index.manager.QueueManager.run(QueueManager.java:81) > at java.lang.Thread.run(Unknown Source) > > We are using Lucene 2.4. Any ideas what is happening? > > Thanks! > > - Chris > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > -------------------------- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: http://www.lucidimagination.com/search --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org
Since my last email I have poked around and discovered two things: 1) These exceptions are related to a handful of particular documents (repeatable), so I suspect the content is the problem. 2) They index fine on Lucene 2.0 but not 2.4 I have added to the error handling, so we get better diagnostics: try { indexWriter.addDocument(doc); } catch (CorruptIndexException ex) { throw new IndexerException ("CorruptIndexException on doc: " + doc.toString() + " - " + ex.toString()); } catch (IOException ex) { throw new IndexerException ("IOException on doc: " + doc.toString() + " - " + ex.toString()); } Here is an example exception (we are indexing email messages): com.scalix.index.api.IndexerException: IOException on doc: Document<indexed,tokenized<return-path:fre...@scalix.com> indexed,tokenized<received :from [172.28.176.173] (bigfoot.scalix.com [172.16.0.31]) by bigfoot.scalix.com (Scalix SMTP Relay 11.4.1.11929) via ESMTP; Mon, 18 Aug 2008 23:52:31 -0400 (EDT)> indexed<CREATE-DATE:20080819035308> indexed,tokenized<date:Mon Aug 18 23:53:08 EDT 2008> indexed,tokenized<from:barney<barn...@scalix.com>> indexed,tokenized<cc:Stefan =?ISO-8859-1?Q?V=F6lkel?= <stefan.voel...@scalix.com>> indexed,tokenized<cc:Peter Kelly <pke...@scalix.com>> indexed<message-id:48aa43a4.3090...@scalix. com> indexed<in-reply-to:lbea59c0eb3be48d1bcd72524401cf9ae.1219072640.mail.us.scalix....@mhs> indexed<references:LBEA59C0EB3BE48d1BCD72524401CF 9ae.1219072640.mail.us.scalix....@mhs> indexed,tokenized<subject:Re: PLEASE READ EVERYONE! Weekly Status Reports> indexed,tokenized<autoforward ed:FALSE> indexed,tokenized<importance:Normal> indexed,tokenized<priority:Normal> indexed,tokenized<sensitivity:Normal> indexed,tokenized<x-msm ail-priority:Medium> indexed,tokenized<x-priority:3> indexed<x-scalix-autoreplied:FALSE> indexed,tokenized<user-agent:Thunderbird 1.5.0.7 (X11/ 20061023)> indexed,tokenized<mime-version:1.0> indexed<SIZE-KBYTES:0000000006> indexed<DELIVERY-DATE:20080819035235> indexed,tokenized<x-scalix -internaldate:Mon Aug 18 23:52:35 EDT 2008> indexed<x-scalix-has-attach:FALSE> indexed,tokenized<content-type:text/plain; charset="US-ASCII"; format="flowed"> indexed,tokenized<content-transfer-encoding:7bit> indexed,tokenized<content-disposition:inline> indexed<CONTENT-TYPE:t ext/plain> indexed,tokenized<BODY:java.io.inputstreamrea...@1c6f48c> indexed<x-scalix-class:IPM.Note> indexed<MSG-TYPE:email> stored/uncompress ed,indexed<INDEXID:90010ac-4682aaad-48aa4383-54daaf>> - java.io.IOException: Stream closed at com.scalix.index.manager.BatchUpdater.performContentAdds(BatchUpdater.java:395) at com.scalix.index.manager.BatchUpdater.processMods(BatchUpdater.java:179) at com.scalix.index.manager.QueueManager.processQueue(QueueManager.java:134) at com.scalix.index.manager.QueueManager.run(QueueManager.java:81) at java.lang.Thread.run(Unknown Source) Unfortunately, I cannot seem to see what is in the email BODY part (highlighted above) as it is hidden behind some object reference (<BODY:java.io.inputstreamrea...@1c6f48c>). How can I do this? Do you need any more information?
--------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org