With debug:
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28034ms for sessionid 0x100000050ae0049
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28034ms for sessionid 0x100000050ae0049, closing socket connection
and attempting reconnect
[Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27708ms for sessionid 0xff00000201970044
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27737ms for sessionid 0xff00000201970043
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27737ms for sessionid 0xff00000201970043, closing socket connection
and attempting reconnect
[Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28316ms for sessionid 0x100000050ae004b
[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28394ms for sessionid 0x2000000b80d0047
[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28394ms for sessionid 0x2000000b80d0047, closing socket connection
and attempting reconnect
[Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27708ms for sessionid 0xff00000201970044, closing socket connection
and attempting reconnect
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
agents process ran out of memory - shutting down
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 36805ms for sessionid 0x2000000b80d0046
[Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 36805ms for sessionid 0x2000000b80d0046, closing socket connection
and attempting reconnect
java.lang.OutOfMemoryError: GC overhead limit exceeded
at java.lang.StringBuilder.toString(StringBuilder.java:407)
at
org.apache.manifoldcf.core.cachemanager.CacheManager.readSharedData(CacheManager.java:849)
at
org.apache.manifoldcf.core.cachemanager.CacheManager.hasExpired(CacheManager.java:483)
at
org.apache.manifoldcf.core.cachemanager.CacheManager.lookupObject(CacheManager.java:454)
at
org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:131)
at
org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:204)
at
org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.performQuery(DBInterfacePostgreSQL.java:862)
at
org.apache.manifoldcf.core.database.BaseTable.performQuery(BaseTable.java:236)
at
org.apache.manifoldcf.crawler.jobs.Jobs.deletingJobsPresent(Jobs.java:3133)
at
org.apache.manifoldcf.crawler.jobs.JobManager.getNextDeletableDocuments(JobManager.java:1862)
at
org.apache.manifoldcf.crawler.system.DocumentDeleteStufferThread.run(DocumentDeleteStufferThread.java:108)
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
agents process ran out of memory - shutting down
[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27763ms for sessionid 0x100000050ae004a
[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 27763ms for sessionid 0x100000050ae004a, closing socket connection
and attempting reconnect
[zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
[Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from
server in 28316ms for sessionid 0x100000050ae004b, closing socket connection
and attempting reconnect
java.lang.OutOfMemoryError: GC overhead limit exceeded
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58 name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service,
session 0xff00000201970043 has expired
[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service,
session 0xff00000201970043 has expired, closing socket connection
[Thread-7573-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0xff00000201970043
[zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58 name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Expired type:None path:null path: null type: None
[zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
Our previous ZooKeeper session was expired. Attempting to reconnect to recover
relationship with ZooKeeper...
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN
org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service,
session 0x100000050ae0049 has expired
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service,
session 0x100000050ae0049 has expired, closing socket connection
[zkCallback-11-thread-2] WARN
org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired -
starting a new one...
[zkCallback-11-thread-2] INFO org.apache.zookeeper.ZooKeeper - Initiating
client connection, connectString=kemp-formation-solr:2181 sessionTimeout=60000
watcher=org.apache.solr.common.cloud.ConnectionManager@53181a58
[Thread-5234-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0x100000050ae0049
[zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Expired type:None path:null path: null type: None
[zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager -
Our previous ZooKeeper session was expired. Attempting to reconnect to recover
relationship with ZooKeeper...
[zkCallback-3-thread-4] WARN
org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired -
starting a new one...
[zkCallback-3-thread-4] INFO org.apache.zookeeper.ZooKeeper - Initiating client
connection, connectString=kemp-formation-solr:2181 sessionTimeout=60000
watcher=org.apache.solr.common.cloud.ConnectionManager@7a5c701e
[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[Thread-490] INFO org.eclipse.jetty.server.ServerConnector - Stopped
ServerConnector@2a640157{HTTP/1.1}{0.0.0.0:8345}
[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Session establishment complete on server
kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid =
0x2000000b80d0049, negotiated timeout = 40000
[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Session establishment complete on server
kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid =
0xff00000201970045, negotiated timeout = 40000
agents process ran out of memory - shutting down
java.lang.OutOfMemoryError: GC overhead limit exceeded
agents process ran out of memory - shutting down
java.lang.OutOfMemoryError: GC overhead limit exceeded
at java.util.HashMap.newNode(HashMap.java:1747)
at java.util.HashMap.putVal(HashMap.java:631)
at java.util.HashMap.put(HashMap.java:612)
at jcifs.util.transport.Transport.sendrecv(Transport.java:66)
at jcifs.smb.SmbTransport.send(SmbTransport.java:661)
at jcifs.smb.SmbSession.send(SmbSession.java:238)
at jcifs.smb.SmbTree.send(SmbTree.java:119)
at jcifs.smb.SmbFile.send(SmbFile.java:776)
at jcifs.smb.SmbFileInputStream.readDirect(SmbFileInputStream.java:181)
at jcifs.smb.SmbFileInputStream.read(SmbFileInputStream.java:142)
at
org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:903)
at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
[zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager -
Connection with ZooKeeper reestablished.
[zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager -
Connection with ZooKeeper reestablished.
agents process ran out of memory - shutting down
java.lang.OutOfMemoryError: GC overhead limit exceeded
[zkCallback-11-thread-2] INFO
org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to
ZooKeeper
[zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager -
Connected:true
[zkCallback-3-thread-4] INFO
org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to
ZooKeeper
[zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager -
Connected:true
[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0046
closed
[zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@381a7557 name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
[Thread-7538-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0x2000000b80d0046
agents process ran out of memory - shutting down
java.lang.OutOfMemoryError: GC overhead limit exceeded
at java.util.regex.Matcher.<init>(Matcher.java:225)
at java.util.regex.Pattern.matcher(Pattern.java:1093)
at
de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.java:40)
at
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(BoilerpipeHTMLContentHandler.java:296)
at
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(BoilerpipeHTMLContentHandler.java:198)
at
org.apache.tika.parser.html.BoilerpipeContentHandler.characters(BoilerpipeContentHandler.java:155)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46)
at
org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82)
at
org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140)
at
org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287)
at
org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.xpath.MatchingContentHandler.characters(MatchingContentHandler.java:85)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46)
at
org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82)
at
org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140)
at
org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287)
at
org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279)
at
org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:306)
at
org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator$SheetTextAsHTML.cell(XSSFExcelExtractorDecorator.java:431)
[zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@43f7378f name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
[zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@6432608f name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
[zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager -
Watcher org.apache.solr.common.cloud.ConnectionManager@68bb3d74 name:
ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent
state:Disconnected type:None path:null path: null type: None
[zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager -
zkClient has disconnected
agents process ran out of memory - shutting down
java.lang.OutOfMemoryError: GC overhead limit exceeded
at sun.nio.cs.UTF_8.newEncoder(UTF_8.java:72)
at java.lang.StringCoding.encode(StringCoding.java:348)
at java.lang.String.getBytes(String.java:941)
at org.postgresql.core.Utils.encodeUTF8(Utils.java:53)
at
org.postgresql.core.v3.QueryExecutorImpl.sendParse(QueryExecutorImpl.java:1448)
at
org.postgresql.core.v3.QueryExecutorImpl.sendOneQuery(QueryExecutorImpl.java:1777)
at
org.postgresql.core.v3.QueryExecutorImpl.sendQuery(QueryExecutorImpl.java:1354)
at
org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:292)
at org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:428)
at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:354)
at
org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:301)
at
org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:287)
at
org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:264)
at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:260)
at
org.apache.manifoldcf.core.database.Database.execute(Database.java:876)
at
org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)
[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0xff00000201970044
closed
[Thread-31532-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0xff00000201970044
[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Session establishment complete on server
kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid =
0x100000050ae004a, negotiated timeout = 40000
[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000050ae004a
closed
[Thread-7574-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0x100000050ae004a
[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to
authenticate using SASL (unknown error)
[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established to
kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session
[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO
org.apache.zookeeper.ClientCnxn - Session establishment complete on server
kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid =
0x2000000b80d0047, negotiated timeout = 40000
[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0047
closed
[Thread-7602-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread
shut down for session: 0x2000000b80d0047
[Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped
o.e.j.w.WebAppContext@44d52de2{/mcf-api-service,file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-5748290590258150821.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-api-service.war}
[Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped
o.e.j.w.WebAppContext@60410cd{/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war}
<mailto:o.e.j.w.WebAppContext@60410cd%7b/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE%7d%7b/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war%7d>
Any idea?
Thanks.
De : Karl Wright [mailto:[email protected]]
Envoyé : mardi 24 juillet 2018 13:15
À : [email protected]
Objet : Re: Out of memory, one file bug i think
I've opened CONNECTORS-1516 to track the Class Not Found issue, and also
created an Apache POI bugzilla ticket, which is referenced.
Karl
On Tue, Jul 24, 2018 at 6:15 AM Karl Wright <[email protected]
<mailto:[email protected]> > wrote:
The "class not found" error looks probably like a classloader issue with Tika
-- the class is present in poi-ooxml-3.17.jar, although to be fair it might
possibly be caused by an out-of-memory condition.
You should be able to find the exception in the Simple History and figure out
what document it came from from that. If not, then look at the log prior to
the exception, and look at what Worker Thread 1 was doing.
Karl
On Tue, Jul 24, 2018 at 5:58 AM msaunier <[email protected]
<mailto:[email protected]> > wrote:
Re Karl,
I have an Out of Memory Error today. I think I have an error with a document. I
have this WARNING before crash:
------------------------------------------------------------------------
WARN 2018-07-24T11:46:22,098 (Worker thread '1') - Tika: Tika exception
extracting: TIKA-198: Illegal IOException from
org.apache.tika.parser.microsoft.OfficeParser@62980adb
<mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb>
org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from
org.apache.tika.parser.microsoft.OfficeParser@62980adb
<mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb>
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
~[tika-core-1.17.jar:1.17]
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
~[tika-core-1.17.jar:1.17]
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
~[tika-core-1.17.jar:1.17]
at
org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:74)
~[mcf-tika-connector.jar:?]
at
org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:235)
[mcf-tika-connector.jar:?]
at
org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3226)
[mcf-agents.jar:?]
at
org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)
[mcf-agents.jar:?]
at
org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2708)
[mcf-agents.jar:?]
at
org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:756)
[mcf-agents.jar:?]
at
org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1583)
[mcf-pull-agent.jar:?]
at
org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1548)
[mcf-pull-agent.jar:?]
at
org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:939)
[mcf-jcifs-connector.jar:?]
at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
[mcf-pull-agent.jar:?]
Caused by: java.io.IOException: java.lang.ClassNotFoundException:
org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder
at
org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:150) ~[?:?]
at
org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) ~[?:?]
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203)
~[?:?]
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132)
~[?:?]
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?]
... 12 more
Caused by: java.lang.ClassNotFoundException:
org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder
at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
~[?:1.8.0_171]
at java.lang.ClassLoader.loadClass(ClassLoader.java:424) ~[?:1.8.0_171]
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
~[?:1.8.0_171]
at java.lang.ClassLoader.loadClass(ClassLoader.java:357) ~[?:1.8.0_171]
at
org.apache.poi.poifs.crypt.EncryptionInfo.getBuilder(EncryptionInfo.java:222)
~[?:?]
at
org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:148) ~[?:?]
at
org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) ~[?:?]
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203)
~[?:?]
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132)
~[?:?]
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?]
... 12 more
I think it’s a file, because RAM allocation have a weird behavior. In one
second, ManifoldCF (or Tika) allocate +6Go RAM.
How Can I find the file?
Thanks,
Maxence,