Hi Jesse,
thanks for posting the detailed log messages. We've created a bug report on
GitHub for this issue. Please subscribe to
https://github.com/Graylog2/graylog2-server/issues/1465 to get updates on
the issue.
Cheers,
Jochen
On Monday, 5 October 2015 17:10:43 UTC+2, Jesse Skrivseth wrote:
>
> Hey Jochen,
>
> Yes, I had to manually trigger the index range rebuild. Note that if you
> make an API call to /system/indexer/indices/{index}/close it also triggers
> a recalc across all indices. Since concurrent range recalcs are not
> possible, that means in our system if you make an API call to close an
> index, you have to wait 2.5 hours to try to close another. I see warnings
> like this frequently:
>
> 2015-10-05_14:47:03.98401 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4128
> 2015-10-05_14:47:03.99521 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4132
> 2015-10-05_14:47:05.33256 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4135
> 2015-10-05_14:47:05.33283 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4136
> 2015-10-05_14:48:02.03805 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4128
> 2015-10-05_14:48:02.03828 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4132
>
> Since the System->Indices page won't load, I manually made a call
> to /system/indices/ranges/rebuild to trigger it. After several hours, it
> eventually finds the problematic indices and throws these exceptions:
>
> 2015-10-05_03:03:58.03817 INFO [EsIndexRangeService] Calculated range of
> [graylog_3282] in [58ms].
> 2015-10-05_03:04:01.05886 INFO [EsIndexRangeService] Calculated range of
> [graylog_3281] in [146ms].
> 2015-10-05_03:04:05.29949 INFO [EsIndexRangeService] Calculated range of
> [graylog_3288] in [57ms].
> 2015-10-05_03:04:08.30944 ERROR [EsIndexRangeService] Error while
> calculating timestamp stats in index <graylog_4136>
> 2015-10-05_03:04:08.30947
> org.elasticsearch.action.search.SearchPhaseExecutionException: Failed to
> execute phase [query], all shards failed; shardFailures
> {[hPjzunuIQHivOLVhGMugsg][graylog_4136][0]:
> RemoteTransportException[[Cethlann][inet[/x.x.x.x:9300]][indices:data/read/search[phase/query]]];
>
> nested:
> ClassCastException[org.elasticsearch.index.fielddata.plain.PagedBytesIndexFieldData
>
> cannot be cast to org.elasticsearch.index.fielddata.IndexNumericFieldData];
> }{[2lLV-RaMTUWwl74xETQ8HA][graylog_4136][1]:
> RemoteTransportException[[Unseen][inet[/x.x.x.x:9300]][indices:data/read/search[phase/query]]];
>
> nested:
> ClassCastException[org.elasticsearch.index.fielddata.plain.PagedBytesIndexFieldData
>
> cannot be cast to org.elasticsearch.index.fielddata.IndexNumericFieldData];
> }{[2lLV-RaMTUWwl74xETQ8HA][graylog_4136][2]:
> RemoteTransportException[[Unseen][inet[/x.x.x.x:9300]][indices:data/read/search[phase/query]]];
>
> nested:
> ClassCastException[org.elasticsearch.index.fielddata.plain.PagedBytesIndexFieldData
>
> cannot be cast to org.elasticsearch.index.fielddata.IndexNumericFieldData];
> }{[hPjzunuIQHivOLVhGMugsg][graylog_4136][3]:
> RemoteTransportException[[Cethlann][inet[/x.x.x.x:9300]][indices:data/read/search[phase/query]]];
>
> nested:
> ClassCastException[org.elasticsearch.index.fielddata.plain.PagedBytesIndexFieldData
>
> cannot be cast to org.elasticsearch.index.fielddata.IndexNumericFieldData];
> }
> 2015-10-05_03:04:08.30951 at
> org.elasticsearch.action.search.type.TransportSearchTypeAction$BaseAsyncAction.onFirstPhaseResult(TransportSearchTypeAction.java:237)
> 2015-10-05_03:04:08.30951 at
> org.elasticsearch.action.search.type.TransportSearchTypeAction$BaseAsyncAction$1.onFailure(TransportSearchTypeAction.java:183)
> 2015-10-05_03:04:08.30951 at
> org.elasticsearch.search.action.SearchServiceTransportAction$6.handleException(SearchServiceTransportAction.java:249)
> 2015-10-05_03:04:08.30952 at
> org.elasticsearch.transport.netty.MessageChannelHandler.handleException(MessageChannelHandler.java:190)
> 2015-10-05_03:04:08.30952 at
> org.elasticsearch.transport.netty.MessageChannelHandler.handlerResponseError(MessageChannelHandler.java:180)
> 2015-10-05_03:04:08.30953 at
> org.elasticsearch.transport.netty.MessageChannelHandler.messageReceived(MessageChannelHandler.java:130)
> 2015-10-05_03:04:08.30953 at
> org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:70)
> 2015-10-05_03:04:08.30954 at
> org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
> 2015-10-05_03:04:08.30954 at
> org.elasticsearch.common.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendUpstream(DefaultChannelPipeline.java:791)
> 2015-10-05_03:04:08.30955 at
> org.elasticsearch.common.netty.channel.Channels.fireMessageReceived(Channels.java:296)
> 2015-10-05_03:04:08.30955 at
> org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.unfoldAndFireMessageReceived(FrameDecoder.java:462)
> 2015-10-05_03:04:08.30956 at
> org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.callDecode(FrameDecoder.java:443)
> 2015-10-05_03:04:08.30956 at
> org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.messageReceived(FrameDecoder.java:303)
> 2015-10-05_03:04:08.30956 at
> org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:70)
> 2015-10-05_03:04:08.30957 at
> org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
> 2015-10-05_03:04:08.30958 at
> org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:559)
> 2015-10-05_03:04:08.30958 at
> org.elasticsearch.common.netty.channel.Channels.fireMessageReceived(Channels.java:268)
> 2015-10-05_03:04:08.30958 at
> org.elasticsearch.common.netty.channel.Channels.fireMessageReceived(Channels.java:255)
> 2015-10-05_03:04:08.30959 at
> org.elasticsearch.common.netty.channel.socket.nio.NioWorker.read(NioWorker.java:88)
> 2015-10-05_03:04:08.30959 at
> org.elasticsearch.common.netty.channel.socket.nio.AbstractNioWorker.process(AbstractNioWorker.java:108)
> 2015-10-05_03:04:08.30960 at
> org.elasticsearch.common.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:337)
> 2015-10-05_03:04:08.30961 at
> org.elasticsearch.common.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:89)
> 2015-10-05_03:04:08.30961 at
> org.elasticsearch.common.netty.channel.socket.nio.NioWorker.run(NioWorker.java:178)
> 2015-10-05_03:04:08.30961 at
> org.elasticsearch.common.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:108)
> 2015-10-05_03:04:08.30962 at
> org.elasticsearch.common.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
> 2015-10-05_03:04:08.30963 at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 2015-10-05_03:04:08.30963 at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 2015-10-05_03:04:08.30964 at java.lang.Thread.run(Thread.java:745)
> 2015-10-05_03:04:08.31020 INFO [RebuildIndexRangesJob] Could not
> calculate range of index [graylog_4136]. Skipping.
> 2015-10-05_03:04:08.31022 org.elasticsearch.indices.IndexMissingException:
> [graylog_4136] missing
> 2015-10-05_03:04:08.31022 at
> org.graylog2.indexer.ranges.EsIndexRangeService.timestampStatsOfIndex(EsIndexRangeService.java:249)
> 2015-10-05_03:04:08.31022 at
> org.graylog2.indexer.ranges.EsIndexRangeService.calculateRange(EsIndexRangeService.java:217)
> 2015-10-05_03:04:08.31023 at
> org.graylog2.indexer.ranges.RebuildIndexRangesJob.execute(RebuildIndexRangesJob.java:96)
> 2015-10-05_03:04:08.31023 at
> org.graylog2.system.jobs.SystemJobManager$1.run(SystemJobManager.java:88)
> 2015-10-05_03:04:08.31024 at
> com.codahale.metrics.InstrumentedScheduledExecutorService$InstrumentedRunnable.run(InstrumentedScheduledExecutorService.java:235)
> 2015-10-05_03:04:08.31024 at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> 2015-10-05_03:04:08.31024 at
> java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 2015-10-05_03:04:08.31025 at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> 2015-10-05_03:04:08.31025 at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> 2015-10-05_03:04:08.31026 at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 2015-10-05_03:04:08.31027 at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 2015-10-05_03:04:08.31027 at java.lang.Thread.run(Thread.java:745)
> 2015-10-05_03:04:08.38399 INFO [EsIndexRangeService] Calculated range of
> [graylog_3287] in [73ms].
> 2015-10-05_03:04:11.52658 INFO [EsIndexRangeService] Calculated range of
> [graylog_4137] in [42ms].
>
> So unfortunately, those indices never have valid ranges due to the
> exception.
>
> The range recalc seems to have trouble with "index missing" exceptions,
> probably due to ES throwing other exceptions related to that index. The
> index is not missing.
>
>
> 2015-10-05_14:37:02.99050 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4128
> 2015-10-05_14:37:02.99091 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4136
> 2015-10-05_14:37:02.99214 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4293
> 2015-10-05_14:37:17.78183 INFO [SetIndexReadOnlyJob] Flushing old index
> <graylog_4292>.
> 2015-10-05_14:37:18.30641 INFO [SetIndexReadOnlyJob] Setting old index
> <graylog_4292> to read-only.
> 2015-10-05_14:37:21.84393 INFO [SystemJobManager] Submitted SystemJob
> <9664c833-6b6e-11e5-b272-0ae24e7f4757>
> [org.graylog2.indexer.indices.jobs.OptimizeIndexJob]
> 2015-10-05_14:37:21.84397 INFO [SystemJobManager] SystemJob
> <82175553-6b6e-11e5-b272-0ae24e7f4757>
> [org.graylog2.indexer.SetIndexReadOnlyJob] finished in 4062ms.
> 2015-10-05_14:37:21.84419 INFO [OptimizeIndexJob] Optimizing index
> <graylog_4292>.
> 2015-10-05_14:37:31.82487 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4128
> 2015-10-05_14:37:31.82519 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4136
> 2015-10-05_14:37:31.82658 WARN [EsIndexRangeService] Couldn't load index
> range for index graylog_4293
> 2015-10-05_14:37:31.83269 WARN [IndexHelper] Couldn't find latest
> deflector target index
> 2015-10-05_14:37:31.83270 org.graylog2.database.NotFoundException:
> Couldn't find index range for index graylog_4293
> 2015-10-05_14:37:31.83271 at
> org.graylog2.indexer.ranges.EsIndexRangeService.loadIndexRange(EsIndexRangeService.java:156)
> 2015-10-05_14:37:31.83271 at
> org.graylog2.indexer.ranges.EsIndexRangeService.access$000(EsIndexRangeService.java:79)
> 2015-10-05_14:37:31.83271 at
> org.graylog2.indexer.ranges.EsIndexRangeService$1.load(EsIndexRangeService.java:107)
> 2015-10-05_14:37:31.83272 at
> org.graylog2.indexer.ranges.EsIndexRangeService$1.load(EsIndexRangeService.java:104)
> 2015-10-05_14:37:31.83272 at
> com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3527)
> 2015-10-05_14:37:31.83272 at
> com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2319)
> 2015-10-05_14:37:31.83273 at
> com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2282)
> 2015-10-05_14:37:31.83273 at
> com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2197)
> 2015-10-05_14:37:31.83273 at
> com.google.common.cache.LocalCache.get(LocalCache.java:3937)
> 2015-10-05_14:37:31.83273 at
> com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3941)
> 2015-10-05_14:37:31.83274 at
> com.google.common.cache.LocalCache$LocalLoadingCache.get(LocalCache.java:4824)
> 2015-10-05_14:37:31.83274 at
> org.graylog2.indexer.ranges.EsIndexRangeService.get(EsIndexRangeService.java:131)
> 2015-10-05_14:37:31.83274 at
> org.graylog2.indexer.IndexHelper.determineAffectedIndicesWithRanges(IndexHelper.java:119)
> 2015-10-05_14:37:31.83275 at
> org.graylog2.indexer.searches.Searches.search(Searches.java:241)
> 2015-10-05_14:37:31.83275 at
> org.graylog2.rest.resources.search.RelativeSearchResource.searchRelative(RelativeSearchResource.java:103)
> 2015-10-05_14:37:31.83275 at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 2015-10-05_14:37:31.83276 at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 2015-10-05_14:37:31.83277 at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2015-10-05_14:37:31.83277 at
> java.lang.reflect.Method.invoke(Method.java:497)
> 2015-10-05_14:37:31.83277 at
> org.glassfish.jersey.server.model.internal.ResourceMethodInvocationHandlerFactory$1.invoke(ResourceMethodInvocationHandlerFactory.java:81)
> 2015-10-05_14:37:31.83277 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher$1.run(AbstractJavaResourceMethodDispatcher.java:144)
> 2015-10-05_14:37:31.83278 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.invoke(AbstractJavaResourceMethodDispatcher.java:161)
> 2015-10-05_14:37:31.83278 at
> org.glassfish.jersey.server.model.internal.JavaResourceMethodDispatcherProvider$TypeOutInvoker.doDispatch(JavaResourceMethodDispatcherProvider.java:205)
> 2015-10-05_14:37:31.83278 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.dispatch(AbstractJavaResourceMethodDispatcher.java:99)
> 2015-10-05_14:37:31.83278 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.invoke(ResourceMethodInvoker.java:389)
> 2015-10-05_14:37:31.83279 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:347)
> 2015-10-05_14:37:31.83279 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:102)
> 2015-10-05_14:37:31.83279 at
> org.glassfish.jersey.server.ServerRuntime$2.run(ServerRuntime.java:308)
> 2015-10-05_14:37:31.83279 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:271)
> 2015-10-05_14:37:31.83280 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:267)
> 2015-10-05_14:37:31.83280 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:315)
> 2015-10-05_14:37:31.83280 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:297)
> 2015-10-05_14:37:31.83280 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:267)
> 2015-10-05_14:37:31.83281 at
> org.glassfish.jersey.process.internal.RequestScope.runInScope(RequestScope.java:317)
> 2015-10-05_14:37:31.83281 at
> org.glassfish.jersey.server.ServerRuntime.process(ServerRuntime.java:291)
> 2015-10-05_14:37:31.83281 at
> org.glassfish.jersey.server.ApplicationHandler.handle(ApplicationHandler.java:1140)
> 2015-10-05_14:37:31.83281 at
> org.graylog2.jersey.container.netty.NettyContainer.messageReceived(NettyContainer.java:356)
> 2015-10-05_14:37:31.83281 at
> org.jboss.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:70)
> 2015-10-05_14:37:31.83282 at
> org.jboss.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
> 2015-10-05_14:37:31.83283 at
> org.jboss.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendUpstream(DefaultChannelPipeline.java:791)
> 2015-10-05_14:37:31.83283 at
> org.jboss.netty.handler.execution.ChannelUpstreamEventRunnable.doRun(ChannelUpstreamEventRunnable.java:43)
> 2015-10-05_14:37:31.83283 at
> org.jboss.netty.handler.execution.ChannelEventRunnable.run(ChannelEventRunnable.java:67)
> 2015-10-05_14:37:31.83284 at
> com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:176)
> 2015-10-05_14:37:31.83284 at
> org.jboss.netty.handler.execution.MemoryAwareThreadPoolExecutor$MemoryAwareRunnable.run(MemoryAwareThreadPoolExecutor.java:606)
> 2015-10-05_14:37:31.83284 at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 2015-10-05_14:37:31.83285 at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 2015-10-05_14:37:31.83285 at java.lang.Thread.run(Thread.java:745)
> 2015-10-05_14:37:34.54268 ERROR [AnyExceptionClassMapper] Unhandled
> exception in REST resource
> 2015-10-05_14:37:34.54272 org.elasticsearch.ElasticsearchTimeoutException:
> Timeout waiting for task.
> 2015-10-05_14:37:34.54273 at
> org.elasticsearch.action.support.AdapterActionFuture.actionGet(AdapterActionFuture.java:73)
> 2015-10-05_14:37:34.54274 at
> org.graylog2.indexer.elasticsearch.GlobalTimeoutClient$GlobalTimeoutActionFuture.actionGet(GlobalTimeoutClient.java:150)
> 2015-10-05_14:37:34.54274 at
> org.graylog2.indexer.searches.Searches.histogram(Searches.java:467)
> 2015-10-05_14:37:34.54275 at
> org.graylog2.rest.resources.search.RelativeSearchResource.histogramRelative(RelativeSearchResource.java:251)
> 2015-10-05_14:37:34.54275 at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 2015-10-05_14:37:34.54275 at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 2015-10-05_14:37:34.54276 at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2015-10-05_14:37:34.54277 at
> java.lang.reflect.Method.invoke(Method.java:497)
> 2015-10-05_14:37:34.54278 at
> org.glassfish.jersey.server.model.internal.ResourceMethodInvocationHandlerFactory$1.invoke(ResourceMethodInvocationHandlerFactory.java:81)
> 2015-10-05_14:37:34.54279 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher$1.run(AbstractJavaResourceMethodDispatcher.java:144)
> 2015-10-05_14:37:34.54280 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.invoke(AbstractJavaResourceMethodDispatcher.java:161)
> 2015-10-05_14:37:34.54280 at
> org.glassfish.jersey.server.model.internal.JavaResourceMethodDispatcherProvider$TypeOutInvoker.doDispatch(JavaResourceMethodDispatcherProvider.java:205)
> 2015-10-05_14:37:34.54280 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.dispatch(AbstractJavaResourceMethodDispatcher.java:99)
> 2015-10-05_14:37:34.54281 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.invoke(ResourceMethodInvoker.java:389)
> 2015-10-05_14:37:34.54281 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:347)
> 2015-10-05_14:37:34.54281 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:102)
> 2015-10-05_14:37:34.54282 at
> org.glassfish.jersey.server.ServerRuntime$2.run(ServerRuntime.java:308)
> 2015-10-05_14:37:34.54282 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:271)
> 2015-10-05_14:37:34.54282 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:267)
> 2015-10-05_14:37:34.54283 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:315)
> 2015-10-05_14:37:34.54283 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:297)
> 2015-10-05_14:37:34.54285 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:267)
> 2015-10-05_14:37:34.54285 at
> org.glassfish.jersey.process.internal.RequestScope.runInScope(RequestScope.java:317)
> 2015-10-05_14:37:34.54286 at
> org.glassfish.jersey.server.ServerRuntime.process(ServerRuntime.java:291)
> 2015-10-05_14:37:34.54286 at
> org.glassfish.jersey.server.ApplicationHandler.handle(ApplicationHandler.java:1140)
> 2015-10-05_14:37:34.54286 at
> org.graylog2.jersey.container.netty.NettyContainer.messageReceived(NettyContainer.java:356)
> 2015-10-05_14:37:34.54287 at
> org.jboss.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:70)
> 2015-10-05_14:37:34.54287 at
> org.jboss.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
> 2015-10-05_14:37:34.54287 at
> org.jboss.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendUpstream(DefaultChannelPipeline.java:791)
> 2015-10-05_14:37:34.54288 at
> org.jboss.netty.handler.execution.ChannelUpstreamEventRunnable.doRun(ChannelUpstreamEventRunnable.java:43)
> 2015-10-05_14:37:34.54288 at
> org.jboss.netty.handler.execution.ChannelEventRunnable.run(ChannelEventRunnable.java:67)
> 2015-10-05_14:37:34.54289 at
> com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:176)
> 2015-10-05_14:37:34.54289 at
> org.jboss.netty.handler.execution.MemoryAwareThreadPoolExecutor$MemoryAwareRunnable.run(MemoryAwareThreadPoolExecutor.java:606)
> 2015-10-05_14:37:34.54289 at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 2015-10-05_14:37:34.54294 at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 2015-10-05_14:37:34.54295 at java.lang.Thread.run(Thread.java:745)
> 2015-10-05_14:37:34.66572 ERROR [AnyExceptionClassMapper] Unhandled
> exception in REST resource
> 2015-10-05_14:37:34.66574
> com.google.common.util.concurrent.UncheckedExecutionException:
> org.elasticsearch.ElasticsearchTimeoutException: Timeout waiting for task.
> 2015-10-05_14:37:34.66575 at
> com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2203)
> 2015-10-05_14:37:34.66575 at
> com.google.common.cache.LocalCache.get(LocalCache.java:3937)
> 2015-10-05_14:37:34.66576 at
> com.google.common.cache.LocalCache$LocalManualCache.get(LocalCache.java:4739)
> 2015-10-05_14:37:34.66576 at
> org.graylog2.rest.resources.sources.SourcesResource.list(SourcesResource.java:89)
> 2015-10-05_14:37:34.66577 at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 2015-10-05_14:37:34.66578 at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 2015-10-05_14:37:34.66578 at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2015-10-05_14:37:34.66579 at
> java.lang.reflect.Method.invoke(Method.java:497)
> 2015-10-05_14:37:34.66579 at
> org.glassfish.jersey.server.model.internal.ResourceMethodInvocationHandlerFactory$1.invoke(ResourceMethodInvocationHandlerFactory.java:81)
> 2015-10-05_14:37:34.66579 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher$1.run(AbstractJavaResourceMethodDispatcher.java:144)
> 2015-10-05_14:37:34.66580 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.invoke(AbstractJavaResourceMethodDispatcher.java:161)
> 2015-10-05_14:37:34.66581 at
> org.glassfish.jersey.server.model.internal.JavaResourceMethodDispatcherProvider$TypeOutInvoker.doDispatch(JavaResourceMethodDispatcherProvider.java:205)
> 2015-10-05_14:37:34.66581 at
> org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.dispatch(AbstractJavaResourceMethodDispatcher.java:99)
> 2015-10-05_14:37:34.66582 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.invoke(ResourceMethodInvoker.java:389)
> 2015-10-05_14:37:34.66582 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:347)
> 2015-10-05_14:37:34.66582 at
> org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:102)
> 2015-10-05_14:37:34.66583 at
> org.glassfish.jersey.server.ServerRuntime$2.run(ServerRuntime.java:308)
> 2015-10-05_14:37:34.66583 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:271)
> 2015-10-05_14:37:34.66583 at
> org.glassfish.jersey.internal.Errors$1.call(Errors.java:267)
> 2015-10-05_14:37:34.66584 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:315)
> 2015-10-05_14:37:34.66585 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:297)
> 2015-10-05_14:37:34.66585 at
> org.glassfish.jersey.internal.Errors.process(Errors.java:267)
> 2015-10-05_14:37:34.66586 at
> org.glassfish.jersey.process.internal.RequestScope.runInScope(RequestScope.java:317)
> 2015-10-05_14:37:34.66586 at
> org.glassfish.jersey.server.ServerRuntime.process(ServerRuntime.java:291)
> 2015-10-05_14:37:34.66586 at
> org.glassfish.jersey.server.ApplicationHandler.handle(ApplicationHandler.java:1140)
> 2015-10-05_14:37:34.66587 at
> org.graylog2.jersey.container.netty.NettyContainer.messageReceived(NettyContainer.java:356)
> 2015-10-05_14:37:34.66587 at
> org.jboss.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:70)
> 2015-10-05_14:37:34.66589 at
> org.jboss.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
> 2015-10-05_14:37:34.66589 at
> org.jboss.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendUpstream(DefaultChannelPipeline.java:791)
> 2015-10-05_14:37:34.66591 at
> org.jboss.netty.handler.execution.ChannelUpstreamEventRunnable.doRun(ChannelUpstreamEventRunnable.java:43)
> 2015-10-05_14:37:34.66591 at
> org.jboss.netty.handler.execution.ChannelEventRunnable.run(ChannelEventRunnable.java:67)
> 2015-10-05_14:37:34.66593 at
> com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:176)
> 2015-10-05_14:37:34.66595 at
> org.jboss.netty.handler.execution.MemoryAwareThreadPoolExecutor$MemoryAwareRunnable.run(MemoryAwareThreadPoolExecutor.java:606)
> 2015-10-05_14:37:34.66595 at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 2015-10-05_14:37:34.66595 at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 2015-10-05_14:37:34.66596 at java.lang.Thread.run(Thread.java:745)
> 2015-10-05_14:37:34.66596 Caused by:
> org.elasticsearch.ElasticsearchTimeoutException: Timeout waiting for task.
> 2015-10-05_14:37:34.66597 at
> org.elasticsearch.action.support.AdapterActionFuture.actionGet(AdapterActionFuture.java:73)
> 2015-10-05_14:37:34.66597 at
> org.graylog2.indexer.elasticsearch.GlobalTimeoutClient$GlobalTimeoutActionFuture.actionGet(GlobalTimeoutClient.java:150)
> 2015-10-05_14:37:34.66597 at
> org.graylog2.indexer.searches.Searches.terms(Searches.java:283)
> 2015-10-05_14:37:34.66598 at
> org.graylog2.indexer.searches.Searches.terms(Searches.java:298)
> 2015-10-05_14:37:34.66598 at
> org.graylog2.rest.resources.sources.SourcesResource$1.call(SourcesResource.java:93)
> 2015-10-05_14:37:34.66598 at
> org.graylog2.rest.resources.sources.SourcesResource$1.call(SourcesResource.java:89)
> 2015-10-05_14:37:34.66599 at
> com.google.common.cache.LocalCache$LocalManualCache$1.load(LocalCache.java:4742)
> 2015-10-05_14:37:34.66600 at
> com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3527)
> 2015-10-05_14:37:34.66600 at
> com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2319)
> 2015-10-05_14:37:34.66600 at
> com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2282)
> 2015-10-05_14:37:34.66601 at
> com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2197)
> 2015-10-05_14:37:34.66601 ... 35 more
> 2015-10-05_14:37:47.23951 INFO [EsIndexRangeService] Calculated range of
> [graylog_4293] in [59453ms].
> 2015-10-05_14:37:47.24239 INFO [CreateNewSingleIndexRangeJob] Created
> ranges for index graylog_4293.
> 2015-10-05_14:37:47.24246 INFO [SystemJobManager] SystemJob
> <82175554-6b6e-11e5-b272-0ae24e7f4757>
> [org.graylog2.indexer.ranges.CreateNewSingleIndexRangeJob] finished in
> 59456ms.
> 2015-10-05_14:37:47.45459 INFO [EsIndexRangeService] Calculated range of
> [graylog_4292] in [59672ms].
>
>
>
> On Monday, October 5, 2015 at 8:17:45 AM UTC-6, Jochen Schalanda wrote:
>>
>> Hi Jesse,
>>
>> did you manually trigger the calculation of all the index ranges?
>>
>> During normal operations of Graylog 1.2.x, only the index range of the
>> latest index should be calculated and stored (in contrast to Graylog 1.1.x
>> and earlier, which most of the time calculated all index ranges).
>>
>> Cheers,
>> Jochen
>>
>> On Saturday, 3 October 2015 19:11:15 UTC+2, Jesse Skrivseth wrote:
>>>
>>> A few examples of index recalc performance differences:
>>>
>>> In 1.1.6
>>> ...
>>> 2015-09-04_03:56:38.19431 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_2075] in [9ms].
>>> 2015-09-04_03:56:38.20487 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1800] in [10ms].
>>> 2015-09-04_03:56:38.21176 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1802] in [6ms].
>>> 2015-09-04_03:56:38.22028 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1801] in [8ms].
>>> 2015-09-04_03:56:38.23003 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1808] in [9ms].
>>> 2015-09-04_03:56:38.23855 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1807] in [8ms].
>>> 2015-09-04_03:56:38.24877 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1809] in [10ms].
>>> 2015-09-04_03:56:38.26011 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1804] in [11ms].
>>> 2015-09-04_03:56:38.26902 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1803] in [8ms].
>>> 2015-09-04_03:56:38.28049 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1806] in [11ms].
>>> 2015-09-04_03:56:38.29289 INFO [RebuildIndexRangesJob] Calculated range
>>> of [graylog_1805] in [12ms].
>>> 2015-09-04_03:56:38.41557 INFO [RebuildIndexRangesJob] Done calculating
>>> index ranges for 800 indices. Took 10608ms.
>>>
>>> In 1.2.1:
>>> ...
>>> 2015-10-03_17:00:59.70210 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3358] in [1609ms].
>>> 2015-10-03_17:01:05.63202 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3357] in [50ms].
>>> 2015-10-03_17:01:08.85062 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3359] in [47ms].
>>> 2015-10-03_17:01:12.39922 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3365] in [119ms].
>>> 2015-10-03_17:01:15.80573 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3364] in [40ms].
>>> 2015-10-03_17:01:19.29222 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3367] in [43ms].
>>> 2015-10-03_17:01:22.98674 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3366] in [142ms].
>>> 2015-10-03_17:01:29.74936 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3361] in [120ms].
>>> 2015-10-03_17:01:34.11754 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3360] in [128ms].
>>> 2015-10-03_17:01:37.91544 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3363] in [134ms].
>>> 2015-10-03_17:01:41.41116 INFO [EsIndexRangeService] Calculated range
>>> of [graylog_3362] in [237ms].
>>> 2015-10-03_17:01:45.31576 INFO [RebuildIndexRangesJob] Done calculating
>>> index ranges for 1855 indices. Took 9551409ms.
>>>
>>> For good measure, I tried scaling the ES cluster up and down.
>>> Performance is exactly the same whether I run on a single large server vs 4
>>> smaller servers.
>>>
>>> On Friday, October 2, 2015 at 9:29:37 AM UTC-6, Jesse Skrivseth wrote:
>>>>
>>>> Note that for the 1800 indices we have in this instance, the ~5 second
>>>> delay between each index during range calculation adds up to ~150 minutes
>>>> of additional delay when calculating ranges. My "> 20 minutes" comment
>>>> should be more like "> 2.5 hours", plus actual time spent calculating
>>>> ranges.
>>>>
>>>
--
You received this message because you are subscribed to the Google Groups
"Graylog Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/graylog2/a33a1567-721a-43eb-af47-01aebdc13f1c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.