Keith Turner created ACCUMULO-1867:
--------------------------------------
Summary: Split failed during conditional randomwalk test
Key: ACCUMULO-1867
URL: https://issues.apache.org/jira/browse/ACCUMULO-1867
Project: Accumulo
Issue Type: Bug
Reporter: Keith Turner
Priority: Critical
Fix For: 1.6.0
I left the conditional random walk test running overnight against
1.6.0-SNAPSHOT configured to use two namenodes. After running for a few hours
a client saw a split operating failand I saw the following corresponding
error message in the tserver logs.
{noformat}
2013-11-08 12:31:59,227 [util.FileUtil] DEBUG: Too many indexes (33) to open at
once for null null, reducing in tmpDir = /accumulo-1.6/tmp/idxReduce_1116774712
2013-11-08 12:31:59,369 [thrift.ProcessFunction] ERROR: Internal error
processing splitTablet
java.lang.IllegalArgumentException: Wrong FS:
hdfs://nn2:9001/accumulo-1.6/tables/2/t-0000ew3/F0000ex7.rf, expected:
hdfs://nn1:6093
at org.apache.hadoop.fs.FileSystem.checkPath(FileSystem.java:381)
at
org.apache.hadoop.hdfs.DistributedFileSystem.getPathName(DistributedFileSystem.java:129)
at
org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:154)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:427)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getBCFile(CachableBlockFile.java:256)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.access$000(CachableBlockFile.java:143)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader$MetaBlockLoader.get(CachableBlockFile.java:212)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getBlock(CachableBlockFile.java:313)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getMetaBlock(CachableBlockFile.java:367)
at
org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getMetaBlock(CachableBlockFile.java:143)
at
org.apache.accumulo.core.file.rfile.RFile$Reader.<init>(RFile.java:825)
at
org.apache.accumulo.core.file.rfile.RFileOperations.openIndex(RFileOperations.java:63)
at
org.apache.accumulo.core.file.DispatchingFileFactory.openIndex(FileOperations.java:66)
at
org.apache.accumulo.server.util.FileUtil.reduceFiles(FileUtil.java:135)
at
org.apache.accumulo.server.util.FileUtil.estimatePercentageLTE(FileUtil.java:207)
at org.apache.accumulo.tserver.Tablet.split(Tablet.java:3527)
at
org.apache.accumulo.tserver.TabletServer.splitTablet(TabletServer.java:2648)
at
org.apache.accumulo.tserver.TabletServer.access$1600(TabletServer.java:237)
at
org.apache.accumulo.tserver.TabletServer$ThriftClientHandler.splitTablet(TabletServer.java:2095)
at sun.reflect.GeneratedMethodAccessor34.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.accumulo.trace.instrument.thrift.TraceWrap$1.invoke(TraceWrap.java:63)
at $Proxy10.splitTablet(Unknown Source)
at
org.apache.accumulo.core.tabletserver.thrift.TabletClientService$Processor$splitTablet.getResult(TabletClientService.java:2531)
at
org.apache.accumulo.core.tabletserver.thrift.TabletClientService$Processor$splitTablet.getResult(TabletClientService.java:2515)
at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39)
at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39)
at
org.apache.accumulo.server.util.TServerUtils$TimedProcessor.process(TServerUtils.java:159)
at
org.apache.thrift.server.AbstractNonblockingServer$FrameBuffer.invoke(AbstractNonblockingServer.java:478)
at
org.apache.accumulo.server.util.TServerUtils$THsHaServer$Invocation.run(TServerUtils.java:214)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at
org.apache.accumulo.trace.instrument.TraceRunnable.run(TraceRunnable.java:47)
at
org.apache.accumulo.core.util.LoggingRunnable.run(LoggingRunnable.java:34)
at java.lang.Thread.run(Thread.java:662)
{noformat}
nn1 is the default namenode. The "Too many indexes" message may be important.
That message indicates the split code entered special code that handles tablets
w/ lots of files.
--
This message was sent by Atlassian JIRA
(v6.1#6144)