Jianfeng Jia created ASTERIXDB-1399:
---------------------------------------
Summary: Sporadically exception happens when using the Similarity
function
Key: ASTERIXDB-1399
URL: https://issues.apache.org/jira/browse/ASTERIXDB-1399
Project: Apache AsterixDB
Issue Type: Bug
Reporter: Jianfeng Jia
Assignee: Taewoo Kim
Sadly it sporadically happens.
I'm creating an `keyword` index, and search the matching document through
`similarity-jaccard(word-tokens($t.text_msg), word-tokens($keyword))`.
Sometimes it will throw the exception like following:
{code}
Caused by: org.apache.hyracks.api.exceptions.HyracksDataException:
java.lang.ArrayIndexOutOfBoundsException: 10
at
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.close(IndexSearchOperatorNodePushable.java:203)
at
org.apache.hyracks.dataflow.std.sort.AbstractExternalSortRunMerger.process(AbstractExternalSortRunMerger.java:167)
at
org.apache.hyracks.dataflow.std.sort.AbstractSorterOperatorDescriptor$MergeActivity$1.initialize(AbstractSorterOperatorDescriptor.java:194)
at
org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable.lambda$initialize$0(SuperActivityOperatorNodePushable.java:83)
at
org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable$1.call(SuperActivityOperatorNodePushable.java:205)
at
org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable$1.call(SuperActivityOperatorNodePushable.java:202)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
... 3 more
Suppressed: java.lang.ArrayIndexOutOfBoundsException: 10
at
org.apache.asterix.dataflow.data.nontagged.serde.AInt32SerializerDeserializer.getInt(AInt32SerializerDeserializer.java:50)
at
org.apache.asterix.dataflow.data.nontagged.serde.AOrderedListSerializerDeserializer.getItemOffset(AOrderedListSerializerDeserializer.java:136)
at
org.apache.asterix.runtime.evaluators.common.AsterixOrderedListIterator.getItemOffset(AsterixOrderedListIterator.java:28)
at
org.apache.asterix.runtime.evaluators.common.AbstractAsterixListIterator.reset(AbstractAsterixListIterator.java:93)
at
org.apache.asterix.runtime.evaluators.common.AbstractAsterixListIterator.reset(AbstractAsterixListIterator.java:153)
at
org.apache.asterix.runtime.evaluators.common.SimilarityJaccardEvaluator.prepareLists(SimilarityJaccardEvaluator.java:145)
at
org.apache.asterix.runtime.evaluators.common.SimilarityJaccardEvaluator.evaluate(SimilarityJaccardEvaluator.java:108)
at
org.apache.asterix.runtime.evaluators.functions.GetItemDescriptor$GetItemEvalFactory$1.evaluate(GetItemDescriptor.java:97)
at
org.apache.asterix.runtime.evaluators.functions.AndDescriptor$2$1.evaluate(AndDescriptor.java:92)
at
org.apache.hyracks.algebricks.runtime.operators.std.StreamSelectRuntimeFactory$1.nextFrame(StreamSelectRuntimeFactory.java:137)
at
org.apache.hyracks.dataflow.common.comm.io.AbstractFrameAppender.write(AbstractFrameAppender.java:93)
at
org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.flushAndReset(AbstractOneInputOneOutputOneFramePushRuntime.java:63)
at
org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.flushIfNotFailed(AbstractOneInputOneOutputOneFramePushRuntime.java:69)
at
org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:55)
at
org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:122)
at
org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:57)
at
org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:57)
at
org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:122)
at
org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$2.close(AlgebricksMetaOperatorDescriptor.java:153)
at
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.close(IndexSearchOperatorNodePushable.java:227)
... 9 more
{code}
Any idea where could be the source of the problem?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)