I am aware of this JIRA and have gone through it and the patch. I would
like to point out that I am not facing the problem in the #4 step. I am
facing it in the #17 step i.e. build cube. Also neither is the cardinality
of  my column high nor is it hitting the 1GB mark.

I am using Kylin 1.5.2.1 but I don't think that should be a problem because
I am not hitting any of the limits that would cause the problem.

Thanks,
-Joel



On Mon, Oct 10, 2016 at 11:55 PM, Alberto Ramón <[email protected]>
wrote:

> can you check this bug: KYLIN-1834
> <https://issues.apache.org/jira/browse/KYLIN-1834>
> check if your actual version is 1.5.2  there is a new version 1.5.4.2
>
> good luck, Alb
>
> 2016-10-10 18:38 GMT+02:00 Joel Victor <[email protected]>:
>
>> I have come across this error where I get a exception in the cube build
>> step (step #17) which says that a particular key does not exist in the trie
>> dictionary.
>>
>> The build dictionary step says that the value is present in the
>> dictionary. I have deduced this by looking at the Kylin debug logs.
>> Following is the log
>> 2016-10-10 05:43:06,956 DEBUG [pool-5-thread-6]
>> dict.DictionaryGenerator:86 : Dictionary value samples: =>0,
>> 20160628082452279-7bdd009d55a794c=>17606, 
>> 20140225082303539-a91c9daed8602d1=>1,
>> 20140225082452582-d55ca8b438418c4=>2, 20140225082509763-e9c208ceff68
>> ea1=>3
>>
>>
>> Also I went back and checked the input for the build dimension dictionary
>> step. In particular the -input paramter
>>
>> I found that the id for which the error is surfacing has a occurs twice in 
>> the -input file.
>>
>> I wanted to ask that whether there Is a chance that duplicate values in the 
>> input file while creating the dimension dictionary would cause the error 
>> mentioned above in the build cube (#17) step?
>>
>> From what I can tell there shouldn't be a duplicate values since the step 
>> before that is to extract distinct values.
>>
>> Following is the stack trace for the cube build step for reference:
>>
>> 2016-10-10 05:45:17,461 ERROR [Thread-11] 
>> org.apache.kylin.dict.TrieDictionary: Not a valid value:
>> 20160628082452279-7bdd009d55a794c
>> 2016-10-10 05:45:18,462 ERROR [pool-5-thread-1] 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder: Dogged Cube Build error
>> java.io.IOException: java.lang.IllegalArgumentException: Value not exists!
>>      at 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder$BuildOnce.abort(DoggedCubeBuilder.java:193)
>>      at 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder$BuildOnce.checkException(DoggedCubeBuilder.java:166)
>>      at 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder$BuildOnce.build(DoggedCubeBuilder.java:113)
>>      at 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder.build(DoggedCubeBuilder.java:72)
>>      at 
>> org.apache.kylin.cube.inmemcubing.AbstractInMemCubeBuilder$1.run(AbstractInMemCubeBuilder.java:74)
>>      at 
>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
>>      at java.util.concurrent.FutureTask.run(FutureTask.java:262)
>>      at 
>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>      at 
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>      at java.lang.Thread.run(Thread.java:745)
>> Caused by: java.lang.IllegalArgumentException: Value not exists!
>>      at 
>> org.apache.kylin.dimension.Dictionary.getIdFromValueBytes(Dictionary.java:160)
>>      at 
>> org.apache.kylin.dict.TrieDictionary.getIdFromValueImpl(TrieDictionary.java:158)
>>      at 
>> org.apache.kylin.dimension.Dictionary.getIdFromValue(Dictionary.java:96)
>>      at 
>> org.apache.kylin.dimension.DictionaryDimEnc$DictionarySerializer.serialize(DictionaryDimEnc.java:120)
>>      at 
>> org.apache.kylin.cube.gridtable.CubeCodeSystem.encodeColumnValue(CubeCodeSystem.java:122)
>>      at 
>> org.apache.kylin.cube.gridtable.CubeCodeSystem.encodeColumnValue(CubeCodeSystem.java:111)
>>      at org.apache.kylin.gridtable.GTRecord.setValues(GTRecord.java:99)
>>      at org.apache.kylin.gridtable.GTRecord.setValues(GTRecord.java:87)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilderInputConverter.convert(InMemCubeBuilderInputConverter.java:75)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilder$InputConverter$1.next(InMemCubeBuilder.java:540)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilder$InputConverter$1.next(InMemCubeBuilder.java:521)
>>      at 
>> org.apache.kylin.gridtable.GTAggregateScanner.iterator(GTAggregateScanner.java:133)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilder.createBaseCuboid(InMemCubeBuilder.java:337)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilder.build(InMemCubeBuilder.java:164)
>>      at 
>> org.apache.kylin.cube.inmemcubing.InMemCubeBuilder.build(InMemCubeBuilder.java:133)
>>      at 
>> org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder$SplitThread.run(DoggedCubeBuilder.java:281)
>>
>>
>> Thanks,
>>
>> -Joel
>>
>>
>>
>

Reply via email to