Github user xuchuanyin commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/2252#discussion_r194603907
  
    --- Diff: 
core/src/main/java/org/apache/carbondata/core/datastore/chunk/store/impl/safe/SafeVariableLengthDimensionDataChunkStore.java
 ---
    @@ -56,7 +57,8 @@ public SafeVariableLengthDimensionDataChunkStore(boolean 
isInvertedIndex, int nu
        * @param invertedIndexReverse inverted index reverse to be stored
        * @param data                 data to be stored
        */
    -  @Override public void putArray(final int[] invertedIndex, final int[] 
invertedIndexReverse,
    +  @Override
    +  public void putArray(final int[] invertedIndex, final int[] 
invertedIndexReverse,
    --- End diff --
    
    @kumarvishal09 yeah, you are right. Considering that scenario will make the 
code much more complex.
    As we all know, a maximum of string/bytearray is about 2GB. I think it is 
enough for current scenarios. 2GB can support average 67104 bytes per column 
(2GB/32000) in one column page. 
    
    Having discussed with @jackylk , we have the following conclusions:
    1. Add an error message to indicate that the maximum size of one column 
page is 2GB.
    2. We can reduce the row size in the page to support longer characters, for 
example if the content is 67104*2, user can reduce the row size from 32000 to 
16000.
    
    1 will be included in this PR and 2 will be implemented in another PR.
    Is that OK? @kumarvishal09 


---

Reply via email to