Wsyzxxn opened a new issue #1884: bucket number  not correct 
URL: https://github.com/apache/incubator-doris/issues/1884
 
 
   **Describe the bug**
   be selected all bucket and double bucket number
   
   **To Reproduce**
   Steps to reproduce the behavior:
   1.  CREATE TABLE `table_name` (
     `a` date NOT NULL ,
     `b` tinyint(4) NOT NULL DEFAULT "0" ,
     `c` tinyint(4) NULL ,
     `d` int(11) NOT NULL DEFAULT "0" ,
     `e` int(11) NOT NULL DEFAULT "0" ,
     `f` varchar(2560) NULL ,
     `g` bigint(20) NULL DEFAULT "0" ,
     `h` int NULL ,
     `i` hll HLL_UNION NOT NULL ,
     `j` varchar(0) BITMAP_UNION NOT NULL
   ) ENGINE=OLAP
   AGGREGATE KEY(`a`, `b`, `c`, `d`, `e`, `f`, `g`, `h`)
   PARTITION BY RANGE(`a`)
   (
   PARTITION p20190923 VALUES LESS THAN ('2019-09-24'),
   PARTITION p20190924 VALUES LESS THAN ('2019-09-25'),
   PARTITION p20190925 VALUES LESS THAN ('2019-09-26'),
   PARTITION p20190926 VALUES LESS THAN ('2019-10-01')
   )
   DISTRIBUTED BY HASH(`d`) BUCKETS 400
   PROPERTIES (
   "storage_type" = "COLUMN",
   "replication_num"="3"
   );
   2.  ALTER TABLE db.table_name 
           ADD ROLLUP rollup_name (a,b,c,d,e,f,g,h)
           PROPERTIES("storage_type"="column");
   3.  select count(distinct d) from table_name ;
   +---------------------------------+
   | count(DISTINCT `d`) |
   +---------------------------------+
   |                            9249 |
   +---------------------------------+
   1 row in set (0.28 sec)
   4. See error
   explain select d from table_name where d = 1195;
   +----------------------------------------------------+
   | Explain String                                     |
   +----------------------------------------------------+
   | PLAN FRAGMENT 0                                    |
   |  OUTPUT EXPRS:`d` |                                |
   |   PARTITION: UNPARTITIONED                         |
   |                                                    |
   |   RESULT SINK                                      |
   |                                                    |
   |   1:EXCHANGE                                       |
   |      tuple ids: 0                                  |
   |                                                    |
   | PLAN FRAGMENT 1                                    |
   |  OUTPUT EXPRS:                                     |
   |   PARTITION: RANDOM                                |
   |                                                    |
   |   STREAM DATA SINK                                 |
   |     EXCHANGE ID: 01                                |
   |     UNPARTITIONED                                  |
   |                                                    |
   |   0:OlapScanNode                                   |
   |      TABLE: table_name                             |
   |      PREAGGREGATION: OFF. Reason: No AggregateInfo |
   |      PREDICATES: `d` = 1195                        |
   |      partitions=2/4                                |
   |      rollup: rollup_name                           |
   |      buckets=800/800                               |
   |      cardinality=412492234                         |
   |      avgRowSize=20.0389                            |
   |      numNodes=42                                   |
   |      tuple ids: 0                                  |
   +----------------------------------------------------+
   28 rows in set (0.00 sec)
   ** Issues **
   count(distinct d) is 9249 ,but when switch one of which ,explian sql,all of  
bucket was selected .
   and bucket number was double not only this sql,but also all most of sql in 
this table ,bucket has this bug
   
   **Expected behavior**
   be selected Bucket number = count(distinct d) / all buckets number
   
   **Desktop (please complete the following information):**
    - OS: [e.g. Centos7]
    - Version [doris@aaabf97471527a81be4329c3bc6fb7c037352ec6]
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to