In the cube design, the COUNT_DISTINCT indicator USES the global dictionary. 
When building a cube, a dictionary file is generated at HDFS, and the 
corresponding directory is:
"/ resources/GlobalDict/dict/database. The tableName/column/", at the same time 
will generate metadata information in hbase metadata, metadata in the hbase 
rowkey information:"/dict/database. The tableName/column/". The value 
information specifies the corresponding HDFS path.
The problem is that the corresponding HDFS data and hbase data are not deleted 
when the cube is deleted. Another HDFS data path and hbase metadata rowkey 
information did not specify the concrete cube, if a table column in more than 
one cube is to use global dictionary, can lead to multiple share a dictionary 
data cube

Reply via email to