Kent Yao created SPARK-34944:
--------------------------------

             Summary: Employ correct data type for web_returns and 
store_returns in TPCDS tests
                 Key: SPARK-34944
                 URL: https://issues.apache.org/jira/browse/SPARK-34944
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.2.0
            Reporter: Kent Yao



{noformat}
2.2.2 Datatype
2.2.2.1 Each column employs one of the following datatypes:
a) Identifier means that the column shall be able to hold any key value 
generated for that column.
b) Integer means that the column shall be able to exactly represent integer 
values (i.e., values in increments of
1) in the range of at least ( − 2n − 1) to (2n − 1 − 1), where n is 64.
c) Decimal(d, f) means that the column shall be able to represent decimal 
values up to and including d digits,
of which f shall occur to the right of the decimal place; the values can be 
either represented exactly or
interpreted to be in this range.
d) Char(N) means that the column shall be able to hold any string of characters 
of a fixed length of N.
Comment: If the string that a column of datatype char(N) holds is shorter than 
N characters, then trailing
spaces shall be stored in the database or the database shall automatically pad 
with spaces upon retrieval such
that a CHAR_LENGTH() function will return N.
e) Varchar(N) means that the column shall be able to hold any string of 
characters of a variable length with a
maximum length of N. Columns defined as "varchar(N)" may optionally be 
implemented as "char(N)".
f) Date means that the column shall be able to express any calendar day between 
January 1, 1900 and
December 31, 2199.
2.2.2.2 The datatypes do not correspond to any specific SQL-standard datatype. 
The definitions are provided to
highlight the properties that are required for a particular column. The 
benchmark implementer may employ any internal representation or SQL datatype 
that meets those requirements.
{noformat}

one thing might be clear that we should replace bigint type which is now used 
in web_returns and store_returns with int type.
another thing that might need to be further discussed is - shall we use bigint 
to meet 2.2.2.1 b)?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to