Kent Yao created SPARK-34944:
--------------------------------
Summary: Employ correct data type for web_returns and
store_returns in TPCDS tests
Key: SPARK-34944
URL: https://issues.apache.org/jira/browse/SPARK-34944
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.2.0
Reporter: Kent Yao
{noformat}
2.2.2 Datatype
2.2.2.1 Each column employs one of the following datatypes:
a) Identifier means that the column shall be able to hold any key value
generated for that column.
b) Integer means that the column shall be able to exactly represent integer
values (i.e., values in increments of
1) in the range of at least ( − 2n − 1) to (2n − 1 − 1), where n is 64.
c) Decimal(d, f) means that the column shall be able to represent decimal
values up to and including d digits,
of which f shall occur to the right of the decimal place; the values can be
either represented exactly or
interpreted to be in this range.
d) Char(N) means that the column shall be able to hold any string of characters
of a fixed length of N.
Comment: If the string that a column of datatype char(N) holds is shorter than
N characters, then trailing
spaces shall be stored in the database or the database shall automatically pad
with spaces upon retrieval such
that a CHAR_LENGTH() function will return N.
e) Varchar(N) means that the column shall be able to hold any string of
characters of a variable length with a
maximum length of N. Columns defined as "varchar(N)" may optionally be
implemented as "char(N)".
f) Date means that the column shall be able to express any calendar day between
January 1, 1900 and
December 31, 2199.
2.2.2.2 The datatypes do not correspond to any specific SQL-standard datatype.
The definitions are provided to
highlight the properties that are required for a particular column. The
benchmark implementer may employ any internal representation or SQL datatype
that meets those requirements.
{noformat}
one thing might be clear that we should replace bigint type which is now used
in web_returns and store_returns with int type.
another thing that might need to be further discussed is - shall we use bigint
to meet 2.2.2.1 b)?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]