[ https://issues.apache.org/jira/browse/SPARK-34944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kent Yao reassigned SPARK-34944: -------------------------------- Assignee: Kent Yao > Employ correct data type for web_returns and store_returns in TPCDS tests > ------------------------------------------------------------------------- > > Key: SPARK-34944 > URL: https://issues.apache.org/jira/browse/SPARK-34944 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.2.0 > Reporter: Kent Yao > Assignee: Kent Yao > Priority: Major > > {noformat} > 2.2.2 Datatype > 2.2.2.1 Each column employs one of the following datatypes: > a) Identifier means that the column shall be able to hold any key value > generated for that column. > b) Integer means that the column shall be able to exactly represent integer > values (i.e., values in increments of > 1) in the range of at least ( − 2n − 1) to (2n − 1 − 1), where n is 64. > c) Decimal(d, f) means that the column shall be able to represent decimal > values up to and including d digits, > of which f shall occur to the right of the decimal place; the values can be > either represented exactly or > interpreted to be in this range. > d) Char(N) means that the column shall be able to hold any string of > characters of a fixed length of N. > Comment: If the string that a column of datatype char(N) holds is shorter > than N characters, then trailing > spaces shall be stored in the database or the database shall automatically > pad with spaces upon retrieval such > that a CHAR_LENGTH() function will return N. > e) Varchar(N) means that the column shall be able to hold any string of > characters of a variable length with a > maximum length of N. Columns defined as "varchar(N)" may optionally be > implemented as "char(N)". > f) Date means that the column shall be able to express any calendar day > between January 1, 1900 and > December 31, 2199. > 2.2.2.2 The datatypes do not correspond to any specific SQL-standard > datatype. The definitions are provided to > highlight the properties that are required for a particular column. The > benchmark implementer may employ any internal representation or SQL datatype > that meets those requirements. > {noformat} > one thing might be clear that we should replace bigint type which is now used > in web_returns and store_returns with int type. > another thing that might need to be further discussed is - shall we use > bigint to meet 2.2.2.1 b)? -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org