[ https://issues.apache.org/jira/browse/PHOENIX-5066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17615876#comment-17615876 ]
Istvan Toth commented on PHOENIX-5066: -------------------------------------- I made quite a bit of progress, but I'm a bit stuck now. On the server side a ThreadLocal seems to work well, as each scan has a dedicated thread. However, on the client side there is no dedicated thread for the query, some expressions (at least standard upserts) are evaluated in the application thread, so a ThreadLocal set from the Connection cannot solve the problem there. My current WIP patch at [https://github.com/apache/phoenix/pull/1504/commits/4a0994078933b7b6b80b7022aed48c522c747966] sets the ThreadLocal when the connection is created, but this is not a workable solution. The main problem is that the Expression and PType system is designed to be a proper functional system, without access to globals like Connection, or Statement. I was able to push the necessary context to some expression nodes, but a lot of the conversions happen in the PDate.toObject() ... methods. PDate.INSTANCE is designed to be a singleton. I am looking at the following solutions: - Setting and unsetting a ThreadLocal with the context at the JDBC entry/exit points as needed. - Adding an extra StatementContext parameter to most methods in the Expression objects, so that we eventually have the context when PDate.toObject() is called. – huge volume of changes, and very ugly - Pushing the StatementContext to every Expression object created, and using that when calling the PDate.toObject() methods - Breaking the rule that the PDataTypes are singletons, and injecting the context to PDataType objects as needed. I am looking for input on what would be the best of the above, or even better, if you have a more elegant solution. Fore reference, a client side stack trace of where we need to have ExpressionContext available: {noformat} java.lang.NullPointerException at org.apache.phoenix.util.ThreadExpressionCtx.parseTimestamp(ThreadExpressionCtx.java:46) at org.apache.phoenix.schema.types.PTimestamp.toObject(PTimestamp.java:135) at org.apache.phoenix.expression.LiteralExpression.newConstant(LiteralExpression.java:187) at org.apache.phoenix.expression.LiteralExpression.newConstant(LiteralExpression.java:172) at org.apache.phoenix.expression.LiteralExpression.newConstant(LiteralExpression.java:159) at org.apache.phoenix.compile.UpsertCompiler$UpdateColumnCompiler.visit(UpsertCompiler.java:970) at org.apache.phoenix.compile.ExpressionCompiler.visit(ExpressionCompiler.java:1) at org.apache.phoenix.parse.LiteralParseNode.accept(LiteralParseNode.java:80) at org.apache.phoenix.compile.UpsertCompiler.compile(UpsertCompiler.java:840) at org.apache.phoenix.jdbc.PhoenixStatement$ExecutableUpsertStatement.compilePlan(PhoenixStatement.java:1008) at org.apache.phoenix.jdbc.PhoenixStatement$ExecutableUpsertStatement.compilePlan(PhoenixStatement.java:1) at org.apache.phoenix.jdbc.PhoenixStatement$3.call(PhoenixStatement.java:539) at org.apache.phoenix.jdbc.PhoenixStatement$3.call(PhoenixStatement.java:1) at org.apache.phoenix.call.CallRunner.run(CallRunner.java:53) at org.apache.phoenix.jdbc.PhoenixStatement.executeMutation(PhoenixStatement.java:519) at org.apache.phoenix.jdbc.PhoenixStatement.executeMutation(PhoenixStatement.java:507) at org.apache.phoenix.jdbc.PhoenixStatement.executeUpdate(PhoenixStatement.java:2151) at org.apache.phoenix.end2end.CompliantDateTimeIT.testUpsertTZ(CompliantDateTimeIT.java:233) <JUnit setup deleted for brevity>{noformat} > The TimeZone is incorrectly used during writing or reading data > --------------------------------------------------------------- > > Key: PHOENIX-5066 > URL: https://issues.apache.org/jira/browse/PHOENIX-5066 > Project: Phoenix > Issue Type: Bug > Affects Versions: 5.0.0, 4.14.1 > Reporter: Jaanai Zhang > Assignee: Istvan Toth > Priority: Critical > Fix For: 5.3.0 > > Attachments: DateTest.java, PHOENIX-5066.4x.v1.patch, > PHOENIX-5066.4x.v2.patch, PHOENIX-5066.4x.v3.patch, > PHOENIX-5066.master.v1.patch, PHOENIX-5066.master.v2.patch, > PHOENIX-5066.master.v3.patch, PHOENIX-5066.master.v4.patch, > PHOENIX-5066.master.v5.patch, PHOENIX-5066.master.v6.patch > > Time Spent: 20m > Remaining Estimate: 0h > > We have two methods to write data when uses JDBC API. > #1. Uses _the exceuteUpdate_ method to execute a string that is an upsert SQL. > #2. Uses the _prepareStatement_ method to set some objects and execute. > The _string_ data needs to convert to a new object by the schema information > of tables. we'll use some date formatters to convert string data to object > for Date/Time/Timestamp types when writes data and the formatters are used > when reads data as well. > > *Uses default timezone test* > Writing 3 records by the different ways. > {code:java} > UPSERT INTO date_test VALUES (1,'2018-12-10 15:40:47','2018-12-10 > 15:40:47','2018-12-10 15:40:47') > UPSERT INTO date_test VALUES (2,to_date('2018-12-10 > 15:40:47'),to_time('2018-12-10 15:40:47'),to_timestamp('2018-12-10 15:40:47')) > stmt.setInt(1, 3);stmt.setDate(2, date);stmt.setTime(3, > time);stmt.setTimestamp(4, ts); > {code} > Reading the table by the getObject(getDate/getTime/getTimestamp) methods. > {code:java} > 1 | 2018-12-10 | 23:45:07 | 2018-12-10 23:45:07.0 > 2 | 2018-12-10 | 23:45:07 | 2018-12-10 23:45:07.0 > 3 | 2018-12-10 | 15:45:07 | 2018-12-10 15:45:07.66 > {code} > Reading the table by the getString methods > {code:java} > 1 | 2018-12-10 15:45:07.000 | 2018-12-10 15:45:07.000 | 2018-12-10 > 15:45:07.000 > 2 | 2018-12-10 15:45:07.000 | 2018-12-10 15:45:07.000 | 2018-12-10 > 15:45:07.000 > 3 | 2018-12-10 07:45:07.660 | 2018-12-10 07:45:07.660 | 2018-12-10 > 07:45:07.660 > {code} > *Uses GMT+8 test* > Writing 3 records by the different ways. > {code:java} > UPSERT INTO date_test VALUES (1,'2018-12-10 15:40:47','2018-12-10 > 15:40:47','2018-12-10 15:40:47') > UPSERT INTO date_test VALUES (2,to_date('2018-12-10 > 15:40:47'),to_time('2018-12-10 15:40:47'),to_timestamp('2018-12-10 15:40:47')) > stmt.setInt(1, 3);stmt.setDate(2, date);stmt.setTime(3, > time);stmt.setTimestamp(4, ts); > {code} > Reading the table by the getObject(getDate/getTime/getTimestamp) methods. > {code:java} > 1 | 2018-12-10 | 23:40:47 | 2018-12-10 23:40:47.0 > 2 | 2018-12-10 | 15:40:47 | 2018-12-10 15:40:47.0 > 3 | 2018-12-10 | 15:40:47 | 2018-12-10 15:40:47.106 {code} > Reading the table by the getString methods > {code:java} > 1 | 2018-12-10 23:40:47.000 | 2018-12-10 23:40:47.000 | 2018-12-10 > 23:40:47.000 > 2 | 2018-12-10 15:40:47.000 | 2018-12-10 15:40:47.000 | 2018-12-10 > 15:40:47.000 > 3 | 2018-12-10 15:40:47.106 | 2018-12-10 15:40:47.106 | 2018-12-10 > 15:40:47.106 > {code} > > _We_ have a historical problem, we'll parse the string to > Date/Time/Timestamp objects with timezone in #1, which means the actual data > is going to be changed when stored in HBase table。 -- This message was sent by Atlassian Jira (v8.20.10#820010)