1309637127 commented on PR #4772:
URL: https://github.com/apache/seatunnel/pull/4772#issuecomment-1591106417

   This cause 2 problems:
   1、Seatunnel push data to only part of the nodes
   We have 6 nodes in the product environment and only 4 nodes have data, the 
other have no data.
   2、We expect the xxHash have the same xxHash result as Clickhouse
   We have further ETL job in Clickhouse, we use cust_no as the sharding key, 
we expect all the data concerning with the same cust_no stay in the same node, 
but we've test the xxHash result with Clickhouse result they are not equal.
   
   Below is the java xxHash and Clickhouse xxHash result "00000186620663":
   java xxHash: -1055457840867779448
   java xxHash & Long.MAX result: 8167914195986996360
   Clickhouse xxHash: 17391286232841772168
   
   And with the xxHash result seatunnel push the customer to shard0 and in 
clickhouse put the customer is in shard 5


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to