Davis Zhang created HUDI-9587:
---------------------------------

             Summary: Secondary index design recap - Is it a good choice to 
track everything as String?
                 Key: HUDI-9587
                 URL: https://issues.apache.org/jira/browse/HUDI-9587
             Project: Apache Hudi
          Issue Type: Bug
          Components: index
            Reporter: Davis Zhang
             Fix For: 1.2.0


context https://issues.apache.org/jira/browse/HUDI-9566

The issue is record in data table can be of any data type

The record value tracked by secondary index is all converted to string via 
"toString" call.

What make things worse is the toString call can be invoked on various data 
type, based on how the read path and write path prepare keys to look up / 
updated keys for SI update.

 

We need to recap the design. For SI index to work, the acceptance criteria is

record matching mechanism in data table should be the same as record matching 
mechanism in the MDT.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to