Hi devs, I'd like to introduce a python SDK for paimon (paimon-python). Python users can use it to access paimon data more easily.
In the first version, I would leverage py4j to wrap Java SDK with python codes. Briefly speaking, py4j can start a JVM and load Java classes, so we can use it to access Paimon table Java API and get results in Python code. An example is flink-python: https://github.com/apache/flink/tree/master/flink-python I'd like to give an paimon example: ``` class FileStoreTable(object): @classmethod def create(cls, context: CatalogContext) -> 'FileStoreTable': *# gateway is built via py4j to access JVM* gateway = get_gateway() *# use gateway.jvm to access java classes* j_table = gateway.jvm.FileStoreTableFactory.create(context.to_j_catalog_context()) return FileStoreTable(j_table) def __init__(self, j_table): self.__j_table = j_table # wrap Java method def primary_keys(self) -> List[str]: return self.__j_table.primaryKeys() ``` Then we can wrap scan, read interface to read table and write, commit interface to write table via Python. Looking forward to your suggestions. Best Regards, Zelin Yu
