yihua commented on code in PR #18876: URL: https://github.com/apache/hudi/pull/18876#discussion_r3322268391
########## website/docs/writing_data.md: ########## @@ -467,6 +467,22 @@ The following advanced storage configuration options were added in Hudi 1.2.0: | Config | Default | Description | |---|---|---| | `hoodie.parquet.write.config.injector.class` | (none) | Fully-qualified class name of a custom `HoodieParquetConfigInjector` implementation. Use this to inject custom Parquet writer properties (e.g., disable dictionary encoding, set bloom filter sizes) without modifying the Hudi source. The implementing class must implement `org.apache.hudi.io.HoodieParquetConfigInjector`. | +| `hoodie.table.base.file.format` | `parquet` | Base file format for the table. Accepts `parquet`, `orc`, `hfile`, or `lance`. See [Lance File Format](lance_file_format.md) for the Lance-specific options. | + +### Writing VECTOR, BLOB, and VARIANT Columns + +Hudi 1.2.0 introduces three new column types that participate in writes the same way as standard +SQL types: + +- **`VECTOR(dim[, FLOAT|DOUBLE|INT8])`** — written as an array of floats matching the declared + dimension. With the DataFrame API, stamp `hudi_type=VECTOR(dim)` metadata on the column. Review Comment: Example added -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
