NineSue opened a new pull request, #782: URL: https://github.com/apache/incubator-graphar/pull/782
## PR Description ### Reason for this PR Implement direct LDBC SNB Datagen to GraphAr conversion pipeline in Spark, enabling in-memory data flow without intermediate CSV files. ### What changes are included in this PR? - Streaming bridge architecture for LDBC Datagen Spark API to GraphAr conversion. - Batch processors for static entities and streaming architecture for dynamic entities. - Unified ID management and schema auto-detector (**GraphAr v1.0 compliant**). - 9/22 entities generated (5 vertices + 4 edges) with **Parquet + YAML** output. - Example tests added (documentation pending upload). ### Are these changes tested? Yes ### Are there any user-facing changes? - **BREAKING CHANGE**: Remove legacy `converter/` package, restructure to `stream/` architecture. - **New APIs**: `LdbcStreamingBridge`, `LdbcGraphArBridge`, `LdbcConfigUtils`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
