adsharma commented on issue #679:
URL: 
https://github.com/apache/incubator-graphar/issues/679#issuecomment-3320973762

   I will read through the GAR spec and align where it makes sense. For example 
use `vertices.parquet` instead of `nodes.parquet`. Remove filename prefix and 
rely on the directory name for the prefix etc.
   
   Where we diverge:
   
   * I'm exclusively focusing on CSR (sort by source, then by edge attributes 
including type, then by destination).
   * Use kuzu catalog for metadata (avoid json, yaml). Similar to DuckLake.
   * Potential support for duckdb single file storage in the future
   * uv + python based tooling to take any arbitrary billion scale graph, 
configurable sort as explained above and produce a directory in the standard 
format.
   
   Open datalakes for graphs is an exciting idea. Thank you for bringing it up!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@graphar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@graphar.apache.org
For additional commands, e-mail: commits-h...@graphar.apache.org

Reply via email to