yuqi1129 commented on code in PR #9173: URL: https://github.com/apache/gravitino/pull/9173#discussion_r2598226610
########## docs/lance-rest-service.md: ########## @@ -0,0 +1,397 @@ +--- +title: "Lance REST service" +slug: /lance-rest-service +keywords: + - Lance REST + - Lance datasets + - REST API +license: "This software is licensed under the Apache License version 2." +--- + +import Tabs from '@theme/Tabs'; +import TabItem from '@theme/TabItem'; + +## Overview + +The Lance REST service provides a RESTful interface for managing Lance datasets through HTTP endpoints. Introduced in Gravitino version 1.1.0, this service enables seamless interaction with Lance datasets for data operations and metadata management. + +The service implements the [Lance REST API specification](https://editor-next.swagger.io/?url=https://raw.githubusercontent.com/lancedb/lance-namespace/refs/heads/main/docs/src/rest.yaml). For detailed specification documentation, see the [official Lance REST documentation](https://lance.org/format/namespace/impls/rest/). + +### What is Lance? + +[Lance](https://lancedb.github.io/lance/) is a modern columnar data format designed for AI/ML workloads. It provides: + +- **High-performance vector search**: Native support for similarity search on high-dimensional embeddings +- **Columnar storage**: Optimized for analytical queries and machine learning pipelines +- **Fast random access**: Efficient row-level operations unlike traditional columnar formats +- **Version control**: Built-in dataset versioning and time-travel capabilities +- **Incremental updates**: Append and update data without full rewrites + +### Architecture + +The Lance REST service acts as a bridge between Lance datasets and applications: + +``` +┌─────────────────┐ +│ Applications │ +│ (Python/Java) │ +└────────┬────────┘ + │ HTTP/REST + ▼ +┌─────────────────┐ +│ Lance REST │◄──── Gravitino Metalake +│ Service │ (Metadata Backend) +└────────┬────────┘ + │ File System Operations + ▼ +┌─────────────────┐ +│ Lance Datasets │ +│ (S3/HDFS/Local) │ Review Comment: I removed the word `HDFS` in the docs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
