myfjdthink commented on issue #10452:
URL: https://github.com/apache/doris/issues/10452#issuecomment-1503032968
Update:
我在 1.2.2 版本上继续尝试,按如下方式配置 catalog
```sql
CREATE CATALOG iceberg PROPERTIES (
"type"="iceberg",
"warehouse" = "gs://gcs-bucket-xxx/hive-warehouse",
"iceberg.catalog.type"="hms",
"hive.metastore.uris" = "thrift://10.106.199.213:9083",
"AWS_ACCESS_KEY" = "xxx",
"AWS_SECRET_KEY" = "xxx",
"AWS_ENDPOINT" = "https://storage.googleapis.com/",
"AWS_REGION" = "us-east4",
"use_path_style" = "true"
);
```
配置后,可以访问 Hive metastore,尝试读取 Table List,一切正常
```sql
SWITCH iceberg
show databases;
```
正常
```sql
use public;
show tables;
```
正常
尝试读取 Table 里的数据,碰到问题
1. org.apache.hadoop.fs.UnsupportedFileSystemException: No FileSystem for
scheme "gs"
添加 gcs-connector-hadoop3-latest.jar 依赖包后解决
2. Unknown file gs location for hms table
修改
fe/fe-core/src/main/java/org/apache/doris/planner/external/iceberg/IcebergScanProvider.java
里的判断逻辑,支持 GCS 可以解决
3. BE 无法识别 gs file
> errCode = 2, detailMessage = failed to init reader for file
gs://gcs-bucket-xxx/hive-warehouse/public.db/table/data/xxx_bucket=6/20230411_075903_15214_h7ekb-1c62bdb7-6982-412d-8114-aeb0da5a6647.orc,
err: s3 uri is invalid:
gs://gcs-bucket-xxx/hive-warehouse/public.db/xxx-7efe259cb5274eeabfb6d6ba803f1843/data/xxx_bucket=6/20230411_075903_15214_h7ekb-1c62bdb7-6982-412d-8114-aeb0da5a6647.orc
搜索报错关键字,找不到报错的代码位置,Block 在这里了,
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]