GitHub user avamingli added a comment to the discussion: Extend the gpfdist
tool to support SFTP/HDFS protocols for high-performance multi-source data
ingestion
Hi, interesting!
As far as I know, gpfdist is a protocol for external tables, and the gpfdist
tool is an implementation of that protocol. Other tools, like GPSS (Greenplum
Streaming Server), also implement the gpfdist protocol for external tables. The
data format is specified by the format option when defining the external table.
```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('gpfdist://9727/d.dat')
format 'csv' (DELIMITER '|');
```
Are you planning to support additional protocols(ex: SFTP) for external tables
:
```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('SFTP://9727/d.dat')
```
or provide more format options?
```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('gpfdist://9727/d.dat')
format 'SFTP' ;
```
GitHub link:
https://github.com/apache/cloudberry/discussions/1205#discussioncomment-13636225
----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]