zhdech opened a new issue, #8478: URL: https://github.com/apache/seatunnel/issues/8478
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### What happened When reading tar.gz files through the sftp method, in the file_fFormat type="text" mode, if the files inside do not have an extension, seatunnel will not be able to read the file by default. It is expected that when reading the file, it does not depend on the file extension 通过sftp 方式读取tar.gz 文件时,在file_format_type = "text" 模式下,如果里面的文件没有扩展名时,seatunnel 默认读不到该文件,期望:读文件时不依赖于文件扩展名 ### SeaTunnel Version dev ### SeaTunnel Config ```conf env { parallelism = 1 job.mode = "BATCH" } source { SftpFile { host = "192.168.1.1" port = 22 user = "root" password = "1231223" path = "/data/20241213060019.tar.gz" archive_compress_codec = "TAR_GZ" file_format_type = "text" result_table_name = "sftp" schema = { fields { ATTR = "string" } } } } sink { Console { source_table_name = "sftp" parallelism = 1 } } ``` ### Running Command ```shell bin/seatunnel.sh --config /config/v2.batch.config.template ``` ### Error Exception ```log No error reported, unable to read file 没有报错,无法读取文件 ``` ### Zeta or Flink or Spark Version _No response_ ### Java or Scala Version 1.8 ### Screenshots _No response_ ### Are you willing to submit PR? - [X] Yes I am willing to submit a PR! ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
