Chosen123Wang opened a new issue, #9113: URL: https://github.com/apache/seatunnel/issues/9113
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### What happened Hive exprot data to Mysql 使用zeta集群,将hive parquet存储的表 导出到MySQL,读取hive文件是,报错:Illegal initial character: 7_work_day_gmv ### SeaTunnel Version 2.3.8 ### SeaTunnel Config ```conf env { parallelism = 1 job.mode = "BATCH" } source { Hive { table_name = "ads.table" metastore_uri = "thrift://localhost:7004" hive.hadoop.conf-path = "/app/hadoop/etc/hadoop/" read_columns = [shelf_id,7_work_day_gmv] read_partitions = ["dt=20250402"] compress_codec = "parquet" } } sink { jdbc { url = "jdbc:mysql://locahost:3306/fe_dm?useUnicode=true&characterEncoding=UTF-8&rewriteBatchedStatements=true" driver = "com.mysql.cj.jdbc.Driver" user = "xx" password = "xx" batch_size = 50000 query = "insert into xxx (shelf_id,7_work_day_gmv) values (?,?)" enable_upsert = "false" } } ``` ### Running Command ```shell /app/seatunnel/bin/seatunnel.sh --config /tmp/test.conf --name test ``` ### Error Exception ```log Exception in thread "main" org.apache.seatunnel.core.starter.exception.CommandExecuteException: SeaTunnel job executed failed at org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:213) at org.apache.seatunnel.core.starter.SeaTunnel.run(SeaTunnel.java:40) at org.apache.seatunnel.core.starter.seatunnel.SeaTunnelClient.main(SeaTunnelClient.java:34) Caused by: org.apache.seatunnel.engine.common.exception.SeaTunnelEngineException: org.apache.seatunnel.connectors.seatunnel.file.exception.FileConnectorException: ErrorCode:[FILE-08], ErrorDescription:[File read failed] - Read data from this file [table_hdfs:///table/dt=20250402/000000_0] failed at org.apache.seatunnel.connectors.seatunnel.hive.source.reader.MultipleTableHiveSourceReader.pollNext(MultipleTableHiveSourceReader.java:87) at org.apache.seatunnel.engine.server.task.flow.SourceFlowLifeCycle.collect(SourceFlowLifeCycle.java:159) at org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.collect(SourceSeaTunnelTask.java:127) at org.apache.seatunnel.engine.server.task.SeaTunnelTask.stateProcess(SeaTunnelTask.java:168) at org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.call(SourceSeaTunnelTask.java:132) at org.apache.seatunnel.engine.server.TaskExecutionService$BlockingWorker.run(TaskExecutionService.java:693) at org.apache.seatunnel.engine.server.TaskExecutionService$NamedTaskWrapper.run(TaskExecutionService.java:1018) at org.apache.seatunnel.api.tracing.MDCRunnable.run(MDCRunnable.java:39) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Caused by: org.apache.seatunnel.shade.connector.hive.org.apache.avro.SchemaParseException: Illegal initial character: 7_work_day_gmv at org.apache.seatunnel.shade.connector.hive.org.apache.avro.Schema.validateName(Schema.java:1562) at org.apache.seatunnel.shade.connector.hive.org.apache.avro.Schema.access$400(Schema.java:91) at org.apache.seatunnel.shade.connector.hive.org.apache.avro.Schema$Name.<init>(Schema.java:701) at org.apache.seatunnel.shade.connector.hive.org.apache.avro.Schema.createFixed(Schema.java:257) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroSchemaConverter$1.convertFIXED_LEN_BYTE_ARRAY(AvroSchemaConverter.java:348) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroSchemaConverter$1.convertFIXED_LEN_BYTE_ARRAY(AvroSchemaConverter.java:313) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.schema.PrimitiveType$PrimitiveTypeName$8.convert(PrimitiveType.java:369) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroSchemaConverter.convertField(AvroSchemaConverter.java:312) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroSchemaConverter.convertFields(AvroSchemaConverter.java:290) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroSchemaConverter.convert(AvroSchemaConverter.java:279) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.avro.AvroReadSupport.prepareForRead(AvroReadSupport.java:134) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.hadoop.InternalParquetRecordReader.initialize(InternalParquetRecordReader.java:190) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.hadoop.ParquetReader.initReader(ParquetReader.java:166) at org.apache.seatunnel.shade.connector.hive.org.apache.parquet.hadoop.ParquetReader.read(ParquetReader.java:135) at org.apache.seatunnel.connectors.seatunnel.file.source.reader.ParquetReadStrategy.read(ParquetReadStrategy.java:111) at org.apache.seatunnel.connectors.seatunnel.hive.source.reader.MultipleTableHiveSourceReader.pollNext(MultipleTableHiveSourceReader.java:83) ... 12 more at org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:205) ... 2 more 2025-04-03 14:01:46,464 INFO [s.c.s.s.c.ClientExecuteCommand] [ForkJoinPool.commonPool-worker-2] - run shutdown hook because get close signal ``` ### Zeta or Flink or Spark Version zeta ### Java or Scala Version 1.8 ### Screenshots _No response_ ### Are you willing to submit PR? - [ ] Yes I am willing to submit a PR! ### Code of Conduct - [x] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
