apallerlamudi commented on PR #28564: URL: https://github.com/apache/airflow/pull/28564#issuecomment-1374316862
[vchiapaikeo](https://github.com/vchiapaikeo) ``` with DAG('sample', schedule_interval='0 13 * * *', catchup=False, is_paused_upon_creation=True, default_args=default_args) as dag: gcs_to_gbq = GCSToBigQueryOperator( task_id = 'gcs_to_gbq', bucket=GCS_BUCKET, source_objects="landing/sample_{d}.json".format(d=pull_date), autodetect=True, destination_project_dataset_table= '{}.{}.{}'.format(wi_project_id,'dataset','table'), source_format='NEWLINE_DELIMITED_JSON', allow_quoted_newlines=True, encoding='UTF-8', write_disposition='WRITE_TRUNCATE', dag=dag ) ``` **Error:** [2023-01-06, 23:26:20 UTC] {standard_task_runner.py:92} ERROR - Failed to execute job 161546 for task gcs_to_gbq (400 POST https://bigquery.googleapis.com/bigquery/v2/projects/jobs?prettyPrint=false: Field "amount": "1"} already exists in schema; 167458) [2023-01-06, 23:26:20 UTC] {local_task_job.py:156} INFO - Task exited with return code 1 [2023-01-06, 23:26:21 UTC] {local_task_job.py:279} INFO - 0 downstream tasks scheduled from follow-on schedule check **** I'm not sure why the operator is throwing a schema error for a write truncate operation and autodetect=True. This never happened and it worked until last week. I hope this information helps. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
