apallerlamudi commented on PR #28564:
URL: https://github.com/apache/airflow/pull/28564#issuecomment-1374316862

   [vchiapaikeo](https://github.com/vchiapaikeo)
   
   ```
   with DAG('sample', schedule_interval='0 13 * * *', catchup=False, 
is_paused_upon_creation=True, default_args=default_args) as dag:
   
   
           gcs_to_gbq = GCSToBigQueryOperator(
           task_id = 'gcs_to_gbq',
           bucket=GCS_BUCKET,
           source_objects="landing/sample_{d}.json".format(d=pull_date),
           autodetect=True,
           destination_project_dataset_table= 
'{}.{}.{}'.format(wi_project_id,'dataset','table'),
           source_format='NEWLINE_DELIMITED_JSON',
           allow_quoted_newlines=True,
           encoding='UTF-8',
           write_disposition='WRITE_TRUNCATE',
           dag=dag
           )
   ```
   
   **Error:**
   
   [2023-01-06, 23:26:20 UTC] {standard_task_runner.py:92} ERROR - Failed to 
execute job 161546 for task gcs_to_gbq (400 POST 
https://bigquery.googleapis.com/bigquery/v2/projects/jobs?prettyPrint=false: 
Field  "amount": "1"} already exists in schema; 167458)
   [2023-01-06, 23:26:20 UTC] {local_task_job.py:156} INFO - Task exited with 
return code 1
   [2023-01-06, 23:26:21 UTC] {local_task_job.py:279} INFO - 0 downstream tasks 
scheduled from follow-on schedule check
   
   ****
   I'm not sure why the operator is throwing a schema error for a write 
truncate operation and autodetect=True.
   
   This never happened and it worked until last week.
   
   I hope this information helps.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to