adrienpacifico opened a new pull request, #45013:
URL: https://github.com/apache/arrow/pull/45013

   Improve error log to display the list of duplicated columns instead of the 
full columns list.
   
   
   ### Rationale for this change
   
   Getting the full list of columns of the dataframe is not usefull, knowing 
which columns is duplicated is.
   I tought that having all duplicated columns of the dataframe is better than 
having only the set of duplicated column names 
(`df.columns[df.columns.duplicated()].unique().tolist()` )
   
   ### Are these changes tested?
   No, very minor changes. Checked that f-strings are compatible with Python 
3.9 (oldest compatible Python version in the pyproject.toml .
   
   ### Are there any user-facing changes?
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to