[
https://issues.apache.org/jira/browse/SPARK-43270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Beishao Cao updated SPARK-43270:
--------------------------------
Attachment: Screenshot 2023-04-23 at 6.48.46 PM-1.png
> Implement __dir__() in pyspark.sql.dataframe.DataFrame to include columns
> -------------------------------------------------------------------------
>
> Key: SPARK-43270
> URL: https://issues.apache.org/jira/browse/SPARK-43270
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 3.5.0
> Reporter: Beishao Cao
> Priority: Major
> Attachments: Screenshot 2023-04-23 at 6.48.46 PM.png
>
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> Currently, {{df.|}} will only suggest the method of dataframe(see attached
> photo of databricks notebook), but {{df.column_name}} is also legal.
> !image-2023-04-24-13-44-33-716.png|width=389,height=248!
> So we should override the parent {{__dir__}} method on Python {{DataFrame}}
> class to include column names. And the benefit of this is engine that uses
> {{dir()}} to generate autocomplete suggestions (e.g. IPython kernel,
> Databricks Notebooks) will suggest column names on the completion {{df.|}}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]