Hi all, I have been investigating the improvements for Pandas API on Spark specifically in UI. I chatted with a couple of people, and decided to send an email here to discuss more.
Currently, both SQL and DataFrame API are shown in “SQL” tab as below: [image: Screen Shot 2022-03-25 at 12.18.14 PM.png] which makes sense to developers because DataFrame API shares the same SQL core but I do believe this makes less sense to end users. Please consider two more points: - Spark ML users will run DataFrame-based MLlib API, but they will have to check the "SQL" tab. - Pandas API on Spark arguably has no link to SQL itself conceptually. It makes less sense to users of pandas API. So I would like to propose to rename: - "SQL" to "SQL/DataFrame" - "Query" to "Execution" There's a PR open at https://github.com/apache/spark/pull/35973. Please let me know your thoughts on this. Thanks.