[ https://issues.apache.org/jira/browse/SPARK-36559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hyukjin Kwon resolved SPARK-36559. ---------------------------------- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33807 [https://github.com/apache/spark/pull/33807] > Allow column pruning on distributed sequence index (pandas API on Spark) > ------------------------------------------------------------------------ > > Key: SPARK-36559 > URL: https://issues.apache.org/jira/browse/SPARK-36559 > Project: Spark > Issue Type: Improvement > Components: PySpark, SQL > Affects Versions: 3.2.0 > Reporter: Hyukjin Kwon > Assignee: Hyukjin Kwon > Priority: Major > Fix For: 3.2.0 > > > https://issues.apache.org/jira/browse/SPARK-36338 implemented distributed > sequence implementation on JVM side. However, it disables leveraging Spark > SQL engine because it creates an RDD directly, and truncate the SQL plans. > We should move the logic into a proper SQL plan to leverage other > optimizations such as column pruning. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org