Hyukjin Kwon created SPARK-50041:
------------------------------------

             Summary: Show proper error message in memory profiler
                 Key: SPARK-50041
                 URL: https://issues.apache.org/jira/browse/SPARK-50041
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 4.0.0
            Reporter: Hyukjin Kwon


Running

{code}
from pyspark.sql.functions import pandas_udf
df = spark.range(10)

@pandas_udf("long")
def add1(x):
  return x + 1

spark.conf.set("spark.sql.pyspark.udf.profiler", "memory")

added = df.select(add1("id"))
added.show()
spark.profile.show(type="memory")
{code}

shows an error message like:

{code}
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/.../python/pyspark/sql/profiler.py", line 320, in show
    self.profiler_collector.show_memory_profiles(id)
  File "/.../python/pyspark/sql/profiler.py", line 145, in show_memory_profiles
    show(id)
  File "/.../python/pyspark/sql/profiler.py", line 139, in show
    MemoryProfiler._show_results(cm)
  File "/.../python/pyspark/profiler.py", line 497, in _show_results
    tmp = template.format(lineno, total_mem, inc, occurrences, all_lines[lineno 
- 1])
{code}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to