[ https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445960#comment-17445960 ]
pralabhkumar commented on SPARK-37189: -------------------------------------- IMHO , the issue is in pyspark.pandas.plot plotly.py plot_histogram method . arguments (kwargs) , passed by user are not passed to plotly when creating the figure. Therefore this issue is not just with title but can happen with other arguments like "activeshape" , "font". Once I passes the user argument to go.Layout , title issue is not happening(provided user passes title). [~yikunkero] . Please let me know , if I my understanding is correct , I can create a PR for it . > pyspark.pandas histogram accepts the range option but does not use it > --------------------------------------------------------------------- > > Key: SPARK-37189 > URL: https://issues.apache.org/jira/browse/SPARK-37189 > Project: Spark > Issue Type: Sub-task > Components: PySpark > Affects Versions: 3.2.0 > Reporter: Chuck Connell > Priority: Major > > In pyspark.pandas if you write a line like this > {quote}DF.plot.hist(bins=30, range=[0, 20], title="US Counties -- > DeathsPer100k (<20)") > {quote} > it compiles and runs, but the plot does not respect the range. All the values > are shown. > The workaround is to create a new DataFrame that pre-selects just the rows > you want, but line above should work also. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org