[
https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445960#comment-17445960
]
pralabhkumar edited comment on SPARK-37189 at 11/18/21, 2:52 PM:
-
was (Author: pralabhkumar):
IMHO , the issue is in pyspark.pandas.plot plotly.py plot_histogram method .
arguments (kwargs) , passed by user are not passed to plotly when creating the
figure. Therefore this issue is not just with title but can happen with other
arguments like "activeshape" , "font".
Once I passes the user argument to go.Layout , title issue is not
happening(provided user passes title).
[~yikunkero] . Please let me know , if I my understanding is correct , I can
create a PR for it .
> pyspark.pandas histogram accepts the range option but does not use it
> -
>
> Key: SPARK-37189
> URL: https://issues.apache.org/jira/browse/SPARK-37189
> Project: Spark
> Issue Type: Sub-task
> Components: PySpark
>Affects Versions: 3.2.0
>Reporter: Chuck Connell
>Priority: Major
>
> In pyspark.pandas if you write a line like this
> {quote}DF.plot.hist(bins=30, range=[0, 20], title="US Counties --
> DeathsPer100k (<20)")
> {quote}
> it compiles and runs, but the plot does not respect the range. All the values
> are shown.
> The workaround is to create a new DataFrame that pre-selects just the rows
> you want, but line above should work also.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org